TY - GEN
T1 - A neural network for discovery of record layouts
AU - Bahrami, Azita
AU - Hashemi, Ray R.
AU - Talburt, John R.
AU - Burachevsky, Zhebnya
N1 - Publisher Copyright:
© 2009 IADIS.
PY - 2009
Y1 - 2009
N2 - In order for automated systems to interact with ASCII files, they must be able to discover the layout of records. There are three formats of record layouts: fixed, delimited, and mixed. The goal of this paper is to discover the record layout of fixed format. Each record of such files is considered to be a character string in which the fields' startings and endings are unknown and two adjacent fields may or may not be separated by a space. A new neural network was devised to accept a random sample of the file's records as a working set to discover the record layout for the file. The validity of the methodology was established through using 20 different synthesized files with regard to number of fields, length of fields, order of fields, content of fields, or any combination of them. The methodology's ability to discover the record layouts has 95% accuracy.
AB - In order for automated systems to interact with ASCII files, they must be able to discover the layout of records. There are three formats of record layouts: fixed, delimited, and mixed. The goal of this paper is to discover the record layout of fixed format. Each record of such files is considered to be a character string in which the fields' startings and endings are unknown and two adjacent fields may or may not be separated by a space. A new neural network was devised to accept a random sample of the file's records as a working set to discover the record layout for the file. The validity of the methodology was established through using 20 different synthesized files with regard to number of fields, length of fields, order of fields, content of fields, or any combination of them. The methodology's ability to discover the record layouts has 95% accuracy.
KW - And record layout discovery
KW - Knowledge discovery
KW - Neural network
KW - Record layout
UR - http://www.scopus.com/inward/record.url?scp=84946014262&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:84946014262
T3 - Proceedings of the IADIS International Conference WWW/Internet 2009, ICWI 2009
SP - 319
EP - 323
BT - Proceedings of the IADIS International Conference WWW/Internet 2009, ICWI 2009
A2 - Barbosa, Patricia
A2 - Nunes, Miguel Baptista
A2 - Isaias, Pedro
A2 - White, Bebo
A2 - Rodrigues, Luis
PB - IADIS
T2 - IADIS International Conference WWW/Internet 2009, ICWI 2009
Y2 - 19 November 2009 through 22 November 2009
ER -