Automated Metadata Creation: Possibilities and Pitfalls

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Automated indexing—using a computer to look at individual documents and assign metadata without a person looking at every document—was used to build an interactive online database to store and retrieve pages from a loose-leaf resource (i.e., a resource that changes state over time). A database was designed and more than 30,000 pages in the database were indexed. Digitization, optical character recognition, and computer scripting to extract metadata were the methods used to assign most metadata. In places where the computer program could not assign metadata, a person looked at the document and assigned metadata. The index was audited for errors. The computer script and the human indexer had comparable error rates but computer indexing spent much less time per value assigned. It is recommended that automated indexing be considered in indexing projects, especially where a large number of similar documents are to be indexed.

Original languageAmerican English
Pages (from-to)267-284
Number of pages18
JournalThe Serials Librarian
Volume64
Issue number1-4
DOIs
StatePublished - Apr 8 2013

Disciplines

  • Social and Behavioral Sciences
  • Library and Information Science

Keywords

  • Artificial Intelligence
  • Automated Indexing
  • Digital Libraries
  • Loose-leaf
  • Metadata Creation
  • Relational Database

Fingerprint

Dive into the research topics of 'Automated Metadata Creation: Possibilities and Pitfalls'. Together they form a unique fingerprint.

Cite this