Searching multi-hierarchical XML documents: The case of fragmentation

Alex Dekhtyar, Ionut E. Iacob, Srikanth Methuku

Research output: Contribution to journalConference articlepeer-review

Abstract

To properly encode properties of textual documents using XML, multiple markup hierarchies must be used, often leading to conflicting markup in encodings. Text Encoding Initiative (TEI) Guidelines[1] recognize this problem and suggest a number of ways to incorporate multiple hierarchies in a single well-formed XML document. In this paper, we present a framework for processing XPath queries over multi-hierarchical XML documents represented using fragmentation, one of the TEI-suggested techniques. We define the semantics of XPath over DOM trees of fragmented XML, extend the path expression language to cover overlap in markup, and describe FragXPath, our implementation of the proposed XPath semantics over fragmented markup.

Original languageEnglish
Pages (from-to)576-585
Number of pages10
JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3588
DOIs
StatePublished - 2005
Event16th International Conference on Database and Expert Systems Applications, DExa 2005 - Copenhagen, Denmark
Duration: Aug 22 2005Aug 26 2005

Fingerprint

Dive into the research topics of 'Searching multi-hierarchical XML documents: The case of fragmentation'. Together they form a unique fingerprint.

Cite this