Please use this identifier to cite or link to this item: http://hdl.handle.net/1901/41
Full metadata record
| DC Field | Value | Language |
|---|---|---|
| contributor.advisor | Stephanie W. Hass | en |
| creator | Susan E. Teague Rector | en |
| date.accessioned | 2004-04-12T17:18:18Z | - |
| date.available | 2004-04-12 | en |
| date.issued | 2004-04-12T17:18:18Z | - |
| date.submitted | April 2004 | en |
| identifier.uri | http://hdl.handle.net/1901/41 | - |
| description.abstract | This study explores the challenges of using traditional information retrieval methods to retrieve document-centric XML encoded text. It demonstrates how coupling structure and content in query and index formulation improves retrieval performance. Native XML database (NXD) and search engine technologies were evaluated in a baseline experiment, and in a second test after alterations were made to their respective indexes. Documents were retrieved for simple and complex forms of 30 XPath and keyword queries from a corpus of 95 XML/TEI encoded texts. Overall results indicated that query augmentation using document structure improves retrieval performance. Complex queries submitted to the NXD produced the most satisfying results, with an average precision of 93.3% and an average recall of 86.3%. Performance improvements were also achieved using complex, structured queries and indexes in the search engine. Study findings suggest that effective XML retrieval models might result from a combination of unstructured and structured retrieval techniques. | en |
| format | application/pdf | en |
| format.extent | 1160373 bytes | - |
| format.mimetype | application/pdf | - |
| language.iso | en_US | en |
| publisher | School of Information and Library Science | en |
| rights | Attribution-NonCommercial 1.0 | en |
| subject | Information Retrieval XML search & retrieval Semistructured Data Indexing Full Text Searching | en |
| title | Accessing Information Based on a Combination of Document Structure and Content: Exploiting XML tags in indexing and searching to enhance content retrieval of online document-centric XML encoded texts | en |
| type | Electronic Theses and Dissertations | en |
| degree.discipline | Information Science | en |
| degree.grantor | University of North Carolina at Chapel Hill | en |
| degree.level | Master | en |
| degree.name | Master of Science | en |
| Appears in Collections: | SILS Master's Papers |
Files in This Item:
|
All items in SILS-ETD are protected by copyright, with all rights reserved.