Please use this identifier to cite or link to this item: http://hdl.handle.net/1901/88
| Title: | AUTOMATIC EXTRACTION OF AUTHOR SELF CONTRIBUTED METADATA FOR ELECTRONIC THESES AND DISSERTATIONS |
| Authors: | Mao Ni |
| Keyword: | Metadata -- Automatic Metadata Extraction |
| Keyword: | Metadata -- Author Self Contributed Metadata |
| Keyword: | Digital Library -- Electronic Theses and Dissertations, ETDs |
| Issue Date: | 4-May-2004 |
| Publisher: | School of Information and Library Science |
| Abstract: | This paper discusses the design and implement of an automatic way to extract the metadata from PDF files in the process of the submission to the Electronic and Theses Dissertations (ETDs). During the submission, each ETDs system requires some metadata about the theses to facilitate the metadata search after it is archived. Those metadata, like creator, title, data, abstract, subject and publisher, comply with the Dublin Core Metadata Initiative. In most of all existing ETDs repositories, students are required to manually type in these metadata, which discourages students' submission, especially when resubmissions are needed due to the errors found in the theses, because they have to type all the metadata again each time they submit the theses. By standardizing a method for capturing the metadata from the original documents, our project aims to enable digital repository, which hosts the ETDs collection, to automatically extract the metadata from the theses, making the submissions much... |
| URI: | http://hdl.handle.net/1901/88 |
| Appears in Collections: | SILS Master's Papers |
Files in This Item:
|
All items in SILS-ETD are protected by copyright, with all rights reserved.