SILS-ETD

Please use this identifier to cite or link to this item: http://hdl.handle.net/1901/88

Title: AUTOMATIC EXTRACTION OF AUTHOR SELF CONTRIBUTED METADATA FOR ELECTRONIC THESES AND DISSERTATIONS
Authors: Mao Ni
Keyword: Metadata -- Automatic Metadata Extraction
Keyword: Metadata -- Author Self Contributed Metadata
Keyword: Digital Library -- Electronic Theses and Dissertations, ETDs
Issue Date: 4-May-2004
Publisher: School of Information and Library Science
Abstract: This paper discusses the design and implement of an automatic way to extract the metadata from PDF files in the process of the submission to the Electronic and Theses Dissertations (ETDs). During the submission, each ETDs system requires some metadata about the theses to facilitate the metadata search after it is archived. Those metadata, like creator, title, data, abstract, subject and publisher, comply with the Dublin Core Metadata Initiative. In most of all existing ETDs repositories, students are required to manually type in these metadata, which discourages students' submission, especially when resubmissions are needed due to the errors found in the theses, because they have to type all the metadata again each time they submit the theses. By standardizing a method for capturing the metadata from the original documents, our project aims to enable digital repository, which hosts the ETDs collection, to automatically extract the metadata from the theses, making the submissions much...
URI: http://hdl.handle.net/1901/88
Appears in Collections:SILS Master's Papers

Files in This Item:

File SizeFormat 
MaoNi.pdf669KbAdobe PDFView/Open

Show full item record

All items in SILS-ETD are protected by copyright, with all rights reserved.