SILS-ETD

Please use this identifier to cite or link to this item: http://hdl.handle.net/1901/208

Full metadata record

DC FieldValueLanguage
contributor.advisorRobert M. Loseeen
creatorJonathan L. Elsasen
date.accessioned2005-07-06T19:04:23Z-
date.available2005-07-06en
date.available2005-07-06T19:04:23Z-
date.issued2005-07-06T19:04:23Z-
date.submittedJuly 6, 2005en
identifier.urihttp://hdl.handle.net/1901/208-
description.abstractDimensionality reduction in the bag-of-words vector space document representation model has been widely studied for the purposes of improving accuracy and reducing computational load of document retrieval tasks. These techniques, however, have not been studied to the same degree with regard to document clustering tasks. This study evaluates the effectiveness of two popular dimensionality reduction techniques for clustering, and their effect on discovering accurate and understandable topical groupings of documents. The two techniques studied are Latent Semantic Analysis and Independent Component Analysis, each of which have been shown to be effective in the past for retrieval purposes.en
formatapplication/pdfen
format.extent1008122 bytes-
format.mimetypeapplication/pdf-
language.isoen_USen
publisherSchool of Information and Library Scienceen
subjectInformation Retrieval, Statistical Methods/Evaluationen
titleAn Evaluation of Projection Techniques for Document Clustering: Latent Semantic Analysis and Independent Component Analysisen
typeElectronic Theses and Dissertationsen
degree.disciplineInformation Scienceen
degree.grantorUniversity of North Carolina at Chapel Hillen
degree.levelMasteren
degree.nameMaster of Scienceen
licensehttp://creativecommons.org/licenses/by-nc/1.0/en
Appears in Collections:SILS Master's Papers

Files in This Item:

File SizeFormat 
MastersPaperFinal.pdf984KbAdobe PDFView/Open

Show simple item record

All items in SILS-ETD are protected by copyright, with all rights reserved.