Text retrieval using linear algebra : an honors thesis [(HONRS 499)]

Loading...
Thumbnail Image

Authors

Drew, Joshua L.

Advisor

Baglama, James

Issue Date

2002

Keyword

Degree

Thesis (B.?.)

Department

Honors College

Other Identifiers

Abstract

Text retrieval is an important area of research. As information and methods of its storage have proliferated, the need to have efficient methods of locating subsets of this information has increased as well.The Internet is serving to catapult the size of present-day text collections past those of only fifty years ago. Accordingly, Internet search engines are a hotbed for information retrieval research.A widely-researched text searching method involves modeling a text collection in a term-bydocument matrix, and evaluating the documents' relevance to a query with simple linear algebra.This document presents one such system, the possibilities for future research incorporating ideas of that system, and computer code with a Web interface, written in C++ and Perl, that implements the Web search engine.