Back to category: Miscellaneous

Limited version - please login or register to view the entire paper.

Design Docutment for Information Retreival Engine

Project Overview:

Information Retrieval is the way of finding documents within a collection relevant to a specific query and also detecting which are more relevant than the others. An Information Retrieval System includes indexing, searching and recalling data particularly text from files/documents. The project given to us aims to implement the above using the Inverted Index model. The search engine thus developed should be able to build an index from specified files, and search them based on a user input query.

Approach:

The Information Retrieval Engine is supposed to read in from text files, and build an index for each word and create a corresponding posting list for the same. In such a case, it is easier to use Tree Data Structure as it will allow you input words in a specified order, and will contain a pointer to another Link List Data Structure which will store the posting list.
While parsing (which is the building of index), the text file will be read and...

Posted by: Angelia Holliday

Limited version - please login or register to view the entire paper.