Opentopia Directory Encyclopedia Tools

Text retrieval

Encyclopedia : T : TE : TEX : Text retrieval


Text retrieval is a branch of computerised information retrieval where the information is stored primarily in the form of text. In short it is the study of techniques of retrieving textual documents relevant to the needs of a searcher.

Text retrieval became a field of study in the 1960s. Initially documents (typically legal or scientific) were indexed by trained indexers, and assigned keywords. The keywords and a reference to the document would then be stored in a computerised database, and the user could retrieve any documents to which given keywords had been attached. Both indexing and searching were relatively skilled occupations.

The advent of full text searching made the job of the indexer redundant during the 1980s. Text databases moved from being large and centralised to local and personal, thanks to the personal computer and the CD-ROM.

Text retrieval is a critical area of study today, since it is the fundamental basis of all internet search engines.

Example: PubMed

The PubMed form interface features the "related articles" search which works through a comparison of words from the documents' title, abstract, and MeSH terms using a word-weighted algorithm. The details of this algorithm are explicated here [link].

See also

External links

 


From Wikipedia, the Free Encyclopedia. Original article here. Support Wikipedia by contributing or donating.
All text is available under the terms of the GNU Free Documentation License See Wikipedia Copyrights for details.

Search Titles
0123456789
ABCDEFGHIJ
KLMNOPQRST
UVWXYZ?

E-mail this article to:

Personal Message: