Abstract
The SMART document retrieval system is used to investigate algorithms for text analysis and request searching. Results from three document collections indicate that word normalization is efficiently performed by automatic thesaurus lookup, while phrase matching procedures, statistical association methods, and concept hierarchies are useful for special applications. Automatic document clustering schemes and user-interactive feedback methods permit rapid searches of large collections. Abstracts are found to be superior to titles as a base for content analysis in a document retrieval system and almost as good as complete texts. Proper procedures for designing dictionaries and searching requests are discussed. The practicality of large scale document centers and their proper design are considered in light of these results.
Original language | English (US) |
---|---|
Pages (from-to) | 201-218 |
Number of pages | 18 |
Journal | Information Storage and Retrieval |
Volume | 4 |
Issue number | 2 |
DOIs | |
State | Published - Jun 1968 |
Externally published | Yes |
All Science Journal Classification (ASJC) codes
- General Engineering