Accession Number : AD0820801
Title : COMPUTER CLASSIFICATION OF INTELLIGENCE-TYPE DOCUMENTS.
Descriptive Note : Final rept.,
Corporate Author : IBM FEDERAL SYSTEMS DIV GAITHERSBURG MD
Personal Author(s) : Perriens, Mathew P. ; Williams, John H., Jr
Report Date : SEP 1967
Pagination or Media Count : 92
Abstract : A computer classification technique was successfully tested on intelligence-type documents. Results of experiments are also reported on technical data bases in the English and German languages. Since the technique is statistical rather than syntactical it can classify documents in any language without requiring translation. In addition to the usual tests on sample and control data bases, a successful test was performed on another additional data base that had not been used to generate the classification statistics. The statistical technique is based upon multiple discriminant functions, which have the ability to classify into any number of categories, the technique provides for classification to several levels of detail. A user may select any set of subject categories suiting his need, and provides a set of sample documents for each category. A subset of words to form the classification bases are selected from the sample in accordance with their statistical properties. Classification applications are not limited to document retrieval, but may include document routing, screeening, or disseminating functions. (Author)
Descriptors : (*MILITARY INTELLIGENCE, AUTOMATION), (*INDEXES, COMPUTER PROGRAMMING), (*WORD ASSOCIATION, STATISTICAL ANALYSIS), (*VOCABULARY, STATISTICAL ANALYSIS), (*DATA PROCESSING, MILITARY INTELLIGENCE), SYSTEMS ENGINEERING, CLASSIFICATION, GERMAN LANGUAGE, ENGLISH LANGUAGE, FACTOR ANALYSIS, FLOW CHARTING, AUTOMATIC, INFORMATION RETRIEVAL.
Subject Categories : Information Science
Distribution Statement : APPROVED FOR PUBLIC RELEASE