Accession Number : AD0608574

Title :   IS AUTOMATIC CLASSIFICATION A REASONABLE APPLICATION OF STATISTICAL ANALYSIS OF TEXT.

Descriptive Note : Scientific rept.,

Corporate Author : SYSTEM DEVELOPMENT CORP SANTA MONICA CALIF

Personal Author(s) : Doyle,Lauren B.

Report Date : 31 AUG 1964

Pagination or Media Count : 34

Abstract : The crucial question of the quality of automatic classification is treated at considerable length, and empirical data are introduced to support the hypothesis that classification quality improves as more information about each document is used for input to the classification program. Six non-judgmental criteria are used in testing the hypothesis for 100 keyboard lists (each list representing a document) for a series of computer runs in which the number of words per document is increased progressively from 12 to 36. Four of the six criteria indicate the hypothesis holds, and two point to no effect. Finally, the future of automatic classification and some of the practical problems to be faced are outlined.

Descriptors :   (*CLASSIFICATION, SUBJECT INDEXING), (*STATISTICAL ANALYSIS, CLASSIFICATION), INFORMATION RETRIEVAL, AUTOMATIC, DATA PROCESSING, INPUT OUTPUT DEVICES, COMPUTERS, DOCUMENTS

Distribution Statement : APPROVED FOR PUBLIC RELEASE