Accession Number : AD0739873

Title :   Automatic Typeset Input Technique Evaluation.

Descriptive Note : Final technical rept. Jun-Sep 71,

Corporate Author : COMPUSCAN INC TETERBORO N J

Personal Author(s) : Bantz,David ; Clarkson,W. ; Schapira,A.

Report Date : DEC 1971

Pagination or Media Count : 82

Abstract : The need to design multi-font print readers is becoming critical for the input conversion activities of the Air Force Foreign Technology Division Machine Translation facilities. This effort was to evaluate existing optical character recognition capabilities toward the total requirement of a Russian typeset print reader. In this research four pages of original scientific Russian text were used as the data base. The contractor demonstrated that the scanning and conversion of Russian text by OCR is feasible and potentially economical. The second objective was to identify the problem areas and compile solutions to them. The results of the analysis of scanning the four samples indicate that secondary recognition and increased digital resolution will be most effective in reducing the total error rate. An ultimate total error rate of less than .5% appears achievable. (Author)

Descriptors :   (*READING MACHINES, *INPUT OUTPUT DEVICES), (*CHARACTER RECOGNITION, *RUSSIAN LANGUAGE), (*MACHINE TRANSLATION, RUSSIAN LANGUAGE), OPTICAL SCANNING, COMPUTER PROGRAMMING

Subject Categories : Linguistics
      Computer Programming and Software
      Computer Hardware

Distribution Statement : APPROVED FOR PUBLIC RELEASE