Accession Number : ADA309542
Title : Induction as Knowledge Integration.
Descriptive Note : Research rept.,
Corporate Author : UNIVERSITY OF SOUTHERN CALIFORNIA MARINA DEL REY INFORMATION SCIENCES INST
Personal Author(s) : Smith, Benjamin D.
PDF Url : ADA309542
Report Date : DEC 1995
Pagination or Media Count : 186
Abstract : Accuracy and efficiency are the two main evaluation criteria for induction algorithms. One of the most powerful ways to improve performance along these dimensions is by integrating additional knowledge into the induction process. However, integrating knowledge that differs significantly from the knowledge already used by the algorithm usually requires rewriting the algorithm. This dissertation presents Kil, a Knowledge Integration framework for Induction, that provides a straightforward method for integrating knowledge into induction, and provides new insights into the effects of knowledge on the accuracy and complexity of induction. The idea behind Kil is to express all knowledge uniformly as constraints and preferences on hypotheses. Knowledge is integrated by conjoining constraints and disjoining preferences. A hypothesis is induced from the integrated knowledge by finding a hypothesis consistent with all of the constraints and maximally preferred by the preferences. Theoretically, just about any knowledge can be expressed in this manner. In practice, the constraint and preference languages determine both the knowledge that can be expressed and the complexity of identifying a consistent hypothesis. RS-KII, an instantiation of Kll based on a very expressive set representation, is described. RS-Kll can utilize the knowledge of at least two disparate induction algorithms-AQ-ll and CEA ('version spaces') in addition to knowledge neither algorithm can utilize. It seems likely that RS-NII can utilize knowledge from other induction algorithms, as well as novel kinds of knowledge, but this is left for future work. RS-Kll's complexity is comparable to these algorithms when using only the knowledge of a given algorithm, and in some cases RS-Kll's complexity is dramatically superior.
Descriptors : *ALGORITHMS, *INDUCTION SYSTEMS, TEST AND EVALUATION, ACCURACY, EFFICIENCY, LANGUAGE, HYPOTHESES.
Subject Categories : Cybernetics
Distribution Statement : APPROVED FOR PUBLIC RELEASE