Show simple item record  

dc.contributor.authorSmith, Tony C.
dc.contributor.authorWitten, Ian H.
dc.date.accessioned2008-11-17T03:10:57Z
dc.date.available2008-11-17T03:10:57Z
dc.date.issued1996
dc.identifier.citationSmith, T.C. & Witten, I.H. (1996). Learning language using genetic algorithms. In Connectionist, statistical and symbolic approaches to learning for natural language processing (pp. 132-145). Berlin: Springer.en_US
dc.identifier.urihttps://hdl.handle.net/10289/1360
dc.description.abstractStrict pattern-based methods of grammar induction are often frustrated by the apparently inexhaustible variety of novel word combinations in large corpora. Statistical methods offer a possible solution by allowing frequent well-formed expressions to overwhelm the infrequent ungrammatical ones. They also have the desirable property of being able to construct robust grammars from positive instances alone. Unfortunately, the zero-frequency problem entails assigning a small probability to all possible word patterns, thus ungrammatical n-grams become as probable as unseen grammatical ones. Further, such grammars are unable to take advantage of inherent lexical properties that should allow infrequent words to inherit the syntactic properties of the class to which they belong. This paper describes a genetic algorithm (GA) that adapts a population of hypothesis grammars towards a more effective model of language structure. The GA is statistically sensitive in that the utility of frequent patterns is reinforced by the persistence of efficient substructures. It also supports the view of language learning as a bootstrapping problem, a learning domain where it appears necessary to simultaneously discover a set of categories and a set of rules defined over them. Results from a number of tests indicate that the GA is a robust, fault-tolerant method for inferring grammars from positive examples of natural language.en_US
dc.language.isoen
dc.publisherSpringeren_US
dc.relation.urihttp://www.springerlink.com/content/7380v8117n80mw60/en_US
dc.subjectcomputer scienceen_US
dc.subjectMachine learning
dc.titleLearning language using genetic algorithmsen_US
dc.typeConference Contributionen_US
dc.identifier.doi10.1007/3-540-60925-3_43en_US
dc.relation.isPartOfConnectionist, Statistical and Symbolic Approaches to Learning for Natural Language Processingen_NZ
pubs.begin-page132en_NZ
pubs.elements-id31556
pubs.end-page145en_NZ
pubs.volumeLNCS 1040en_NZ


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record