Large-Scale Modeling of Wordform Learning and Representation.

Sibley, Daragh E.; T. Kello, Christopher; Plaut, David; L. Elman, Jeffrey

doi:10.1184/R1/6616919.v1

File(s) stored somewhere else

http://www.ncbi.nlm.nih.gov/pmc/articles/pmid/20107621/

Please note: Linked content is NOT stored on Carnegie Mellon University and we can't guarantee its availability, quality, security or accept any liability.

Large-Scale Modeling of Wordform Learning and Representation.

journal contribution

posted on 2008-06-01, 00:00 authored by Daragh E. Sibley, Christopher T. Kello, David PlautDavid Plaut, Jeffrey L. Elman

The forms of words as they appear in text and speech are central to theories and models of lexical processing. Nonetheless, current methods for simulating their learning and representation fail to approach the scale and heterogeneity of real wordform lexicons. A connectionist architecture termed the sequence encoder is used to learn nearly 75,000 wordform representations through exposure to strings of stress-marked phonemes or letters. First, the mechanisms and efficacy of the sequence encoder are demonstrated and shown to overcome problems with traditional slot-based codes. Then, two large-scale simulations are reported that learned to represent lexicons of either phonological or orthographic word-forms. In doing so, the models learned the statistics of their lexicons as shown by better processing of well-formed pseudowords as opposed to ill-formed (scrambled) pseudowords, and by accounting for variance in well-formedness ratings. It is discussed how the sequence encoder may be integrated into broader models of lexical processing.

History

Date

2008-06-01

Usage metrics

Keywords

Large-scale connectionist modeling Sequence encoder Simple recurrent network Lexical processing Orthography Phonology Wordforms

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) stored somewhere else

Large-Scale Modeling of Wordform Learning and Representation.

History

Date

Usage metrics

Categories

Keywords

Licence

Exports