Carnegie Mellon University
Browse
file.pdf (98.45 kB)

Data Collection and Language Technologies for Mapudungun

Download (98.45 kB)
journal contribution
posted on 2006-05-01, 00:00 authored by Lorraine LevinLorraine Levin, Rodolfo M Vega, Jaime G. Carbonell, Ralf D Brown, Alon LavieAlon Lavie, Eliseo Canulef, Carolina Huenchullan
Mapudungun is spoken by over 900,000 people (Mapuche) in Chile and Argentina. Thanks to an active bilingual and multicultural education program, Mapuche children are now being taught to be literate in both Mapudungun and Spanish. The Chilean Ministry of Education has teamed up with the Language Technologies Institute's AVENUE project to collect data and produce language technologies that support bilingual education. The main resource that has come out of the Mineduc-LTI partnership is Mapudungun-Spanish parallel corpus consisting of approximately 200,000 words of text and 120 hours of transcribed speech. Plans are being made for machine translation and computer-assisted instruction.

History

Date

2006-05-01

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC