Summary of the paper

Title The Study of Writing Variants in an Under-resourced Language: Some Evidence from Mobile N-Deletion in Luxembourgish
Authors Natalie D. Snoeren, Martine Adda-Decker and Gilles Adda
Abstract The national language of the Grand-Duchy of Luxembourg, Luxembourgish, hasoften been characterized as one of Europe's under-described and under-resourcedlanguages. Because of a limited written production of Luxembourgish, poorlyobserved writing standardization (as compared to other languages such asEnglish and French) and a large diversity of spoken varieties, the study ofLuxembourgish poses many interesting challenges to automatic speech processingstudies as well as to linguistic enquiries. In the present paper, we make useof large corpora to focus on typical writing and derived pronunciation variantsin Luxembourgish, elicited by mobile -n deletion (hereafter shortened to MND).Using transcriptions from the House of Parliament debates and 10k words fromnews reports, we examine the reality of MND variants in written transcripts ofspeech. The goal of this study is manyfold: quantify the potential of variationdue to MND in written Luxembourgish, check the mandatory status of the MND ruleand discuss the arising problems for automatic spoken Luxembourgish processing.
Language Corpus (creation, annotation, etc.)
Topics Endangered languages, Knowledge Discovery/Representation, Corpus (creation, annotation, etc.)
Full paper The Study of Writing Variants in an Under-resourced Language: Some Evidence from Mobile N-Deletion in Luxembourgish
Bibtex @InProceedings{SNOEREN10.258,
  author = {Natalie D. Snoeren, Martine Adda-Decker and Gilles Adda},
  title = {The Study of Writing Variants in an Under-resourced Language: Some Evidence from Mobile N-Deletion in Luxembourgish},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA