Summary of the paper

Title The Nijmegen Corpus of Casual Spanish
Authors Francisco Torreira and Mirjam Ernestus
Abstract This article describes the preparation, recording and orthographictranscription of a new speech corpus, the Nijmegen Corpus of Casual Spanish(NCCSp). The corpus contains around 30 hours of recordings of 52 Madrid Spanishspeakers engaged in conversations with friends. Casual speech was elicitedduring three different parts, which together provided around ninety minutes ofspeech from every group of speakers. While Parts 1 and 2 did not requireparticipants to perform any specific task, in Part 3 participants negotiated acommon answer to general questions about society. Information about how toobtain a copy of the corpus can be found online athttp://mirjamernestus.ruhosting.nl/Ernestus/NCCSp
Language Phonetic Databases, Phonology
Topics Corpus (creation, annotation, etc.), Speech resource/database, Phonetic Databases, Phonology
Full paper The Nijmegen Corpus of Casual Spanish
Bibtex @InProceedings{TORREIRA10.271,
  author = {Francisco Torreira and Mirjam Ernestus},
  title = {The Nijmegen Corpus of Casual Spanish},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA