LREC 2010 Proceedings

Summary of the paper

Title	The Nijmegen Corpus of Casual Spanish
Authors	Francisco Torreira and Mirjam Ernestus
Abstract	This article describes the preparation, recording and orthographictranscription of a new speech corpus, the Nijmegen Corpus of Casual Spanish(NCCSp). The corpus contains around 30 hours of recordings of 52 Madrid Spanishspeakers engaged in conversations with friends. Casual speech was elicitedduring three different parts, which together provided around ninety minutes ofspeech from every group of speakers. While Parts 1 and 2 did not requireparticipants to perform any specific task, in Part 3 participants negotiated acommon answer to general questions about society. Information about how toobtain a copy of the corpus can be found online athttp://mirjamernestus.ruhosting.nl/Ernestus/NCCSp
Language	Phonetic Databases, Phonology
Topics	Corpus (creation, annotation, etc.), Speech resource/database, Phonetic Databases, Phonology
Full paper	The Nijmegen Corpus of Casual Spanish
Bibtex	@InProceedings{TORREIRA10.271, author = {Francisco Torreira and Mirjam Ernestus}, title = {The Nijmegen Corpus of Casual Spanish}, booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} }