Summary of the paper

Title The Kachna L1/L2 Picture Replication Corpus
Authors Helena Spilková, Daniel Brenner, Anton Öttl, Pavel Vondřička, Wim van Dommelen and Mirjam Ernestus
Abstract This paper presents the Kachna corpus of spontaneous speech, in which ten Czechand ten Norwegian speakers were recorded both in their native language and inEnglish. The dialogues are elicited using a picture replication task thatrequires active cooperation and interaction of speakers by asking them toproduce a drawing as close to the original as possible. The corpus isappropriate for the study of interactional features and speech reductionphenomena across native and second languages. The combination of productions innon-native English and in speakers’ native language is advantageous forinvestigation of L2 issues while providing a L1 behaviour reference from allthe speakers. The corpus consists of 20 dialogues comprising 12 hours 53minutes of recording, and was collected in 2008. Preparation of thetranscriptions, including a manual orthographic transcription and anautomatically generated phonetic transcription, is currently in progress. Thephonetic transcriptions are automatically generated by aligning acoustic modelswith the speech signal on the basis of the orthographic transcriptions and adictionary of pronunciation variants compiled for the relevant language. Uponcompletion the corpus will be made available via the European LanguageResources Association (ELRA).
Language Speech resource/database
Topics Corpus (creation, annotation, etc.), Dialogue, Speech resource/database
Full paper The Kachna L1/L2 Picture Replication Corpus
Bibtex @InProceedings{SPILKOV10.768,
  author = {Helena Spilková, Daniel Brenner, Anton Öttl, Pavel Vondřička, Wim van Dommelen and Mirjam Ernestus},
  title = {The Kachna L1/L2 Picture Replication Corpus},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA