Summary of the paper

Title Multi-Channel Database of Spontaneous Czech with Synchronization of Channels Recorded by Independent Devices
Authors Petr Pollák and Josef Rajnoha
Abstract This paper describes Czech spontaneous speech database of lectures ondigital signal processing topic collected at Czech TechnicalUniversity in Prague, commonly with the procedure of its recording andannotation. The database contains 21.7 hours of speech material from22 speakers recorded in 4 channels with 3 principally differentmicrophones. The annotation of the database is composed from basictime segmentation, orthographic transcription including marks forspeaker and environmental non-speech events, pronunciation lexicon inSAMPA alphabet, session and speaker information describing recordingconditions, and the documentation. The orthographic transcription with timesegmentation is saved in XML format supported by frequently usedannotation tool Transcriber. In this article, special attention isalso paid to the description of time synchronization of signals recordedby two independent devices: computer based recording platform usingtwo external sound cards and commercial audio recorder EdirolR09. This synchronization is based on cross-correlation analysis withsimple automated selection of suitable short signal subparts. Thecollection and annotation of this database is now complete and itsavailability via ELRA is currently under preparation.
Language Speech Recognition/Understanding
Topics Speech resource/database, Tools, systems, applications, Speech Recognition/Understanding
Full paper Multi-Channel Database of Spontaneous Czech with Synchronization of Channels Recorded by Independent Devices
Bibtex @InProceedings{POLLK10.516,
  author = {Petr Pollák and Josef Rajnoha},
  title = {Multi-Channel Database of Spontaneous Czech with Synchronization of Channels Recorded by Independent Devices},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA