Summary of the paper

Title Dialogue Acts Annotation for NICT Kyoto Tour Dialogue Corpus to Construct Statistical Dialogue Systems
Authors Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka and Satoshi Nakamura
Abstract This paper introduces a new corpus of consulting dialogues designed fortraining a dialogue manager that can handle consulting dialogues throughspontaneous interactions from the tagged dialogue corpus. We have collectedmore than 150 hours of consulting dialogues in the tourist guidance domain.We are developing the corpus that consists of speech, transcripts, speech act(SA) tags, morphological analysis results, dependency analysis results, andsemantic content tags. This paper outlines our taxonomy of dialogue act (DA)annotation that can describe two aspects of an utterance: the communicativefunction (SA), and the semantic content of the utterance. We provide anoverview of the Kyoto tour dialogue corpus and a preliminary analysis using theDA tags. We also show a result of a preliminary experiment for SA tagging viaSupport Vector Machines (SVMs). We introduce the current states of the corpusdevelopment In addition, we mention the usage of our corpus for the spokendialogue system that is being developed.
Language Speech resource/database
Topics Dialogue, Corpus (creation, annotation, etc.), Speech resource/database
Full paper Dialogue Acts Annotation for NICT Kyoto Tour Dialogue Corpus to Construct Statistical Dialogue Systems
Bibtex @InProceedings{OHTAKE10.676,
  author = {Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka and Satoshi Nakamura},
  title = {Dialogue Acts Annotation for NICT Kyoto Tour Dialogue Corpus to Construct Statistical Dialogue Systems},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA