Summary of the paper

Title Using Dialogue Corpora to Extend Information Extraction Patterns for Natural Language Understanding of Dialogue
Authors Roberta Catizone, Alexiei Dingli and Robert Gaizauskas
Abstract This paper examines how Natural Language Process (NLP) resources and onlinedialogue corpora can be used to extend coverage of Information Extraction (IE)templates in a Spoken Dialogue system. IE templates are used as part of aNatural Language Understanding module for identifying meaning in a userutterance. The use of NLP tools in Dialogue systems is a difficult task given1) spoken dialogue is often not well-formed and 2) there is a serious lack ofdialogue data. In spite of that, we have devised a method for extending IEpatterns using standard NLP tools and available dialogue corpora found on theweb. In this paper, we explain our method which includes using a set of NLPmodules developed using GATE (a General Architecture for Text Engineering), aswell as a general purpose editing tool that we built to facilitate the IE rulecreation process. Lastly, we present directions for future work in this area.
Language Named Entity recognition
Topics Information Extraction, Information Retrieval, Dialogue, Named Entity recognition
Full paper Using Dialogue Corpora to Extend Information Extraction Patterns for Natural Language Understanding of Dialogue
Bibtex @InProceedings{CATIZONE10.818,
  author = {Roberta Catizone, Alexiei Dingli and Robert Gaizauskas},
  title = {Using Dialogue Corpora to Extend Information Extraction Patterns for Natural Language Understanding of Dialogue},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA