Summary of the paper

Title MPC: A Multi-Party Chat Corpus for Modeling Social Phenomena in Discourse
Authors Samira Shaikh, Tomek Strzalkowski, Aaron Broadwell, Jennifer Stromer-Galley, Sarah Taylor and Nick Webb
Abstract In this paper, we describe our experience with collecting and creating anannotated corpus of multi-party online conversations in a chat-roomenvironment. This effort is part of a larger project to develop computationalmodels of social phenomena such as agenda control, influence, and leadership inon-line interactions. Such models will help capturing the dialogue dynamicsthat are essential for developing, among others, realistic human-machinedialogue systems, including autonomous virtual chat agents. In this paper wedescribe data collection method used and the characteristics of the initialdataset of English chat. We have devised a multi-tiered collection process inwhich the subjects start from simple, free-flowing conversations and progresstowards more complex and structured interactions. In this paper, we report onthe first two stages of this process, which were recently completed. The third,large-scale collection effort is currently being conducted. All Englishdialogue has been annotated at four levels: communication links, dialogue acts,local topics and meso-topics. Some details of these annotations will bediscussed later in this paper, although a full description is impossible withinthe scope of this article.
Language Acquisition
Topics Corpus (creation, annotation, etc.), Discourse annotation, representation and processing, Acquisition
Full paper MPC: A Multi-Party Chat Corpus for Modeling Social Phenomena in Discourse
Bibtex @InProceedings{SHAIKH10.85,
  author = {Samira Shaikh, Tomek Strzalkowski, Aaron Broadwell, Jennifer Stromer-Galley, Sarah Taylor and Nick Webb},
  title = {MPC: A Multi-Party Chat Corpus for Modeling Social Phenomena in Discourse},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA