Summary of the paper

Title Building Textual Entailment Specialized Data Sets: a Methodology for Isolating Linguistic Phenomena Relevant to Inference
Authors Luisa Bentivogli, Elena Cabrio, Ido Dagan, Danilo Giampiccolo, Medea Lo Leggio and Bernardo Magnini
Abstract This paper proposes a methodology for the creation of specialized data sets forTextual Entailment, made of monothematic Text-Hypothesis pairs (i.e. pairs inwhich only one linguistic phenomenon relevant to the entailment relation ishighlighted and isolated). The expected benefits derive from the intuition thatinvestigating the linguistic phenomena separately, i.e. decomposing thecomplexity of the TE problem, would yield an improvement in the development ofspecific strategies to cope with them. The annotation procedure assumes thathumans have knowledge about the linguistic phenomena relevant to inference, anda classification of such phenomena both into fine grained and macro categoriesis suggested. We experimented with the proposed methodology over a sample ofpairs taken from the RTE-5 data set, and investigated critical issues arisingwhen entailment, contradiction or unknown pairs are considered. The result is anew resource, which can be profitably used both to advance the comprehension ofthe linguistic phenomena relevant to entailment judgments and to make a firststep towards the creation of large-scale specialized data sets.
Language Semantics
Topics Textual Entailment and Paraphrasing, Corpus (creation, annotation, etc.), Semantics
Full paper Building Textual Entailment Specialized Data Sets: a Methodology for Isolating Linguistic Phenomena Relevant to Inference
Bibtex @InProceedings{BENTIVOGLI10.478,
  author = {Luisa Bentivogli, Elena Cabrio, Ido Dagan, Danilo Giampiccolo, Medea Lo Leggio and Bernardo Magnini},
  title = {Building Textual Entailment Specialized Data Sets: a Methodology for Isolating Linguistic Phenomena Relevant to Inference},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA