Summary of the paper

Title Cross-Corpus Textual Entailment for Sublanguage Analysis in Epidemic Intelligence
Authors Avaré Stewart, Kerstin Denecke and Wolfgand Nejdl
Abstract Textual entailment has been recognized as a generic task that captures majorsemantic inference needs across many natural language processing applications.However, to date, textual entailment has not been considered in a cross-corpussetting, nor for user generated content.Given the emergence of Medicine 2.0, medical blogs are becoming anincreasinglyaccepted source of information. However, given the characteristics of blogs(which tend to be noisy and informal; or containa interspersing of subjective and factual sentences) a potentially largeamount of irrelevant information may be present.Given the potential noise, the overarching problem withrespect to information extraction from social media is achieving the correctlevel of sentence filtering - as opposed to document or blog post level.Specifically for the task of medical intelligence gathering.In this paper, we propose an approach to textual entailment with uses thetext from one source of user generated content (T text) for sentence-levelfiltering within a new and less amenable one (H text), when the underlyingdomain, tasks or semantic information is the same, or overlaps.
Language Corpus (creation, annotation, etc.)
Topics Information Extraction, Information Retrieval, Knowledge Discovery/Representation, Corpus (creation, annotation, etc.)
Full paper Cross-Corpus Textual Entailment for Sublanguage Analysis in Epidemic Intelligence
Bibtex @InProceedings{STEWART10.881,
  author = {Avaré Stewart, Kerstin Denecke and Wolfgand Nejdl},
  title = {Cross-Corpus Textual Entailment for Sublanguage Analysis in Epidemic Intelligence},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA