Summary of the paper

Title Analysing Temporally Annotated Corpora with CAVaT
Authors Leon Derczynski and Robert Gaizauskas
Abstract We present CAVaT, a tool that performs Corpus Analysis and Validation forTimeML. CAVaT is an open source, modular checking utility for statisticalanalysis of features specific to temporally-annotated natural language corpora.It provides reporting, highlights salient links between a variety of generaland time-specific linguistic features, and also validates a temporal annotationto ensure that it is logically consistent and sufficiently annotated. Uniquely,CAVaT provides analysis specific to TimeML-annotated temporal information.TimeML is a standard for annotating temporal information in natural languagetext. In this paper, we present the reporting part of CAVaT, and then itserror-checking ability, including the workings of several novel TimeML documentverification methods. This is followed by the execution of some example tasksusing the tool to show relations between times, events, signals and links. Wealso demonstrate inconsistencies in a TimeML corpus (TimeBank) that have beendetected with CAVaT.
Language Validation of LRs
Topics Corpus (creation, annotation, etc.), Tools, systems, applications, Validation of LRs
Full paper Analysing Temporally Annotated Corpora with CAVaT
Bibtex @InProceedings{DERCZYNSKI10.546,
  author = {Leon Derczynski and Robert Gaizauskas},
  title = {Analysing Temporally Annotated Corpora with CAVaT},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA