Summary of the paper

Title A Flexible Representation of Heterogeneous Annotation Data
Authors Richard Johansson and Alessandro Moschitti
Abstract This paper describes a new flexible representation for the annotation ofcomplex structures of metadata over heterogeneous data collections containingtext and other types of media such as images or audio files. We argue thatexisting frameworks are not suitable for this purpose, most importantly becausethey do not easily generalize to multi-document and multimodal corpora, andbecause they often require the use of particular software frameworks. In thepaper, we define a data model to represent such structured data over multimodalcollections. Furthermore, we define a surface realization of the data structureas a simple and readable XML format. We present two examples of annotationtasks to illustrate how the representation and format work for complexstructures involving multimodal annotation and cross-document links. Therepresentation described here has been used in a large-scale project focusingon the annotation of a wide range of information ― from low-level features tohigh-level semantics ― in a multimodal data collection containing both textand images.
Language Standards for LRs
Topics Multimedia Document Processing, LR Infrastructures and Architectures, Standards for LRs
Full paper A Flexible Representation of Heterogeneous Annotation Data
Bibtex @InProceedings{JOHANSSON10.80,
  author = {Richard Johansson and Alessandro Moschitti},
  title = {A Flexible Representation of Heterogeneous Annotation Data},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA