Summary of the paper

Title Annotation Process Management Revisited
Authors Dain Kaplan, Ryu Iida and Takenobu Tokunaga
Abstract Proper annotation process management is crucial to the construction of corpora,which are in turn indispensable to the data-driven techniques that have come tothe forefront in NLP during the last two decades. It is still common to seead-hoc tools created for a specific annotation project, but it is time thischanged; creation of such tools is labor and time expensive, and is secondaryto corpus creation. In addition, such tools likely lack proper annotationprocess management, increasingly more important as corpora sizes grow in sizeand complexity. This paper first raises a list of ten needs that any generalpurpose annotation system should address moving forward, such as user & rolemanagement, delegation & monitoring of work, diffing & merging annotators’work, versioning of corpora, multilingual support, import/export formatflexibility, and so on. A framework to address these needs is then proposed,and how having proper annotation process management can be beneficial to thecreation and maintenance of corpora explained. The paper then introduces SLATE(Segment and Link-based Annotation Tool Enhanced), the second iteration of aweb-based annotation tool, which is being rewritten to implement the proposedframework.
Language LR Infrastructures and Architectures
Topics Tools, systems, applications, Corpus (creation, annotation, etc.), LR Infrastructures and Architectures
Full paper Annotation Process Management Revisited
Bibtex @InProceedings{KAPLAN10.129,
  author = {Dain Kaplan, Ryu Iida and Takenobu Tokunaga},
  title = {Annotation Process Management Revisited},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA