Title |
Annotation Process Management Revisited |
Authors |
Dain Kaplan, Ryu Iida and Takenobu Tokunaga |
Abstract |
Proper annotation process management is crucial to the construction of corpora,which are in turn indispensable to the data-driven techniques that have come tothe forefront in NLP during the last two decades. It is still common to seead-hoc tools created for a specific annotation project, but it is time thischanged; creation of such tools is labor and time expensive, and is secondaryto corpus creation. In addition, such tools likely lack proper annotationprocess management, increasingly more important as corpora sizes grow in sizeand complexity. This paper first raises a list of ten needs that any generalpurpose annotation system should address moving forward, such as user & rolemanagement, delegation & monitoring of work, diffing & merging annotatorswork, versioning of corpora, multilingual support, import/export formatflexibility, and so on. A framework to address these needs is then proposed,and how having proper annotation process management can be beneficial to thecreation and maintenance of corpora explained. The paper then introduces SLATE(Segment and Link-based Annotation Tool Enhanced), the second iteration of aweb-based annotation tool, which is being rewritten to implement the proposedframework. |
Language |
LR Infrastructures and Architectures |
Topics |
Tools, systems, applications, Corpus (creation, annotation, etc.), LR Infrastructures and Architectures |
Full paper  |
Annotation Process Management Revisited |
Bibtex |
@InProceedings{KAPLAN10.129,
author = {Dain Kaplan, Ryu Iida and Takenobu Tokunaga}, title = {Annotation Process Management Revisited}, booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |