Title |
Towards a Motivated Annotation Schema of Collocation Errors in Learner Corpora |
Authors |
Margarita Alonso Ramos, Leo Wanner, Orsolya Vincze, Gerard Casamayor del Bosque, Nancy Vázquez Veiga, Estela Mosqueira Suárez and Sabela Prieto González |
Abstract |
Collocations play a significant role in second language acquisition. In orderto be able to offer efficient support to learners, an NLP-based CALLenvironment for learning collocations should be based on a representativecollocation error annotated learner corpus. However, so far, notheoretically-motivated collocation error tag set is available. Existinglearner corpora tag collocation errors simply as lexical errors ― whichis clearly insufficient given the wide range of different collocation errorsthat the learners make. In this paper, we present a fine-grainedthree-dimensional typology of collocation errors that has been derived in anempirical study from the learner corpus CEDEL2 compiled by a team at theAutonomous University of Madrid. The first dimension captures whether the errorconcerns the collocation as a whole or one of its elements; the seconddimension captures the language-oriented error analysis, while the thirdexemplifies the interpretative error analysis. To facilitate a smoothannotation along this typology, we adapted Knowtator, a flexible off-the-shelfannotation tool implemented as a Protégé plugin. |
Language |
Other |
Topics |
Corpus (creation, annotation, etc.), MultiWord Expressions & Collocations, Other |
Full paper  |
Towards a Motivated Annotation Schema of Collocation Errors in Learner Corpora |
Bibtex |
author = {Margarita Alonso Ramos, Leo Wanner, Orsolya Vincze, Gerard Casamayor del Bosque, Nancy Vázquez Veiga, Estela Mosqueira Suárez and Sabela Prieto González}, title = {Towards a Motivated Annotation Schema of Collocation Errors in Learner Corpora}, booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |