Summary of the paper

Title Building the Basque PropBank
Authors Izaskun Aldezabal, María Jesús Aranzabe, Arantza Díaz de Ilarraza and Ainara Estarrona
Abstract This paper presents the work that has been carried out to annotate semanticroles in the Basque Dependency Treebank (BDT). We will describe the resourceswe have used and the way the annotation of 100 verbs has been done. We decideto follow the model proposed in the PropBank project that has been deployed inother languages, such as Chinese, Spanish, Catalan and Russian. The resourcesused are: an in-house database with syntactic/semantic subcategorization framesfor Basque verbs, an English-Basque verb mapping based on Levin’sclassification and the BDT itself. Detailed guidelines for human taggers havebeen established as a result of this annotation process. In addition, we havecharacterized the information associated to the semantic tag. Besides, andbased on this study, we will define semi-automatic procedures that willfacilitate the task of manual annotation for the rest of the verbs of theTreebank. We have also adapted AbarHitz, a tool used in the construction of theBDT, for the task of annotating semantic roles according to the proposedcharacterization.
Language Tools, systems, applications
Topics Corpus (creation, annotation, etc.), Endangered languages, Tools, systems, applications
Full paper Building the Basque PropBank
Bibtex @InProceedings{ALDEZABAL10.217,
  author = {Izaskun Aldezabal, María Jesús Aranzabe, Arantza Díaz de Ilarraza and Ainara Estarrona},
  title = {Building the Basque PropBank},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA