Summary of the paper

Title Empty Categories in a Hindi Treebank
Authors Archna Bhatia, Rajesh Bhatt, Bhuvana Narasimhan, Martha Palmer, Owen Rambow, Dipti Misra Sharma, Michael Tepper, Ashwini Vaidya and Fei Xia
Abstract We are in the process of creating a multi-representational andmulti-layered treebank for Hindi/Urdu (Palmer et al., 2009), which has threemain layers: dependency structure, predicate-argument structure (PropBank),and phrase structure. This paper discusses an important issue in treebank design which is often neglected: the use of empty categories (ECs). All three levels of representation make use of ECs. We make a high-level distinction between two types of ECs, trace and silent, on the basis of whether they are postulated to mark displacement or not. Each type is furtherrefined into several subtypes based on the underlying linguistic phenomenawhich the ECs are introduced to handle. This paper discusses the stages atwhich we add ECs to the Hindi/Urdu treebank and why. We investigate methodically the different types of ECs and their role in our syntactic and semantic representations. We also examine our decisions whether or notto coindex each type of ECs with other elements in the representation.
Language
Topics Corpus (creation, annotation, etc.)
Full paper Empty Categories in a Hindi Treebank
Bibtex @InProceedings{BHATIA10.561,
  author = {Archna Bhatia, Rajesh Bhatt, Bhuvana Narasimhan, Martha Palmer, Owen Rambow, Dipti Misra Sharma, Michael Tepper, Ashwini Vaidya and Fei Xia},
  title = {Empty Categories in a Hindi Treebank},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA