Title |
Lexical Resources for Noun Compounds in Czech, English and Zulu |
Authors |
Karel Pala, Christiane Fellbaum and Sonja Bosch |
Abstract |
In this paper we discuss noun compounding, a highly generative, productiveprocess, in three distinct languages: Czech, English and Zulu. Derivationalmorphology presents a large grey area between regular, compositional andidiosyncratic, non-compositional word forms. The structural properties ofcompounds in each of the languages are reviewed and contrasted. Whereas Englishcompounds are head-final and thus left-branching, Czech and Zulu compoundsusually consist of a leftmost governing head and a rightmost dependent element.Semantic properties of compounds are discussed with special reference tosemantic relations between compound members which cross-linguistically showuniversal patterns, but idiosyncratic, language specific compounds are alsoidentified. The integration of compounds into lexical resources, and WordNetsin particular, remains a challenge that needs to be considered in terms of thecompounds syntactic idiosyncrasy and semantic compositionality. Experimentswith processing compounds in Czech, English and Zulu are reported and partlyevaluated. The obtained partial lists of the Czech, English and Zulu compoundsare also described. |
Language |
MultiWord Expressions & Collocations |
Topics |
Lexicon, lexical database, Semantics, MultiWord Expressions & Collocations |
Full paper  |
Lexical Resources for Noun Compounds in Czech, English and Zulu |
Bibtex |
@InProceedings{PALA10.883,
author = {Karel Pala, Christiane Fellbaum and Sonja Bosch}, title = {Lexical Resources for Noun Compounds in Czech, English and Zulu}, booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |