Summary of the paper

Title Building an Italian FrameNet through Semi-automatic Corpus Analysis
Authors Alessandro Lenci, Martina Johnson and Gabriella Lapesa
Abstract n this paper, we outline the methodology we adopted to develop a FrameNet forItalian. The main element of novelty with respect to the original FrameNet isrepresented by the fact that the creation and annotation of Lexical Units isstrictly grounded in distributional information (statistical distribution ofverbal subcategorization frames, lexical and semantic preferences of eachframe) automatically acquired from a large, dependency-parsed corpus. We claimthat this approach allows us to overcome some of the shortcomings of theclassical lexicographic method used to create FrameNet, by complementing theaccuracy of manual annotation with the robustness of data on the globaldistributional patterns of a verb. In the paper, we describe our method forextracting distributional data from the corpus and the way we used it for theencoding and annotation of LUs. The long-term goal of our project is to createan electronic lexicon for Italian similar to the original English FrameNet. Forthe moment, we have developed a database of syntactic valences that will bemade freely accessible via a web interface. This represents an autonomousresource besides the FrameNet lexicon, of which we have a beginning nucleusconsisting of 791 annotated sentences.
Language Lexicon, lexical database
Topics Semantics, Acquisition, Lexicon, lexical database
Full paper Building an Italian FrameNet through Semi-automatic Corpus Analysis
Bibtex @InProceedings{LENCI10.313,
  author = {Alessandro Lenci, Martina Johnson and Gabriella Lapesa},
  title = {Building an Italian FrameNet through Semi-automatic Corpus Analysis},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA