Summary of the paper

Title Work on Spoken (Multimodal) Language Corpora in South Africa
Authors Jens Allwood, Harald Hammarström, Andries Hendrikse, Mtholeni N. Ngcobo, Nozibele Nomdebevana, Laurette Pretorius and Mac van der Merwe
Abstract This paper describes past, ongoing and planned work on the collection andtranscription of spoken language samples for all the South African officiallanguages and as part of this the training of researchers in corpus linguisticresearch skills. More specifically the work has involved (and still involves)establishing an international corpus linguistic network linked to a network hubat a UNISA website and the development of research tools, a corpus researchguide and workbook for multimodal communication and spoken language corpusresearch. As an example of the work we are doing and hope to do more of in thefuture, we present a small pilot study of the influence of English andAfrikaans on the 100 most frequent words in spoken Xhosa as this is evidencedin the corpus of spoken interaction we have gathered so far. Other plannedwork, besides work on spoken language phenomena, involves comparison of spokenand written language and work on communicative body movements (gestures) andtheir relation to speech.
Language Multilinguality
Topics Corpus (creation, annotation, etc.), Discourse annotation, representation and processing, Multilinguality
Full paper Work on Spoken (Multimodal) Language Corpora in South Africa
Bibtex @InProceedings{ALLWOOD10.438,
  author = {Jens Allwood, Harald Hammarström, Andries Hendrikse, Mtholeni N. Ngcobo, Nozibele Nomdebevana, Laurette Pretorius and Mac van der Merwe},
  title = {Work on Spoken (Multimodal) Language Corpora in South Africa},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA