Summary of the paper

Title BabyExp: Constructing a Huge Multimodal Resource to Acquire Commonsense Knowledge Like Children Do
Authors Massimo Poesio, Marco Baroni, Oswald Lanz, Alessandro Lenci, Alexandros Potamianos, Hinrich Schütze, Sabine Schulte im Walde and Luca Surian
Abstract There is by now widespread agreement that the most realistic way to constructthe large-scale commonsense knowledge repositories required by natural languageand artificial intelligence applications is by letting machines learn suchknowledge from large quantities of data, like humans do. A lot of attention hasconsequently been paid to the development of increasingly sophisticated machinelearning algorithms for knowledge extraction. However, the nature of the inputthat humans are exposed to while learning commonsense knowledge has receivedmuch less attention. The BabyExp project is collecting very dense audio andvideo recordings of the first 3 years of life of a baby. The corpus constructedin this way will be transcribed with automated techniques and made available tothe research community. Moreover, techniques to extract commonsense conceptualknowledge incrementally from these multimodal data are also being exploredwithin the project. The current paper describes BabyExp in general, andpresents pilot studies on the feasibility of the automated audio and videotranscriptions.
Language Corpus (creation, annotation, etc.)
Topics Acquisition, Knowledge Discovery/Representation, Corpus (creation, annotation, etc.)
Full paper BabyExp: Constructing a Huge Multimodal Resource to Acquire Commonsense Knowledge Like Children Do
Bibtex @InProceedings{POESIO10.455,
  author = {Massimo Poesio, Marco Baroni, Oswald Lanz, Alessandro Lenci, Alexandros Potamianos, Hinrich Schütze, Sabine Schulte im Walde and Luca Surian},
  title = {BabyExp: Constructing a Huge Multimodal Resource to Acquire Commonsense Knowledge Like Children Do},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA