LREC 2010 Proceedings

Summary of the paper

Title	Automatic and Human Evaluation Study of a Rule-based and a Statistical Catalan-Spanish Machine Translation Systems
Authors	Marta R. Costa-jussà, Mireia Farrús, José B. Mariño and José A. R. Fonollosa
Abstract	Machine translation systems can be classified into rule-based and corpus-basedapproaches, in terms of their core technology. Since both paradigms havelargely been used during the last years, one of the aims in the researchcommunity is to know how these systems differ in terms of translation quality.To this end, this paper reports a study and comparison of a rule-based and acorpus-based (particularly, statistical) Catalan-Spanish machine translationsystems, both of them freely available in the web.The translation quality analysis is performed under two different domains:journalistic and medical. The systems are evaluated by using standard automaticmeasures, as well as by native human evaluators. Automatic results show thatthe statistical system performs better than the rule-based system. Humanjudgements show that in the Spanish-to-Catalan direction the statistical systemalso performs better than the rule-based system, while in theCatalan-to-Spanish direction is the other way round. Although the statisticalsystem obtains the best automatic scores, its errors tend to be more penalizedby human judgements than the errors of the rule-based system. This can beexplained because statistical errors are usually unexpected and they do notfollow any pattern.
Language	Evaluation methodologies
Topics	Machine Translation, SpeechToSpeech Translation, Web Services, Evaluation methodologies
Full paper	Automatic and Human Evaluation Study of a Rule-based and a Statistical Catalan-Spanish Machine Translation Systems
Bibtex	@InProceedings{RCOSTAJUSS10.47, author = {Marta R. Costa-jussà, Mireia Farrús, José B. Mariño and José A. R. Fonollosa}, title = {Automatic and Human Evaluation Study of a Rule-based and a Statistical Catalan-Spanish Machine Translation Systems}, booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} }