Abstract: This paper digs into the characteristics and challenges of web-based machine translation,and proposes possible solutions.First of all,we look back on the history of machine translation and summarize its methods.Next,we analyze the characteristics of internet bilingual corpora and monolingual corpora as: large scale,with lots of noise,real-time and sometimes sparse.Based on the features described above,we propose the hybrid machine translation method,corpus mining and filtering methods,and distributed computing methods.Furthermore,the pivot language approach is adopted to tackle the data sparseness problem,thus enabling the quick development of multilingual machine translation systems.We then discuss the approach to support the personalization of machine translation via the combination of translation technology and search technology.Finally the applications and products of machine translation technology are presented.
Publication Year: 2011
Publication Date: 2011-01-01
Language: en
Type: article
Access and Citation
AI Researcher Chatbot
Get quick answers to your questions about the article from our AI researcher chatbot