统计机器翻译领域自适应方法比较研究

丁亮; 李颖; 何彦青

文章摘要

丁亮,李颖,何彦青.统计机器翻译领域自适应方法比较研究[J].情报工程,2016,2(4):080-088

统计机器翻译领域自适应方法比较研究

Comparison Study of Domain Adaption Methods in Statistical Machine Translation DING Liang，LI Ying，HE

DOI：10.3772/j.issn.2095-915X.2016.04.010

中文关键词: 统计机器翻译，领域自适应，语料选取，翻译性能改进

英文关键词: Statistical machine translation,domain adaptation,corpus selection,translation performance improvement

基金项目:国家自然科学基金项目：（61303152、 71503240 和 71403257），中国科学技术信息研究所重点工作项目：（ZD2016-05）

作者	单位
丁亮	中国科学技术信息研究所
李颖	中国科学技术信息研究所
何彦青	中国科学技术信息研究所

摘要点击次数: 4824

全文下载次数: 3814

中文摘要:

统计机器翻译常常面临训练数据与待翻译文本领域不一的问题，从而影响了翻译的性能，因此领域自适应一直是研究者关注的课题。本文以传统自适应方法和现行的机器学习方法为框架，介绍了近年来统计机器翻译领域自适应研究的进展。分析了各类研究方法的优缺点并对未来研究做出展望。

英文摘要:

Statistical machine translation（SMT）is often faced with the problem of different domains between the training data set and test data set,which affecting the performance of translation,therefore domain adaptation has been a subject of concern.In this paper,we constructed the domain adaptation research framework-the traditional adaptive methods and the existing machine learning methods,and introduced the research progress in recent years.We analyzed the advantages and disadvantages of each method and made a prospect for the future research.

查看全文查看/发表评论下载PDF阅读器

关闭