English  |  正體中文  |  简体中文  |  Post-Print筆數 : 27 |  Items with full text/Total items : 94986/125531 (76%)
Visitors : 30979810      Online Users : 470
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    政大機構典藏 > 理學院 > 資訊科學系 > 會議論文 >  Item 140.119/98230
    Please use this identifier to cite or link to this item: http://nccur.lib.nccu.edu.tw/handle/140.119/98230

    Title: Statistical approaches to patent translation - Experiments with various settings of training data
    Authors: Tseng, Yuen-Hsien;Liu, Chao-Lin;Tsai, Chia-Chi;Wang, Jui-Ping;Chuang, Yi-Hsuan;Jeng, James
    Contributors: 資科系
    Keywords: Chinese segmentation, language modeling, training corpus
    Date: 2011-12
    Issue Date: 2016-06-22 17:10:06 (UTC+8)
    Abstract: This paper describes our experiments and results in the NTCIR-9 Chinese-to-English Patent Translation Task. A series of open source software were integrated to build a statistical machine translation model for the task. Various Chinese segmentation, additional resources, and training corpus preprocessing were then tried based on this model. As a result, more than 20 experiments were conducted to compare the translation performance. Our current results show that 1) consistent segmentation between the training and testing data is important to maintain the performance; 2) sufficient number of good quality bilingual training sentences is more helpful than additional bilingual dictionaries; and 3) the translation effectiveness in BLEU values doubles as the number of bilingual training sentences at the level of 100,000 doubles.
    Relation: Proceedings of the Ninth NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, Question Answering and Cross-Lingual Information Access - PatentMT (NTCIR 9), 661‒665. Tokyo, Japan, 6-9 December 2011
    Data Type: conference
    Appears in Collections:[資訊科學系] 會議論文

    Files in This Item:

    File Description SizeFormat
    661-665.pdf1384KbAdobe PDF325View/Open

    All items in 政大典藏 are protected by copyright, with all rights reserved.

    社群 sharing

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback