Evaluation of terminology translation in instance-based neural MT adaptation

M. Amin Farajian; Nicola Bertoldi; Matteo Negri; Marco Turchi; Marcello Federico

Back

Conference paper

Evaluation of terminology translation in instance-based neural MT adaptation

M. Amin Farajian, Nicola Bertoldi, Matteo Negri, Marco Turchi and Marcello Federico

EAMT 2018 - Proceedings of the 21st Annual Conference of the European Association for Machine Translation

European Association for Machine Translation

2018

Abstract

Computer aided language translation

Corpus-based methods

Domain specific

Generic system

Machine translations

Multiple domains

Technical terms

Training data

Translation quality

Computational linguistics

We address the issues arising when a neural machine translation engine trained on generic data receives requests from a new domain that contains many specific technical terms. Given training data of the new domain, we consider two alternative methods to adapt the generic system: corpus-based and instance-based adaptation. While the first approach is computationally more intensive in generating a domain-customized network, the latter operates more efficiently at translation time and can handle on-the-fly adaptation to multiple domains. Besides evaluating the generic and the adapted networks with conventional translation quality metrics, in this paper we focus on their ability to properly handle domain-specific terms. We show that instance-based adaptation, by fine-tuning the model on-the-fly, is capable to significantly boost the accuracy of translated terms, producing translations of quality comparable to the expensive corpus-based method. © 2018 The authors. This article is licensed under a Creative Commons 3.0 licence, no derivative works, attribution, CC-BY-ND.

Metrics

1 Record Views

Details

Title: Evaluation of terminology translation in instance-based neural MT adaptation
Creators - without role: M. Amin Farajian
Nicola Bertoldi
Matteo Negri
Marco Turchi
Marcello Federico
Publication Details: EAMT 2018 - Proceedings of the 21st Annual Conference of the European Association for Machine Translation
Publisher: European Association for Machine Translation
Grant note: This work has been partially supported by the EC-funded H2020 projects QT21 (grant no. 645452) and ModernMT (grant no. 645487). This work was also supported by The Alan Turing Institute under the EPSRC grant EP/N510129/1 and by a donation of Azure credits by Microsoft.
Identifiers: 9920088509548
Academic Unit: Alan Turing Institute
Language: English
Resource Type: Conference paper
Date published: 2018

Evaluation of terminology translation in instance-based neural MT adaptation

Abstract

Metrics

Details

Alan Turing Institute Social media