Bilingual Word Embedding with Sentence Similarity Constraint for Machine Translation

Bilingual Word Embedding with Sentence Similarity Constraint for Machine Translation
Title:
Bilingual Word Embedding with Sentence Similarity Constraint for Machine Translation
Other Titles:
International Conference on Asian Language Processing (IALP 2017)
Publication URL:
Publication Date:
01 December 2017
Citation:
Abstract:
In this work, we propose a context-based bilingual word embedding framework that leverages the information of large amount of parallel sentence pairs which share the same semantic meaning. Such information is abundantly available but has not been fully utilized in previous work of context-based bilingual word embedding models, which only exploit local contextual information through a short window sequence at the word level. To incorporate such information, we define a sentence similarity matching objective which is enforced as a constraint into the original bilingual word embedding objective. They are jointly optimized to better learn the bilingual word embedding. Experimental results show that the proposed model is superior to previous methods on machine translation quality.
License type:
PublisherCopyrights
Funding Info:
Description:
ISBN:

Files uploaded:

File Size Format Action
final-paper.pdf 322.31 KB PDF Open