Issue #16 – Revisiting synthetic training data for Neural MT

08 Nov18 Issue #16 – Revisiting synthetic training data for Neural MT Author: Dr. Patrik Lambert, Machine Translation Scientist @ Iconic In a previous guest post in this series, Prof. Andy Way explained how to create training data for Neural MT through back-translation. This technique involves translating monolingual data in the target language into the source language to obtain a parallel corpus of “synthetic” source and “authentic” target data – so called back-translation. Andy reported interesting findings whereby, with a few million […]