Chimera: Learning Shared Semantic Space for Speech-to-Text Translation
This is a Pytorch implementation for the “Chimera” paper Learning Shared Semantic Space for Speech-to-Text Translation https://arxiv.org/abs/2105.03095 (accepted by ACL Findings 2021), which aims to bridge the modality gap by unifying the task of MT (textual Machine Translation) and ST (Speech-to-Text Translation). It has achieved new SOTA performance on all 8 language pairs in MuST-C benchmark, by utilizing an external MT corpus.  This repository is up to now a nightly version, and is bug-prone because of code refactoring. […]
Read more