Remove Categorization Remove Computational Linguistics Remove Data Scarcity
article thumbnail

FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation

Google Research AI blog

Also, region-unaware MT systems tend to favor whichever variety has more data available online, which disproportionately affects speakers of under-resourced language varieties. However, the vast majority of available training data doesn’t specify what regional variety the translation is in.