OPUS-100

на сайте с December 18, 2022 14:24
Dataset Card for Opus100 Dataset Summary OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side. The corpus covers 100 languages (including English). Selected the languages based on the volume of parallel data available in OPUS.