Hugging Face
YouTube search... ...Google search
HuggingFace NLP Library - Open Parallel Corpus (OPUS)
- Hugging Face dives into machine translation with release of 1,000 models | Khari Johnson - VentureBeat
- Build Your Own Machine Translation Service with Transformers Using the latest Helsinki NLP models available in the Transformers library to create a standardized machine translation service | Kyle Gallatin - Towards Data Science
OPUS is a growing collection of translated texts from the web. In the OPUS project we try to convert and align free online data, to add linguistic annotation, and to provide the community with a publicly available parallel corpus. OPUS is a project undertaken by the University of Helsinki and global partners to gather and open-source a wide variety of language data sets. OPUS is based on open source products and the corpus is also delivered as an open content package. We used several tools to compile the current collection. All pre-processing is done automatically. No manual corrections have been carried out. The OPUS collection is growing! ... OPUS the open parallel corpus