CCAligned

GPTKB entity

Statements (17)
Predicate Object
gptkbp:instanceOf gptkb:dataset
gptkbp:containsLanguagePairs over 100 language pairs
gptkbp:createdBy gptkb:Facebook_AI_Research
gptkbp:dataSource gptkb:Common_Crawl
gptkbp:domain machine translation
gptkbp:format gptkb:text
https://www.w3.org/2000/01/rdf-schema#label CCAligned
gptkbp:license CC BY-SA 4.0
gptkbp:relatedTo gptkb:OPUS
gptkb:WMT
gptkbp:releaseYear 2019
gptkbp:size over 1.9 billion parallel sentences
gptkbp:supportsLanguage high-resource and low-resource languages
gptkbp:url https://opus.nlpl.eu/CCAligned.php
gptkbp:usedFor training machine translation models
gptkbp:bfsParent gptkb:M2M-100
gptkbp:bfsLayer 6