|
gptkbp:instanceOf
|
gptkb:corpus
gptkb:parallel_corpus
|
|
gptkbp:availableOn
|
http://www.statmt.org/europarl/
|
|
gptkbp:contains
|
parliamentary debates
translations
transcriptions
|
|
gptkbp:creator
|
gptkb:Philipp_Koehn
|
|
gptkbp:domain
|
gptkb:politics
|
|
gptkbp:firstReleased
|
2005
|
|
gptkbp:language
|
gptkb:Bulgarian
gptkb:Estonian
gptkb:French
gptkb:German
gptkb:Greek
gptkb:Italian
gptkb:Romanian
gptkb:Spanish
gptkb:Latvian
Czech
Danish
Dutch
English
Finnish
Hungarian
Lithuanian
Polish
Portuguese
Swedish
Slovak
Slovene
|
|
gptkbp:license
|
public domain
|
|
gptkbp:relatedTo
|
gptkb:OpenSubtitles
JRC-Acquis
ParaCrawl
|
|
gptkbp:size
|
over 50 million words per language
|
|
gptkbp:source
|
European Parliament proceedings
|
|
gptkbp:updated
|
EuroParl v7
|
|
gptkbp:usedFor
|
machine translation
natural language processing
multilingual research
|
|
gptkbp:usedIn
|
WMT shared tasks
|
|
gptkbp:bfsParent
|
gptkb:The_Pile
gptkb:The_Pile:_An_800GB_Dataset_of_Diverse_Text_for_Language_Modeling
|
|
gptkbp:bfsLayer
|
8
|
|
https://www.w3.org/2000/01/rdf-schema#label
|
EuroParl
|