Statements (34)
Predicate | Object |
---|---|
gptkbp:instanceOf |
speech representation model
|
gptkbp:author |
gptkb:Xiaodong_Liu
gptkb:Jianfeng_Gao Chengyi Wang Jian Wu Michael Zeng Shujie Liu Xuedong Huang Yao Qian Yu Wu |
gptkbp:availableOn |
gptkb:GitHub
|
gptkbp:basedOn |
transformer architecture
|
gptkbp:citation |
1000+
|
gptkbp:developedBy |
gptkb:Microsoft_Research
|
https://www.w3.org/2000/01/rdf-schema#label |
WavLM
|
gptkbp:input |
raw audio waveform
|
gptkbp:language |
English
|
gptkbp:license |
gptkb:MIT_License
|
gptkbp:notablePublication |
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
|
gptkbp:openSource |
true
|
gptkbp:output |
speech representations
|
gptkbp:publishedIn |
gptkb:NeurIPS_2022
|
gptkbp:relatedTo |
gptkb:Wav2Vec_2.0
gptkb:HuBERT |
gptkbp:releaseYear |
2021
|
gptkbp:trainer |
Libri-Light dataset
VoxPopuli dataset |
gptkbp:usedFor |
speech recognition
emotion recognition speaker diarization speech enhancement speaker verification |
gptkbp:bfsParent |
gptkb:Hugging_Face_models
|
gptkbp:bfsLayer |
7
|