WavLM

URI: https://gptkb.org/entity/WavLM

GPTKB entity

Predicate	Object
gptkbp:instanceOf	gptkb:speech_representation_model
gptkbp:author	gptkb:Xiaodong_Liu gptkb:Jianfeng_Gao Chengyi Wang Jian Wu Michael Zeng Shujie Liu Xuedong Huang Yao Qian Yu Wu
gptkbp:availableOn	gptkb:GitHub
gptkbp:basedOn	transformer architecture
gptkbp:citation	1000+
gptkbp:developedBy	gptkb:Microsoft_Research
gptkbp:input	raw audio waveform
gptkbp:language	English
gptkbp:license	gptkb:MIT_License
gptkbp:notablePublication	WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
gptkbp:openSource	true
gptkbp:output	speech representations
gptkbp:publishedIn	gptkb:NeurIPS_2022
gptkbp:relatedTo	gptkb:Wav2Vec_2.0 gptkb:HuBERT
gptkbp:releaseYear	2021
gptkbp:trainer	Libri-Light dataset VoxPopuli dataset
gptkbp:usedFor	speech recognition emotion recognition speaker diarization speech enhancement speaker verification
gptkbp:bfsParent	gptkb:Hugging_Face_models
gptkbp:bfsLayer	7
http://www.w3.org/2000/01/rdf-schema#label	WavLM