AWQ

GPTKB entity

Statements (17)
Predicate Object
gptkbp:instanceOf large language model
gptkbp:compatibleWith gptkb:horse_race
gptkb:Llama_2
other transformer-based LLMs
gptkbp:developedBy gptkb:Tencent_AI_Lab
gptkbp:enables faster inference
lower memory usage
https://www.w3.org/2000/01/rdf-schema#label AWQ
gptkbp:license Apache 2.0
gptkbp:openSource true
gptkbp:purpose efficient quantization of large language models
gptkbp:releaseYear 2023
gptkbp:repository https://github.com/mit-han-lab/llm-awq
gptkbp:supports int4 quantization
int8 quantization
gptkbp:bfsParent gptkb:Indonesia_AirAsia
gptkbp:bfsLayer 6