Llama.cpp

GPTKB entity

Statements (74)
Predicate Object
gptkbp:instanceOf gptkb:software
gptkbp:creator Georgi Gerganov
gptkbp:feature quantization
GPU support
multi-threading
C API
Python bindings
streaming inference
chat interface
CPU inference
embeddings
gptkbp:firstReleased 2023
gptkbp:hasModel gptkb:bird
gptkb:horse_race
gptkb:GPT-2
gptkb:ChatGLM
gptkb:MPT
gptkb:Llama_2
gptkb:Baichuan
gptkb:Qwen
https://www.w3.org/2000/01/rdf-schema#label Llama.cpp
gptkbp:license gptkb:MIT_License
gptkbp:notableUser gptkb:Chroma
gptkb:LangChain
gptkb:LlamaIndex
gptkb:Haystack
gptkb:Hermes
gptkb:KoboldAI
gptkb:Auto-GPT
gptkb:SuperAGI
Dalai
CrewAI
Flowise
GPT Engineer
GPT4-X-Alpaca
GPT4All
GPT4All-API
GPT4All-CLI
GPT4All-Chat
GPT4All-Community
GPT4All-Discord
GPT4All-Docs
GPT4All-Examples
GPT4All-Extensions
GPT4All-Forum
GPT4All-GitHub
GPT4All-Integrations
GPT4All-Notebooks
GPT4All-Playground
GPT4All-Plugins
GPT4All-Reddit
GPT4All-Server
GPT4All-Telegram
GPT4All-Tools
GPT4All-Twitter
GPT4All-UI
GPT4Free
LM Studio
LocalAI
Mlc-llm
Ollama
Oobabooga
Open Interpreter
Open WebUI
PrivateGPT
Text Generation WebUI
llama-cpp-python
llamafile
gptkbp:platform cross-platform
gptkbp:programmingLanguage gptkb:C++
gptkbp:purpose run LLMs efficiently on CPU
gptkbp:repository https://github.com/ggerganov/llama.cpp
gptkbp:bfsParent gptkb:LLaMA_model_family
gptkbp:bfsLayer 8