Causal language modeling

GPTKB entity

Statements (51)
Predicate Object
gptkbp:instanceOf Language modeling task
gptkbp:application Code generation
Text generation
Dialogue systems
Autoregressive modeling
gptkbp:architecture Unidirectional transformer
gptkbp:contrastsWith gptkb:Masked_language_modeling
Bidirectional language modeling
gptkbp:dataRequirement Large text corpora
gptkbp:directionA Left-to-right
gptkbp:example gptkb:GPT-2
gptkb:GPT-3
gptkb:GPT-4
gptkb:LLaMA
gptkb:BLOOM
gptkb:OPT
https://www.w3.org/2000/01/rdf-schema#label Causal language modeling
gptkbp:input Sequence of tokens
gptkbp:lossFunction Cross-entropy loss
gptkbp:output Probability distribution over vocabulary
Logits for each token
gptkbp:predicts Next token
gptkbp:relatedTo gptkb:Masked_language_modeling
gptkbp:tokenization gptkb:WordPiece
Byte pair encoding
Subword tokenization
gptkbp:trainingObjective Maximize likelihood of next word
gptkbp:usedBy gptkb:Transformer_models
GPT models
gptkbp:usedFor Chatbots
Conversational AI
Paraphrasing
Creative writing
Code completion
Text completion
Autocompletion
Dialogue generation
Email drafting
Machine translation (autoregressive)
Poetry generation
Pretraining large language models
Question answering (generative)
Script generation
Story generation
Summarization (autoregressive)
Text expansion
Text infilling
gptkbp:usedIn Natural language processing
Machine learning
gptkbp:bfsParent gptkb:Language_modeling
gptkbp:bfsLayer 7