LinUCB

URI: https://gptkb.org/entity/LinUCB

GPTKB entity

Predicate	Object
gptkbp:instanceOf	gptkb:algorithm gptkb:contextual_bandit_algorithm
gptkbp:assumes	linear reward model
gptkbp:basedOn	upper confidence bound principle
gptkbp:category	gptkb:machine_learning gptkb:reinforcement_learning
gptkbp:input	context vector
gptkbp:introduced	gptkb:John_Langford gptkb:Robert_E._Schapire gptkb:Lihong_Li Wei Chu
gptkbp:introducedIn	2010
gptkbp:output	action selection
gptkbp:publishedIn	Proceedings of the 19th International Conference on World Wide Web
gptkbp:relatedTo	Epsilon-Greedy Thompson Sampling UCB1
gptkbp:usedFor	gptkb:multi-armed_bandit_problem online learning recommendation systems
gptkbp:bfsParent	gptkb:Contextual_Bandits_with_Linear_Payoff_Functions
gptkbp:bfsLayer	7
http://www.w3.org/2000/01/rdf-schema#label	LinUCB