TD learning

GPTKB entity