Temporal Difference learning

GPTKB entity