Prompt
"An illustration of a robot learning to navigate a maze using Q-learning, a model-free reinforcement learning algorithm. The robot is shown in the center of the maze, with a thought bubble displaying a Q-value table. The maze has several paths and rewards, represented by different colors and symbols. In the background, a graph shows the convergence of the Q-learning algorithm to an optimal policy. The style is a futuristic, technical illustration with a mix of digital and robotic elements."