
-
At least 2 NVIDIA GPUs with CUDA 12.8 or newer
至少需要 2 块 NVIDIA GPU,支持 CUDA 12.8 或更高版本
An example of agent output is given below:
下面给出一个代理输出的示例:
From the current observation, let's analyze the situation. The player (P) is at: (4, 0), and the goal (G) is at: (2, 3). There is also a hole (O) at (4, 4). Given this, I can move towards the goal without worrying about slippery tiles right now.
The shortest path from P to G involves moving left (4 steps) followed by moving down (1 step), since going directly would bypass the hole or move us further from the goal. Let's move left first.
Let's take the action ```Left```.