 
一个强化学习 API 标准,包含多样化的参考环境集合
 
Gymnasium 是 OpenAI 的 Gym 库的一个维护分支。 Gymnasium 接口简洁、符合 Python 习惯,能够表示通用的强化学习问题,并为旧版 Gym 环境提供了一个 兼容性包装器
import gymnasium as gym
# Initialise the environment
env = gym.make("LunarLander-v3", render_mode="human")
# Reset the environment to generate the first observation
observation, info = env.reset(seed=42)
for _ in range(1000):
    # this is where you would insert your policy
    action = env.action_space.sample()
    # step (transition) through the environment with the action
    # receiving the next observation, reward and if the episode has terminated or truncated
    observation, reward, terminated, truncated, info = env.step(action)
    # If the episode has ended then we can reset to start a new episode
    if terminated or truncated:
        observation, info = env.reset()
env.close()