suite

DeepMind Control Suite.

This submodule contains the domains and tasks described in the DeepMind Control Suite tech report.

Quickstart

from dm_control import suite
import numpy as np

# Load one task:
env = suite.load(domain_name="cartpole", task_name="swingup")

# Iterate over a task set:
for domain_name, task_name in suite.BENCHMARKING:
  env = suite.load(domain_name, task_name)

# Step through an episode and print out reward, discount and observation.
action_spec = env.action_spec()
time_step = env.reset()
while not time_step.last():
  action = np.random.uniform(action_spec.minimum,
                             action_spec.maximum,
                             size=action_spec.shape)
  time_step = env.step(action)
  print(time_step.reward, time_step.discount, time_step.observation)

Illustration video

Below is a video montage of solved Control Suite tasks, with reward visualisation enabled.

Quadruped domain [April 2019]

Roughly based on the 'ant' model introduced by Schulman et al. 2015. Main modifications to the body are:

4 DoFs per leg, 1 constraining tendon.
3 actuators per leg: 'yaw', 'lift', 'extend'.
Filtered position actuators with timescale of 100ms.
Sensors include an IMU, force/torque sensors, and rangefinders.

Four tasks:

walk and run: self-right the body then move forward at a desired speed.
escape: escape a bowl-shaped random terrain (uses rangefinders).
fetch, go to a moving ball and bring it to a target.

All behaviors in the video below were trained with Abdolmaleki et al's MPO.

Name		Name	Last commit message	Last commit date
parent directory ..
common		common
demos		demos
dog_assets		dog_assets
utils		utils
wrappers		wrappers
README.md		README.md
__init__.py		__init__.py
acrobot.py		acrobot.py
acrobot.xml		acrobot.xml
all_domains.png		all_domains.png
ball_in_cup.py		ball_in_cup.py
ball_in_cup.xml		ball_in_cup.xml
base.py		base.py
cartpole.py		cartpole.py
cartpole.xml		cartpole.xml
cheetah.py		cheetah.py
cheetah.xml		cheetah.xml
dog.py		dog.py
dog.xml		dog.xml
explore.py		explore.py
finger.py		finger.py
finger.xml		finger.xml
fish.py		fish.py
fish.xml		fish.xml
hopper.py		hopper.py
hopper.xml		hopper.xml
humanoid.py		humanoid.py
humanoid.xml		humanoid.xml
humanoid_CMU.py		humanoid_CMU.py
humanoid_CMU.xml		humanoid_CMU.xml
loader_test.py		loader_test.py
lqr.py		lqr.py
lqr.xml		lqr.xml
lqr_solver.py		lqr_solver.py
lqr_test.py		lqr_test.py
manipulator.py		manipulator.py
manipulator.xml		manipulator.xml
pendulum.py		pendulum.py
pendulum.xml		pendulum.xml
point_mass.py		point_mass.py
point_mass.xml		point_mass.xml
quadruped.py		quadruped.py
quadruped.xml		quadruped.xml
reacher.py		reacher.py
reacher.xml		reacher.xml
stacker.py		stacker.py
stacker.xml		stacker.xml
suite_test.py		suite_test.py
swimmer.py		swimmer.py
swimmer.xml		swimmer.xml
walker.py		walker.py
walker.xml		walker.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

DeepMind Control Suite.

Quickstart

Illustration video

Quadruped domain [April 2019]

FilesExpand file tree

suite

Directory actions

More options

Directory actions

More options

Latest commit

History

suite

Folders and files

parent directory

README.md

DeepMind Control Suite.

Quickstart

Illustration video

Quadruped domain [April 2019]