All topics / reward engineering

Topic

Skim Read Deep

You are reading the free Skim layer. Read unlocks the synthesis and sources.

Reward engineering

episodes

thinkers

of conversation

books & papers

terms defined

The neighbourhood: reward engineering and the ideas it travels with. Drag to roam, click a star for the episode, click a neighbour to travel.

Drag to roam · scroll to zoom · click a neighbour to travel · click a star for the episode

From foundational to frontier

Climb the spectrum. The most accessible conversations come first.

Start here

ACCESSIBLECOREFRONTIER

Anca Dragan: Human-Robot Interaction and Reward Engineering

1h 38m

03-19-20

Anca Dragan: Human-Robot Interaction and Reward Engineering

Coming soon

The lexicon

Every term the guests lean on, in plain language. Read one in full, or filter to find it.

Goodhart's law

A principle stating that a metric ceases to be useful once it becomes a target.

inverse reinforcement learning

A method where robots infer human preferences by observing behavior to optimize their actions.

What the corpus says

The throughline across every conversation that touches this idea.

Anca Dragan highlights the importance of robots communicating internal states through movement for effective human-robot interaction.

Anca Dragan · Anca Dragan: Human-Robot Interaction and Reward Engineering

Inverse reinforcement learning enables robots to infer human preferences from observed behaviors, optimizing their actions accordingly.

Anca Dragan · Anca Dragan: Human-Robot Interaction and Reward Engineering

Goodhart's law challenges reward function design in AI, as metrics become ineffective once they are targeted.

Anca Dragan · Anca Dragan: Human-Robot Interaction and Reward Engineering

Robots can gather information by influencing human behavior, such as nudging a car to infer driver intent.

Anca Dragan · Anca Dragan: Human-Robot Interaction and Reward Engineering

LiDAR remains a contentious topic in autonomous driving, with differing views on its necessity for innovation.

Anca Dragan · Anca Dragan: Human-Robot Interaction and Reward Engineering

Voices on reward engineering

3 standout quotes from across the corpus.

Go read

2 books and papers cited across these episodes.

For the specialist

What experts find new

2 expert-level takeaways for a specialist reader.

At the frontier

Still unresolved

1 open questions flagged across these conversations.

The thinkers

Who takes this idea on, by how often they return to it.

All guests

Anca Dragan

Professor

Adjacent ideas

autonomous driving1 human-robot interaction1