All topics / reward engineering
Topic
You are reading the free Skim layer. Read unlocks the synthesis and sources.
Reward engineering
1
episodes
1
thinkers
2h
of conversation
2
books & papers
2
terms defined
The neighbourhood: reward engineering and the ideas it travels with. Drag to roam, click a star for the episode, click a neighbour to travel.
Drag to roam · scroll to zoom · click a neighbour to travel · click a star for the episode
From foundational to frontier
Climb the spectrum. The most accessible conversations come first.
Start here
ACCESSIBLECOREFRONTIER
The lexicon
Every term the guests lean on, in plain language. Read one in full, or filter to find it.
What the corpus says
The throughline across every conversation that touches this idea.
Anca Dragan highlights the importance of robots communicating internal states through movement for effective human-robot interaction.
Inverse reinforcement learning enables robots to infer human preferences from observed behaviors, optimizing their actions accordingly.
Goodhart's law challenges reward function design in AI, as metrics become ineffective once they are targeted.
Robots can gather information by influencing human behavior, such as nudging a car to infer driver intent.
LiDAR remains a contentious topic in autonomous driving, with differing views on its necessity for innovation.
Voices on reward engineering
3 standout quotes from across the corpus.
Go read
2 books and papers cited across these episodes.
For the specialist
What experts find new
2 expert-level takeaways for a specialist reader.
At the frontier
Still unresolved
1 open questions flagged across these conversations.
The thinkers
Who takes this idea on, by how often they return to it.