Social behavior for autonomous vehicles

None, None; None, None; None, None; None, None; None, None

Social behavior for autonomous vehicles

Journal Article (2019)

Author(s)

Wilko Schwarting (Massachusetts Institute of Technology)

Alyssa Pierson (Massachusetts Institute of Technology)

Javier Alonso-Mora (TU Delft - Learning & Autonomous Control)

Sertac Karaman (Massachusetts Institute of Technology)

Daniela Rus (Massachusetts Institute of Technology)

Research Group

Learning & Autonomous Control

Copyright

DOI related publication

https://doi.org/10.1073/pnas.1820676116

Autonomous driving Game theory Inverse reinforcement learning Social compliance Social Value Orientation

To reference this document use:

https://resolver.tudelft.nl/uuid:5709b745-bfaf-4d94-ab1d-195ba996dba2

More Info

expand_more

Publication Year

2019

Language

English

Copyright

Research Group

Learning & Autonomous Control

Issue number

50

Volume number

116

Pages (from-to)

24972-24978

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Deployment of autonomous vehicles on public roads promises increased efficiency and safety. It requires understanding the intent of human drivers and adapting to their driving styles. Autonomous vehicles must also behave in safe and predictable ways without requiring explicit communication. We integrate tools from social psychology into autonomous-vehicle decision making to quantify and predict the social behavior of other drivers and to behave in a socially compliant way. A key component is Social Value Orientation (SVO), which quantifies the degree of an agent’s selfishness or altruism, allowing us to better predict how the agent will interact and cooperate with others. We model interactions between agents as a best-response game wherein each agent negotiates to maximize their own utility. We solve the dynamic game by finding the Nash equilibrium, yielding an online method of predicting multiagent interactions given their SVOs. This approach allows autonomous vehicles to observe human drivers, estimate their SVOs, and generate an autonomous control policy in real time. We demonstrate the capabilities and performance of our algorithm in challenging traffic scenarios: merging lanes and unprotected left turns. We validate our results in simulation and on human driving data from the NGSIM dataset. Our results illustrate how the algorithm’s behavior adapts to social preferences of other drivers. By incorporating SVO, we improve autonomous performance and reduce errors in human trajectory predictions by 25%.

Files

24972.full.pdf

(pdf | 1.71 Mb)