Preventing Sociopathic Robots

Icons

April 2023

Received funding for project

2 Years

Project timeline

Simulation

Type of study

Study Status

  • Modeling Design

  • Modeling Benchmarking

  • Manuscripts Preparation

  • Publication

About
Preventing Sociopathic Robots

Leveraging Vulnerability and Homeostasis to Encourage Prosocial Behavior in World-Modeling AI

We will run a variety of simulations to demonstrate how prosocially aligned AI may be emergently bootstrapped via vulnerable engagement (alone and in competition/collaboration with others) with an open-ended Minecraft-like virtual world.

Such environments are well-suited for training and testing for our systems, who have several planned emergent capacities: (I) Abilities for meta-learning and counterfactual modeling; (II) Vulnerability; (III) Incorporation of biologically-inspired inductive biases.

Combined with meta-learning and counterfactual modeling that can accommodate complex agent-based relationships across multiple time scales, we will demonstrate how social alignment may emerge as a kind of predictive homeostasis (i.e., allostasis, or “sociostasis”).

 

https://i0.wp.com/advancedconsciousness.org/wp-content/uploads/2023/05/ChristovMoore_A_vulnerable_homeostatic_Buddhist_robot_learning__f2545e91-bb30-4d7d-8101-52e4b4e70310.jpg?w=891
https://i0.wp.com/advancedconsciousness.org/wp-content/uploads/2023/05/ChristovMoore_A_vulnerable_homeostatic_robot_learning_to_experi_90767625-657d-41b0-b8d7-94604da7e623.jpg?w=891

In close collaboration with other groups dedicated to human-centered AI, we will focus on determining the extent to which human-mimetic design may allow for the development of effective and robustly beneficial artificial intelligences.

We hope to a) determine conditions that are more or less likely to result in either prosocial or antisocial ‘personalities’, b) evaluate the consequences of these agent traits on interpersonal interactions, and c) explore the degree to which these emergent social dynamics alter the personalities of our agents.

Posts

Outcomes

2023

2022

  • Christov-Moore, L., Reggente, N., Vaccaro, A., Schoeller, F., Pluimer, B., Douglas, P. K., Iacaboni, M., Kaplan, J. (2022). Are Robots Sociopaths? A Neuroscientific Approach to the Alignment Problem. Organization for Human Brain Mapping, Glasgow, Scotland. [Poster]

2023

  • Moore, C. (2023, October 25-26). Emulating Entheogens for Enhancing Empathy. Compassion 2.0, San Francisco, CA. [Talk] [Supplemental]

2024

2023

Contributors

Project Contributors

Vyacheslav Kungurtsev, Ph.D.

Research Consultant

Naoto Yoshida, Ph.D.

Research Consultant

Martin Krutský

Research Consultant

Arthur Juliani, Ph.D.

Research Consultant

Gustav Šír, Ph.D.

Research Consultant

Zahra Sheikhbahaee, Ph.D.

Research Consultant

Adam Safron, Ph.D.

Senior Research Scientist

Leonardo Christov-Moore, Ph.D.

Senior Research Scientist

Nicco Reggente IACS headshot

Nicco Reggente, Ph.D.

Principal Investigator

Funded By

Funded By