Culturally-attuned AI: Implicit learning of altruistic cultural values through inverse reinforcement learning

Nigini Oliveira; Jasmine Li; Koosha Khalvati; Rodolfo Cortes Barragan; Katharina Reinecke; Andrew N. Meltzoff; Rajesh P. N. Rao

doi:10.1371/journal.pone.0337914

Abstract

Constructing a universal moral code for artificial intelligence (AI) is challenging because human cultures have different values, norms, and social practices. We therefore argue that AI systems should adapt to culture based on observation: Just as a child raised in a particular culture learns the specific values, norms, and behaviors of that culture, we propose that an AI system operating in a particular human community could similarly learn them as well. How AI systems might accomplish this from observing and interacting with humans has remained an open question. Here, we propose using inverse reinforcement learning (IRL) as a method for AI agents to acquire culturally relevant values implicitly from humans. We test our approach using an experimental paradigm in which AI agents use IRL to learn different reward functions, which govern the agents’ actions, by learning from variations in the altruistic behavior of human subjects from two cultural groups in an online game requiring real-time decision making. We show that an AI agent learning from a particular human cultural group can acquire the altruistic characteristics reflective of that group’s average behavior, and can generalize to new scenarios requiring altruistic judgments. Our results provide a proof-of-concept demonstration that AI agents can be endowed with the ability to learn culturally-typical behaviors and values directly from observing human behavior.

Affiliated Institutions

Related Publications

THE MIRROR-NEURON SYSTEM

Giacomo Rizzolatti , Laila Craighero

▪ Abstract A category of stimuli of great importance for primates, humans in particular, is that formed by actions done by other individuals. If we want to survive, we must unde...

2004 Annual Review of Neuroscience 6868 citations

Learning dexterous in-hand manipulation

OpenAI Marcin Andrychowicz , Bowen Baker , Maciek Chociej +13 more

We use reinforcement learning (RL) to learn dexterous in-hand manipulation policies that can perform vision-based object reorientation on a physical Shadow Dexterous Hand. The t...

2019 The International Journal of Robotics... 1502 citations

Continual lifelong learning with neural networks: A review

German I. Parisi , Ronald Kemker , Jose L. Part +2 more

Humans and animals have the ability to continually acquire, fine-tune, and transfer knowledge and skills throughout their lifespan. This ability, referred to as lifelong learnin...

2019 Neural Networks 2746 citations

Reinforcement Learning or Active Inference?

Karl Friston , Jean Daunizeau , Stefan J. Kiebel

This paper questions the need for reinforcement learning or control theory when optimising behaviour. We show that it is fairly simple to teach an agent complicated and adaptive...

2009 PLoS ONE 424 citations

Playing Atari with Deep Reinforcement Learning

Alex Graves , Ioannis Antonoglou , Daan Wierstra +4 more

We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolu...

2013 arXiv (Cornell University) 5109 citations

Publication Info

Year: 2025
Type: article
Volume: 20
Issue: 12
Pages: e0337914-e0337914
Citations: 0
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Culturally-attuned AI: Implicit learning of altruistic cultural values through inverse reinforcement learning

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

OpenAlex

Cite This

APA Style

                            
                                
                                    Nigini Oliveira, 
                                
                                    Jasmine Li, 
                                
                                    Koosha Khalvati
                                
                                et al.
                            
                            (2025). 
                            Culturally-attuned AI: Implicit learning of altruistic cultural values through inverse reinforcement learning. 
                            PLoS ONE
                            , 20
                            (12)
                            , e0337914-e0337914.
                            https://doi.org/10.1371/journal.pone.0337914
                        

Identifiers

DOI: 10.1371/journal.pone.0337914