ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper Do Large Language Models Have “Emotion Neurons”? Investigating the Existence and Role
Cited 0 time in scopus Download 96 time Share share facebook twitter linkedin kakaostory
Authors
Jaewook Lee, Woojin Lee, Oh-Woog Kwon, Harksoo Kim
Issue Date
2025-07
Citation
Findings of the Association for Computational Linguistics: ACL 2025, pp.15617-15639
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.18653/v1/2025.findings-acl.806
Abstract
This study comprehensively explores whether there actually exist “emotion neurons” within large language models (LLMs) that selectively process and express certain emotions, and what functional role they play. Drawing on the representative emotion theory of the six basic emotions, we focus on six core emotions. Using synthetic dialogue data labeled with emotions, we identified sets of neurons that exhibit consistent activation patterns for each emotion. As a result, we confirmed that principal neurons handling emotion information do indeed exist within the model, forming distinct groups for each emotion, and that their distribution varies with model size and architectural depth. We then validated the functional significance of these emotion neurons by analyzing whether the prediction accuracy for a specific emotion significantly decreases when those neurons are artificially removed. We observed that in some emotions, the accuracy drops sharply upon neuron removal, while in others, the model's performance largely remains intact or even improves, presumably due to overlapping and complementary mechanisms among neurons. Furthermore, by examining how prediction accuracy changes depending on which layer range and at what proportion the emotion neurons are masked, we revealed that emotion information is processed in a multilayered and complex manner within the model.
KSP Keywords
Basic emotions, Prediction accuracy, functional role, language models
This work is distributed under the term of Creative Commons License (CCL)
(CC BY)
CC BY