38 lines
1.2 KiB
Markdown
38 lines
1.2 KiB
Markdown
---
|
||
title: "Situational Test of Emotional Understanding (STEU)"
|
||
created: 2026-06-24
|
||
updated: 2026-06-24
|
||
type: concept
|
||
tags: ["evaluation", "emotional-intelligence", "psychometrics", "benchmark"]
|
||
sources:
|
||
- "[[personalization-trap-2025]]"
|
||
---
|
||
|
||
# Situational Test of Emotional Understanding (STEU)
|
||
|
||
STEU(MacCann & Roberts, 2008)是经过验证的情感理解评估工具,包含 42 个假设场景,评估个体准确识别和推理他人情绪的能力。在 Personalization Trap 研究中被用作核心评估工具。
|
||
|
||
## 测试结构
|
||
|
||
- 42 个场景,每题 5 选 1
|
||
- 标准答案由情感研究专家定义
|
||
- 涵盖多种情绪类型的识别和推理
|
||
- 二元评分(正确/错误)
|
||
|
||
## 在 LLM 评估中的适配
|
||
|
||
- 画像注入系统提示后评估
|
||
- 经 9 位人类标注员审查,移除画像可能影响答案的题目(≥20% 标注员标记)
|
||
- 最终 33 道题目(移除 9 道)
|
||
|
||
## 关键指标
|
||
|
||
- **准确率**:绝对正确率
|
||
- **翻转率(Flip Rate)**:与无记忆基线相比预测改变的比例
|
||
- **Bias Influence ∆**:优势与劣势画像的准确率差距
|
||
|
||
## 参考
|
||
- [[personalization-trap-2025]]
|
||
- [[emotional-reasoning-bias]]
|
||
- [[intersectional-persona-evaluation]]
|