Files
myWiki/concepts/situational-test-emotional-understanding.md

38 lines
1.2 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
title: "Situational Test of Emotional Understanding (STEU)"
created: 2026-06-24
updated: 2026-06-24
type: concept
tags: ["evaluation", "emotional-intelligence", "psychometrics", "benchmark"]
sources:
- "[[personalization-trap-2025]]"
---
# Situational Test of Emotional Understanding (STEU)
STEUMacCann & Roberts, 2008是经过验证的情感理解评估工具包含 42 个假设场景,评估个体准确识别和推理他人情绪的能力。在 Personalization Trap 研究中被用作核心评估工具。
## 测试结构
- 42 个场景,每题 5 选 1
- 标准答案由情感研究专家定义
- 涵盖多种情绪类型的识别和推理
- 二元评分(正确/错误)
## 在 LLM 评估中的适配
- 画像注入系统提示后评估
- 经 9 位人类标注员审查移除画像可能影响答案的题目≥20% 标注员标记)
- 最终 33 道题目(移除 9 道)
## 关键指标
- **准确率**:绝对正确率
- **翻转率Flip Rate**:与无记忆基线相比预测改变的比例
- **Bias Influence ∆**:优势与劣势画像的准确率差距
## 参考
- [[personalization-trap-2025]]
- [[emotional-reasoning-bias]]
- [[intersectional-persona-evaluation]]