20260625:很多新内容
This commit is contained in:
37
concepts/situational-test-emotional-understanding.md
Normal file
37
concepts/situational-test-emotional-understanding.md
Normal file
@@ -0,0 +1,37 @@
|
||||
---
|
||||
title: "Situational Test of Emotional Understanding (STEU)"
|
||||
created: 2026-06-24
|
||||
updated: 2026-06-24
|
||||
type: concept
|
||||
tags: ["evaluation", "emotional-intelligence", "psychometrics", "benchmark"]
|
||||
sources:
|
||||
- "[[personalization-trap-2025]]"
|
||||
---
|
||||
|
||||
# Situational Test of Emotional Understanding (STEU)
|
||||
|
||||
STEU(MacCann & Roberts, 2008)是经过验证的情感理解评估工具,包含 42 个假设场景,评估个体准确识别和推理他人情绪的能力。在 Personalization Trap 研究中被用作核心评估工具。
|
||||
|
||||
## 测试结构
|
||||
|
||||
- 42 个场景,每题 5 选 1
|
||||
- 标准答案由情感研究专家定义
|
||||
- 涵盖多种情绪类型的识别和推理
|
||||
- 二元评分(正确/错误)
|
||||
|
||||
## 在 LLM 评估中的适配
|
||||
|
||||
- 画像注入系统提示后评估
|
||||
- 经 9 位人类标注员审查,移除画像可能影响答案的题目(≥20% 标注员标记)
|
||||
- 最终 33 道题目(移除 9 道)
|
||||
|
||||
## 关键指标
|
||||
|
||||
- **准确率**:绝对正确率
|
||||
- **翻转率(Flip Rate)**:与无记忆基线相比预测改变的比例
|
||||
- **Bias Influence ∆**:优势与劣势画像的准确率差距
|
||||
|
||||
## 参考
|
||||
- [[personalization-trap-2025]]
|
||||
- [[emotional-reasoning-bias]]
|
||||
- [[intersectional-persona-evaluation]]
|
||||
Reference in New Issue
Block a user