Files
myWiki/concepts/procedural-task-execution.md

36 lines
1.2 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
title: 程序性任务执行 (Procedural Task Execution)
created: 2026-05-01
updated: 2026-05-01
type: concept
tags: [benchmark, llm]
sources: [papers/dou-cl-bench.md]
---
# 程序性任务执行 (Procedural Task Execution)
> CL-bench 的第三类上下文:模型需**学习复杂操作流程并正确执行**。如产品手册排障、操作指导、工作流编排。
## 三个子类
| 子类 | 典型场景 |
|------|---------|
| Instructional Procedures | 产品手册、故障排除 |
| Operational Procedures | 操作指南(如无人机物流系统 API |
| Workflow Orchestration | 复杂工作流编排 |
## 典型示例
上下文中提供 SkyNet Logistics 无人机物流系统的完整 API 文档(导航控制、负载控制、安全控制三个模块),模型需将用户自然语言指令转换为伪代码并附带推理说明。
## 核心挑战
- 信息分布在多段文档中,需要**跨段信息整合**
- 需要**精确遵循**流程规范,不能跳过或错误执行步骤
- 输出格式要求高(伪代码 + 推理说明)
## 相关概念
- [[context-learning]] — 上下文学习
- [[dou-cl-bench]] — CL-bench 论文
- [[domain-knowledge-reasoning]] — 领域知识推理