20260514:增加新内容
This commit is contained in:
35
concepts/procedural-task-execution.md
Normal file
35
concepts/procedural-task-execution.md
Normal file
@@ -0,0 +1,35 @@
|
||||
---
|
||||
title: 程序性任务执行 (Procedural Task Execution)
|
||||
created: 2026-05-01
|
||||
updated: 2026-05-01
|
||||
type: concept
|
||||
tags: [benchmark, llm]
|
||||
sources: [papers/dou-cl-bench.md]
|
||||
---
|
||||
|
||||
# 程序性任务执行 (Procedural Task Execution)
|
||||
|
||||
> CL-bench 的第三类上下文:模型需**学习复杂操作流程并正确执行**。如产品手册排障、操作指导、工作流编排。
|
||||
|
||||
## 三个子类
|
||||
|
||||
| 子类 | 典型场景 |
|
||||
|------|---------|
|
||||
| Instructional Procedures | 产品手册、故障排除 |
|
||||
| Operational Procedures | 操作指南(如无人机物流系统 API) |
|
||||
| Workflow Orchestration | 复杂工作流编排 |
|
||||
|
||||
## 典型示例
|
||||
|
||||
上下文中提供 SkyNet Logistics 无人机物流系统的完整 API 文档(导航控制、负载控制、安全控制三个模块),模型需将用户自然语言指令转换为伪代码并附带推理说明。
|
||||
|
||||
## 核心挑战
|
||||
|
||||
- 信息分布在多段文档中,需要**跨段信息整合**
|
||||
- 需要**精确遵循**流程规范,不能跳过或错误执行步骤
|
||||
- 输出格式要求高(伪代码 + 推理说明)
|
||||
|
||||
## 相关概念
|
||||
- [[context-learning]] — 上下文学习
|
||||
- [[dou-cl-bench]] — CL-bench 论文
|
||||
- [[domain-knowledge-reasoning]] — 领域知识推理
|
||||
Reference in New Issue
Block a user