$ ~/registry/skill/microsoft-apm-skills-docs-grounding-verifier

SKILL

docs-grounding-verifier

Name: docs-grounding-verifier
Author: microsoft

核查回答是否有文档依据，帮助发现未被资料支持的内容与引用问题。

星标

★ 3,263

来源

GitHub

更新于

2026-07-16

// 安全评估低风险

仅提示词，不执行代码
开源可审计
社区验证· 3.3k

正在进行安全审计…

凭证密钥
网络外发
代码执行
数据访问
来源供应链

// 安装

复制安装指令，让 AI 自动完成配置 · 推荐新手

请帮我安装 askskill 上的 "docs-grounding-verifier" 技能：
1. 下载 https://raw.githubusercontent.com/microsoft/apm/main/.apm/skills/docs-grounding-verifier/SKILL.md
2. 保存为 ~/.claude/skills/docs-grounding-verifier/SKILL.md
3. 装好后重载技能，告诉我可以用了

// 下载

下载 SKILL.md机读安装清单 ↗

// 用法示例

检查客服答案是否有依据

输入

请根据我提供的产品帮助文档，核查这段客服回复中的每一项说法是否有明确依据；逐条标注“有依据/无依据/部分依据”，并引用对应文档段落。

预期产出

一份逐条核验结果，说明哪些表述被文档支持、哪些存在依据不足，并附引用位置。

审查总结报告的事实来源

输入

下面是会议纪要和一版总结报告。请检查报告中的结论、数字和行动项是否都能在纪要中找到依据，并列出不准确或推断过度的内容。

预期产出

一份对照审查清单，指出报告中可验证内容、缺失依据内容和需要改写的句子。

验证 RAG 回答质量

输入

给定检索到的文档片段和模型最终回答，请评估回答是否严格基于检索内容；找出幻觉、遗漏的关键证据，以及引用不匹配的地方。

预期产出

一份 grounding 评估结果，包含风险点、证据覆盖情况和改进建议。

// 文档

name: docs-grounding-verifier description: Use this skill to verify CLAIM-LEVEL grounding of a documentation page (or set of pages) against the source code. Activate when you have specific pages to check for factual accuracy -- not when sweeping a whole corpus (use docs-corpus-audit for that) and not when triaging a PR diff (use docs-sync for that). Trigger nouns: "is this doc accurate", "verify the page against the code", "fact-check this section", "any claims that drifted from source", "fact-checking", "grounding audit", "drift hunt", "claim verification". Returns per-claim verdicts (GROUNDED | PARTIAL | CONTRADICTED | UNSUPPORTED) with file:line evidence citations. Catches paragraph-level inaccuracies that page-level audit averages over -- e.g. a paragraph with 5 claims where 4 are grounded and 1 is fabricated. Does NOT modify files (returns advisory only); does NOT re-architect the docs; does NOT triage PRs.

docs-grounding-verifier

CLAIM-LEVEL grounding verification. Adapts the RAGAS faithfulness-eval pattern (proven in RAG literature) to docs/code instead of generated- answers/retrieved-context. Source code is the ground truth; docs paragraphs are the candidate text under audit.

python-architect persona doc-writer persona

Sibling contract

This skill is a SIBLING of docs-corpus-audit and docs-sync. The boundary is load-bearing:

Skill	Trigger	Scope	Granularity
docs-sync	PR opened/synchronized	PR diff only	Page-level
docs-corpus-audit	Maintainer asks for whole-corpus pass	Entire corpus	Page-level
docs-grounding-verifier	Verify specific pages factually	1..N pages	CLAIM-level

docs-corpus-audit invokes this skill in its VERIFY phase on the highest-risk pages of each wave. docs-sync can invoke it on the specific pages in a PR diff. The skill is also runnable standalone.

When to activate

Maintainer says "verify <page> against the code".
An audit wave wants per-claim grounding scores for its highest-risk pages.
A PR review wants to confirm that prose changes are not just plausible but actually consistent with the implementation.
A "fact-check" or "grounding" or "drift hunt" request.

When NOT to activate

Whole-corpus sweep with no specific page list -> use docs-corpus-audit.
PR review with mixed code+docs diff -> use docs-sync.
Editorial / tone review -> use editorial-owner persona directly.

Architecture (PIPELINE-of-PANELS)

PARENT
  -> [Stage 1: EXTRACT claims, fan-out PANEL]
       per page -> LLM extracts atomic factual claims as JSON
       script: scripts/extract-claims.py
  -> [Stage 2: RETRIEVE evidence, deterministic S7]
       per claim -> grep over src/ via keywords + hints
       script: scripts/retrieve-evidence.sh   (NO LLM)
  -> [Stage 3: JUDGE grounding, adversarial A7]
       per (claim, evidence) -> LLM rules GROUNDED|PARTIAL|CONTRADICTED|UNSUPPORTED
       asset: assets/judge-prompt.md
  -> [Stage 4: SYNTHESIZE]
       aggregate ungrounded -> doc-writer for fix
       re-verify after fix (A8 ALIGNMENT LOOP)

Stage 2 is the load-bearing design choice: evidence retrieval is DETERMINISTIC (grep + AST hints), not LLM. The judge in Stage 3 can only rule on evidence it actually receives -- it cannot hallucinate support that the retriever did not find. This is the structural guard against the failure mode "the LLM convinces itself the docs match the code."

Phase 1: SCOPE

Input: list of page paths to verify (1..N). If a risk_class is attached (e.g. "high-stakes"), prefer it; otherwise treat all as equal.

Out-of-scope:

Pages outside docs/src/content/docs/ or

…

查看完整文档 ↗

校验法律类AI输出中的孤立引文与未引用主张，辅助风险审查

—装→

MCP 工具

★1

Citation Verification

为AI代理实时核验事实、引用与来源时效

—装→

MCP 工具

Fact Verification MCP

帮助用户核验事实声明，输出结论、置信度与引用来源，并支持批量检查。

—装→

MCP 工具

★21

DocGuard

用于检测文档漂移并核验 MCP 工具说明与声明一致性

—装→

$ loading_

请帮我安装 askskill 上的 "docs-grounding-verifier" 技能： 1. 下载 https://raw.githubusercontent.com/microsoft/apm/main/.apm/skills/docs-grounding-verifier/SKILL.md 2. 保存为 ~/.claude/skills/docs-grounding-verifier/SKILL.md 3. 装好后重载技能，告诉我可以用了

// 用法示例

检查客服答案是否有依据

输入

请根据我提供的产品帮助文档，核查这段客服回复中的每一项说法是否有明确依据；逐条标注“有依据/无依据/部分依据”，并引用对应文档段落。

预期产出

一份逐条核验结果，说明哪些表述被文档支持、哪些存在依据不足，并附引用位置。

审查总结报告的事实来源

输入

下面是会议纪要和一版总结报告。请检查报告中的结论、数字和行动项是否都能在纪要中找到依据，并列出不准确或推断过度的内容。

预期产出

一份对照审查清单，指出报告中可验证内容、缺失依据内容和需要改写的句子。

验证 RAG 回答质量

输入

给定检索到的文档片段和模型最终回答，请评估回答是否严格基于检索内容；找出幻觉、遗漏的关键证据，以及引用不匹配的地方。

预期产出

一份 grounding 评估结果，包含风险点、证据覆盖情况和改进建议。

// 文档

name: docs-grounding-verifier description: Use this skill to verify CLAIM-LEVEL grounding of a documentation page (or set of pages) against the source code. Activate when you have specific pages to check for factual accuracy -- not when sweeping a whole corpus (use docs-corpus-audit for that) and not when triaging a PR diff (use docs-sync for that). Trigger nouns: "is this doc accurate", "verify the page against the code", "fact-check this section", "any claims that drifted from source", "fact-checking", "grounding audit", "drift hunt", "claim verification". Returns per-claim verdicts (GROUNDED | PARTIAL | CONTRADICTED | UNSUPPORTED) with file:line evidence citations. Catches paragraph-level inaccuracies that page-level audit averages over -- e.g. a paragraph with 5 claims where 4 are grounded and 1 is fabricated. Does NOT modify files (returns advisory only); does NOT re-architect the docs; does NOT triage PRs.

docs-grounding-verifier

python-architect persona doc-writer persona

Sibling contract

This skill is a SIBLING of docs-corpus-audit and docs-sync. The boundary is load-bearing:

Skill	Trigger	Scope	Granularity
docs-sync	PR opened/synchronized	PR diff only	Page-level
docs-corpus-audit	Maintainer asks for whole-corpus pass	Entire corpus	Page-level
docs-grounding-verifier	Verify specific pages factually	1..N pages	CLAIM-level

docs-corpus-audit invokes this skill in its VERIFY phase on the highest-risk pages of each wave. docs-sync can invoke it on the specific pages in a PR diff. The skill is also runnable standalone.

When to activate

Maintainer says "verify <page> against the code".
An audit wave wants per-claim grounding scores for its highest-risk pages.
A PR review wants to confirm that prose changes are not just plausible but actually consistent with the implementation.
A "fact-check" or "grounding" or "drift hunt" request.

When NOT to activate

Whole-corpus sweep with no specific page list -> use docs-corpus-audit.
PR review with mixed code+docs diff -> use docs-sync.
Editorial / tone review -> use editorial-owner persona directly.

Architecture (PIPELINE-of-PANELS)

PARENT
  -> [Stage 1: EXTRACT claims, fan-out PANEL]
       per page -> LLM extracts atomic factual claims as JSON
       script: scripts/extract-claims.py
  -> [Stage 2: RETRIEVE evidence, deterministic S7]
       per claim -> grep over src/ via keywords + hints
       script: scripts/retrieve-evidence.sh   (NO LLM)
  -> [Stage 3: JUDGE grounding, adversarial A7]
       per (claim, evidence) -> LLM rules GROUNDED|PARTIAL|CONTRADICTED|UNSUPPORTED
       asset: assets/judge-prompt.md
  -> [Stage 4: SYNTHESIZE]
       aggregate ungrounded -> doc-writer for fix
       re-verify after fix (A8 ALIGNMENT LOOP)

Phase 1: SCOPE

Input: list of page paths to verify (1..N). If a risk_class is attached (e.g. "high-stakes"), prefer it; otherwise treat all as equal.

Out-of-scope:

Pages outside docs/src/content/docs/ or

…

查看完整文档 ↗