核查回答是否有文档依据,帮助发现未被资料支持的内容与引用问题。
复制安装指令,让 AI 自动完成配置 · 推荐新手
请帮我安装 askskill 上的 "docs-grounding-verifier" 技能: 1. 下载 https://raw.githubusercontent.com/microsoft/apm/main/.apm/skills/docs-grounding-verifier/SKILL.md 2. 保存为 ~/.claude/skills/docs-grounding-verifier/SKILL.md 3. 装好后重载技能,告诉我可以用了
请根据我提供的产品帮助文档,核查这段客服回复中的每一项说法是否有明确依据;逐条标注“有依据/无依据/部分依据”,并引用对应文档段落。
一份逐条核验结果,说明哪些表述被文档支持、哪些存在依据不足,并附引用位置。
下面是会议纪要和一版总结报告。请检查报告中的结论、数字和行动项是否都能在纪要中找到依据,并列出不准确或推断过度的内容。
一份对照审查清单,指出报告中可验证内容、缺失依据内容和需要改写的句子。
给定检索到的文档片段和模型最终回答,请评估回答是否严格基于检索内容;找出幻觉、遗漏的关键证据,以及引用不匹配的地方。
一份 grounding 评估结果,包含风险点、证据覆盖情况和改进建议。
CLAIM-LEVEL grounding verification. Adapts the RAGAS faithfulness-eval pattern (proven in RAG literature) to docs/code instead of generated- answers/retrieved-context. Source code is the ground truth; docs paragraphs are the candidate text under audit.
python-architect persona doc-writer persona
This skill is a SIBLING of docs-corpus-audit and docs-sync. The
boundary is load-bearing:
| Skill | Trigger | Scope | Granularity |
|---|---|---|---|
| docs-sync | PR opened/synchronized | PR diff only | Page-level |
| docs-corpus-audit | Maintainer asks for whole-corpus pass | Entire corpus | Page-level |
| docs-grounding-verifier | Verify specific pages factually | 1..N pages | CLAIM-level |
docs-corpus-audit invokes this skill in its VERIFY phase on the
highest-risk pages of each wave. docs-sync can invoke it on the
specific pages in a PR diff. The skill is also runnable standalone.
docs-corpus-audit.docs-sync.editorial-owner persona directly.PARENT
-> [Stage 1: EXTRACT claims, fan-out PANEL]
per page -> LLM extracts atomic factual claims as JSON
script: scripts/extract-claims.py
-> [Stage 2: RETRIEVE evidence, deterministic S7]
per claim -> grep over src/ via keywords + hints
script: scripts/retrieve-evidence.sh (NO LLM)
-> [Stage 3: JUDGE grounding, adversarial A7]
per (claim, evidence) -> LLM rules GROUNDED|PARTIAL|CONTRADICTED|UNSUPPORTED
asset: assets/judge-prompt.md
-> [Stage 4: SYNTHESIZE]
aggregate ungrounded -> doc-writer for fix
re-verify after fix (A8 ALIGNMENT LOOP)
Stage 2 is the load-bearing design choice: evidence retrieval is DETERMINISTIC (grep + AST hints), not LLM. The judge in Stage 3 can only rule on evidence it actually receives -- it cannot hallucinate support that the retriever did not find. This is the structural guard against the failure mode "the LLM convinces itself the docs match the code."
Input: list of page paths to verify (1..N). If a risk_class is
attached (e.g. "high-stakes"), prefer it; otherwise treat all as equal.
Out-of-scope:
docs/src/content/docs/ or…
批量梳理并分诊 APM 仓库问题,将明确事项推进到可合并 PR。
对涉及 OpenAPM 规范的 PR 进行多专家对抗式审查并给出发布建议。
优化 APM 代码库中的 CLI 输出、日志提示与诊断信息体验
对整个文档语料逐页对照源码审计,定位过时表述并给出精确修复建议。
在每次 PR 变更时评估文档影响并生成可执行的更新建议
为复杂拉取请求发起多专家并行评审,汇总架构、测试、安全与文档建议。
校验模型输出是否可追溯到真实来源,及时标记无依据的引用与陈述。
通过生成—核验—修正循环,提高回答、代码或方案的准确性与可靠性。
为 Claude Code 会话提供系统化校验流程,帮助检查结果正确性与质量。
为 AI 编程助手连接本地可信知识库,基于书籍标准与文档提供有据可依的编码回答。
基于科学文献检索与全文访问,帮助用户获得更可信的研究结论与依据。
基于浏览器页面内容进行问答,帮助快速提取信息与核对细节。