帮助你安全排查 Azure 生产故障,定位应用、容器与消息服务根因
复制安装指令,让 AI 自动完成配置 · 推荐新手
请帮我安装 askskill 上的 "azure-diagnostics" 技能: 1. 下载 https://raw.githubusercontent.com/microsoft/GitHub-Copilot-for-Azure/main/plugin/skills/azure-diagnostics/SKILL.md 2. 保存为 ~/.claude/skills/azure-diagnostics/SKILL.md 3. 装好后重载技能,告诉我可以用了
请帮我排查 Azure App Service 生产环境 CPU 持续过高的问题。请给出安全排查步骤,优先使用 AppLens、Azure Monitor 和资源健康,说明需要查看的指标、日志、可能根因,以及如何缩小范围。
一套安全的生产排障流程,包含关键监控项、日志查询方向、常见根因与后续处理建议。
请协助排查 AKS 中 Pod 一直 Pending 或 CrashLoopBackOff 的问题,并考虑 kube-system 或 CoreDNS 异常、节点 NotReady、镜像拉取失败、健康探针失败等可能性。请按优先级给出检查步骤。
按优先级整理的 AKS 排障清单,覆盖 kubectl 检查点、集群组件状态、事件日志与可能修复方向。
请帮我分析 Azure Service Bus 中消息处理失败的问题,重点关注 AMQP 连接失败、message lock lost、死信队列增加和 SDK 报错。请给出可能根因、应查看的日志和指标,以及排查顺序。
针对消息服务故障的结构化分析,包含关键症状对应的根因假设、监控项和排查步骤。
AUTHORITATIVE GUIDANCE — MANDATORY COMPLIANCE
This document is the official source for debugging and troubleshooting Azure production issues. Follow these instructions to diagnose and resolve common Azure service problems systematically.
Activate this skill when user wants to:
| Service | Common Issues | Reference |
|---|---|---|
| Container Apps | Image pull failures, cold starts, health probes, port mismatches | container-apps/ |
| App Service | High CPU, deployment failures, crashes, slow responses, TLS/custom domains | app-service/ |
| Function Apps | App details, invocation failures, timeouts, binding errors, cold starts, missing app settings | functions/ |
| AKS | Cluster access, nodes, kube-system, scheduling, crash loops, ingress, DNS, upgrades | AKS Troubleshooting |
| Messaging | Event Hubs & Service Bus SDK errors, AMQP failures, message lock, connectivity | Messaging Troubleshooting |
# Check resource health
az resource show --ids RESOURCE_ID
# View activity log
az monitor activity-log list -g RG --max-events 20
# Container Apps logs
az containerapp logs show --name APP -g RG --follow
# Function App logs (query App Insights traces)
az monitor app-insights query --apps APP-INSIGHTS -g RG \
--analytics-query "traces | where timestamp > ago(1h) | order by timestamp desc | take 50"
For AI-powered diagnostics, use:
mcp_azure_mcp_applens
intent: "diagnose issues with <resource-name>"
command: "diagnose"
parameters:
resourceId: "<resource-id>"
Provides:
- Automated issue detection
- Root cause analysis
- Remediation recommendations
For querying logs and metrics:
mcp_azure_mcp_monitor
…
分析并精简 Markdown 内容,降低 token 消耗并提升 AI 处理效率。
帮助排查并解决 Azure Event Hubs 与 Service Bus SDK 的连接、认证和消息处理问题。