$ loading_

mle-workflow — askskill

$ ~/registry/skill/affaan-m-mle-workflow

SKILL

mle-workflow

帮助团队搭建可复现、可部署、可监控的生产级机器学习工程流程

星标

★ 210,546

来源

GitHub

更新于

2026-06-19

// 安全评估低风险

仅提示词，不执行代码
开源可审计
社区验证· 210.5k

总评

该技能材料显示其为纯提示词型的机器学习工程工作流说明，不要求密钥、未声明远程端点，也未体现本地执行或数据外发能力。结合 GitHub 开源和较高社区采用度，整体风险较低，主要仅需常规关注来源维护与仓库内容是否持续可审计。

凭证密钥低风险

材料明确标注“需要的密钥/环境变量：无”，README 也未见要求 API key、云凭证或令牌的说明；未发现凭证收集、存储或滥用迹象。

网络外发低风险

系统检查项显示无远程端点，材料中也未声明向外部服务发送数据的机制；作为 prompt-only 技能，未见用户数据外发到第三方的事实依据。

代码执行低风险

该对象被判定为 prompt-only，当前材料仅描述工作流方法论与适用场景，未体现本机起进程、执行脚本、调用系统命令或申请额外执行权限。

数据访问低风险

README 讨论的是生产 ML 流程设计，并未声明可读取或写入本地文件、仓库、数据库或其他用户资源；未见超出文档/提示词范围的数据访问能力。

来源供应链低风险

来源为 GitHub 开源仓库，且社区采用度很高（约 210.5k stars），这些都是显著的降风险因素；虽许可证未声明、维护状态未知，仍未见闭源外发、来源可疑或明显注入等高风险红旗。

安全建议

在实际采用前，复核仓库当前维护活跃度、最近提交与 issue 处理情况。
补查许可证信息，确认其与组织内部使用和分发要求兼容。
若后续安装了其关联 agent/扩展能力，应单独重新审计执行权限、数据访问与外联行为。

审计模型: gpt-5.4 · 2026-06-19

// 安装

复制安装指令，让 AI 自动完成配置 · 推荐新手

请帮我安装 askskill 上的 "mle-workflow" 技能：
1. 下载 https://raw.githubusercontent.com/affaan-m/ECC/main/skills/mle-workflow/SKILL.md
2. 保存为 ~/.claude/skills/mle-workflow/SKILL.md
3. 装好后重载技能，告诉我可以用了

// 下载

下载 SKILL.md机读安装清单 ↗

// 用法示例

设计端到端ML流水线

输入

请为一个用于用户流失预测的机器学习系统设计生产级工程流程，覆盖数据契约、特征处理、可复现训练、模型评估、部署、监控与回滚策略，并给出各阶段的责任分工与关键检查点。

预期产出

一份结构化的ML工程流程方案，说明阶段、职责、质量门禁与上线保障措施。

审查现有模型上线流程

输入

请审查我们当前的模型上线流程：数据来自多个表，训练靠手动脚本，评估只看AUC，没有漂移监控，也缺少回滚预案。请指出风险，并给出生产化改进建议和优先级路线图。

预期产出

一份流程审查报告，包含主要风险、缺口分析和按优先级排序的改进建议。

制定监控与回滚方案

输入

请为一个已部署的推荐模型制定监控与回滚方案，包含输入数据质量、特征漂移、预测分布、线上效果指标、告警阈值、触发条件，以及自动回滚和人工介入流程。

预期产出

一套可执行的监控与回滚机制，便于运维和算法团队稳定维护线上模型。

// 文档

Machine Learning Engineering Workflow

Use this skill to turn model work into a production ML system with clear data contracts, repeatable training, measurable quality gates, deployable artifacts, and operational monitoring.

When to Activate

Planning or reviewing a production ML feature, model refresh, ranking system, recommender, classifier, embedding workflow, or forecasting pipeline
Converting notebook code into a reusable training, evaluation, batch inference, or online inference pipeline
Designing model promotion criteria, offline/online evals, experiment tracking, or rollback paths
Debugging failures caused by data drift, label leakage, stale features, artifact mismatch, or inconsistent training and serving logic
Adding model monitoring, canary rollout, shadow traffic, or post-deploy quality checks

Scope Calibration

Use only the lanes that fit the system in front of you. This skill is useful for ranking, search, recommendations, classifiers, forecasting, embeddings, LLM workflows, anomaly detection, and batch analytics, but it should not force one architecture onto all of them.

Do not assume every model has supervised labels, online serving, a feature store, PyTorch, GPUs, human review, A/B tests, or real-time feedback.
Do not add heavyweight MLOps machinery when a data contract, baseline, eval script, and rollback note would make the change reviewable.
Do make assumptions explicit when the project lacks labels, delayed outcomes, slice definitions, production traffic, or monitoring ownership.
Treat examples as interchangeable scaffolds. Replace metrics, serving mode, data stores, and rollout mechanics with the project-native equivalents.

Related Skills

// 同源资产

技能★211k

lead-intelligence

帮助用户智能挖掘高价值潜在客户并生成多渠道触达方案

affaan-m装→

技能★211k

bun-runtime

帮助开发者使用 Bun 进行运行、打包、测试与依赖管理，并评估替代 Node 的时机。

affaan-m装→

// 功能相似

MCP 工具

mlctl

通过自然语言管理机器学习全生命周期与模型部署流程。

—装→

MCP 工具

MCP ML Monitor

监控生产环境机器学习模型漂移与性能下降，并提供告警和重训练建议。

—装→

SWE surface	MLE use
`product-capability` / `architecture-decision-records`	Turn model work into explicit product contracts and record irreversible data, model, and rollout choices
`repo-scan` / `codebase-onboarding` / `code-tour`	Find existing training, feature, serving, eval, and monitoring paths before introducing a parallel ML stack
`plan` / `feature-dev`	Scope model changes as product capabilities with data, eval, serving, and rollback phases
`tdd-workflow` / `python-testing`	Test feature transforms, split logic, metric calculations, artifact loading, and inference schemas before implementation
`code-reviewer` / `mle-reviewer`	Review code quality plus ML-specific leakage, reproducibility, promotion, and monitoring risks
`build-fix` / `pr-test-analyzer`	Diagnose broken CI, flaky evals, missing fixtures, and environment-specific model or dependency failures
`quality-gate` / `test-coverage`	Require automated evidence for transforms, metrics, inference contracts, promotion gates, and rollback behavior
`eval-harness` / `verification-loop`	Turn offline metrics, slice checks, latency budgets, and rollback drills into repeatable gates

mle-workflow

// 用法示例

// 文档

Machine Learning Engineering Workflow

When to Activate

Scope Calibration

Related Skills

// 同源资产

lead-intelligence

bun-runtime

// 功能相似

mlctl

MCP ML Monitor

Reuse the SWE Surface

quarkus-verification

jira-integration

plankton-code-quality

rust-patterns

MCP Workflow Server

Neo — AI/ML Engineering

MCP Workflow Engine

Unified MCP Server