$ ~/registry/skill/anthropics-bio-research-skills-single-cell-rna-qc

SKILL

single-cell-rna-qc

Name: single-cell-rna-qc
Author: anthropics

对单细胞RNA测序数据进行质量控制、过滤低质量细胞并生成可视化结果

星标

★ 22,528

来源

GitHub

更新于

2026-07-11

// 安全评估低风险

仅提示词，不执行代码
开源可审计
社区验证· 22.5k

正在进行安全审计…

凭证密钥
网络外发
代码执行
数据访问
来源供应链

// 安装

复制安装指令，让 AI 自动完成配置 · 推荐新手

请帮我安装 askskill 上的 "single-cell-rna-qc" 技能：
1. 下载 https://raw.githubusercontent.com/anthropics/knowledge-work-plugins/main/bio-research/skills/single-cell-rna-qc/SKILL.md
2. 保存为 ~/.claude/skills/single-cell-rna-qc/SKILL.md
3. 装好后重载技能，告诉我可以用了

// 下载

下载 SKILL.md机读安装清单 ↗

// 用法示例

单细胞数据初步质控

输入

请对这个 .h5ad 单细胞RNA测序数据集做质量控制，按照 scverse/scanpy 最佳实践进行 MAD-based 过滤，识别低质量细胞，并输出关键质控图与过滤建议。

预期产出

返回质控指标汇总、低质量细胞过滤结果以及包含基因数、UMI数和线粒体比例等的可视化图表。

评估样本数据质量

输入

帮我评估这个 .h5 文件中的单细胞RNA测序数据质量，检查是否存在异常细胞、低复杂度细胞或高线粒体污染，并说明数据是否适合进入下游分析。

预期产出

给出数据质量诊断结论、异常细胞特征说明，以及是否建议继续做聚类和差异分析的判断。

生成标准化质控报告

输入

请基于这个单细胞RNA测序数据文件生成一份标准化质控报告，包括过滤阈值、保留细胞数量、剔除原因和主要可视化结果，便于我汇报实验数据质量。

预期产出

输出结构化质控报告，清楚展示阈值设置、过滤前后统计对比及主要图表结论。

// 文档

Single-Cell RNA-seq Quality Control

Automated QC workflow for single-cell RNA-seq data following scverse best practices.

When to Use This Skill

Use when users:

Request quality control or QC on single-cell RNA-seq data
Want to filter low-quality cells or assess data quality
Need QC visualizations or metrics
Ask to follow scverse/scanpy best practices
Request MAD-based filtering or outlier detection

Supported input formats:

.h5ad files (AnnData format from scanpy/Python workflows)
.h5 files (10X Genomics Cell Ranger output)

Default recommendation: Use Approach 1 (complete pipeline) unless the user has specific custom requirements or explicitly requests non-standard filtering logic.

Approach 1: Complete QC Pipeline (Recommended for Standard Workflows)

For standard QC following scverse best practices, use the convenience script scripts/qc_analysis.py:

python3 scripts/qc_analysis.py input.h5ad
# or for 10X Genomics .h5 files:
python3 scripts/qc_analysis.py raw_feature_bc_matrix.h5

The script automatically detects the file format and loads it appropriately.

When to use this approach:

Standard QC workflow with adjustable thresholds (all cells filtered the same way)
Batch processing multiple datasets
Quick exploratory analysis
User wants the "just works" solution

Requirements: anndata, scanpy, scipy, matplotlib, seaborn, numpy

Parameters:

Customize filtering thresholds and gene patterns using command-line parameters:

--output-dir - Output directory
--mad-counts, --mad-genes, --mad-mt - MAD thresholds for counts/genes/MT%
--mt-threshold - Hard mitochondrial % cutoff
--min-cells - Gene filtering threshold
--mt-pattern, --ribo-pattern, --hb-pattern - Gene name patterns for different species

Use --help to see current default values.

Outputs:

All files are saved to <input_basename>_qc_results/ directory by default (or to the directory specified by --output-dir):

qc_metrics_before_filtering.png - Pre-filtering visualizations
qc_filtering_thresholds.png - MAD-based threshold overlays
qc_metrics_after_filtering.png - Post-filtering quality metrics
<input_basename>_filtered.h5ad - Clean, filtered dataset ready for downstream analysis
<input_basename>_with_qc.h5ad - Original data with QC annotations preserved

If copying outputs for user access, copy individual files (not the entire directory) so users can preview them directly.

Workflow Steps

The script performs the following steps:

Calculate QC metrics - Count depth, gene detection, mitochondrial/ribosomal/hemoglobin content
Apply MAD-based filtering - Permissive outlier detection using MAD thresholds for counts/genes/MT%
Filter genes - Remove genes detected in few cells
Generate visualizations - Comprehensive before/after plots with threshold overlays

Approach 2: Modular Building Blocks (For Custom Workflows)

For custom analysis workflows or non-standard requirements, use the modular utility functions from scripts/qc_core.py and scripts/qc_plotting.py:

# Run from scripts/ directory, or add scripts/ to sys.path if needed
import anndata as ad
from qc_core import calculate_qc_metrics, detect_outliers_mad, filter_cells
from qc_plotting import plot_qc_distributions  # Only if visualization needed

adata = ad.read_h5ad('input.h5ad')
calculate_qc_metrics(adata, inplace=True)
# ... custom analysis logic here

When to use this approach:

Different workflow needed (skip steps, change order, apply different thresholds to subsets)
Conditional logic (e.g., filter neurons differently than other cells)
Partial execution (only metrics/visualization, no filtering)
Integration with other analysis steps in a larger pipeline
Custom filtering criteria beyond what command-line params support

Available utility functions:

From qc_core.py (core QC operations):

…

查看完整文档 ↗

anthropics装→

// 功能相似

技能

★23k

nextflow-development

运行 nf-core/Nextflow 流水线，完成 RNA-seq、变异检测与 ATAC-seq 数据分析