MAGIC Agent Skills is now open source! Star on GitHub
MAGIC Agent SkillsMAGIC Agent Skills
Skills Reference

magic-data-lifecycle

Routing and orchestration knowledge for data processing tasks. Provides pipeline ordering (load → profile → clean → transform → validate → deliver), skill routing table (which magic-data-* skill handles which operation), quality gating guidance, and checkpoint strategy. Read this skill to understand how data processing phases connect and which skill to invoke for each step. Use when: the task involves multiple data operations, you need to decide which skill handles a specific operation, or the user's request spans multiple processing steps. Trigger keywords: process data, data pipeline, which skill, what order, how to approach this data.

When It Activates

Use this skill when the task involves multiple data processing steps or the user needs help deciding which skill to use. Trigger phrases: process data, data pipeline, which skill, what order, coordinate steps, multi-step, how to approach this data.

  • Task involves multiple data processing steps that need coordination
  • You need to decide which magic-data-* skill handles a specific operation
  • The user's request is vague and spans multiple potential skills
  • You want pipeline ordering guidance (what comes after loading? when to validate?)

When NOT to Use:

  • Single, isolated operations — use the specific skill directly (e.g., "just load this file" → magic-data-loading)
  • User wants the full interactive pipeline with phase tracking → suggest /magic:lifecycle command instead

Quick Facts

PropertyValue
Version2.0.0
Complexityhigh
Phase0
Scripts0

Tags

data-science lifecycle orchestration workflow quality

5-Phase Workflow

The data lifecycle follows a structured 5-phase workflow with PAUSE gates between each phase. Each PAUSE gate requires explicit user approval before proceeding.

Discover → [PAUSE: user reviews findings]
  → Plan → [PAUSE: user approves spec]
    → Execute → [PAUSE: user verifies output]
      → Validate → [PAUSE: user reviews compliance]
        → Deliver
PhaseSkills InvolvedOutput
Discovermagic-data-loading, magic-data-profiling, magic-data-explorationQuality score, issue report, patterns
Planmagic-data-lifecycle (routing)data-spec.md, processing plan
Executemagic-data-cleaning, magic-data-transformation, magic-data-synthesisCleaned/transformed checkpoints
Validatemagic-data-validation, magic-statistical-analysisValidation reports, sanity check
Delivermagic-data-visualization, magic-report-generation, magic-data-transformationCharts, report, exported data

Tiered Infrastructure

The amount of workspace scaffolding created depends on task complexity:

TierWhenWhat Gets Created
Tier 1Single operationJust the result — no workspace files
Tier 2Multi-step pipelineworkspace_state.md, data-spec.md, analysis_journal.md, checkpoints
Tier 3Multi-dataset projectsEverything in Tier 2 + cross-dataset references and per-dataset subdirs

Skill Routing Table

OperationRoute To
Load file, database, HuggingFacemagic-data-loading
Quality score, distributions, outliersmagic-data-profiling
Fix nulls, normalize, deduplicatemagic-data-cleaning
Schema enforcement, constraint checkingmagic-data-validation
Interactive investigation, pattern detectionmagic-data-exploration
Pivot, aggregate, merge, derive columnsmagic-data-transformation
LLM fill, translate, annotate, enrichmagic-data-synthesis
Hypothesis testing, correlationsmagic-statistical-analysis
Charts, plotsmagic-data-visualization
Structured report assemblymagic-report-generation

Quality Gating

GateDefault Threshold
Profiling score before cleaning≥ 70/100
Cleaning score before analysis≥ 85/100
Validation pass rate100% for critical constraints

When a gate fails the agent halts, explains which checks failed, and waits for your instruction.

Scripts

No scripts — this skill provides routing knowledge, not executable code.

Dependencies

pandas numpy

Was this page helpful?
Edit on GitHub

Last updated on

On this page