magic-workspace-init
Initialize a MAGIC data processing workspace: directory scaffolding, Python environment verification, dependency installation, and LLM configuration. Use when starting a new data project or setting up the MAGIC environment for the first time.
When It Activates
Use this skill when setting up a new workspace or installing dependencies. Trigger phrases: setup, initialize, create workspace, install dependencies, set up environment, init project.
- Starting a new data processing project
- Setting up environment for MAGIC data skills for the first time
- User asks to "set up", "initialize", "bootstrap", or "install" the data workspace
- User needs help installing Python packages or DataDesigner
- Need to verify the environment is ready for data processing
When NOT to Use: Workspace already exists and is initialized. Tier 1 quick tasks (e.g., "clean these nulls") do not need full workspace scaffolding.
Quick Facts
| Property | Value |
|---|---|
| Version | 3.1.0 |
| Complexity | low |
| Phase | 0 |
| Scripts | 0 |
Tags
workspace setup init environment bootstrap install
Workspace Directory Structure
workspace/
├── data/
│ ├── input/ # Original input files (never modified)
│ ├── checkpoints/ # Intermediate results (ckpt_NN_*.csv)
│ └── output/ # Final processed data
├── logs/ # Profiling results, validation reports, analysis_journal.md
├── reports/ # Generated reports (Markdown)
├── charts/ # Generated visualizations (PNG, SVG)
└── configs/ # Agent configs, synthesis configsFor Tier 2/3 projects, the workspace also contains:
workspace/
├── workspace_state.md # Phase tracker, quality score, task list
├── specs/
│ └── data-spec.md # Single source of truth for dataset properties
└── logs/
└── analysis_journal.md # Decision log (timestamp, context, options, chosen, rationale)Project Types
| Project Type | Workspace Shape |
|---|---|
| One-off analysis | data/ + reports/ — minimal scaffolding |
| ETL pipeline | Add staging/ + archive/ |
| Multi-dataset | Per-dataset subdirs under data/input/ |
| Existing pipeline (dbt/Airflow) | Use magic-workspace/ subdirectory to coexist |
Environment Setup
Preferred method (uv — fast, no system pollution):
uv venv .venv --python 3.12
source .venv/bin/activate
uv pip install -r requirements.txtStandard venv:
python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txtDefault Output Paths by Skill
| Skill | Default Output Path |
|---|---|
| magic-data-loading | data/input/ |
| magic-data-profiling | logs/ |
| magic-data-cleaning | data/checkpoints/ |
| magic-data-transformation | data/checkpoints/ |
| magic-data-validation | logs/ |
| magic-data-synthesis | data/output/ |
| magic-data-visualization | charts/ |
| magic-report-generation | reports/ |
Scripts
No scripts — workspace initialization is performed by the agent directly.
Dependencies
No dependencies — initialization only requires Python stdlib.
Last updated on
Skills Overview
All 12 data science skills at a glance
magic-data-loading
Load and ingest data from any source — files (CSV, TSV, Parquet, JSON, JSONL, Excel), databases (SQLite, PostgreSQL, MySQL via connection string), or remote repositories (HuggingFace Hub datasets). Auto-detects format, encoding, and delimiter for files. Use when a user mentions data, a dataset, a file, a database, a table, records, or any structured data source they want to work with — even vague references like 'I have some data' or 'help me with this dataset'.