magic-statistical-analysis

Perform descriptive statistics, hypothesis testing, and correlation analysis with mandatory uncertainty communication. Use when computing statistics, testing hypotheses, comparing groups, or analyzing correlations with significance.

When It Activates

Use this skill when computing statistics or testing hypotheses. Trigger phrases: statistics, statistical, hypothesis test, t-test, chi-square, correlation, regression, significance, p-value, distribution, balance check.

Need descriptive statistics with narrative interpretation
Need hypothesis testing (group comparisons)
Need correlation analysis with significance
After magic-data-profiling or magic-data-cleaning, before reporting
Results naturally feed into magic-report-generation for structured deliverables, or magic-data-visualization for charts

When NOT to Use: Use magic-data-profiling for initial exploration; use magic-data-exploration for pattern discovery.

Quick Facts

Property	Value
Version	2.0.0
Complexity	high
Phase	1
Scripts	3

Scripts

Scriptable Tools (call directly or read + adapt)

Script	Standard CLI Usage	When to Customize
`descriptive_stats.py`	`python3 descriptive_stats.py --input data.csv --output stats.json`	`--columns col1,col2` to restrict; `--explain` for verbose narrative; `--auto-checkpoint` for versioned snapshots
`hypothesis_test.py`	`python3 hypothesis_test.py --input data.csv --output test.json --group_col region --value_col revenue`	`--group_col` and `--value_col` functionally required; `--test` to override auto; `--explain` for narrative; `--auto-checkpoint`
`correlation_analysis.py`	`python3 correlation_analysis.py --input data.csv --output corr.json`	`--method pearson\|spearman\|kendall` to override auto; `--columns` to restrict

# Preview what hypothesis test will be selected
python3 hypothesis_test.py --input data.csv --output test.json \
  --group_col region --value_col revenue --explain

Test Selection

The skill auto-selects the appropriate statistical test based on data characteristics:

Condition	Test
2 groups + normal distribution	t-test
2 groups + non-normal	Mann-Whitney U
3+ groups + normal	One-way ANOVA
3+ groups + non-normal	Kruskal-Wallis
Both categorical	Chi-square

Every test result includes an effect size (Cohen's d, eta-squared, rank-biserial, or Cramer's V) alongside the p-value.

Dependencies

pandas numpy scipy matplotlib seaborn

Linguistic Eval — NLP evaluation metrics

Was this page helpful?

magic-statistical-analysis

When It Activates

Quick Facts

Tags

Scripts

Scriptable Tools (call directly or read + adapt)

New in v2.0.0

`--auto-checkpoint` Flag

`--explain` Flag

Test Selection

Dependencies

On this page