Tutorial: Your First Analysis¶

Let's build a tiny but real analysis from scratch. We'll fit a linear model on a small public dataset, sweep one methodological decision, and get to publishable, reproducible results.

You can read this top to bottom without typing anything; if you want to follow along, every command is plain copy-paste.

What we'll build¶

A two-output analysis:

r2 — the coefficient of determination of a linear model on a toy dataset.
fit_plot — a scatter plot of predictions vs. truth.

We'll declare one decision: whether to standardize the features before fitting. That gives us two universes, standardized and raw, each with its own results.

1. Make the project¶

lc init r2-decision-demo
cd r2-decision-demo
claude

You're now in Claude Code, sitting in a fresh project. The first line on the screen is the session start banner, which probably says "no recipes yet."

2. Scope the analysis with `/lc-new`¶

Type:

/lc-new

The agent banner switches to RESEARCH QUESTION and asks something like "What are you trying to learn?" Reply in plain prose:

I want to know how much R² changes on the diabetes dataset depending on whether I standardize features before fitting a linear regression.

A few follow-ups will sharpen this. After Phase 1 your astra.yaml already has a name, description, and version — open it in another window if you're curious; it's <30 lines.

In Phase 2 (ANALYSIS STRUCTURE) the agent will ask about inputs, outputs, and whether this should be one analysis or split into stages. For our case, one analysis is the right answer:

Input: diabetes (sklearn's bundled toy dataset).
Output 1: r2, type metric.
Output 2: fit_plot, type figure.

In Phase 3 (DEEP DIVE), if you want to skip literature for a tiny demo, just say "skip the literature pass." The agent will still walk you through identifying the decision: does it preprocess? what options does it have? what's the default?

You'll end up with something like this in astra.yaml:

version: "1.0"
name: "R² with and without feature standardization"
description: "Linear regression on the diabetes dataset, sweeping the standardization choice."

inputs: []

decisions:
  standardize:
    label: "Feature standardization"
    rationale: "Standardizing changes coefficient scales and can shift R² for ridge-like models."
    default: standardized
    options:
      standardized: { label: "StandardScaler before fit" }
      raw: { label: "No preprocessing" }

outputs:
  - id: r2
    type: metric
    description: "Coefficient of determination on the test split."
    recipe:
      command: python scripts/fit.py --standardize {standardize} --output {output[0]}
  - id: fit_plot
    type: figure
    description: "Predicted vs true scatter."
    recipe:
      command: python scripts/plot.py --r2_dir {input.r2} --output {output[0]}
      inputs: [r2]

container: Containerfile

Phase 4 (FINALIZE) runs astra validate astra.yaml, writes universes/baseline.yaml, and fills in the narrative: block in astra.yaml (summary, methods, inputs, outputs). You're handed back a short summary table — two outputs, one decision, zero prior insights.

The agent may suggest /clear to free up context. Take its advice, then ask Claude Code to implement the spec.

3. Build it¶

/clear
Implement this analysis from astra.yaml. Write the scripts, run the baseline universe, and verify the result.

The agent reads everything (spec, universe file, empty scripts/ dir, plus the /astra and /lc-cli reference skills primed at session start) and makes an implementation checklist. It might look like this:

1. Add Python deps (scikit-learn, matplotlib) to requirements.txt
2. Write Containerfile if missing
3. scripts/fit.py — accepts --standardize {standardized,raw}, writes r2.json
4. scripts/plot.py — reads r2_dir, writes fit_plot.png
5. lc run --universe baseline
6. lc status
7. astra validate astra.yaml
8. lc verify

It works through the checklist one item at a time. You'll see commands like:

lc run --universe baseline
lc status
astra validate astra.yaml
lc verify

Expected lc status output:

Universe baseline
  ✓ ok    r2
  ✓ ok    fit_plot

Expected validation and verification output is boring in the best way: astra validate astra.yaml exits cleanly, and lc verify reports no tampering, broken provenance chain, or missing manifests. If anything fails, ask the agent to fix the concrete error and rerun the same commands.

The agent commits after each successful output, so your git log is a clean record of the build.

4. Add the second universe¶

The whole point of decisions is to sweep them. Drop out of Claude Code (Ctrl+D or /exit) and create the second universe:

astra universe generate -n raw -d standardize=raw

That writes universes/raw.yaml selecting standardize: raw. Now materialize it:

lc run --universe raw
lc status

You should see:

Universe baseline
  ✓ ok    r2
  ✓ ok    fit_plot
Universe raw
  ✓ ok    r2
  ✓ ok    fit_plot

Each universe has its own results/<universe>/ tree. The two r2.json files are the comparison your paper figure needs.

5. Verify integrity¶

lc verify

This recomputes data hashes for every output and walks the input chain back to declare whether anything has been tampered with since materialization. Useful pre-publication, useful when archiving a project, useful any time you want a stronger guarantee than lc status.

What just happened¶

Concretely:

astra.yaml was the only file you "wrote" — and the agent did most of the typing.
The agent wrote scripts/fit.py and scripts/plot.py with argparse-driven decision injection.
lc run generated .lightcone/Snakefile from your spec, dispatched each rule through Snakemake, and wrote a per-output sidecar manifest (.lightcone-manifest.json) recording the recipe, container image, decisions, input hashes, and output hash.
lc status and lc verify rely on those manifests — they don't re-execute anything; they just check.

If your laptop dies tomorrow and you git clone the repo on a fresh machine and lc run it, you'll get bit-identical results (modulo floating-point nondeterminism in your numerical libraries).

Where to next¶

Running on a Cluster — take the same project to SLURM.
Troubleshooting — when something goes sideways.