Skip to main content

Operations that build themselves.
Answer to you.

One prompt assembles the team, the dashboards, the schedules, and the approvals — forging new agents when none fit. You hold the keys for the entire lifecycle.

Lifecycle

Autonomy you can steer.

Every operation has six beats. Every beat is yours.

  1. 01

    Describe

    Write the outcome. One sentence is enough.

    Your control

    Edit, refine, revise.

  2. 02

    Assemble

    The harness picks the data, the agents, the team. Forges new ones when none fit.

    Your control

    Approve the plan before it runs.

  3. 03

    Run

    Agents work in parallel. Glass Box DAG shows every step.

    Your control

    Pause anything, anytime.

  4. 04

    Steer

    Anything an agent ships waits at the approval queue. Reasoning attached.

    Your control

    Approve, revise, or reject.

  5. 05

    Adjust

    Edit the prompt, swap an agent, retune a schedule. The operation re-assembles.

    Your control

    Fork into a variant. Keep both.

  6. 06

    Retire

    Operations end on your call — never on theirs.

    Your control

    Archive with full audit trail.

No surprise sends. No runaway costs. No agent doing something you didn't sign off on.

Self-assembly

Install what fits. Forge what doesn't.

When the harness reads your outcome, it does three things in parallel — finds the data sources you need, picks the agents from your library and the marketplace, and forges new agents on the spot when nothing fits. The plan lands in your queue before a single line of work runs.

Every forged agent is named, prompted, scoped, and given a brief. Every installed one carries its marketplace pedigree. You see the full team — and the data sources behind it — before you approve.

↓ Installed

monitoring_agent

Watches your metrics, fires alerts.

✦ Forged

stripe_metric_watcher

Custom — watches MRR + churn deltas.

↓ Installed

report_generator

Drafts every digest in your voice.

✦ Forged

churn_root_cause

Custom — correlates churn to events.

Glass Box DAG

Every decision, traceable.

While the operation runs, the Glass Box DAG renders every step — the agent picked, the model selected, the tool called, the data read, the artifact produced. Observe → Reason → Act → Verify, four phases per run, narrated line by line.

Click any node to see the full reasoning, the prompt, the tokens spent, the wall-clock time. Replay any run, on any version, forever.

Run · churn-digest · v3

OBSERVE data_source_researcher

Discovered Stripe events, mapped 4 fields.

REASON viz_recommender

Selected line chart with cohort overlay.

ACT widget_config

Generated config, attached to dashboard.

VERIFY chart_qa_agent

Ran eval suite. All checks passed.

Control surfaces

Five levers. Always yours.

Every running operation surfaces the same five controls. Available on every agent, every swarm, every schedule.

Pause

Halt any agent, any swarm, any schedule. Resume when ready.

Edit

Change the prompt, swap the model, retune the tools. The next run picks up the edit.

Fork

Spin a variant off any agent or operation. Keep both. A/B them.

Approve

Every action that crosses an org boundary waits for you. Reasoning attached.

Retire

Operations end on your call. The audit trail stays.

Group chat

Multiple minds. One thread.

Mention two or more agents in the command bar and they round-robin on the question. Debate, disagree, converge. You see every position and the final synthesis.

Useful for the hard calls — pricing decisions, architecture trade-offs, legal red-flag reviews. No single agent has the last word.

@advisory_product @advisory_engineering @advisory_finance

a

advisory_product

Ship the freemium tier. Acquisition lever is too strong to ignore.

a

advisory_engineering

Freemium doubles our infra cost. We need usage caps.

a

advisory_finance

Caps at the 80th percentile. Margin stays above 65%.

g

general

Synthesis: ship freemium with caps. Reconsider at 10k users.

Memory

Agents that remember.

Four layers of memory. Every conversation, every document, every workflow your agents touch — searchable, inspectable, yours.

Working

Per-user preferences, agent stats, recent query patterns. Loaded into every conversation.

Episodic & Semantic

Every conversation distilled into reusable rules and lessons. Vector-indexed and recalled when relevant.

Procedural & RAG Vault

Learned workflows and your private knowledge base. Search any document your org has ever ingested.

Say "always use UTC timestamps" once. It becomes an organization rule — injected into every future conversation, for every team member, permanently. Inspect, edit, or delete any memory at any time.

Eval harness

Every agent graded.

Every agent in the catalog — installed or forged — runs through a five-layer eval suite before it ships. LLM-as-judge graders score every artifact against a rubric. Every change re-runs the suite. Every regression is blocked at the merge.

No black boxes. No trust me.

Spin one up.

Describe the outcome. Keep the controls.