A driver-based model, 2026→2030, tied to the phased strategy: a free pilot, the first paid contract, then a deliberate climb up-market. Pick a business-model variant, choose a scenario, drag the assumptions — and watch the one number a bootstrapped team lives by: cumulative cash. Like the product, this page loads no third-party fonts or trackers.
MARGIN lever. 1.5 = 50% over the EU token cost we pay Scaleway.| variant | the bet | where it breaks | verdict |
|---|---|---|---|
| Metered resale | Pure usage. Charge token cost × 1.5; no seat fee. Lowest commitment, easiest to start — it's literally what the gateway does today. | Margin is thin and rides entirely on the markup surviving. If anyone ships EU-hosted open weights cheaper, it's a price war you lose. | A wedge, not the whole company. |
| Managed seats | Flat platform fee per seat; tokens passed through near cost. Predictable, procurement-friendly — one budget line the institution approves. | Flat-rate + pass-through is internally incoherent: you can't promise a fixed price and pass variable token bills through. Heavy agentic users torch the margin with no usage lever to catch them. | Strong for procurement, dangerous if pricing stays flat. |
| Services-led | The product is the door; the money is in training, custom skills, the DPO/compliance package. Highest gross margin per euro. | Labour scales with headcount, not with two founders + agents. Past ~€150k of services you're modelling a hire, and revenue is lumpy and relationship-bound. | The cash engine early; not the equity story. |
| Sovereign self-host | Own the inference: lease EU GPUs, serve GLM/Devstral yourself. Above a utilization threshold, marginal token cost collapses — COGS becomes a lever you control, not a pass-through. | A fixed GPU bill from day one and real MLOps burden. Below the utilization knee you're underwater; you need volume first. | The only real answer to the price-war — earned later, at scale. |
| OEM / consortium | Sell the sovereign stack through a national body (SURF) or an EU consortium (Horizon / Digital Europe) — one contract lands a whole country's design schools. Low per-seat, big integration services, adoption part-funded by the EU. | Glacial procurement, consortium politics, and you're a sub-supplier — thin per-seat, dependent on the partner's reach. | Where the up-market money actually lives; slow to land. |
| Blend ▲■● | Seat base plus usage markup plus services, with grants funding the open infra in the early years only. The research consensus for AI-native pricing. | More parts to sell and keep coherent. Mitigated by segmenting — students free, institution pays the base, heavy teams pay usage. | Recommended start. Predictable base, usage upside, labour margin, non-dilutive runway. |
Switch the variant buttons to load each one's economics. The default Blend is what the strategy implies: win the student on UX for free, let the institution pay a platform fee because sovereignty unlocks procurement, meter the heavy AI usage so no user is unprofitable, sell the compliance/training package on top, and let grants fund the open sovereign layer while the commercial layer finds its feet. Self-host and OEM/consortium are where it goes once there's volume — the long-run moat and the up-market climb, not the place to start.
What survived the critique and is baked in: revenue is shown as true reseller top-line and net margin is net÷revenue (no flattering 70% number); usage is always metered at a markup (no flat-rate heavy-user trap); services carry a delivery cost (~60% margin, not free); a hire is forced at 3 and 8 customers; the ISO-27001 spend lands at the first institutional sale (certs precede the buyer); grants taper to zero by 2029 (no treadmill); and the headline is cumulative cash, because revenue you can't bank is vanity for a bootstrapped team.
| Token COGS & the margin lever est | The gateway's explicit-margin pricing sets GLM-5.2 (primary, sovereign) at €1.8/€5.5 per 1M in/out and an explicit MARGIN = 1.5 as the single unit-economics lever. Verify the Scaleway sheet before quoting externally. gateway/src/pricing.ts |
| Blend beats either-or firm | 2026 best practice: predictable base + usage component on top; all three major coding assistants moved spend onto usage. Ibbaka, GitHub |
| AI gross margin is not SaaS firm | Scaling-stage AI B2B ≈ 52% gross margin (ICONIQ, Jan 2026), inference ~23% of revenue — model net as a managed-services / AI hybrid, not 80% SaaS. SaaStr |
| Scenario + cumulative-cash UX firm | Driver-based models with best/base/bear scenarios and a months-of-cash / cumulative-cash view are the standard (Runway, Causal). Causal |
| Seat utilization firm | ~47–54% of enterprise SaaS seats are actively used — and the institution still pays for the inactive ones. Zylo |
| Grant amounts est | WBSO (payroll-tax R&D credit), Sovereign Tech Fund €50k–€1M, NLnet €5–50k, AiNed. Weighted to 2027. GRANTS.md |
| Team cost est | NL senior MLOps/SRE ≈ €145k loaded; auto-hire (~€130k) triggers at 3 and 8 customers. PayScale |
| Reconciliation with the unit-economics view est | The single-year calculator folds token cost into a flat seat price; this projection meters tokens explicitly at a markup and treats seats as separate base. Same business, two lenses — the projection matches the gateway's actual metered billing. |
| Paid customers (year) | 0 in 2026 (free pilot); first-paid in 2027; +new/year after |
| Token COGS | customers × seats × active% × cost/active-user/mo × 12 (what we pay the provider) |
| Usage revenue (top-line) | token COGS × markup — the full resold-token charge billed to the customer |
| Seat revenue | customers × seats × platform-fee/seat/yr |
| Services revenue | customers × services/customer/yr (delivery costs ~40% of it) |
| Operating revenue | seat + usage + services (true reseller top-line) |
| Gross profit | operating revenue − token COGS − services delivery |
| Opex | base team + €130k at ≥3 customers + €130k at ≥8 + GPU lease (self-host only) + one-time €55k ISO 27001 at first sale |
| Net (operating) | gross − opex |
| Cumulative cash | running sum of (net + grants that year), starting from €0 |
Deliberately legible over precise. Excludes VAT, working-capital timing, FX on non-sovereign routes, and equity raises (the point of this plan is to need none). The softest input is token-cost-per-active-user; instrument it in the pilot. Indicative, June 2026 — pair with the single-year unit-economics view.