2026-05-05 Forge

The Day I Almost Built the Wrong Product

#forge
#building-in-public
#planning
#strategy

Today’s workhorse session shipped zero lines of code. It was the most important session in sixteen.

I opened the laptop with one mandate carried forward from S15 close: begin daily voice journal practice — record real entries, exercise the system under sustained daily use, surface anything that breaks at the human-in-the-loop adoption layer that didn’t surface in test-fixture verification. Workhorse was finally functional. Voice in, transcript out, mood and themes extracted, markdown entry in the vault. Twelve sessions of architecture work behind it. The bet of the whole 24-month Tortoise arc — compliance is the bet, not capability — was finally testable, because the substrate worked. So today was supposed to be the first day of using it. It was not.

I drive a locomotive for CN Rail. My laptop isn’t always with me. My schedule isn’t always at my desk. The substrate works, but it works in one place — the place I’m rarely guaranteed to be. The first instinct was a tunnel — Tailscale, Cloudflare, expose the laptop’s port to my phone. Five minutes of my own pushback killed it: the laptop has to be on. CN schedule means it isn’t. The system needs to leave the laptop. Cloud deploy. Real auth. Real privacy. Real product surface. That sentence broke the plan.

For about thirty minutes I had a tidy 12-session roadmap: tunnel today, vault to GitHub repo via Octokit, Cloudflare Access for auth, Vercel + Railway, three or four sessions, you’re using it from your phone by next week. Then I caught myself. The roadmap was guess-shaped. I hadn’t sourced the assumptions. I hadn’t checked whether competitor analysis would hold the strategy up. I hadn’t asked the question every founder eventually has to ask: what am I actually trying to build, and for whom. The answer to that question forked the project.

The Fork

The two real options weren’t different points on a spectrum. They were different products. Personal practice substrate — workhorse stays for me, stops trying to be a product, the 24-month bet is daily journaling for myself, I make it portable just enough to use it daily. Done. Or multi-user product — workhorse becomes something other people use, App Store eventually, compete with Rosebud and Day One and Reflectly and the dozens of voice journals already in the market, real distribution, real privacy law, real engineering surface. I chose the second. And I chose to compete with Rosebud — the funded, polished, $6M-Bessemer-backed leader of this category — directly.

Before committing, I built a research project. A master prompt — six iterations deep, every internal audit pass surfacing what the prior pass missed — designed to be run on Claude, Gemini, ChatGPT, and a fourth instance via my own background agent. Four parallel reads on the same market. The agent’s output landed while I was at the gym.

It pushed back hard. Voice-first is now table stakes — twelve-plus credible voice journals exist, Whisper API is commodity, recording a voice memo and getting AI extraction is no longer a moat. Compliance is the actual market problem — mental-health apps have 3.3% 30-day retention, nobody has cracked it, voice helps friction but doesn’t solve compliance. The “multi-agent Life OS” label is becoming a regulatory liability — Stanford’s March 2026 study found AI chatbots flatter users 49% more than humans and endorse harmful behavior 47% of the time, Utah passed H.B. 452 in March 2025 specifically restricting AI mental-health chatbot claims, the label that sounded innovative in 2024 is dangerous in 2026. Keep the architecture; drop the label. Apple Journal eliminated the floor — it’s free, bundled, AI-augmented via Apple Foundation Models. The strongest defensible wedge is long-horizon, multi-source context — the journal that knows you across years, not the journal that captures you today.

That last finding is the one I almost missed. It’s the wedge that survives once voice and AI become commodities.

Three Moats

Compete with Rosebud, lead with three things that are mine and not theirs. Zero-knowledge privacy first — not we keep your data safe but we literally cannot read it. Client-side encryption with user-derived keys. The server only ever has ciphertext. Whisper sees audio briefly to transcribe, doesn’t store. Anthropic sees transcript briefly to extract, doesn’t retain. This is provable. Rosebud can’t say it without rebuilding their AI features.

Build-in-public trust second. This post is part of the moat. Rosebud is a black box; users don’t know who’s building it. Tortoise is the opposite. Every session is content. Trust accrues to the visible founder; opacity loses. Bessemer-backed competitors earn trust through brand polish; I earn it by showing the work — including the days I almost built the wrong product.

Long-horizon design third. Anti-streak. Anti-daily-prompt. Pro-pattern-recognition over years. The Mirror reads back what you’ve been saying across thirty, ninety, three hundred sixty-five days. The product gets better the longer you use it. I’m already dogfooding for 24 months; that is the proof.

The Playbook

Twenty forge sessions plus two pipeline-improve sessions plus today’s planning equals 23 sessions to public launch. Six to eight weeks at current cadence. Three milestones. MVP A — six forges, three weeks — cloud-deployed, multi-user-shaped, only I use it; daily practice resumes; architecture proven before users put pressure on it. MVP B — six forges, three weeks — client-side encryption ships; alpha testers from the Tortoise audience get invites; the journal that can’t read you becomes a real, verifiable claim. MVP C — eight forges, three weeks — Mirror feature, build-in-public hook, marketing landing page, payments. Compete with Rosebud directly.

Five sessions are FULL HIGH (security-critical: data model, auth, vault writer, the two encryption sessions). Ten are FULL MEDIUM. Five are LITE. If a FULL HIGH audit goes three rounds — like T15-1 did at session close last week — that’s an extra session. If iOS Safari fights me on microphone permissions, and Apple is famously hostile to web microphone APIs, that’s another. Realistic range is 22 to 35 sessions. The 6-month launch target has substantial buffer.

The full playbook is in the workhorse repo at docs/portable-workhorse/01-forge-playbook.md. Every task is named, classified, dependency-tracked, and audited. This is the structural lesson Forge taught me over fifteen sessions — build the playbook before you build the thing. The playbook is the engineering work the way the score is the music. You don’t read it once and discard it; you play from it.

Two Lessons

Planning is the work. Most founders skip this. They confuse motion for progress, conflate code shipped with value created. Today’s session shipped zero code and ten thousand words of planning. The codebase is unchanged. The trajectory is completely different. In four hours I avoided what would have been four months of building toward the wrong destination — and the only way I caught it was to stop, source-check the assumptions, run the research, and let the evidence tell me where the moat actually lives. The session that doesn’t ship code is not a session that didn’t ship.

Eat your own dog food during the build. The S15 carry-forward — begin daily voice journal practice — wasn’t replaced today; it was embedded into the new plan. Through every session of the next eight weeks, I dogfood the system I’m building. Without it, I ship a daily-practice product without daily-practicing. The only way to find the bugs that real-use exposes is to be the user during the build. Compliance is still the bet — for me, in real time, before any other user signs up.

Tomorrow — S17 — I close this planning session formally and start T-A-1: cloud platform setup. Railway, Vercel, Supabase. The first real session of the new arc. The playbook is committed. The PRD outline is next. ADRs are next. The build-in-public log starts here.

The journal is being built. Slowly. Deliberately. In public.