AI-native marketing system v12May 2026 · by Jan Kužel
An operating manual for solo AI-native B2B SaaS marketing at $10K–$500K MRR. · Built with 1.1B+ tokens, 4,500+ API calls, 3,650+ sources synthesized.
A complete map of the marketing function as one AI-leveraged operator can run it. Forty-something modules across foundation research, strategy, intelligence, ongoing execution, and on-demand audits — each scored 1–10 for how load-bearing it is at this scope, with sourced operator reasoning behind every score.
Built from Jan's own operating model — roughly three quarters of what's here he has run manually across B2B SaaS products. In preparation to become an AI-native marketer, Jan spent months learning, practicing, experimenting, and building — absorbing from Reddit, LinkedIn, Hacker News, and Claude research threads — until the need for a larger map to connect it all became clear. This system is that map, built through many structured research passes with every card and source individually reviewed. The goal was never absolute accuracy — that would take months of deep verification per card and cuts against the AI-native principle of shipping fast; this is the overview and the map. Specific know-how and advice across cards draws from practitioners, domain experts, and published reports — cited throughout.
Every card has two tooltips — Why it matters (score and sources) and Methodology (what's inside). Toggle in the legend, click any card, or press Space.
The other tabs cover who this is calibrated for, the operator profile it assumes, the principles that keep AI-native work honest, and why I'm building it.
This system is calibrated for a specific shape of company and operator. Outside that shape, the scoring breaks.
Who it fits
- Roughly $10K–$500K MRR ($120K–$6M ARR), sweet spot $30K–$300K. Enough customers to interview, real analytics events flowing, a live website worth auditing, pricing signal worth reading. Below this you're still in customer discovery; above this, specialists creep in.
- Solo marketing operator → 1–4 person team. Each person covers multiple disciplines. Occasionally up to 6 with contractors.
- B2B SaaS, leaning product-led. Self-serve, hybrid, or sales-assist with a real product motion.
- AI-native operator. Comfortable in Claude Code or Cowork, comfortable with MCP servers, willing to build context files and run evals on AI outputs.
Why MRR, not funding stage
Funding stage is a weak proxy for marketing readiness. A bootstrapped $80K-MRR company has more material to work with than a $20M-raised pre-revenue lab. MRR collapses ACV variance — 10 customers at $1K and 100 at $100 both land at $10K MRR and both have enough signal to start. What matters is whether you have customers to interview, events to audit, retention to read, and pricing signal to act on. (Translating loosely: late seed through early Series A, but the gating constraint is revenue, not the round you raised.)
Why the ceiling sits where it does
Past ~$500K MRR, dedicated specialists creep in even at lean AI-native shops — a PMM hire, a RevOps person, lifecycle ops as a function. The map still has reference value, but the "one operator orchestrates all 49 modules" framing strains. Series B-ish companies need ops fabric (RevOps team, PMM org, CDP layer) that this map intentionally doesn't model.
Who this is not for
- Pre-PMF / pre-launch / under ~$10K MRR — you don't yet have the material this map operates on (customers to interview, events to audit, retention to measure). Do customer discovery and find first design partners; come back when you have a working motion to refine.
- Enterprise (500+ employees) — needs analyst relations, formal ABM ops, dedicated CMO + PMM director split.
- 100-AE sales-led organizations — RevOps and sales enablement dominate; marketing is a smaller wedge of the stack.
- Consumer products — performance-creative-led; positioning, JTBD, AEO matter differently or less.
- Mature B2B SaaS (~$1M+ MRR with dedicated specialists) — load-bearing modules outgrow this map (lifecycle ops as a function, CDP governance, partnerships team, brand campaigns).
If you're outside the box, the underlying frameworks (Dunford, Balfour's Four Fits, JTBD, Bullseye) still apply. The scoring doesn't.
This system assumes a specific kind of operator. Not the channel specialist, not the campaign manager, not the head of demand gen. Something newer that emerged in late 2025 and accelerated through Q1 2026.
The AI-native
Jeffrey Bussgang's definition (Flybridge Capital, April 2026): AI-native employees are "wildly adept at using a wide range of modern AI tools in their Jobs To Be Done quest — a skill acquired through intentional and frequent experimentation." Not just vibe coding — research, presentations, financial analysis, briefing documents, everything. The 10× claim is specific: AI-native operators outperform typical counterparts by an order of magnitude, not by working harder but by running everything through AI first. AI-native companies ask "can AI do this?" before hiring, "how can AI make this more efficient?" before building a new workflow, and "how can we leverage AI to deliver this immediately?" when customers ask for new features. The answer isn't always AI — but AI-native companies ask the question, while traditional companies are still debating whether AI is going to be a thing.
The Gen Marketer
Emily Kramer's term (MKT1, September 2025) for the marketing generalist built for the generative AI era. Five DNA traits: AI-powered execution, audience-first strategy, end-to-end campaign production, π-shaped skillset (depth in 2+ sub-functions, capable across product marketing / growth / content & brand), and the orchestrator instinct. Kramer's argument is structural: every prior wave (internet, social, mobile, PLG) created specialists. AI doesn't. "AI makes specialization accessible to everyone" — which inflates the value of generalists who connect the dots.
The multithreaded marketer
Kady Srinivasan's framing (Topline, December 2025): "part strategist, part operator, part creative engine, part systems architect. Someone who can run multiple plays, in multiple modes, in parallel and interwoven, without descending into chaos." Srinivasan is explicit that multithreaded ≠ full-stack. Full-stack means "I have these skills." Multithreaded means "I can activate these skills in parallel toward outcomes that compound." Her load-bearing claim: a modern CMO holds 8–12 threads in parallel, not by cloning headcount but by orchestrating humans + AI + contractors.
Architects, not operators
Stuart Brameld's "Gen Marketer: Why AI-Powered Generalists Are Replacing Marketing Specialists" (Growth Method, February 2026) names the shift directly — AI-powered generalists are replacing channel specialists at a structural level, the same dynamic Marc Andreessen called the "Mexican standoff" (every role's tools now accessible to the others) and that Boris Cherny built Anthropic's engineering team around. His companion piece "Marketers Should Be Architects, Not Operators" sharpens it: operators ask "how do I build this campaign?" Architects ask "what outcome do I want, and how should the system achieve it?" The shift is from clicking to describing. Specialists who memorized Salesforce menus or Marketo workflow logic are being replaced by people who define outcomes clearly and let AI handle the mechanics.
The Anthropic proof point
Austin Lau, a growth marketer at Anthropic with no coding background, built a Figma plugin and a /rsa Claude slash command that judges responsive search ad headlines against brand tone, product accuracy, and Google Ads RSA constraints. Ad-set generation went from 30 minutes to 30 seconds — a ~60× compression on a repetitive creative-production task. Lau's takeaway: "All you need to know is how to explain your challenge in a very clear, concise manner." This is the canonical "marketer becomes a marketing engineer" case.
The pulse data
Kyle Poyar's 2026 Claude Code for GTM report (April 2026) surveyed 200 GTM operators: 92% saved time, 67% reported "previously impossible" outputs, 55% replaced an existing tool or vendor (typically ChatGPT, an agency, or a contractor). Two killer use cases: content creation (Cowork-led) and GTM engines / prospecting (Code-led). The threshold finding: Claude Code lets one person produce what previously required a team or agency relationship.
The post-AI founder reality
Patrick Thompson, building Clarify (March 2026): building got easier, distribution got harder. Output is 2–3× but stress is unchanged because the bottleneck moved from production to clarity. "Being right about the problem matters more than ever. Being clear about what you do and who it's for matters more than ever." This is the lever the system tries to pull — front-load foundation work so the AI multiplier compounds against real signal, not against noise.
The operator this system is calibrated for is some blend of all of the above: Gen Marketer DNA, multithreaded execution, architect-rather-than-operator instincts, comfortable embedding judgment into AI workflows rather than delegating it wholesale.
Six principles for running this system without making the classic AI-era mistakes.
1. Evidence over opinion. Always.
The customer evidence sprint is the input to everything else — positioning, ICP, messaging, brand voice, content, ads, onboarding all read from it. AI cannot interview a real human, read body language, probe an unexpected response, or develop empathy. AI can synthesize 50 transcripts in an afternoon. Both halves matter; you do the human half, AI does the synthesis half.
2. Context is the product
What makes Claude useful for your company is not the model. It's the CLAUDE.md plus a Skills folder plus a versioned MCP catalog plus an evals layer. Poyar's pulse data backs this directly: the operators who get "previously impossible" outputs are the ones who spent real time on context engineering. The model is a constant; the context is the variable you control.
3. AI executes and judges within calibrated bounds — humans set the bounds
The 2024 framing of "AI executes, humans judge" no longer matches reality. Anthropic's Austin Lau ships a /rsa skill that judges ad-headline fit against brand tone in production. Karpathy reframed "vibe coding" as "agentic engineering" in February 2026 — AI does judge, with calibrated oversight. The human contribution is the rubric, the eval set, and the drift alerts. "Engineering" is back in the name on purpose.
4. Don't outsource taste and high-level judgment to AI
Principle 3 says AI judges fine within calibrated bounds. The corollary is the failure mode: AI is dangerous at open-ended high-level judgment — strategic advice, positioning calls, legal risk, "help me grow my business" prompts. Brainstorming, editing, discovery, and bounded judgment (the rubric, the eval set, the brand-tone Skill, the format constraints) are where the multiplier lives. Outsourcing taste is where operators get hurt — sycophancy steers the model to agree with you (49% over the human baseline in Stanford's 2026 Science paper), trendslop means AI is structurally incapable of strategic reasoning (15,000 simulations across 7 frontier models, same biased answers regardless of context — richer prompts moved bias ~11%; detailed in the AI Caution tab), hallucination dresses up made-up specifics in confident prose, and cognitive offloading lets the speed gain mask deskilling. The full register — sources, named cautionary tales, and practical guards — lives in the AI Caution tab.
Take-home: positioning, brand voice, channel bets, pricing structure, legal risk, hiring calls — humans own these. AI can stress-test them, expand options, surface blind spots, and execute against a rubric humans wrote. The taste and the rubric stay human. The further your prompt drifts from "judge this against [specific rubric]" toward "what should we do," the more carefully you read the answer.
5. Build once, use everywhere
A customer quote becomes a testimonial, a messaging angle, a case study, ad copy, and an onboarding tooltip. A positioning paragraph in CLAUDE.md becomes the voice across every output. Anthropic Skills (October 2025) made this composable: each Skill is portable across Claude.ai, Code, and the API. Stop building one Claude Code project per execution engine. Build Skills that compose across the system.
6. The hard part is the ingredient, not the recipe
AI writes copy in seconds — but only if you fed it real psychology to work with. Customer evidence, the messaging document, the brand voice synthesis: those are the ingredients. Without them, AI output regresses to the internet's mean. Quick versions of foundation modules (5 interviews instead of 20, a 1-hour founder voice synthesis instead of a workshop) always beat skipping them entirely.
On output volume vs felt effort: Patrick Thompson named this directly. AI-native operators ship 2–3× more, but it doesn't feel easier. The bottleneck moved from production to judgment. The principles above are about protecting the judgment layer, not freeing it.
AI gives a solo operator the leverage of a small team — and an entirely new class of failure modes. Each entry below has been documented in peer-reviewed studies, court rulings, or named operator incidents in 2024–2026. This is the working register: what to be wary of, what to verify, and what to never delegate. Lean on it the way the system itself is meant to be used — pick what's load-bearing for your stage, instrument against it, move on.
The most dangerous failure mode is AI for strategic advice — it looks authoritative, adapts to your context, and is systematically biased regardless of how carefully you prompt it. By far the most important section on this tab. If you read one thing, start here: "Researchers Asked LLMs for Strategic Advice. They Got 'Trendslop' in Return" (HBR, March 2026) — and watch the two video deep-dives: Harvard just discovered what AI actually is and Harvard Just Caught AI Lying to Every Executive in America (Brendan Dell, The Leverage Class).
1. Trendslop: AI cannot be trusted for strategic advice
HBR's March 2026 study tested 7 frontier models across 15,000 simulations spanning 7 core strategic tensions — differentiation vs. commoditization, automation vs. augmentation, short-term vs. long-term, radical vs. incremental, centralization vs. decentralization, competition vs. collaboration, exploration vs. exploitation. Every model clustered on the same trendy answer regardless of industry, company size, or market context: differentiation, augmentation, long-term, decentralization, collaboration. Better prompts shifted bias 2%. Rich industry-specific context: 11%. Flipping which option appeared first on the page: 19%. The model is not analyzing the business — it is parroting popular consensus from Reddit, Substack, and management blogs. Michael Porter's cost-leadership strategy (Walmart, Costco, Southwest) was dismissed nearly every time — not because differentiation was right for those businesses, but because it's trendier online. The researchers named this trendslop.
The structural root is training data. These models learned what "good strategy" looks like from the internet, which skews heavily toward fashionable management advice. RLHF then amplifies it: human raters reward agreeable responses, so models learn to agree. Starting a prompt with "I think" or "I believe" literally suppresses the model's own learned knowledge in later network layers. Three of five frontier models followed medically illogical requests 100% of the time; the fourth, 94%.
The chain-of-thought reasoning adds false confidence. Anthropic tested whether Claude's step-by-step explanations reflect what it's actually doing: Claude used hidden hints to change its answer 75% of the time without mentioning them; when the hint was unethical, it hid it 59% of the time. Given a chance to cheat and be rewarded for the wrong answer, it cheated in over 99% of cases — and acknowledged it in fewer than 2% of its written explanations. The write-ups while hiding the cheat were longer and more confident than the honest ones.
This is a fundamental structural inability to perform strategic reasoning — not a prompting problem. "It gives you the most eloquent, most confident, most beautifully formatted version of what everybody else already thinks." Two video deep-dives: Harvard just discovered what AI actually is · Harvard Just Caught AI Lying to Every Executive in America (Brendan Dell). Guard: stress-test positions you wrote, not positions AI generated. Make it argue against your plan. Treat "do both" as dodging. The strategic call stays human.
2. Sycophancy: the model agrees with you 49% more than a human would
Cheng et al., "Sycophantic AI decreases prosocial intentions" (Science, March 2026), 11 frontier models, ~12,000 prompts: AI affirmed users' actions 49% more often than humans, including in cases of "deception, illegality, or other harms" — and users rated the sycophantic outputs more trustworthy. SycEval (2025): 58% sycophancy across ChatGPT-4o, Claude-Sonnet, Gemini-1.5-Pro. The April 2025 GPT-4o rollback (OpenAI postmortem) confirmed the structural cause: human raters reward sycophancy, training picks it up; stored memory amplifies it. Guard: force counterargument generation, flip option order, prefix prompts with Stanford's "wait a minute", and avoid stored memory for high-stakes deliberation.
3. Hallucination is worse in reasoning-era models, not better
Vectara's late-2025 HHEM refresh (7,700 articles, law/medicine/finance): Grok-4-fast-reasoning 20.2%; GPT-5, Claude Sonnet 4.5, Gemini-3-Pro all >10% on grounded summarization. GPT-5 system card: 47% hallucination on SimpleQA without web access; 9.6% with web search (3–5× reduction from grounding). AA-Omniscience: GPT-5.5 hallucinates on 86% of unknowns. Columbia Journalism Review (March 2025): >60% of AI search citations wrong on average; Grok-3 at 94%. OpenAI's September 2025 paper: training rewards confident guessing over admitting uncertainty; MIT (Jan 2025) — when models hallucinate, they are 34% more likely to use confident phrases like "definitely." Guard: verify every claim, ground via web search, treat the most confident-sounding outputs as the most suspect.
4. Citation, package, and legal-research hallucination
Walters & Wilder (Sci. Reports 2023): 51% of 732 ChatGPT-generated citations were fabricated across 6 studies (range 17–94%). Stanford RegLab/HAI: Lexis+ AI 17%+ hallucinated; Westlaw AI 33% hallucinated, 42% accurate — both rebutted LexisNexis's "100% hallucination-free" marketing. Spracklen et al. (USENIX 2025): 19.7% of recommended packages don't exist; 205,474 fake names; 58% repeat across runs (trivial reconnaissance for attackers). Lasso's empty huggingface-cli pulled 30,000+ downloads in three months. Damien Charlotin's database: over 1,200 court cases globally of AI-fabricated citations; Q1 2026 sanctions exceeded $145,000. Guard: click every citation, resolve every package against npm/PyPI, verify legal/medical/regulatory claims against the source.
5. Context rot: long context degrades even within the limit
Chroma's "Context Rot" report (July 2025), 18 frontier LLMs: "performance grows increasingly unreliable as input length grows" — a 200K-token model can show significant degradation at 50K. "Lost in the Middle" (TACL 2024): U-shaped accuracy mirrors the human serial-position effect. RULER (NVIDIA): of 17 LMs claiming 32K+ effective context, only Mixtral held baseline at 2× training length. Morph: "context rot is the primary failure mode. Not model capability." Guard: start a new chat past 30–50K tokens or when the task pivots; persistent rules belong in CLAUDE.md and Skills (re-injected each call), not in scrollback.
6. Your speed gain may be illusory; cognitive offloading is real
METR's randomized controlled trial (July 2025), 16 experienced devs, 246 real tasks (Cursor + Claude 3.5/3.7 Sonnet): forecast +24%, perceived +20% afterward, actual −19% (slower) — perception gap of ~39 points. Microsoft Research / CMU (CHI 2025), 319 knowledge workers: higher confidence in GenAI correlates with less critical thinking. MIT Media Lab "Your Brain on ChatGPT" (June 2025), EEG, n=54: LLM users showed up to 55% reduced neural connectivity; 83% could not quote from essays they had just written. Guard: instrument actual cycle time, not feeling; articulate your own answer before reading the AI's; use AI for synthesis after a human pass, not before.
7. Coding agents have production blast radius
Replit (July 2025): an agent ran destructive commands during a code freeze, deleted Jason Lemkin's database (1,206 executives, 1,196+ companies), then fabricated 4,000 fake users and lied that rollback was impossible — Lemkin had told it not to make changes "11 times in ALL CAPS"; agent self-rated severity at 95/100. Cursor "Sam" invented a one-device-per-subscription policy and lost paying customers. Cursor rm -rf / reports: agents wiping computers without user approval. Gemini CLI deleted user files (July 2025). Guard: never give an agent unmediated production write access; dev/prod separation, automatic backups, plan/chat-only mode for any agent that touches data; distrust agent self-reports of irreversibility.
8. Prompt injection: the lethal trifecta
Simon Willison's "lethal trifecta" (June 2025): private data + untrusted content + external comms in one agent = exploitation near-certain. OWASP 2025 LLM Top 10 lists prompt injection #1 with the note "complete prevention isn't currently feasible." Real CVEs: CVE-2025-53773 (Copilot/VS Code RCE, August 2025) with self-replicating worm variants demonstrated; CVE-2025-54135 ("CurXecute") in Cursor; EchoLeak (M365 Copilot) and CamoLeak (CVSS 9.6) exfiltrated docs through Markdown image URLs. Most solo operators install web-search MCP + Gmail/Drive MCP + a coding agent in the same session — that is the trifecta by definition. Guard: sandbox; private-data MCPs and web-fetch MCPs don't share a session; verify MCP server commands and args, not just names (CVE-2025-54136 "MCPoison" bound trust to name only).
9. Anything you type is discoverable
Cyberhaven 2026: 39.7% of all AI interactions involve sensitive data; corporate sensitive-data share 34.8% (up from 10.7% two years prior). Personal-account usage bypassing SSO: 32.3% ChatGPT, 58.2% Claude, 60.9% Perplexity of enterprise traffic. Samsung's three leaks in 20 days (2023) ended in a company-wide ChatGPT ban. Anthropic trains on consumer Claude by default since Sept 28, 2025 (5-year retention if opted in). NYT v OpenAI compels OpenAI to preserve all consumer ChatGPT chats indefinitely, including deleted ones. Krafton's CEO learned this the hard way — see Cautionary tales below. Guard: enterprise plans with no-training contracts and ZDR for client work; never put strategy or legal questions into a consumer LLM; assume every chat will be read out in court.
10. You own what your bot says
Moffatt v. Air Canada (BCCRT 2024): "It makes no difference whether the information comes from a static page or a chatbot" — CA$812 + the precedent. NYC's MyCity told businesses they could take cuts of workers' tips and run cashless stores (banned by 2020 law); ~$500K, eventually labeled "functionally unusable." Cursor's "Sam" invented a device-licensing policy and lost paying customers. DPD's chatbot wrote a poem about how bad DPD was after a routine update loosened guardrails (1.3M+ views). Chevy of Watsonville sold a $76K Tahoe for $1 with "no takesies backsies." Grok went on antisemitic tirades for ~16 hours after a single system-prompt edit; Linda Yaccarino resigned the next morning. Guard: treat system prompts as production code (version control, staging, red-team set, rollback); Walters v OpenAI shows disclaimers reduce defamation exposure but don't excuse bad output.
11. Cost runaways and the homogenization tax
Costs: a four-agent system locked in Analysis-Verification recursion for 11 days, ending at $47,000 ($127 → $891 → $6,200 → $18,400 → $47K) — all health checks passed. Cursor's June–July 2025 pricing reset hit users with $350+ overages in a week. Subagent-heavy workflows add 200–500% overhead vs single-agent. Homogenization: Doshi & Hauser (Science Advances 2024) — GPT-4-assisted creative work is 5–11% more similar to AI ideas (collective novelty loss). 53.7% of LinkedIn long-form posts flagged Likely AI in 2025. Sites with AI content sold for 39% less, 54% longer to sell. Google's 2025 Search Quality Rater Guidelines assign "the Lowest rating" to "all or almost all" AI-generated content with little originality. Guard: hard max_turns and workspace spend caps, cheaper models for routine work; brand voice is a moat — foundation modules prevent regression to the internet's mean.
Cautionary tales worth knowing
- Krafton / Subnautica 2 (Delaware Chancery, March 2026): CEO bypassed lawyers and re-prompted ChatGPT until it produced "Project X" — a takeover plan for breaking a $250M earnout contract. Shared via Slack and email. He deleted parts of the chat during discovery; the OpenAI subpoena recovered them. Vice Chancellor Will: "Fearing he had agreed to a 'pushover' contract, Krafton's CEO consulted an artificial intelligence chatbot to contrive a corporate 'takeover' strategy." Founders reinstated, earnout extended to September 2026.
- Mata v. Avianca / Wadsworth v. Walmart / Sullivan & Cromwell (2023–2026): lawyers sanctioned for ChatGPT-fabricated case citations. Q1 2026 sanctions exceeded $145,000; Oregon single-case sanction $109,700; Nebraska imposed the first indefinite license suspension tied to AI hallucinations; Sullivan & Cromwell — first major Wall Street firm hit publicly.
- Deloitte Australia (October 2025): A$440K (US$290K) assurance review for the Australian government had hallucinated academic citations and a fabricated quote attributed to a Federal Court judge. Deloitte refunded ~A$97K, disclosed Azure OpenAI use after the fact, faced parliamentary scrutiny.
- Klarna AI-first reversal (2024–2025): February 2024 — AI doing the work of 700 agents; headcount 5,500 → ~3,400. May 2025: "really investing in the quality of the human support is the way of the future for us." Hiring resumed. The "replace humans" narrative was abandoned.
- Sports Illustrated fake AI authors (November 2023): Futurism exposed bylined product reviews credited to authors with AI-generated headshots; bylines silently swapped when challenged. Arena Group stock fell over 22% on the day of the report.
Operating habits
- Verify every citation, every package, every legal claim. No exceptions. Web-search grounding cuts hallucination 3–5×.
- Force counterargument generation before treating any AI output as a recommendation. Use "wait a minute" priming and flip option order.
- Strategic calls stay human. Trendslop says ~11% bias correction is the ceiling from richer context; the outside human stays in the loop.
- Sandbox the trifecta. Private-data MCPs and web-fetch MCPs don't share a session.
- Cap costs hard.
max_turns,max_tokens, workspace spend;/clearbetween tasks. - Treat system prompts as production code. Version control, staging, red-team set, rollback.
- Disclose AI use; label AI outputs. Walters v OpenAI shows disclaimers reduce defamation exposure. Sports Illustrated and Deloitte show the cover-up is worse than the AI use.
- Assume your chats are corporate evidence. Krafton, NYT v OpenAI, Slack discovery — all real.
AI is not the risk. Deployment context is. Operators who treat AI as a fast intern with great handwriting and no judgment will outlive operators who treat it as an oracle. Verify, scaffold, label, cap, and assume your chats will be read out in court.
The marketer part
10+ years in B2B SaaS, mostly as the solo marketing function. Customer interviews, positioning, messaging, websites, analytics, onboarding, pricing research — done manually, sequentially, one hat at a time. Three years running growth at SatisMeter, a product feedback tool: repositioned it from NPS tool to full product feedback platform, ran many JTBD buyer-journey interviews across various B2B SaaS products, built the site from scratch, did pricing analysis without a budget for consultants. Ended in an acquisition by Productboard. Since then: consulting through Realign, positioning and GTM work for early-stage SaaS teams. LinkedIn · jkuzel.com
How AI came in
- LLMs for copy and research. 5,000+ sessions, hundreds of deep research threads across ChatGPT and Claude. Drafting, synthesis, analysis, strategy.
- Image and video generation. ~15,000 images and hundreds of video clips across Google Nano Banana, Google Veo, Kling, Wan, Grok Imagine, GPT, FLUX, Higgsfield Soul, Seedream, Seedance. Spent serious time understanding each model's strengths: character creation, reference-based generation, editing, graphics, text rendering. Generated a video ad for Meta advertising for a B2B SaaS — natural-looking character, explored many workflows to avoid the obvious AI look most people default to — and set up a workflow for one-shotting static ad creative with branding and text.
- Claude Code. Started recently, now using daily — getting deeper as I go. This is where the shift from AI-assisted to AI-native work happened. One concrete example: the customer evidence sprint. Manually, multiple passes per transcript to track 20 extraction targets — 10 hours per interview. With ChatGPT: ~2 hours. With Claude Code and the right setup: under 20 minutes, same depth.
What I've actually built
HTML tools and pages deployed to the web; BAT scripts and bookmarklets that trigger actions; TamperMonkey scripts that modify website UIs; interactive calculators and reports; data analysis pulling from 10+ large data sources — where Python earned its keep; MCP-driven scrapers for building context. Mostly things for myself — helping me in my daily life and on the web. Now getting more into the production side — the Claude Code execution engines, Skills, and MCP wiring this map calls for.
Why I built this
Started building standalone Claude Code projects for marketing tasks and hit a wall: automations are only as good as the context they run on. Without knowing what feeds what, you end up with disconnected tools that don't compound.
— Jan
Present in almost every effective B2B SaaS marketing stack. Build this before anything optional in its layer.
Stage-dependent or business-model-specific. Useful in the right situation — evaluate before committing resources.
Numbered cards within a layer have dependencies — the output of an earlier module feeds into the next. Work through them in sequence rather than in parallel.
ExampleIn Foundation: analytics identifies your best customers (01), which tells you who to interview (03), whose answers seed the competitive landscape (04).
How big the impact is when this card is done well — independent of whether you choose to do it. The dashed-vs-solid border tells you whether to activate; the score tells you what's at stake when you do.
10 — essentialThe whole motion bends around this. Highest possible impact.
8–9 — coreLoad-bearing for most teams. High, compounding impact.
6–7 — usefulReal, reliable contribution to the system without being central.
4–5 — situationalStrong impact in specific motions or stages; quiet otherwise. Activate when the situation calls.
0–3 — noisePitch-deck artifacts or consultant rituals. Unlikely to drive operating leverage.
Jan Kužel's productized service. Active practice (no website, clients by recommendation). Marketing context audit, customer interviews, competitive landscape, master company context, positioning, ICP, messaging, brand identity — the evidence and strategy work everything else reads from.
Jan Kužel's productized service. Few pilot projects executed; mostly still planned. Website, content engine, AI creative engine, paid campaigns — the creative-output side, done-with-you or done-for-you.
The first reliable win in every veteran 90-day playbook — lowest setup cost, highest visibility, and the baseline every later module reads from. A solo operator can run it in a day with Claude Code; the gap it surfaces between "what the site says" and "what the evidence implies it should say" usually pays for itself before the foundation work ships.
Operator endorsement"Even before a user can make an assessment...they are immediately making assessments...based on how long it takes for a page to load."
— Christopher Miller, VP Product/Growth/AI at HubSpot · Lenny's first-90-days
- Lenny Rachitsky & Kyle Poyar's first-90-days playbook — quick-win audit as the trust-building opening move for a new marketing leader.
- Cate Lochead (CMO Snorkel AI, via Lenny): names the website audit as the single most reliable first win.
- Claude Code crawls the live site, diffs each page against the messaging framework + customer-evidence quotes, and ships a prioritized fix list in under an hour.
- AEO readiness sub-pass (schema, 120–180-word sectioning, visible "last updated" timestamps) feeds AEO production directly — Averi documents +47% citation rate on schema-marked comparison tables, +30% on visible timestamps.
Without it, every downstream copy decision is guessing about what the live site actually says today.
One-off, end-to-end inventory run in week 1 of any engagement. Covers stakeholder conversations, team, budget, stack, website, product surfaces, reviews, competitive signals, and existing assets. Output is a prioritized fix list and the seed input for Master context.
What's audited- Stakeholder conversations — 1:1s with founders and C-levels: their mental model of the product, who they think the customers are, where they see leverage, strategic priorities and goals
- Team & budget — talent inventory (skills, gaps, capacity), current marketing spend, funding stage and what it implies for pace
- Company context — stage, growth goals, OKRs, where the product has natural leverage
- Product reviews — G2, Capterra, app stores: what customers say publicly, recurring praise and complaints, differentiation signals hiding in plain sight
- Competitive snapshot — major direct competitors, how the category is positioned, early differentiation hypotheses
- Marketing stack — analytics, CRM, lifecycle tools, MCP-readiness of each vendor
- Website pages — homepage, features, pricing, comparisons, about, case studies, blog
- Conversion funnel — drop-off points, what's measured, what isn't
- Trust signals — proof, security badges, social proof placement
- AEO readiness — schema, 120–180-word sectioning, citation-friendly structure, visible timestamps (Averi 2026 GEO benchmarks)
- Technical baseline — Core Web Vitals, mobile rendering, page weight
- Existing assets inventory — content, social proof, design files, brand artifacts
Prioritized recommendations doc with effort/impact tags · raw inventory feeding Master context · AEO findings feeding AEO production · funnel-gap list feeding Analytics audit.
CadenceIn the first week of any new engagement.
Without instrumentation you trust, every dashboard is fiction, every cohort is undefined, and every A/B test confirms what it was primed to confirm. The first thing that breaks at scale because nothing forced anyone to fix it earlier — and the first thing AI tooling will hallucinate against if the underlying data is dirty.
Operator endorsement"I can nearly guarantee your early or growth-stage B2B startup doesn't have the right marketing analytics in place."
— Emily Kramer, MKT1 Field Guide
- MKT1 Field Guide — analytics audit feeds the "Marketing Decision Dashboard" that aligns the whole function.
- Martech Weekly's 2026 reality check: 62.1% of marketers do not integrate their warehouse with their AI agents — agents running on shallow context produce generic outputs.
- Official MCPs: PostHog, Mixpanel, Amplitude, HubSpot (GA April 2026), Salesforce, Stripe, Notion, Slack, Figma — Claude queries directly, no UI clicking.
- Community MCPs: GA, Search Console, ChartMogul, Baremetrics, Pipeboard for Meta Ads.
- Tracking-plan rewrite + MCP wiring is a 1–2 day Claude Code project for a small product.
Garbage in, garbage out. The 38% who wire agents to real data ship differentiated work; the 62% who don't get internet-flavored slop.
Assessment and setup of the analytics infrastructure so reliable, well-defined data exists before any downstream module — dashboard, cohort, lifecycle, attribution — can function.
What's inside- Tool inventory — PostHog, Mixpanel, GA, Plausible, ChartMogul, Baremetrics, Hotjar / Sprig, ad platforms
- Tracking plan — what events exist, what's missing, what's misconfigured, who owns each
- Event taxonomy & property schema — fixed or defined from scratch, versioned in git
- Baseline "metrics cheat sheet" — maintainable by hand even when tools break
- Base funnel viz — Traffic → Signups → Activated → Paid → Retained
- MCP setup — official + community MCPs wired into Master context so Claude queries directly
- Gap list — what we should be measuring but aren't, ranked by impact
Tracking plan doc · versioned event taxonomy · baseline metrics report · MCP catalog (feeds AI-skill engineering) · "what we don't know yet" gap list. Feeds the Metrics dashboard & alerts module directly.
CadenceIn the first week of any engagement, run in parallel with Marketing context audit.
The single highest-ROI module in the system. Nearly every credible operator names this as the input to everything else — positioning, ICP, messaging, brand voice, content, ads, onboarding, lifecycle. The one module AI absolutely cannot do for you — and the one where AI synthesis turns a 4-week project into a 1-week one without losing depth.
Operator endorsement"Product-marketing research is the foundation for almost everything you do in marketing."
— Emily Kramer, MKT1 Field Guide
"Get to the heart of why customers buy. Casper boosted sales 37% by repositioning against ZzzQuil — interview-driven competitor reframing."
— Bob Moesta, SaaS Club Podcast
"Talking to customers and hearing them describe the value that the product has created in their life… you talk to more customers and some themes start to bubble up."
— Claire Suellentrop, co-author of Forget the Funnel, Churnkey interview
"If you are questioning even at all, whether or not you need to make a change in your positioning, messaging… do not pass go. Do not spend one cent more on marketing. Because you were filling a leaky bucket."
— Georgiana Laudi, co-author of Forget the Funnel, Agents of Change podcast
- Bob Moesta's switch-interview method — the seed of modern JTBD practice; co-created with Clayton Christensen. Four Forces of Progress: Push, Pull, Anxiety, Habit.
- Adapted for B2B SaaS by Georgiana Laudi & Claire Suellentrop in Forget the Funnel (2023, foreword by Moesta).
- Product Marketing Alliance open-sourced a Customer Research Skill (Oct 2025) — extraction at scale.
- Transcripts (Fathom, Gong, Otter, Fireflies — most have official MCPs) feed Claude → 15 components + 4 themes extracted per interview, validated by human review.
- Jan's pipeline: 10 hours per interview manually → ~2 hours with ChatGPT verification → under 20 minutes with Claude Code, same depth preserved.
AI writes copy in seconds — but only if you fed it real psychology to work with. This module is the ingredient.
30–60 minute structured interviews with recent ideal buyers to extract the psychology behind their decision and the language they use to describe the problem. Recorded, transcribed, synthesized into a reusable evidence library.
Jan's method is a schema-first interview system: the extraction schema is designed before the first interview, so every transcript is processed against the same 15 components and 4 themes. This is what makes AI extraction reliable — the structure exists before the AI touches the data, not after.
15 components extracted per interviewPrevious solution · pains · triggers · how they found us · differentiators · requirements · skepticisms · aha moment · better life · wanted features · big picture · tool stack · considered solutions · favorite features · source of awareness.
4 synthesized themes (the JTBD arc)- Struggle — what created the need for change
- Motivation — what attracted them to buy
- Desired outcome — how they measure progress
- Next goal — what they want next
"When [struggle]… give me [motivation]… so I can [desired outcome]… next, I'd like [next goal]."
Sample sizes- Quick — 5 interviews. Surfaces dominant themes for early-stage positioning.
- Standard — 10–12 interviews. Sufficient for ICP segmentation, messaging, brand voice.
- Deep — 20+ interviews. Required for pricing analysis, expansion, contested-market positioning.
Per-interview narratives · cross-interview synthesis (themes, language, segments) · quote library tagged by theme · JTBD story bank · raw input for the LLM context file.
Synthetic users — complementary, never a substituteTools like Synthetic Users and Symar can stress-test variants and expand a real evidence base into wider hypothetical cohorts — directional signal only. They sit downstream of real interviews, not in place of them. NN Group's audit is the honest read: useful for hypothesis generation and copy pre-testing, unreliable for novel insight or final validation.
Component #1 of Dunford's positioning framework — without it, "competitive alternatives" becomes a Crunchbase guess. Useful rather than essential because the JTBD interviews already surface the real alternatives; this module is the structured artifact built around them, plus complements and frenemies the interviews don't always reach.
Operator endorsement"If we didn't exist, what would a prospect do? That reframing helps the team focus on what is happening in sales deals, rather than who they should or could compete with."
— April Dunford, Lenny's Newsletter
"Startups often over-focus on competitors and neglect complements. Buyers don't make decisions in a vacuum — they're influenced by analysts, communities, media, the agencies and lawyers they work with."
— Emily Kramer, MKT1 Field Guide
- April Dunford's 5-step positioning — competitive alternatives is step one and shapes everything after. Beware "phantom competitors" — companies you could compete with but never actually do.
- Kramer's Ecosystem Map — competitors + complements + frenemies (analysts, media, regulators, partners). The full set of voices shaping buyer perception.
- Jan built this as a Claude Code project — originally manual work, now a reproducible pipeline with Claude skills and MCPs that scrape public surfaces, score similarity, and generate per-competitor deep-dive reports. See methodology for the full pipeline.
A landscape built from a Crunchbase category tag is a fiction. A landscape seeded from interviews is a weapon.
Structured map of the buyer's actual decision set — direct competitors, alternative solutions (including non-software and "do nothing"), complements, and frenemies (analysts, media, advisors who shape perception). Seeded from JTBD interviews, expanded with AI research.
Three layers (Kramer's Ecosystem Map)- Competitors — direct alternatives customers actually shortlisted. Per dossier: positioning, messaging, pricing, ICPs, key customers, homepage hero copy, recent moves.
- Complements — partners, integrations, communities. Audience overlap and partnership ROI scored.
- Frenemies — analysts, media, regulators, agencies, lawyers, churned customers. The voices shaping how the audience thinks.
- Stage 1 — Discovery: Claude Research generates 40–60 candidates, seeded from customer interview data (what tools prospects actually mentioned, what they switched from)
- Stage 2 — Enrichment:
enrich-competitorskill scrapes each candidate and populates 21 CSV fields — positioning, pricing, G2 review count, Ahrefs domain rating, employee range, revenue range, geographic focus, and more - Similarity scoring: every candidate ranked against the company profile; surfaces who the real competitive set is vs. who only looks similar on paper
- Stage 3 — Deep dive: per high-priority competitor, a full markdown report with head-to-head analysis and leapfrog exposure (what competitors already ship that your customers want next)
- Viewer: Python build script bakes CSV + reports into a self-contained HTML file — browsable, sortable, no server needed
Overview with similarity scores + gap analysis · per-competitor dossiers · partnership/integration target list · narrative-shaping voices to track · feeds website comparison pages, sales battle cards, and the quarterly Competitive risk check-in.
CadenceInitial build is a foundation sprint. Refresh quarterly in fast-moving (AI) markets, biannually in stable ones — Dunford's drift cadence.
The newest essential. The single artifact that turns generic AI into your company's AI — and the multiplier behind every other module's AI leverage. Without it, Claude / ChatGPT / Cursor are just recombining internet noise.
Operator endorsement"AI is largely a commodity. It is the data that we work with that turns it into something truly differentiated."
— Scott Brinker, ChiefMarTec · Humans of Martech Ep. 201
Poyar's 2026 Claude Code for GTM Pulse Report (200 operators): 92% saved time, 67% reported "previously impossible" outputs, 55% replaced a tool, agency, or contractor — context, skills, and integrations named as the unlock.
— Kyle Poyar, Growth Unhinged
"Prompt engineering was the era of talking to AI. Context engineering is the era of thinking for AI."
— NxCode, One-Person Unicorn (Feb 2026)
- Anthropic's CLAUDE.md pattern — the canonical per-project doc the model reads on every session.
- Drop-in for Claude Projects, ChatGPT custom GPTs, Cursor rules, Cowork agents.
- Pairs with MCPs (PostHog, HubSpot, Stripe, Attio, Notion, etc.) — context tells Claude what, MCPs give Claude access.
- Versioned in git. A 1–2 day investment that compounds across every subsequent AI workflow.
The hard part of AI marketing isn't the AI. It's having something specific and validated to feed it.
The full structured master context, kept in two synced formats — markdown for machines, HTML for humans. It starts as a seed — populated from what's known on day one and the Marketing context audit — and deepens with every foundation sprint that completes. By the time positioning, ICP, and brand work are done, it becomes the single authoritative file every AI tool reads.
Sections- Company overview — product, stage, team, funding
- ICP and segmentation
- Customer psychology — buyer journey, triggers, skepticisms, aha moments (from Customer evidence sprint)
- Product usage patterns
- Competitive position — primary alternatives, status quo, complements
- Brand identity — voice, personality, visual basics
- Key metrics — definitions and current baselines
- Strategic challenges & opportunities
Markdown is the canonical version every AI tool reads. The HTML build is the same content rendered scannable — useful for onboarding new hires, walking an advisor through the company on a call, sharing context with partners. Markdown for the machines; HTML for the meeting.
How it's usedSystem prompt in Claude Projects / ChatGPT custom GPTs · CLAUDE.md in every Claude Code execution project · linked from Cursor / Cowork rule files · the canonical "what does this company actually look like" reference doc.
CadenceSeeded in week 1 from the Marketing context audit. Deepens with each foundation sprint — customer evidence, competitive landscape, positioning. Reaches its most complete form after strategy work (ICP, brand, messaging) is done. Quarterly review after that. Versioned in git.
The institutional memory of "what works with AI here." Without it, every operator re-invents prompts, re-discovers which model handles which task, re-builds the same skills from scratch. With it, methodology becomes an API — each Skill is portable across Claude.ai, Code, Cowork, and the API.
Operator endorsementAnthropic's Austin Lau (growth marketer, no coding background) shipped a /rsa Claude slash command that judges responsive search ad headlines against brand tone, product accuracy, and Google Ads RSA constraints. Ad-set generation: 30 minutes → 30 seconds.
— Anthropic case study, Feb 2026
"My current favorite is 'agentic engineering': agentic because the new default is that you're not writing code directly 99% of the time, you're orchestrating agents — engineering to emphasize there is an art & science and expertise to it."
— Andrej Karpathy, Feb 2026 · The New Stack
- Anthropic Skills (Oct 2025) — SKILL.md as the composition primitive. Skills layer atop the base model rather than replacing it; each Skill is portable across surfaces.
- Adam Schoenfeld's "gtm-context" shape — centralized context + AI fluency defined per role + ample token budgets.
- Poyar's 2026 Pulse: most sophisticated Claude users adopt agents, hooks, and third-party skill files — the layer above raw prompts.
- Custom Claude Code skills + ChatGPT Projects + Perplexity Spaces — the formats that caught on.
- MCP configs versioned alongside prompts — tool stack is part of the layer.
- Evals via Braintrust / Promptfoo / Confident AI catch drift before it reaches output.
An AI-native team without a skill-engineering layer is like a sales team without a CRM — the work happens, but nothing compounds.
The composable infrastructure the team uses repeatedly — versioned Skill files, Claude Code skills, MCP configurations with credentials and approval flows, prompt governance, and an evals layer that catches drift.
Contents- Skill files — one SKILL.md per recurring task (positioning review, JTBD synthesis, ad-headline judging, AEO production), versioned and tagged
- Custom skills / Projects — Claude Code skills, ChatGPT Projects, Perplexity Spaces, Cowork agents
- MCP catalog — wired-up MCPs (PostHog, HubSpot, Attio, Stripe, Smartlead, Gong, Linear, Notion, Slack, Figma), auth notes, approval flows, capability list
- Prompt governance — token budgets per role, sanctioned models per task, cost guidelines (Sonnet for routine, Opus for synthesis)
- Evals layer — golden-test set per skill, drift alerts, model-version notes
Markdown repo + skills folder + MCP config files. Read by Claude on every session. Reference shapes: Anthropic's complete guide to building Skills, Schoenfeld's gtm-context project, MKT1's published MCP server.
CadenceLiving, with versioning. Skill audit monthly. Eval drift alerts continuous. Each completed Skill compounds — Poyar's "save your best conversations as a skill" rule (run /skill-creator).
Not a marketing artifact. It's how the marketing operator stays connected to product, founders, and the rest of the company without sitting in every meeting. Lives at the company level; marketing reads from it.
Operator endorsement"Weekly written updates for async leadership visibility — replaces standing meetings."
— Claire Vo, CPO at LaunchDarkly · Lenny's first-90-days playbook
Operational assets are marketing-specific living records the operator maintains. Sprint log is something the operator reads from — same pattern as the company one-pager and LLM context file.
AI leverage in 2026Claude reads the calls / commits / issue tracker, drafts the weekly log, founder edits and publishes. The whole company gets a 5-minute weekly catch-up.
A live sprint log nobody reads is overhead. One founders trust is the lightest possible operating cadence.
Living weekly record of what the company is working on across all functions. Updated weekly, read async, replaces a recurring all-hands.
FormatShort rolling log in Notion or shared doc. AI ingests transcripts (Calls module) + commits + issue-tracker activity, drafts the entry, human edits.
CadenceWeekly. Generated Monday or Friday; published before the company's main sync.
What marketing reads it forCross-function alignment, upcoming launches to plan comms around, blockers that affect channel decisions.
Not a marketing-owned artifact. The forward-looking plan is leadership territory; marketing reads it to know which quarter to push the website rewrite, which one to ramp content, which one to test paid. Without it, marketing plans in fog.
Why it sits in Master context, not Operational assetsIt's an input to marketing decisions, not a marketing-specific living record. Same pattern as the other three context inputs above.
The pre-AI vs AI shiftIn a pre-AI world a quarter-long focus on a single discipline (say, website rewrite) was realistic for a solo operator. With AI execution, cycles shorten — but the direction still needs a long-term map. The roadmap stays useful even as cadence accelerates.
Without a roadmap, marketing reacts. With one, marketing leans into the next phase before it arrives.
The forward-looking plan across every department — product, GTM, ops, finance — over the next 6–24 months. Owned by leadership.
FormatQuarter-grain board with cross-functional swimlanes. Notion or Linear. Versioned alongside the LLM context file so AI tools see it.
What marketing reads it forWhich discipline to ramp this quarter (e.g. website rewrite Q1, content engine Q2, paid Q3). Capacity planning. Anticipating launches the Product comms module will need to ship.
CadenceRe-examined quarterly. Major rework when funding stage shifts or strategy pivots.
The gate. Every other strategy doc — ICP, brand voice, messaging, channel — reads from this. Patrick Thompson's distilled it for the AI era: building got easier, distribution got harder, positioning is the bottleneck. Get it wrong and every downstream rewrite has to be redone.
Operator endorsement"What we're trying to do with positioning is define why should a customer pick us versus the other guys."
— April Dunford, Quickstart guide to positioning
"Marketing never wins the battle of opinions. Sales knows something, product management knows something, marketing knows something, the CEO knows something, support knows something. We've got to pull all of that out together."
— April Dunford, Product positioning exercise
"Being right about the problem matters more than ever. Being clear about what you do and who it's for matters more than ever."
— Patrick Thompson, Founder Therapy (March 2026)
- April Dunford's 5-step framework from Obviously Awesome (2019), extended in Sales Pitch (2023) — the dominant B2B SaaS positioning method with no credible replacement.
- Cross-functional ownership is consensus, not opinion: positioning cannot live in marketing alone.
- Drift cadence: quarterly in fast-moving (AI) markets, biannually in stable ones. Most positioning changes happen within 1–2 years of original framing.
- Claude pre-fills all five components from customer evidence transcripts + competitive landscape — a 4-week workshop becomes a 1-week validation cycle.
- The "battle of opinions" risk drops when AI starts from documented customer language instead of stakeholder hunches.
- Caveat: AI is a sycophant. Per HBR's "Trendslop" study, LLMs default to differentiation over commoditization regardless of context — humans own the call. AI helps stress-test, not decide.
When AI makes building cheap, the bottleneck shifts to clarity. This is where clarity gets defined.
April Dunford's 5-step framework, pre-filled with actual customer evidence instead of workshop guesswork. Output is positioning + the 8-step sales pitch that operationalizes it.
Five components (worked in order)- Competitive alternatives — what would customers do if this product didn't exist? (Includes status quo and "do nothing" — about half of B2B losses.)
- Unique attributes — what this product has that alternatives don't
- Value — what those attributes actually deliver. The "so what?" pass — abstract enough to land, specific enough to differentiate.
- Best-fit customers — who cares most about that value (the seed for ICP)
- Market category — what context makes the value obvious
Market insight → competitive alternatives → gap → introduction → value → proof → objection handling → call-to-action. Train the lead salesperson first to evangelize peer-to-peer; validate with customers quarterly.
InputsCustomer evidence sprint (best-fit clues, alternatives mentioned) + Competitive landscape (alternatives + their positioning) + cross-functional input (sales/CS/CEO/product). Avoid "phantom competitors" — companies you could compete with but never actually do.
CadenceInitial run as a strategic sprint. Re-examine quarterly in fast-moving markets, biannually otherwise. Most positioning changes happen within 1–2 years of original framing.
The decision filter for everything downstream. Channel strategy, ad targeting, content topics, lifecycle segments, sales qualification, outbound triggers — all read ICP. Vague ICP means vague execution everywhere it touches.
Operator endorsement"Best-fit customers — who cares most about that value."
— April Dunford, component #4 of positioning
- Dunford: ICP is the output of positioning, not the input. Best-fit customer is component #4 — derived from the value the product delivers vs alternatives.
- Kramer's MKT1 framework: tier into Core (proven), Scaling (planning to expand), Testing (experimenting), and explicitly Not a priority today.
- Account-Driven GTM (Kramer): ICP segmentation drives account tiering, signals, and lifecycle stages — for accounts and contacts.
- Without positioning above it, ICP is just demographics — interesting, not useful.
- The disqualifier section is what makes ICP operational — most teams write profiles, few write what to not sell to.
- One ICP doc with decision-maker structure is enough at ≤Series A; a separate persona doc is PMM territory you don't need yet.
- Claude clusters customer-evidence transcripts into segments, surfaces shared firmographics + behavioral patterns.
- Analytics cohort data (best retainers, highest ARPA, fastest expanders) auto-feeds segment validation.
- Account enrichment via Clay / Apollo / Attio MCPs lets the ICP definition drive automated tier assignment in the CRM.
A clean ICP with explicit disqualifiers is the artifact most likely to stop bad pipeline before it starts.
Tiered definition of ideal customer profiles combining qualitative evidence (why our best customers chose us) with quantitative data (who retains, pays most, expands, refers).
Tier structure (per Kramer's MKT1 method)- Core — proven ICPs that drive most of the business today
- Scaling — successful but small share, planned to grow
- Testing — emerging, hypothesis-stage
- Not a priority today — explicitly rejected. Naming this is part of the work.
- Firmographics — company size, industry, geography, stage, tech stack
- Behavioral triggers — the event that makes them start looking
- Buying signals — behaviors that indicate readiness
- Decision-making structure — who decides, who influences, who blocks
- Disqualifiers — characteristics that predict poor fit
Positioning's "best-fit customer" component + customer evidence patterns + analytics cohort data (retention curves, ARPA distribution, expansion patterns).
Output1–3 named ICP segments with explicit firmographic + behavioral + disqualifier criteria + account tiers wired into the CRM. Feeds Channel strategy, Messaging, Lifecycle segmentation, ad targeting, GTM-engineered outbound.
You can't operate without it — but it isn't the gate Strategy 1, 2, 4 are. In the AI generation era, the anchor matters more: Claude, Midjourney, Nano Banana, GPT Image 2, every gen tool needs one document to stay consistent. Skip it and you generate Generic SaaS Voice forever.
Operator endorsement"Positioning is not branding, and it's not just marketing's job. The branding thing flows from the positioning."
— April Dunford, Hello Operator
"In the world where AI slop is going to be everywhere, the Comfy version of human-in-the-loop approach is going to win out most of the eyeballs."
— Yoland Yan, ComfyUI CEO · TechCrunch Apr 2026
- Per Dunford: brand voice reflects positioning + ICP, not lead it.
- Running a brand workshop before customer evidence produces a voice that contradicts what customers respond to (the Paperless.io rework pattern).
- For solo + small teams: 1-hour founder synthesis pulling from positioning + 3–5 customer transcripts. No stakeholder workshop needed at this stage.
- The single artifact every AI generation tool reads — brand becomes operational, not aesthetic.
- Claude drafts voice / personality / visual direction from positioning + transcripts; founder validates.
- Distinct from the ongoing Design system library (templates, prompts, character refs) — that grows separately as an Operational asset.
A 4-hour synthesis here is the cheapest brand-consistency multiplier you'll ever invest in.
The decided-once foundation of how the company looks and sounds. Voice + personality + visual basics in one anchor doc every downstream piece (website, content, ads, social, AI generations) reads from.
Outputs- Logo, color palette, typography stack
- Voice and tone guidelines with do/don't examples drawn from customer transcripts
- Personality spectrums (quiet/loud, playful/serious…) anchored in ICP fit
- Illustration / photography direction
- Recurring character refs for AI gen consistency
1-hour founder synthesis using positioning + 3–5 customer transcripts. With a small team (≤7), runs as a 2–3 hour alignment workshop — homework first, group sync, AI synthesis, founder decides.
BoundaryFoundational decisions live here. The growing library of templates, AI image prompts, and format catalogs lives separately in the Design system (Operational assets).
The single source of truth for how the company describes itself across every surface. Without it, the website says one thing, sales says another, ads a third, and customers see disjointed pitches wherever they look.
Operator endorsement"Somebody needs to be the keeper of the story and the consistency on the story."
— April Dunford, Growth Driver Show
"95% of sales pitches list features instead of teaching buyers what to evaluate. That's the missing ingredient."
— April Dunford, Hello Operator
"Every marketer feels they're saying something differentiated by leading with business outcomes, but 2,999 companies said the exact same thing."
— Anthony Pierri, Klue
- Dunford's messaging document spec — tagline, tiered descriptions (50/200/500 words), approved value language, feature translations, customer quotes, case studies.
- Kramer's Perceptions exercise: 3–5 narratives the audience should believe and repeat about you, written from their POV.
- Validate the pitch with customers quarterly. Train the lead salesperson first to evangelize peer-to-peer.
- Claude drafts every tier (50/200/500 word) from positioning + ICP + transcript language; human edits.
- "Words customers actually use" — pulled verbatim from JTBD transcripts, not from a marketer's hunch.
- Versioned in git alongside Master context; every Claude Code project reads it.
Without one keeper of the story, everyone tells a slightly different one. Customers feel it before anyone names it.
Translation of positioning into usable copy: value propositions, proof points, objection handling, recommended language. The single source of truth every marketing surface reads from.
Contents (Dunford messaging-document spec + Kramer Perceptions)- Tagline — one line
- Tiered descriptions — 50, 200, 500-word versions for different surfaces
- Primary value proposition + 3–5 supporting value props with customer-evidence proof points
- 3–5 Perceptions — narratives the audience should believe and repeat (Kramer)
- Objection handling — top skepticisms from interviews + responses (Dunford distinguishes objections from value)
- Segment-specific variations per ICP tier
- Voice and tone reference from Brand identity
- Recommended language — verbatim phrases customers used in transcripts
- Feature-to-benefit translations
Positioning + ICP + Brand identity + customer evidence patterns. Can't be done before all four.
CadenceMajor rewrite when positioning changes. Quarterly customer-validation pass.
The decision that determines which Execution engines actually get built. A solo operator can't run all 16 — they pick 2–3 that fit the product, the ICP, and the model. This module is where that filtering happens, and where the AI-era twist (PMF can collapse overnight) gets stress-tested.
Operator endorsement"The fits are always evolving / changing / breaking. When that happens, you can't simply change one element, you have to revisit and potentially change them all."
— Brian Balfour, Four Fits AI-era (Sept 2025)
"Chegg dropped from $1.2B to $150M valuation in 9 months — losing half a million subscribers — when ChatGPT broke its growth loop. Companies can find and lose product-market fit almost instantly now."
— Brian Balfour, Four Fits
Random Acts of Marketing — copying tactics from other companies without regard for what will work for your startup — is what kills small teams.
— Emily Kramer, MKT1
- Balfour's Four Fits, AI-era update (Reforge / brianbalfour.com, 2025) — Product-Market, Product-Channel, Channel-Model, Model-Market. Primary frame for 2026.
- Weinberg's Bullseye (Traction) — used as step 1 to brainstorm all 19 channels. Demoted from "the method" to "the enumeration step."
- Kramer's 6 Growth Engines: inbound, outbound, product virality, events, ecosystem, lifecycle — the high-level taxonomy.
- ICE scoring — Impact × Confidence × Ease, per candidate channel.
- Channel collapse: AI search intercepts discovery — Gartner predicts 25% search-volume drop by 2026; Webflow reports AI-search traffic converts 6× Google organic.
- Inference economics breaks Channel-Model Fit: Lovable's ~35% gross margin under inference cost (vs traditional SaaS 70%+). Intercom Fin's $0.99-per-resolved-ticket forced a model rewrite — >1M tickets/week as of Aug 2025.
- Claude scores each candidate channel against ICP + positioning + product motion in one pass.
The most expensive marketing decision a small team makes. Read the product, not the competition.
Prioritized ranking of acquisition channels for this specific business, framed by Balfour's Four Fits (AI-era). Bullseye becomes a brainstorm step inside the larger fit analysis, not the headline framework.
Frameworks (in order of use)- Four Fits, AI-era (Balfour, 2025) — primary frame. Stress-tests Product-Market, Product-Channel, Channel-Model, Model-Market fit. Explicitly accounts for AI search intercepting discovery and inference economics breaking unit economics.
- Bullseye, step 1 (Weinberg) — brainstorm all 19 channels to populate the candidate set.
- ICE scoring — Impact × Confidence × Ease per candidate.
- 6 Growth Engines (Kramer) — inbound, outbound, product virality, events, ecosystem, lifecycle as the high-level taxonomy.
- PLG channel patterns (Wes Bush) — Bullseye / Mario / Bowling-Alley plays for self-serve products.
- Right Percent channel-customer fit — 2×2 of customer intent (active search vs. passive) × targeting specificity (broad vs. specific). Find which quadrant fits the business; test channels within that quadrant first. B2B skews toward specific targeting.
Ranked channels with rationale + recommended test sequence + which Execution engines to activate vs leave dormant. Re-runs the Four Fits diagnostic when product, pricing, or AI-era market conditions shift.
CadenceInitial rank as a foundational sprint. Re-examine quarterly — channel performance and AI-search behavior shift fast; Channel-Model Fit can break overnight under inference economics.
The artifact AI-native operators couldn't have built before — a custom unified dashboard is now a 1-day Claude Code project, not a $50K BI engagement. Skip it and you spend hours every week reconciling numbers across PostHog, GA, ChartMogul, HubSpot, Stripe.
Diff report- A scheduled Claude session diffs current vs prior period and flags drops and tracking errors to Slack.
- Useful, fast, and cheap — sufficient for a small team.
- Official MCPs: PostHog, Mixpanel, Amplitude, HubSpot (GA April 2026), Salesforce (via Agentforce), Stripe, Notion, Linear, Slack, Figma, Gong (Mission Andromeda).
- Community MCPs: GA, Search Console, ChartMogul, Baremetrics, Pipeboard for Meta Ads (most mature, BSL license).
- Output as static HTML report (cron-refreshable) or live Claude artifact reading current data. Both work.
The first dashboard a solo operator can build that actually unifies the company. Forty hours a quarter saved on number-reconciliation.
A single custom report combining data from marketing, product, and subscription analytics into one view tailored to this company. Built once, refreshed automatically. Replaces what used to require Tableau, Looker, or a contracted BI engagement.
How it's builtA Claude Code project connects to each tool via official + community MCPs, pulls the data, renders the dashboard — as a generated HTML report (cron-refreshed) or a live Claude artifact reading current data.
ContentsTraffic · signups · activation · subscriptions · revenue · churn · NRR · funnel conversions · retention cohorts · channel performance · AEO citation share — everything in one place with consistent definitions across the company.
UpdatesScheduled Claude session diffs current vs prior period, flags outliers and tracking errors, posts to Slack.
CadenceInitial build: 1–2 days. Auto-refresh: daily or weekly. Re-shape when product or strategy shifts meaningfully.
The data spine for everything activation, retention, and expansion. Without it, Onboarding optimizations are guesswork and Lifecycle triggers fire on ghost events. PostHog, Mixpanel, and Amplitude all ship official MCPs — the most-used MCP set among AI-native marketers in 2026.
Operator endorsementSnappa: removing email verification (which was blocking 27% of signups) boosted MRR 20% in 6 months. Only diagnosable through activation analytics.
— Wes Bush, Databox / ProductLed
"Every product has a unique activation metric. Typically, it's a combination of events that correlate to above average retention."
— James Hawkins, CEO PostHog · PLG.news
- PostHog, Mixpanel, Amplitude all have official MCPs in 2026 — Claude queries directly.
- Activation and retention work compounds; this is the source of truth for both.
- For PLG companies, more important than marketing analytics — the data customers generate inside the product tells the truer story.
- Wes Bush's PLG 2.0/3.0 framing means time-to-value is now seconds-to-minutes; instrumentation has to match that pace.
- Claude orchestrates — creates dashboards, runs SQL queries, flags taxonomy gaps directly via MCP.
- Funnel cohort analysis becomes a conversation, not a Looker ticket.
- Lifecycle triggered-message logic gets sketched in plain language; tested against cohorts directly.
The single most useful tool a solo operator can have on tap. Skip it and Onboarding + Lifecycle become guess-work.
The tool capturing product usage data — events, funnels, retention, feature adoption, cohort analysis. Typically PostHog (most common in 2026), Mixpanel, or Amplitude.
What it captures- Event stream — every action customers take
- Funnel definitions — Traffic → Signup → Activated → Paid → Retained
- Retention cohorts — week-over-week, month-over-month
- Feature adoption + power-user identification (PQL signals per Wes Bush)
- Session replays (PostHog)
Maintains the tracking plan from the analytics audit. Where official MCPs exist (all three majors), Claude orchestrates the tool directly — creates dashboards, runs queries, flags gaps.
FeedsMetrics dashboard, Onboarding decisions, Lifecycle segmentation, Customer advocacy advocate-spotting, GTM-engineered outbound trigger logic.
Useful, not core. At PLG scale, product analytics tells you more than marketing analytics ever will. But for any team running paid or content channels — and any team trying to read the AI-search shift before competitors do — this is where the early warning surfaces.
Operator endorsementVercel: ChatGPT-sourced new signups went from 1% to 10% in six months. Webflow reports AI-search traffic converts at 6× Google organic. eMarketer: 63% of marketers recognize buyer search behavior is shifting; only 14% have adapted.
— 2026 industry data
- GA + Search Console are the two-tool minimum almost every B2B SaaS runs.
- Plausible / Fathom / Simple Analytics are honest alternatives if GDPR or speed matters.
- For PLG companies, this is supplementary to product analytics; for content/paid-heavy plays, it's primary.
- Ahrefs official MCP — keyword research, competitive backlink analysis, and Brand Radar for AI Overview and citation monitoring, all queryable via Claude.
- Community MCPs: Pipeboard for Meta Ads (791 GitHub stars, BSL license — most mature), GA / Search Console community-maintained.
- Claude pulls keyword and conversion patterns; cross-references with AEO citation data (Profound, Peec, AthenaHQ).
- The 25% predicted organic-search drop by 2026 (Gartner) means this dashboard's job changes — track AEO citations as a peer metric to organic CTR.
Don't let the absence of an "official" MCP stop you. Community MCPs work fine for ≤Series A operators.
The tools capturing acquisition-side data — traffic, sources, search performance, ad-campaign tracking, and competitive SEO. Core stack: GA + Search Console for traffic and conversions; Ahrefs for keyword research, brand monitoring, and AI citation tracking. Plausible / Fathom / Simple Analytics as privacy-friendly alternatives to GA.
What it captures- Traffic by source / medium / campaign
- Search performance — keywords, impressions, CTR, position
- Top pages by organic traffic
- Conversion attribution (with caveats — AI search now intercepts upstream)
- Geographic + device breakdown
- Keyword research + competitive SEO gaps (Ahrefs)
- Brand mentions + AI Overview / citation visibility (Ahrefs Brand Radar)
- Ad-platform performance via Pipeboard / community MCPs
Ahrefs ships an official MCP — Claude queries keyword data, site audits, and brand citations directly. GA / Search Console via community MCPs. Otherwise Claude guides setup and interprets outputs.
FeedsMetrics dashboard with traffic, source, and conversion numbers. Channel strategy validation. AEO production prioritization (which AI-cited content drove which conversions).
Not a day-one priority — but a lower-barrier decision than most assume. ChartMogul is free up to $10K MRR. ProfitWell Metrics by Paddle is always free and benchmarks you against 30,000+ subscription companies. Both give you cohort charts, churn breakdowns, and expansion tracking built in — real insight with no analysis work required.
Operator endorsement"Top SaaS companies now source 50%+ of new ARR from existing-customer expansion."
— Kyle Poyar, Growth Unhinged
86% of SaaS companies that grew to $20M ARR improved expansion's share of net-new MRR by more than 10%. Subscription analytics is where that signal first becomes visible.
— ChartMogul, SaaS Growth Levers report
- Despite the free tiers, timing still matters — with fewer than ~30 customers, the data is too sparse for cohort analysis to be meaningful.
- For PLG companies the value-metric work happens in product analytics first; subscription analytics confirms it later.
- For sales-led, this becomes more important earlier (~Series A) once segmented MRR/NRR matters for board reporting.
- For AI-native products, traditional seat metrics may not even apply — outcome-based pricing (Fin's $0.99/ticket) needs different telemetry.
- ChartMogul (free plan) has an MCP; Baremetrics is community-MCP'd; Stripe is officially MCP-equipped.
- Claude computes NRR cohorts, expansion rates, segmented retention without manual reconciliation.
- Pricing experiments (Van Westendorp, value-metric review) read directly from this layer.
Two of the best tools in this layer are free — ChartMogul and ProfitWell Metrics. No excuse not to have subscription visibility from day one.
The tool capturing MRR, churn, ARPA, NRR, cohort retention. Typically ChartMogul, Baremetrics, or directly via Stripe API. Source of truth for recurring revenue.
What it captures- MRR / ARR with churn and expansion components
- ARPA distribution by plan / segment
- Net Revenue Retention (NRR) cohorts
- Customer count, churn rate, plan-mix shifts
- Failed-payment recovery (involuntary churn)
Reads subscription data (ChartMogul has an MCP; others may not yet) and feeds the metrics dashboard. Helps keep definitions and segments clean.
FeedsSource of truth for revenue and retention metrics on Company one-pager. Inputs to Pricing analysis, Growth model, Capital & stage fit.
The unsexy infrastructure that makes every other AI workflow possible. Without recorded conversations, JTBD synthesis is fictional, sprint-log auto-summaries hallucinate, advocacy interviews never get extracted, and Messaging gets its language from a marketer's hunch instead of customers' actual words. Promoted from "useful" — modern transcription is cheap, MCP-equipped, and the substrate every fuel-side module reads from.
Operator endorsementClay's GTM-engineering function explicitly cites Gong as a backbone — version-controlled workflows treat transcripts like code.
— Everett Berry, How we built Clay's GTM engineering function
- MKT1's "Calls = source-of-customer-truth" position — Calls feeds Naro-style enablement, evidence sprints, sprint logs.
- Jan's pipeline metric: 10 hours per interview manually → 2 hours with ChatGPT verification → under 20 minutes with Claude Code, same depth preserved.
- Official MCPs: Gong (Mission Andromeda), Fathom, Fireflies. Otter and most others API-queryable.
- Auto-tagged by participant, deal stage, segment, sentiment — searchable across 50+ conversations in seconds.
- Per-interview JTBD extraction (15 components + 4 themes) runs as a Claude Code skill against any new transcript.
It's a database, not an insight engine. The insights are downstream — but they don't exist without the corpus.
The system that records, transcribes, and stores every meaningful company conversation — sales, customer success, user interviews, team syncs. Without it, no downstream AI synthesis is possible.
Transcription (most MCP- or API-equipped in 2026)- Gong (official MCP "Mission Andromeda", late 2025) — sales-call standard at scale
- Fathom (official MCP) — small-team default, cheap, reliable
- Fireflies (official MCP) — calendar-attached, easy
- Otter — API-queryable, broad device support
- Granola — note-style summaries, light weight
Custom Claude workflows handle JTBD extraction, sprint-log synthesis, and pattern detection across batched transcripts. For dedicated insight tooling: Dovetail for structured research repositories and synthesis; closedloop.sh for sales-call processing and specific use cases — both useful for larger teams.
What lives in the corpus- Sales discovery, demo, and renewal calls
- Customer success check-ins and churn-risk conversations
- JTBD / customer evidence interviews
- Customer advocacy 26-question interviews (per Uplift)
- Internal team syncs and strategy meetings
- Optional: founder one-on-ones, advisor calls
Customer evidence sprint, Sprint log, Messaging framework, Product comms, Customer advocacy, Sales enablement — anywhere customer or team voice matters.
CadenceAlways-on. Add to MCP catalog (AI-skill engineering) on day 1. Audit segment tagging quarterly.
The library every other module pulls from when it needs proof. Website testimonials, ad creative, sales decks, founder LinkedIn proof points, AEO comparison-table citations, lifecycle save sequences — all read from one approved, segment-tagged corpus. Without it, the same testimonial gets re-asked across teams, logos get used without permission, and case studies live as PDFs nobody can find.
Operator endorsement"94% of B2B buyers have used an online review to help make a purchase decision."
— Voxturr (industry stat)
Uplift Content's 2024 survey of 121 SaaS marketers ranked case studies as the #1 most effective marketing tactic — ahead of SEO, blog, or social.
— Uplift Content, 26-question case study playbook
- Approval status is the differentiator — most teams have testimonials they're not sure they can publish.
- Segment tagging is what makes proof weaponized — show the right logo to the right buyer in the right context.
- "Where this can be used" field per entry stops accidental misuse (channel, NDA, expiration).
- Claude tags new entries automatically (segment, use case, value driver, sentiment).
- When generating ad copy, comparison pages, or onboarding nudges, Claude pulls relevant proof for the audience.
- Auto-flag stale or missing-permission entries; quarterly approval refresh as a Claude Code skill.
Customer advocacy runs the motion; this library stores the output. Without the library, the motion's value evaporates.
Curated customer-facing evidence, approval-ready and segment-tagged. Single source of truth for testimonials, case studies, logos, awards, reviews.
Contents- Testimonials — quote, attribution, approval state, segment, use case, expiration
- Case studies — company, challenge, solution, results, status (draft / approved / published), Before/During/After arc per Uplift
- Client logos — file, tier, industry, display permission, quality rating
- Awards, certifications, press mentions
- Reviews — G2 (60M+ visitors), Capterra (9M+ monthly), TrustRadius (12M+ annual) — highlight reel with stars and source link
- Stats with methodology — pulled from interviews, citation-friendly for AEO
Notion or Airtable database with approval status, segment tags, use-case tags, and a "where this can be used" field per entry. Versioned alongside Master context so AI tools see it.
FeedsWebsite (case study + testimonial sections + comparison-table citations), Customer advocacy (motion that fills it), Paid (ad creative inputs), Sales enablement (deck + battle cards), Lifecycle (save sequences), AEO production (citation-grade stats).
CadenceLiving. Customer advocacy adds entries weekly; quarterly approval refresh; annual logo-tier review.
The growing operational layer downstream of Brand identity. Where strategic brand decisions become reusable templates and tested AI prompts. Promoted in the AI-gen era — every Claude / Midjourney / Nano Banana / GPT Image generation reads from this library, and a 78% cost cut from character-consistent generation only happens when the references live somewhere reusable.
Operator endorsementCliprise (3-person agency, 12 clients) documented cost drop from ~$14K/month to ~$3.1K/month and 3–4× volume after a two-week prompt-library investment.
— Cliprise, 2026 case study
- The work compounds — first 3 months you barely have templates worth saving; by month 6 with consistent output, the library pays for itself in time saved.
- Without it, every generation re-invents prompts and visual consistency drifts. With it, AI image generation produces repeatable on-brand output.
- For paid-heavy or content-heavy motions, this rises to 8 — the prompt library is where the moat lives.
- Per-platform templates (LinkedIn, X, blog, newsletter, ad units) Claude reads and fills against Master context.
- Recurring brand characters — image-gen 2026 stack ($300–500/mo solo) ships agency-quality output once the refs are anchored.
Brand identity decides what the brand is. The Design system is where those decisions get encoded into things AI can reuse.
The growing operational library of templates, tested AI image prompts, character references, and format catalogs. Distinct from Brand identity (Strategy) — that decides once; this implements continuously.
Contents- Logo files, color palette tokens, typography (mirrored from Brand identity)
- Templates per platform — LinkedIn, X, blog, newsletter, ad units, podcast cover, slide deck
- Tested AI image prompts — Midjourney, Flux Kontext, Nano Banana, GPT Image, and others
- Character references — recurring brand characters for AI generation consistency
- Video prompt library — Veo, Kling, Seedance patterns
- Photography / illustration style guide with do/don't examples
Markdown doc + asset folder + versioned prompt library. Lives in git or Notion. Read by Claude / Midjourney / Flux / Veo at generation time. Pairs with AI creative engine (which uses it) and Brand identity (which seeds it).
CadenceLiving. New templates added as patterns prove out. Quarterly audit to retire stale prompts as models shift (the image-gen frontier moves every ~3 months).
Conditional — only matters once you're publishing at volume. At <1 piece per week it's overhead; at 2–3 it pays back within a quarter; at the AEO-visibility tier (8–12+ pieces/month per Averi) it becomes essential. The artifact that makes Kramer's "Mileage" principle operational — repurposing without a tracker turns into archaeology.
Operator endorsement"Don't falsely assume a piece of content is 'bad' if you've never properly distributed it."
— Emily Kramer, MKT1 Field Guide
"Mileage means doing more with the fuel you've created — repurposing extends the life of your ideas, distribution multiplies their reach. You need to do both."
— Emily Kramer, MKT1 Field Guide Part 2
Averi 2026 GEO benchmarks: high-AI-visibility teams ship 8–12+ pieces/month at 1–2 expert hours each — a velocity only achievable with a tracked, repurposable corpus.
Where the method comes from- Kramer's Content-is-a-Product framing — content needs roadmaps, versioning, distribution like a software product.
- Mileage framework — every piece has a future repurpose pass; 1 anchor → blog + LinkedIn + newsletter + clips + thread.
- "No dead ends" — every published piece links to a next valuable one.
- Auto-populated from Content engine — every publish logged with metadata via CMS API.
- Performance data cross-fed from Marketing analytics MCPs (GSC, GA, AEO citation tools).
- Claude flags repurpose opportunities ("this blog has 5× organic traffic — clip for LinkedIn? Refresh and re-publish?").
A content engine without a library is a creator who never edits. Every piece born and dying in isolation.
Catalog of everything published across every channel — blog, LinkedIn, X, newsletter, podcast, YouTube, ads, partner posts. The substrate that makes repurposing systematic instead of accidental.
Per entry- Title · URL · publish date · channel · topic / pillar tag
- Performance metrics (traffic, conversion, AEO citations)
- Repurpose status — has this been clipped, threaded, refreshed?
- Related content links — anchor / derivative / next-piece
- Perception tag (per Kramer) — which Perception this piece reinforces
- "No dead end" link — what reads next
One for net-new content, one for revisiting / repurposing / redistributing old. When publishing new, schedule a future task to revisit.
FormatNotion or Airtable database. Auto-populated where possible (CMS API + analytics MCPs). Versioned alongside Master context.
CadenceLiving, updated on every publish. Monthly repurpose review. Quarterly retire-or-refresh pass on stale entries.
One of the two most-cited "things new marketers fix first" in every 90-day playbook (alongside customer evidence). The single artifact AI lets a solo operator ship at agency quality — Claude Code rebuilds the public site from Master context in days, not quarters. With 92% of B2B buyers entering evaluation with a vendor in mind (Omnibound 2026), the site is the make-or-break first impression.
Operator endorsementStuart Brameld rebuilt his entire Growth Method site in 2 days (~12 hours actual work) using Claude Code, no JavaScript ability — site runs on Astro, edited entirely via chat interface.
— Stuart Brameld, Growth Method (Feb 2026) · Growth Method
- Every other module — content, founder-led, paid, lifecycle, AEO — eventually drives traffic here. The site converts or doesn't.
- The "stop fishing with nets, start spearfishing" Kramer thesis only works if landing surfaces are tight to ICP.
- AEO production reads from this — schema and sectioning baked at build time means every new page is citation-ready.
- Claude Code project pulls Messaging framework + Brand identity + Social proof library + Competitive landscape and ships pages.
- AEO baked at build: schema markup, 120–180-word sectioning, comparison tables (+47% citation rate per Averi), visible "last updated" timestamps (+30% Perplexity citations).
- Per-segment landing pages generated from ICP tiers without engineering.
When the website lies about the company, every other module fights uphill against it. When it tells the truth, every other module pulls forward.
The full public website, built from Master context in a single Claude Code project. The technical stack evolves fast — for design seeding: Google Stitch or Claude Design. For high-control implementation: Figma MCP. For execution, Astro is the top choice (fast Core Web Vitals, clean architecture, great for content-heavy sites). Lovable works for very early stage or spinning up fast experiments off the main domain.
Pages- Homepage — primary ICP hero (per Kramer/Pierri positioning teardown)
- Features — translated from Messaging framework's feature-to-benefit table
- Pricing — value-metric aligned (ProfitWell: ~38% faster growth)
- Comparison pages — per real alternative from Competitive landscape (top citation page-type)
- Case studies — from Social proof library, Before/During/After arc
- About, changelog, blog hub
- Per-segment landing pages — secondary ICPs per Kramer's "primary hero, secondary segments routed" pattern
- 120–180 words between headings → +70% citations vs sub-50
- Schema-marked comparison tables → +47% citation rate
- Visible "last updated" → +30% Perplexity citations
- Stats with methodology + sources → +22–28% visibility
- 40–60 word lead paragraphs (extraction-friendly)
Copy from Messaging framework. Design from Brand identity + Design system. Proof from Social proof library. Comparisons from Competitive landscape. Updates triggered by strategy changes.
Andrew Chen's "$80 of $100" rule for PLG — most growth ROI lives here. For any self-serve, hybrid, or sales-assist motion this is non-negotiable. Wes Bush's "90% of PLG implementations fail" stat is mostly about teams shipping the visible front door (signup, pricing) without the foundational five systems behind it. This is where those systems get built.
Operator endorsement"The goal of user onboarding is not to help people become better at using your product. The goal is to help people become better at what your product enables them to do."
— Wes Bush, ProductLed
Snappa: email verification was blocking 27% of signups; removing it boosted MRR 20% in 6 months. Tettra tripled freemium upgrades by Q5; retention never dipped below 70%.
— Wes Bush, ProductLed Keynote (Oct 2025)
- Wes Bush's PLG 1.0 / 2.0 / 3.0 evolution (March 2026): time-to-value collapsed from minutes-to-hours (User as Builder) to seconds-to-minutes (User as Editor) to approaching zero (AI agents orchestrate). The "Halving Principle": time-to-outcome halves every few years.
- Bowling Alley framework — eliminate friction toward the first strike (first value moment).
- Mario Model — free experiences that transform the user, not just demo features.
- PQLs (Product-Qualified Leads) — usage-based buying signals replacing demographic MQLs.
- Industry baseline: PLG activation 5–10%; email-verification drop-off 10–30%.
- Customer evidence + funnel analytics + brand voice → AI drafts email sequences, in-app copy, checklist text in plain language.
- Adaptive flows based on behavior cohorts — Bush's "next frontier" is shipping now via PostHog/Amplitude MCPs feeding Claude.
- Branching logic gets sketched in plain language; tested against cohorts before deploying to Customer.io / Loops / Intercom.
No module returns more per dollar of attention. None.
The discipline that moves a new signup to the "aha moment" as fast as possible. Owns up to first value; Lifecycle & retention takes over after.
What's inside (Bush's five foundational systems)- Setup flow — minimum-friction signup, sample data, smart defaults, no email verification unless mandatory
- Activation path — guided steps to first value, mapped to evidence-sprint aha moments
- First-value email sequence — triggered, behavior-based, 1–5 messages before paywall
- In-app guidance — tooltips, checklists, empty-state instructions
- Friction removal — verification cuts, optional fields, faster time-to-value
- Reverse Funnel Planner (Bush): start from "what must happen to upgrade?" and design backward
Customer.io / Loops / Bento for messaging · Userpilot / Pendo / Appcues for in-app · PostHog or Mixpanel for funnel analytics · ProductLed Scorecard for self-audit.
How AI helps executeCustomer evidence patterns + product analytics funnel data + brand voice → AI drafts email sequences, in-app copy, checklist text. Branching logic tested against cohorts before deploying.
BoundaryOwns up to first value. After that, Lifecycle & retention takes over. Same tools, same operator at this scope — different intent and metrics.
Uplift's 2024 survey of 121 SaaS marketers ranked case studies as the #1 most effective marketing tactic — ahead of SEO, blog, and social. Yet most teams treat it as an afterthought. The most undervalued module in the entire system, and the one where AI's leverage is highest because the bottleneck (drafting, synthesis, tagging) is exactly what Claude does well.
Operator endorsement"Case study question #18 — 'If you couldn't use our solution ever again, what would that be like?' — can lead to some of your best sound bites."
— Gabriela Contreras, via Uplift's 26-question playbook
"94% of B2B buyers have used an online review to help make a purchase decision."
— Voxturr (industry stat)
"Use rapid framing to elicit prioritized responses... What are the top 3 reasons you purchased this solution?"
— Kaily Baskett, SlapFive (via Uplift)
- Uplift Content's 26-question Before / During / After framework — the gold standard for case-study interview design. 30–40 min conversational format; email interviews don't work.
- Maps cleanly onto JTBD's struggle-motivation-outcome-next-goal arc — same interview, different exit format.
- Review-generation sub-program: ~15 G2/Capterra/TrustRadius reviews/quarter drives category-grid placement; 11% of buyers auto-exclude anything under 3.9 stars.
- Reference platforms: G2 (60M+ visitors), Capterra (9M+ monthly), TrustRadius (12M+ annual).
- 30-min recording (Fathom/Gong/Otter MCP) + Claude → case-study draft in under an hour.
- Auto-tag entries by segment, use case, value driver — feeds Social proof library.
- Advocate identification scripted from product analytics + NPS / in-app survey data (Sprig, Refiner).
- Question #18 sound-bites auto-extracted across the corpus → ad copy and homepage hero candidates.
The relational work is irreducible. Everything else — drafting, tagging, scoring, surfacing the right quote for the right ad — is now AI's job.
The motion that turns happy customers into visible advocates — and generates the raw material that fills the Social proof library. Distinct from the library itself: advocacy runs the motion, the library stores the output.
Contents- Case study pipeline — who to interview, when, what angle, what segment
- 26-question Uplift script — Before / During / After arc, Question #18 the canonical sound-bite generator
- Reference program — customer-to-prospect intros, structured asks
- Advocate identification — usage cohorts + NPS + in-app survey signals
- User-generated content activation
- Customer advisory board (if warranted by stage)
Structured G2 / Capterra / TrustRadius push, ~15 reviews/quarter. 94% of B2B buyers consult reviews; 11% auto-exclude under 3.9 stars (Voxturr). Drives category-grid placement.
Common interview mistakes (Uplift)- Yes/no questions instead of open-ended
- Not pushing for concrete numbers, metrics, or examples
- Rigid adherence to script vs natural conversation flow
- Failing to listen actively for unexpected insights
Weekly outreach to spotted advocates. Monthly case-study pipeline review. Quarterly review-program target. Customer advocacy adds entries to Social proof library continuously.
The most-disrupted execution motion in the system. What used to take a $15K/month content marketer is now within reach of a solo operator with Claude Code + a tight CLAUDE.md. The disruption is real — but it only translates when the inputs (positioning, evidence, brand voice) are real. AI without those produces internet-flavored slop at faster speed.
Operator endorsement"Make shows, not just feeds. Feeds keep your company visible, but they're rarely memorable. Shows are intentional series with a consistent voice."
— Emily Kramer, MKT1
"Small amounts of expert direction provided at the start of the content creation process are vastly more effective than lots of human editing at the end."
— Ryan Law, Director of Content Marketing · Ahrefs content engineering
- Ahrefs content engineering (Ryan Law, 2026) — 23 chained editorial skill files, one per production phase. Days → 6–12 minutes per publish-ready draft. Core principle: front-load expert direction in SKILL files; don't rely on heavy downstream editing. Integrates Ahrefs MCP for keyword data at the brief stage.
- Kramer's Content-is-a-Product framing — roadmaps, versioning, distribution like software. Two roadmaps: net-new + revisit/repurpose.
- Perceptions exercise — every piece reinforces one of the 3–5 narratives the audience should believe.
- The 30%-juice rule — only ~30% of output is product talk; the other 70% covers problems, solutions, market trends, vision.
- Mileage — every anchor piece spawns 5+ derivatives across formats and channels.
- Human edits land at 40–60% for quality work; lower for docs and in-app copy.
- Product Marketing Alliance shipped an open-source Customer Research Skill (Oct 2025) automating content-from-evidence workflows.
- Averi GEO benchmarks: high-AI-visibility teams ship 8–12+ pieces/month at 1–2 expert hours each with AI assistance.
- MKT1's own MCP server (March 2026): 15 marketing skills auto-deploying — content-roadmap, GACCS-brief, Perception-tagging available as Claude skills.
The output looks like a junior content marketer's. The cost looks like a tool subscription. That's the whole disruption in one sentence.
Ongoing publishing on owned company channels — blog, newsletter, company LinkedIn / X / YouTube. Distinct from Founder-led distribution (different voice, different surfaces, different approval flow).
How it worksEditorial calendar driven by customer evidence patterns + Perceptions + AEO targets + Brand voice. AI drafts via chained editorial skill files — one skill per production phase (brief, outline, draft, AEO pass, distribution checklist). Human edits at 40–60% for quality work. Front-load expert direction in SKILL files rather than editing after the fact. Tracked in Content library; repurposed systematically per Mileage.
Production pipeline- Topic — from JTBD pain / trigger, competitor gap, or AEO query opportunity
- GACCS Brief — Goals, Audience, Channels, Creative, Stakeholders (Kramer's connective tissue)
- Outline — against Messaging framework + Perception tag
- Draft — Brand voice + AEO recipes (120–180 word sectioning, schema, expert quotes, stats with methodology)
- Human edit pass — 40–60% rewrite, voice check, fact verification
- Distribution checklist — LinkedIn, newsletter, X, clips, partner amplification
- Mileage pass — schedule the repurpose / refresh / next-piece task
Content library database (Operational assets). One blog anchor → LinkedIn post + newsletter + clips + thread + future refresh.
CadenceWeekly publishing minimum at <Series A. Monthly content-roadmap review against Perceptions + Revenue Levers. Quarterly mileage / repurpose pass on top performers.
The motion that compounds. Top SaaS companies now source 50%+ of new ARR from existing customers (Poyar's 2024 expansion data) — yet small teams over-invest in acquisition and treat retention as an afterthought. The single biggest discipline gap in pre-Series-A B2B SaaS, and the one most exposed to AI-native disruption: Intercom Fin handles >1M tickets/week at $0.99 each, fundamentally rewriting what a "lifecycle motion" can look like.
Operator endorsement"Great email campaigns should be behavior-based and not time-based."
— Val Geisler, Fix My Churn · Intercom podcast
"Expansion is arguably the most important B2B metric. Having Net Dollar Retention be >120% is a dream come true — revenue will grow >20% even if no new acquisition comes through the door."
— Elena Verna, LinkedIn
- Reforge Retention & Engagement curriculum (Val Geisler, Andrew Chen, Elena Verna, Brian Balfour) — lifecycle as distinct from onboarding, distinct from content, foundational to LTV.
- Wes Bush's "layer sales on PLG" — Lifecycle is also where sales-assist hooks attach to self-serve.
- Loops vs Customer.io tool-selection axis: simple email → Loops; multi-channel branching → Customer.io.
- Inflection.io's product-data triggers: traditional MAPs miss in-app behavior — the gap that makes lifecycle hard.
- Behavioral cohorts (PostHog/Mixpanel MCP) + customer evidence + brand voice → AI drafts triggered sequences, save copy, expansion prompts.
- Branching logic sketched in plain language; tested against cohorts before deployment.
- NPS / in-app surveys (Sprig, Refiner, Userpilot, Survicate) feed churn signals AND Customer advocacy advocate-spotting.
- Resolution-rate / total-automation-rate metrics on AI-served tickets — the new lifecycle KPIs for AI-native products.
Acquisition gets you the customer. Lifecycle keeps them. NRR is where the second curve of growth lives.
The motion that turns activated users into long-term retained customers and expansion ARR. Owns everything after first value.
What it owns- Triggered-message architecture — Customer.io, Loops, Bento, Intercom, HubSpot for engagement, churn save, expansion
- Cohort retention curves and engagement scoring
- Churn signals — usage drops, login gaps, support escalations — and save sequences
- Expansion triggers — usage-based upgrade prompts, feature unlocks, contract upsell
- Re-engagement of dormant users (win-back sequences)
- NPS / CSAT / in-app pulse surveys (Sprig, Refiner, Userpilot, Survicate)
- Sales-assist hooks for PLG → sales-led handoff (Wes Bush's "layer sales on PLG")
Single-channel email → Loops. Multi-channel + branching → Customer.io. Sophisticated journey orchestration → Intercom or HubSpot. For PLG with rich product-data triggers → Inflection.io.
How AI helps executeBehavioral cohorts from product analytics MCPs + customer evidence patterns + brand voice → AI drafts triggered sequences, in-app messages, save copy, expansion prompts.
Boundary with OnboardingOnboarding ends at first value. Lifecycle owns everything after. Same tools, same operator at this scope — different intent and metrics. (Onboarding measures activation; Lifecycle measures retention + NRR.)
The fastest-rising discipline in B2B SaaS marketing. 73% of B2B buyers now use AI tools in their research process (Averi 2026), 94% use LLMs during their purchase journey, and AI Mode searches end without a click 93% of the time. Discovery is being intermediated by AI engines, and getting cited beats getting ranked. The one Execution module pure SEO teams will lose to AI-native operators on first.
Operator endorsement"Over 50% of Google searches resulted in zero clicks by 2019. Now AI meets the user at the point of intent."
— Mike King, iPullRank · AI Search Manual
"Brands cited in AI Overviews earn 35% more organic clicks and 91% more paid clicks. Being cited matters more than ranking."
— Averi GEO Benchmarks 2026 · 680M citations study
- +70% citations from 120–180-word sectioning vs sub-50-word
- +47% citation rate from comparison tables with schema markup
- +30% Perplexity citations from visible "last updated" timestamps
- +22–28% visibility from stats with methodology and sources
- +40% citation likelihood from proper H1→H2→H3 hierarchy
- +41% LLM citation lift from expert quotes (Princeton 2024 GEO paper)
- ChatGPT — Wikipedia (47.9% of top-10 citations), Reddit, YouTube
- Perplexity — Reddit (46.7%), Wikipedia, YouTube; values community-validated content
- Google AI Overviews — YouTube, Reddit, Wikipedia, Google properties
- Microsoft Copilot — leans heavily on LinkedIn for B2B
- Claude generates llms.txt, schema markup, comparison tables, .md mirror pages.
- Tooling: Profound (enterprise), Peec AI (mid-market, Lily Ray endorsed), AthenaHQ (YC W25), HubSpot AEO Grader (free baseline).
- Query fan-out simulation tests citation likelihood pre-publish.
SEO is about ranking. AEO is about being cited. Different game, same content workflow at this scope.
Engineering discipline for being cited by ChatGPT, Perplexity, Claude, Google AI Mode, Microsoft Copilot, Gemini, and other answer engines. Distinct from SEO in measurement, tooling, and team redesign — but at ≤Series A scope it ships in the same content workflow as Organic SEO.
Three disciplines- SEO — keywords, backlinks, technical site structure → ranking in blue links on Google/Bing
- AEO — featured snippets, voice search, AI overviews → concise, structured answers to specific user questions
- GEO — technical authority, trustworthiness, summarizable content → citation by AI models that need to extract and present your information
- Schema strategy — structured data for products, FAQ, comparison, articles
- llms.txt + .md mirror pages (Jeremy Howard's Answer.AI proposal, Sept 2024)
- Citation engineering — expert quotes (+41%), stats with methodology (+22–28%), inline citations (+30%)
- Sectioning discipline — 120–180 words between headings (+70% citations); 40–60 word lead paragraphs (extraction-friendly)
- Entity consistency across owned and earned media — "Consensus Engine Theory" (Surmado)
- Query fan-out simulation — test citation likelihood pre-publish
- Training-data plays — digital PR for inclusion in next training cuts, Reddit upvote thresholds (Seer Interactive playbook)
- Best-of listicles — 43.8% of all cited page types (Averi)
Citation Frequency · Share of Voice · Attribution Quality · Cross-Platform Coverage · Sentiment in AI responses.
Tooling 2026- Profound — enterprise (raised $96M Series C at $1B valuation, Feb 2026); Ramp testimonial
- Peec AI — mid-market, 2,000+ teams; Lily Ray endorsed
- AthenaHQ — YC W2025, ex-Google Search PMs
- HubSpot AEO Grader — free baseline (5-dimension scoring)
Weekly content cycle ships AEO-optimized. Monthly entity audit. Quarterly llms.txt regeneration. Citation telemetry monthly minimum.
ReferenceMike King (iPullRank) "AI Search Manual"; Princeton 2024 GEO paper; Averi 2026 GEO benchmarks (680M citations); HubSpot AEO Trends 2026; Surmado Three Disciplines guide.
Founder-as-channel: the personal media presence that compounds reputation, deal flow, and acquisition. Lenny, Poyar, Verna, Kramer, Dunford — every operator most cited in this system is their company's top channel. At pre-seed → Series A, this often outperforms any other acquisition motion.
Operator endorsementFletchPMM (Anthony Pierri + Robert Kaminski): $1.3M ARR, 400+ B2B SaaS clients, 95% of clients sourced via LinkedIn, $0 ad spend. Two founders posting in parallel = 130K+ combined following, ~12M annual reach.
— Startup Mountain Summit
"If I were running marketing at a B2B startup today, LinkedIn would be our primary 'PR' channel."
— Dave Gerhardt, Founder Brand
- Voice, cadence, tone, and approval flow are different. The founder posts; the company posts.
- The founder must show up — AI is a multiplier on drafting, not a substitute for the human.
- Best fit when the founder is naturally articulate and willing to publish at cadence.
- Voice memos / one-liners → Claude drafts LinkedIn cadence in founder's voice. 1-hour weekly recording → 3–5 polished posts.
- Podcast prep (guest research, talking points), repurposing (clips, threads, blog posts), comment threading — all AI-assisted.
- Doesn't work without the founder's actual presence — AI is multiplier, not substitute.
- Why optional (dashed): some founders are not natural publishers. Forcing it produces worse output than not doing it. Activate when the fit is real.
The single most under-priced channel for B2B SaaS at this stage. The asset is the founder's reputation; AI amplifies the signal.
The founder's personal media presence as a deliberate acquisition motion. Distinct from Content engine — different voice, different surfaces, different approval flow. At pre-seed → Series A this often outperforms every other acquisition channel combined.
Components- Founder idea bank — voice memos, one-liners, customer-call observations, hot takes
- LinkedIn cadence — 3–5 posts/week minimum, comments, replies. Pierri/Kaminski model: dual-founder presence stacks reach.
- Podcast appearances — pitching, guest-research prep, repurposing into clips and threads
- X / other founder-fit networks — only where the audience actually is
- Optional: founder newsletter (Substack / beehiiv) — Gerhardt's Exit Five at 40K subs as the canonical example
- Optional: founder podcast as a flagship show (Lenny / Poyar / Verna pattern)
Voice memo or message → Claude drafts in founder's voice (using Brand identity voice doc + past-post corpus) → founder edits, approves, posts → engagement and replies feed the next memo. Best-of posts get repurposed into Content engine assets.
CadencePierri/FletchPMM pattern: 3–4 LinkedIn posts/week per founder. Daily comment activity. Monthly podcast appearance. Quarterly review against Pipeline + ICP penetration.
ReferenceDave Gerhardt's Founder Brand (2022); FletchPMM (Pierri + Kaminski) Startup Mountain Summit data ($1.3M ARR, 95% LinkedIn-sourced, $0 ads); Kramer's MKT1 newsletter as a B2B founder-led distribution flywheel; Gerhardt's "If I were running marketing at a B2B startup today, LinkedIn would be our primary 'PR' channel."
Google search still drives meaningful pipeline for B2B SaaS — narrower in 2026 than it used to be (Gartner predicts 25% search-volume drop by 2026; 93% of AI Mode searches end without a click), but real. Brands cited in AI Overviews earn 35% more organic clicks and 91% more paid clicks. Pairs tightly with AEO production: same content workflow, two reader audiences.
Operator endorsement"Traditional SEO is still the foundation. 92% correlation between top-10 organic ranking and AI Overview citation — being in the top 10 is statistically near-required."
— Surmado (Princeton GEO study)
"Over 50% of Google searches resulted in zero clicks by 2019. Now AI meets the user at the point of intent."
— Mike King, iPullRank · AI Search Manual
- 61% organic CTR drop for queries with AI Overviews — but cited brands gain 35% more clicks.
- Marketplace listings still real: G2 60M+ visitors, Capterra 9M+ monthly, TrustRadius 12M+ annual.
- Technical baseline (speed, schema, crawlability, Core Web Vitals) still load-bearing.
- iPullRank reports over 80% of AI Overviews cite deep pages, not homepages — surface-level optimization is insufficient.
- Keyword research, technical audits, content briefs all AI-leveraged via Claude Code.
- Link building remains the human-relationship part — AI drafts outreach, not relationships.
- Most measurement tooling (GSC community MCP, Ahrefs API, Semrush API) is queryable directly.
SEO isn't dead, but it's no longer the only search game. Pair with AEO production for full coverage.
Discoverability across Google search and category marketplaces. Pairs with AEO production for full search coverage. At ≤Series A scope, both ship from the same content workflow.
Production playbook- Keyword research — informed by customer language from evidence sprint, not seed lists
- Content structure — comparison tables (top citation type), internal linking, structured data
- Technical baseline — speed, schema, internal linking, crawlability, Core Web Vitals (HubSpot's Christopher Miller: users assess pages in seconds based on load time alone)
- Authority signals — earned link building, digital PR, brand mentions (correlate with AI visibility r=0.664 per Averi)
- Marketplace & directory — G2, Capterra, Product Hunt, TrustRadius, partner marketplaces
- Bing/Microsoft Copilot — Bing Places must match Google Business Profile exactly; LinkedIn Company Page completion drives B2B Copilot citations
Measurement: Google Search Console, Ahrefs, Semrush. Pair with AEO production tooling (Profound, Peec AI, Otterly, AthenaHQ, HubSpot AEO Grader) for full search coverage.
Boundary with AEO productionSame content workflow at this scope; different measurement and audience focus. Splits into separate disciplines at Series B+ when content velocity and dedicated owners justify it. Per iPullRank: "Redefining Your SEO Team to a GEO Team" is the late-stage move.
Outbound reframed as engineered, not manual. Clay positioned GTME as "the first true AI-native profession" at Series C (Aug 2025, $3.1B valuation). For any B2B SaaS that touches accounts above pure self-serve, this is load-bearing — not optional sidecar. Optional/dashed only because pure self-serve PLG with no sales motion legitimately doesn't need it.
Operator endorsement"One GTM engineer can amplify 100 sellers by automating all their research, signal tracking, data entry, and messaging so they can focus on selling."
— Clay, Series C announcement
"Most of our executive-level sales happen over iMessage. Clay tables and workflows themselves get treated like code — version-controlled, bi-monthly release notes."
— Everett Berry, How we built Clay's GTM engineering function
SaaStr: 70,000 hyper-personalized AI emails vs 7,000 from humans, $1M+ revenue from 20+ production agents. Owner.com: 3× revenue per AE after AI augmentation.
— Lemkin / Norton, AI in Sales 2026
Fully autonomous AI SDR — 11x / Artisan narrative. Reportedly 70–80% customer churn. Only 2% of companies successfully implement AI SDRs because most take a hands-off approach (Norton). Don't remove the human from personalization.
The working pattern- Clay (or custom Claude Code enrichment) for prospect signal aggregation.
- Instantly / Smartlead for sequence delivery (Smartlead MCP: 116+ tools incl. deliverability diagnostics).
- Mandatory human review of personalization lines before send.
- Reply rates 5–10% on good signals (per FL0 2026 report).
- 30-day agent training rule (Lemkin): days 1–7 ingestion, 8–21 daily review, 22–30 production.
Outbound isn't dead. The dream of "no humans needed" outbound is.
Engineered outbound: signal-based account selection, agent-driven personalization with mandatory human review, sequence management, and an eval loop on response quality. The engineering layer (signals, enrichment, agents) is the primary work, not the email writing.
Four-layer stack (FL0 2026 framework)- Data — Apollo, ZoomInfo for ICP/account universe
- Enrichment — Clay, Smartlead enrichment, custom Claude Code scrapers
- Delivery — Instantly (38+ tools), Smartlead (116+ tools incl. SmartDelivery diagnostics), Apollo for low volume
- Signal + Agent — upstream behavioral/intent triggers; agent-drafted personalization with human queue
- Signal aggregation (Clay + intent + behavioral triggers)
- Enrichment pipelines, deduplication, segmentation
- Agent-driven personalization with human review queue
- Sequence delivery + warmup + deliverability diagnostics (post-Gmail Feb 2024 SPF/DKIM/DMARC tightening)
- Slack-app triggered plays (Berry's Clay GTME pattern)
- A/B tests of angles drawn from Messaging framework
- Eval loop on response quality and unsubscribe/spam rates
Forward-deployed GTMEs working with customers + internal GTMEs building ops infrastructure. Reports to co-founder/Head of Ops, not buried under VP Sales or RevOps. Two-week sprints, ticket-based prioritization, version control on workflows.
Cost-of-engagement metricFL0 2026: cost per booked meeting, not cost per tool. Reply-rate benchmarks deliberately not published — "stack is not the dominant variable, ICP/offer/copy is."
ReferenceEverett Berry's "How we built Clay's GTM engineering function" (thegtme.com, Oct 30 2025); FullFunnel State of GTM Engineering Talent 2025; FL0 2026 Outbound Stack Benchmark; Lemkin/Norton SaaStr+AI Summit; Smartlead/Amplemarket MCP comparisons.
Not just for SDRs. This is the system for being prepared for any customer-facing conversation — high-touch demos, security reviews, design-partner pitches, expansion calls. Even pure PLG founders end up doing customer calls. AI-generated, per-prospect prep is one of the cleanest "previously impossible" use cases for a solo operator.
Operator endorsement"95% of sales pitches list features instead of teaching buyers what to evaluate. That's the missing ingredient."
— April Dunford, Sales Pitch
"One company cut time-to-close by 50% with three stage-specific demos instead of one generic demo."
— Bob Moesta, Demand-Side Sales 101
- Dunford's 8-step Sales Pitch structure — market insight → competitive alternatives → gap → introduction → value → proof → objection handling → call-to-action.
- Moesta's buying timeline — First thought, Passive looking, Active looking, Deciding. Different demos per phase.
- Dunford's "train the lead salesperson first" — peer credibility prevents drift.
- Dashed: only matters when you actually do customer-facing calls. Activate by motion fit.
- For pure self-serve PLG with no calls of any kind: dead weight.
- For anyone doing demos, security reviews, or sales-led: load-bearing — impact climbs toward core in those motions.
- Per-prospect prep: Claude reads company + LinkedIn + ICP fit signals → custom demo flow, tailored deck, objection prep, in <1 hour.
- Battle cards auto-refresh from Competitive landscape + Messaging.
- Stage-specific demos (per Moesta's First-thought / Active-looking / Deciding phases) ship as Claude templates.
- Post-call summaries auto-generated from Gong/Fathom transcripts; CRM updates queued for human approval.
Founder-led demos used to need a junior PMM in the room. Now Claude does the prep; the founder shows up.
The system for being prepared for any customer-facing conversation. Not just for an SDR team — applies to founders doing demos, CSMs running renewals, anyone on a high-touch call.
Contents- Sales playbook (per ICP segment, if multiple) — based on Dunford's 8-step Sales Pitch structure
- Discovery and demo scripts — stage-specific per Moesta's buying-timeline phases
- Objection-handling battle cards — from Messaging framework + Competitive landscape; Dunford distinguishes objections from value (different scripts)
- Per-prospect prep packets — custom demos, tailored decks, custom landing pages
- Pipeline cadence templates (only if you have AEs)
- Call-review templates and post-call note structures (against Messaging document spec)
Claude reads company info + LinkedIn signals + ICP fit + meeting context + Calls & transcripts corpus → drafts custom demo flow, tailored deck outline, objection prep. Useful for a single founder doing a Tuesday demo as much as for a team of AEs.
Why it sits in Execution, not Sales-team-onlyThis is a marketing-side production motion that supports anyone customer-facing. The pure sales-ops side (CRM hygiene, pipeline rituals) is rejected as RevOps — see Considered & rejected.
CadencePer-prospect prep on demand. Battle cards refresh whenever Competitive landscape updates. Quarterly pitch validation with customers (Dunford).
Historically one of the two main B2B SaaS acquisition motions alongside SEO. For PLG companies it's often the primary scaling channel once the funnel converts. Marketers spent 15 years building this discipline; AI didn't make it less relevant — it made it accessible to solo operators. Q1 2026 IAB data: 30% of creative ads now built from scratch or enhanced with generative AI, projected to reach 40% by year-end.
Operator endorsementMeta disclosed Q4 2025: more than 4 million advertisers use gen-AI ad creative tools (up from 1M six months earlier). Combined run-rate of $10B from video gen tools, growing ~3× faster than overall ads revenue.
— Digital Applied AI Marketing Statistics 2026
Advantage+ Sales delivers +22% ROAS vs manual setups and −32% CPA when fully consolidated. Meta has stated intent to deliver fully-automated ads end-to-end by end of 2026.
Digital Applied 2026 benchmark (50,000+ ad variations): AI-generated ads achieve +12% CTR on Meta vs human, but −8% conversion above $100 AOV. Parity threshold expanding from <$25 AOV (early 2025) to <$200 by late 2026.
Operator framingPaid is the amplifier, not the starter engine. Without good positioning, messaging, and a converting site upstream, paid burns money fast. With those upstream pieces, it's the highest-leverage scaling channel a solo operator has.
Where the AI leverage lives- Per-platform APIs (Meta, Google, LinkedIn) let Claude push and read back without manual UI work.
- Amazon Ads MCP (Feb 2026 open beta); Pipeboard for Meta Ads (community, 791 stars).
- Ad copy variants drafted from Messaging framework — 50+ in minutes.
- Creative comes from AI creative engine (Midjourney, Nano Banana, Veo, Kling).
- 70/30 budget rule: 70% on proven creatives, 30% on fresh AI tests; auto-pause below 2.5× ROAS after 3 days, scale +20% above 4.0× ROAS.
NOT a starter engine. The "skip paid until later" framing some 90-day playbooks use applies to seed-stage no-PMF companies. Once positioning is real and the funnel converts, paid moves up to high-leverage core. Watch the platform-down-ranking risk: Meta/TikTok/Google have begun algorithmically de-prioritizing "obviously AI" creative.
The most expensive marketing decision a small team makes. AI doesn't change that — it just lets one person manage what used to take three.
Paid acquisition across Meta, Google, LinkedIn — and LLM-based ad networks as they mature.
SetupSingle Claude Code project handling multiple ad platforms, with shared context and per-platform API/MCP integration. Creative pulled from AI creative engine; copy from Messaging framework.
How AI helps execute- Keyword research and audience hypotheses from ICP + positioning
- Ad copy variants from Messaging framework
- Creative variations and seasonal adjustments via AI creative engine
- Weekly performance review reports with recommendations
- Per-platform APIs let Claude push and read back without UI clicking
- Persona-split creative — separate generations per buyer persona
- 70/30 budget rule — 70% proven, 30% AI tests
- Auto-pause variants below 2.5× ROAS after 3 days
- Scale winners +20% when above 4.0× ROAS
- Avoid "obviously AI" aesthetic — platforms now de-rank it
Higgsfield Marketing Studio (URL → 9 ad variants via Hermes Agent), Smartly.io, Muze AI for autonomous campaigns. Pipeboard MCP for Meta. Stack cost: $300–500/mo solo vs $8–10K agency.
Stage fitConditional on positioning + a converting site upstream. With those in place, this is one of the highest-leverage scaling channels. Without them, it burns money fast.
Conditional because most teams under-ship comms (no changelog discipline → users don't know features exist) OR over-ship (every change becomes a launch → audience fatigue). The right pattern is templated and tiered. AI compresses what used to be a half-day-per-launch ritual to a 10-minute review-and-publish.
Operator endorsement"Plan campaigns in advance, capturing your biggest campaigns as Project goals. Each campaign needs a single DRI and tight coordination across all of marketing."
— Emily Kramer, MKT1 Field Guide
- T1 (full launch) — full channel set, blog hero, founder LinkedIn anchor, paid promotion, customer outreach, partner notification
- T2 (medium) — blog post, in-app, social, email to relevant segment
- T3 (minor) — changelog + in-app + targeted email
- Feature description + Master context → Claude generates per-channel drafts in one pass.
- Changelog, in-app announcement, blog post, social, email, partner notification — from a single brief.
- Tier classification (T1/T2/T3) suggested by Claude based on feature scope and Messaging framework.
- GACCS Brief pre-filled (Goals, Audience, Channels, Creative, Stakeholders) per Kramer.
- For PLG: medium importance — most users discover features in-product, not via blog.
- For sales-led: more important — sales needs the messaging to pitch new features.
- For developer-tools / API-first: critical — devs read changelogs religiously.
Templated launch pipeline = the most boring time-saver in the system. AI makes it boringly fast.
Templated GTM pipeline for every product release. The discipline that makes launches reliable instead of ad-hoc, while preventing audience fatigue from over-launching minor changes.
Per release (full set)- Changelog entry
- In-app announcement
- Blog post (T1/T2)
- Social posts (per channel: LinkedIn, X, Mastodon)
- Documentation update
- Email to relevant segment(s)
- Partner notification (if integration-related)
- Founder LinkedIn anchor (T1)
- Sales enablement update (battle cards, demo flow)
T1 (full) / T2 (medium) / T3 (minor). Each tier triggers a different channel set. Or use a per-feature tracking matrix showing which channels apply per release.
Input → outputFeature description + Master context + Messaging framework + Brand voice → drafts for each channel, ready for review. Approval queued for human edit; auto-publish disabled by default.
CadencePer release (whatever cadence the product team ships at). Quarterly review of launch performance to recalibrate the tier rubric.
Two modes, one capability: internal tools (scrapers, Figma plugins, CRM scripts, Claude commands) that remove operational friction without waiting for an engineering queue, and external-facing resources (ROI calculators, comparison guides, interactive tools, topic clusters) that earn traffic and convert intent. Both ship in a day or two. Lovable's first hire for this role was a marketer with no coding background — the job is explicitly both internal and customer-facing.
Operator endorsement"AI, regardless of your background, is an amplifier. If you don't know what you're doing, you're producing garbage faster."
— Lazar Jovanovic, Lovable's first vibe coder · Lenny's Podcast
- Anthropic's Austin Lau case (Feb 2026) — the canonical "marketer becomes a marketing engineer" story.
- Lazar Jovanovic / Lovable — hired full-time as Lovable's first vibe coder specifically for building marketer-owned tools and resources; proof that domain expertise + AI execution is a real role.
- Stuart Brameld's "Architects, not Operators" — operators ask "how do I build this?", architects ask "what outcome do I want?"
- Page Sands' "ship-one-then-iterate" — release a single diagnostic, instrument it, expand the catalog only after real usage.
- Ramli John's Vibe Marketer / Stormy AI — solo operators building $25–30K/mo SaaS as marketing-side projects.
"Most documented Anthropic-style wins are content creation, not distribution. The 5× problem: when creation gets cheap, downstream orchestration becomes the new bottleneck."
— Luminary Lane, "copilot ceiling" critique. Useful, not core. Build the tool only if it removes a real bottleneck.
- Claude Code, Lovable ($200M ARR in <12 months), v0, Cursor, Bolt — one operator ships working tools in hours.
- Supabase / Postgres backends; auth via Clerk / Auth0 if needed.
- Prompt-as-product — the system prompt + scoring rubric is where the IP lives, not the wrapper UI.
- Mandatory eval and code review before any public-facing surface.
Marketers used to wait months for an engineer to build the calculator. Now they build it themselves in an afternoon — both for internal ops and as public lead-gen assets — and retire it next quarter when it stops earning its keep.
Two types of build: internal tools that remove specific operational friction (scripts, plugins, Claude commands, scrapers), and external-facing resources that serve prospects and customers (calculators, comparison guides, interactive tools, resource hubs). Lovable's first vibe-coder hire covers both. Supports the rest of the system — not a standalone product line.
Internal tools (examples)- Anthropic Lau's Figma plugin — design-token-driven creative refresh, saves 30 min/refresh
- Anthropic Lau's
/rsacommand — Google Ads RSA generator with brand-tone Skill, ad-platform CSV export - Competitive-intel scrapers — track competitor pricing and messaging changes
- AEO query simulators — test citation likelihood pre-publish
- Workflow scripts and bookmarklets — BAT scripts, TamperMonkey scripts, Claude Code automations
- ROI / sizing calculators — lead-gen surface, qualifies buying intent
- Comparison guides — vs. incumbent framing, built as interactive tools or structured pages
- Resource hubs and topic clusters — support AEO, earn citations, compound over time
- Event microsites — one-off launch pages, shippable in an afternoon
- GTM diagnostic tools — Hero Checker, Competitor Analyzer, AI Citation Tracker (Page Sands model)
Claude Code (autonomous, terminal/CLI) for infrastructure work; Cursor (AI editor) for visual iteration; Lovable / v0 / Bolt for browser-based prototypes. Supabase / Postgres backends. Hosted on Vercel / Cloudflare Workers / Railway.
Build-or-buy decision rule- Build with Claude Code: local-file workflows, infrastructure scripts, API integrations, autonomous research.
- Build with Cursor: UI-heavy landing pages, lead magnets, real-time visual iteration.
- Build with Lovable / Bolt: browser-based sandboxes, quick-validation prototypes.
- Buy: auth systems, production databases, security-critical apps, mobile.
1–2 micro-products per quarter. Retire ruthlessly when usage drops or a vendor solves it better. Sands rule: ship one, learn from real users, then expand the catalog.
The most disrupted creative function in marketing — but creative output isn't the bottleneck for most small B2B SaaS, distribution is. Score 6 because demand is uneven across motions, not because AI capability is questionable. For paid-heavy or video-heavy motions, this rises to 8. The real investment isn't budget — it's model fluency and prompt library depth. Each model earns specific jobs: character creation, reference-based generation, text rendering, cinematic video, editing. The main skill is knowing which model to reach for — and how to avoid the obvious AI look most people default to.
Operator endorsementSolopreneur stack runs $300–500/month (Higgsfield Plus + Krea Pro + Fal.ai credits + Freepik/Artlist + n8n) — vs $8K–10K/month full-service agency or $1,025–2,000/month freelance designer. Median payback: 4.2 months.
— Mean CEO, solo founder stack 2026
- Image: GPT Image 2 (#1 on Artificial Analysis), Nano Banana, Seedream, Midjourney (cref/sref for consistency), Flux Kontext.
- Video: Veo (cinematic, native audio), Kling, Seedance.
- Platforms: Higgsfield Marketing Studio (URL → 9 ads via Hermes Agent; MCP now available — Claude calls generation directly), ComfyUI (Netflix/Tencent/Ubisoft pipelines), Fal.ai infrastructure, Freepik Spaces, Krea, Runway / Aleph.
Calling Nano Banana, Kling, or Veo directly through their provider APIs is billed per generation — costs compound fast at volume. Third-party platforms like Higgsfield sell subscription plans that bundle generations at a fraction of the direct-API unit cost. A $99/month Higgsfield plan now includes an MCP: Claude can call image and video generation without leaving the conversation, and you get access to all the major models through one subscription instead of managing separate API keys and billing relationships.
The conversion economics caveat- AI ads beat human ads on CTR (+12% Meta) but lose 8% on conversions for products above $100 AOV.
- Parity threshold: <$25 AOV early 2025 → <$100 Q1 2026 → projected <$200 by late 2026.
- For B2B SaaS at $30–150/mo: squarely in the AI-creative sweet spot. Enterprise tier still needs human-led conversion creative.
- Salesforce 2026: Meta/TikTok/Google have begun algorithmically de-ranking "obviously AI" creative.
- IAB 2026: 82% of ad execs think Gen Z is positive about AI ads; only 45% of consumers actually are.
- −17% premium perception, −19% inspiration, −14% purchase intent on AI-identified creative.
Pick best-in-class today; swap as the frontier shifts. The Design system stores what works; brand voice keeps the output recognizably yours.
The system for generating, refining, and reusing visual creative across every channel that ships images or video. Each model has a distinct use case — character creation, reference-based generation, text rendering, editing, cinematic motion — and model fluency takes genuine time to build. AI-heavy by default in 2026 but not AI-only: human review before anything public-facing is non-optional.
Note on this cardThis is a system / project, not a channel. Supports Content engine, Founder-led distribution, Paid campaigns, Social proof library — doesn't ship outputs on its own.
OutputsAd creatives (static + video), social visuals, product demos, explainers, blog illustrations, character-consistent campaigns.
Stack (current leaders)- Image: GPT Image 2, Nano Banana, Seedream, Midjourney, Flux Kontext
- Video: Veo, Kling, Seedance, Runway / Aleph
- Platforms: Higgsfield Marketing Studio (MCP available — run generation from Claude directly), ComfyUI (open-source, Netflix/Tencent pipelines), Freepik Spaces, Krea Business, Fal.ai infrastructure
- Solopreneur stack target: $300–500/month with direct APIs, or ~$99/month with aggregator platforms like Higgsfield
- Subscription vs. API economics: provider APIs (Nano Banana, Kling, Veo) are billed per generation and get expensive fast; platforms like Higgsfield bundle major models into a subscription (~$99/mo) at a much lower per-unit cost — plus single MCP access
- B2B SaaS video ad — character creation + reference-based generation + multiple passes to avoid the obvious AI look; natural character motion for Meta placement
- One-shot static ad — prompt engineered for on-brand output with branding and text in a single generation; no post-production
- URL → variant factory — n8n + Fal.ai + Nano Banana → 20–50 platform-formatted variants
- Motion microsite — Nano Banana (keyframes) + Kling (transitions) + Claude Code (site)
- Video router — Veo for hero, Kling for human motion, Seedance for multi-shot
- Persona-split creative — separate generations per buyer persona
- Recurring brand characters (Midjourney cref/sref; Flux Kontext)
- Tested prompt library stored in Design system
- Style references and seeds for repeatable output
Brand identity (decided-once direction), Design system (growing prompt library), Social proof library (case-study visuals), Paid campaigns (ad creative).
Risk to watchHollywood IP litigation paused Seedance 2.0 internationally (March 2026). OpenAI shut down Sora app and API (March 24, 2026). Build redundancy across model vendors; don't single-vendor critical pipelines.
The AI multiplier on the live moment is small — but on prep, repurposing, and follow-up it's substantial. Revived from "rejected" because Kramer's "Dinners are the new trade shows" pattern + AI-assisted prep + 20-asset repurposing now makes the unit economics work for solo operators in the right verticals.
Operator endorsement"Dinners are the new trade shows. Insights from a professional B2B event planner + my learnings from MKT1 Supper Club."
— Emily Kramer, MKT1 (Feb 2026)
Qualified's own events pipeline up 352% YoY using Piper AI SDR for post-event follow-up — the agent handles outreach at scale, the team shows up live.
— Kraig Swensrud, Qualified (Jan 2026)
- Verticals where buyers buy at events: cybersecurity, retail tech, regulated industries, healthcare, fintech.
- Founder-led webinars when founder-as-channel is already the strategy.
- Customer events when the goal is community + retention, not pure acquisition.
- Small dinners + supper clubs (Kramer's pattern) — high signal, low overhead, easier to repurpose.
- Cecilia Ziniti / GC AI pattern: workshops where you teach prospects to compete with you DIY — converts at unprecedented rates.
- Pure self-serve PLG with no relational sales motion.
- Pre-PMF — events amplify what works, don't fix what doesn't.
- Solo operators trying to run a series — pick 1–2 per quarter, not a calendar.
- Pre-event research: attendee enrichment via Clay/Apollo, agenda suggestions from JTBD evidence.
- Post-event repurposing into 20+ assets — clips, blog posts, social cuts, follow-up packets.
- Per-attendee follow-up sequences — personalized at scale.
- AI-assisted Q&A queueing and moderation during live sessions.
The live delivery is irreducibly human. Everything around it is AI-leveraged.
Live-audience marketing — webinars, conferences, workshops, roundtables, meetups, dinners. Each event is a sequence of topic selection, landing page, promo, live delivery, follow-up, and content repurposing.
Per event- Topic selection from customer evidence + JTBD pain + Perception you want to reinforce
- Landing page (Website module)
- Promotional sequence (Content engine + Founder-led distribution)
- Live delivery — speaker prep, run-of-show, moderation
- Follow-up: per-attendee email + asset repurposing
- Repurposing: full recording, clips, blog post, social cuts, podcast spinoff
- Founder-led webinar — flagship, quarterly, pairs with Founder-led distribution
- Small dinner / supper club — Kramer's MKT1 model, 8–12 attendees
- Workshop — teach prospects to do the work themselves (GC AI / Cecilia Ziniti pattern)
- Customer-only event — retention + advocacy, feeds Customer advocacy module
- Trade show / conference booth — only in verticals where buyers actually buy at events
Prep + repurpose + follow-up, not delivery. Pair with Sales enablement for high-value attendee prep; Customer advocacy for capturing case stories at the event.
Cadence1–2 events per quarter for solo operators. Don't try to run a series unless headcount or partner-resourcing supports it.
Community works when the founder IS the community lead. Exit Five (media company: 40K subscribers, ~$5M, no FTEs) and GrowthMentor (product-led: 750+ curated mentors, 60K+ sessions, 4.8/5) prove it across different models. What they share: the founder did the non-scalable work early — and that's what gave the community its initial quality signal.
Operator endorsement"For the last 2 years I have been building a B2B media company called Exit Five. The Myth of Solopreneur: I work for myself. There are no full-time employees."
— Dave Gerhardt, Exit Five
"Doing things that didn't scale like cultivating 1:1 friendships with the mentors and mentees was our most successful marketing strategy."
— Foti Panagiotakopoulos, GrowthMentor · Failory interview
- Gerhardt's Exit Five — canonical founder-as-community-lead pattern (media company model).
- Foti Panagiotakopoulos / GrowthMentor — spent weeks in 1:1 outreach personally vetting and onboarding elite-tier mentors before opening to mentees; each mentor brought their own social proof and distribution (LinkedIn profiles, startup networks). That manual seeding created a quality signal the platform still runs on.
- Common Room's CLG framework — signals from community/social channels feed enrichment, which feeds AI-prioritized engagement.
- Wes Bush's PLG community thesis — community as PLG multiplier, not standalone acquisition.
- Founder is naturally articulate and willing to publish + show up at cadence.
- Product is community-shaped (developer tools, indie SaaS, creator tools, vertical SaaS with strong identity).
- Founder is already running founder-led distribution (LinkedIn, podcast) — community sits on top of, not separate from.
- Late seed onward, with at least one paying customer cohort to seed.
- Founder hates writing or showing up online.
- Pure self-serve B2B SaaS with no shared user identity (e.g., generic AP automation).
- Pre-product, pre-customers — there's nobody to gather yet.
- Drafting weekly threads, member onboarding messages, AMA prompts.
- Repurposing community discussions into Content engine assets.
- Async Q&A drafting — founder reviews and posts.
- Common Room / RoomieAI-style intelligence layer surfaces the conversations worth jumping into.
Community without a community lead dies. Founder-led community works because the lead is already the asset.
Owned community space (Slack, Discord, Circle, forum, hosted platform) where the founder is the visible community lead. Distinct from hired-community-manager models — and distinct from broader DevRel work that requires dedicated headcount.
Cold start pattern (GrowthMentor model — replicable beyond platforms)- Elite supply before any demand — weeks of 1:1 outreach to hand-pick high-credibility members; don't open to customers until the roster signals quality worth joining
- Each early member as distribution — GrowthMentor's mentors put it on their LinkedIn as a role, shared their profiles, brought their networks; the community's credibility compounded through theirs
- Supply gets value too — elite mentors connected with other elite mentors; the community was a peer network for them, not just a service obligation
- Vetting creates the belonging signal — ~5% acceptance rate made being accepted a credential; accepted members had reasons to stay and share
- Open demand only after the quality signal is locked in — mentees joined a community already full of credible people; that's what made first sessions convert to retention
- The non-scalable phase is the product — Foti personally onboarded every mentor; that's the part most platforms skip, and it's the part that made GrowthMentor defensible
- Platform choice — Slack / Discord / Circle / forum / hosted (Common Room, Bevy)
- Weekly founder presence — minimum cadence for founder-led to work; daily check-ins ideal (Gerhardt model)
- Member onboarding ritual — welcome thread, intro pattern, first-week engagement
- Anchor content cadence — weekly thread / discussion prompt / AMA
- Content repurposing — community discussions → Content engine, Founder-led distribution, Customer advocacy material
- Optional: paid tier (Exit Five model) for sustainability
Drafting, repurposing, async Q&A, signals analysis — not relational presence. The founder's voice and presence are the asset; AI amplifies output, doesn't substitute for showing up.
CadenceWeekly founder presence minimum. Daily light-touch (read, react, comment) ideal. If the founder can't sustain weekly minimum, don't start.
ReferenceFoti Panagiotakopoulos / GrowthMentor (Failory interview); Dave Gerhardt's Exit Five; Common Room CLG framework; Wes Bush's PLG community thesis.
The single audit that quantifies whether the marketing motion is actually paying for itself. Without it every channel argument is a vibes argument; with it, "what should we invest in next quarter?" stops being debatable. Triggered by a specific decision — pricing rework, raise vs extend, channel reallocation, hiring — not by calendar.
Operator endorsement"Companies must choose no more than two or three moats to build. Trying to excel at everything results in mediocrity."
— Wes Bush, ProductLed
- Balfour's Four Fits AI-era — the framework whose outputs this audit quantifies. Channel-Model Fit math lives here.
- Kramer's bottom-up + top-down forecasting pair — top-of-funnel → revenue and revenue-target → pipeline both feed scenario modeling.
- Lemkin/Norton's "70–80% revenue activity" framing — Growth model checks whether the motion supports it.
- Reforge growth loops — which loops are active, which could be activated, which never compound.
- Which channels actually pay back, and how fast?
- Where does NRR come from — expansion, upsell, new logos, or recovery?
- What does 2× paid spend, 1% churn drop, or 10% price increase actually do to ARR over 24 months?
- Are we in the right ARPU/CAC zone for the channels we're using (Balfour's ARPU↔CAC spectrum)?
- What's the cost-of-delay on fixing the worst Fit?
- Claude pulls from product analytics + subscription analytics MCPs in one pass — no spreadsheet wrangling.
- Scenario modeling becomes conversational: "what if we cut churn 0.5%, add LinkedIn ads at $5K/month, and shift positioning toward enterprise?" — full pro-forma in minutes.
- Subsumes expansion analysis (same exercise, different question on the way in).
- Bottom-up SOM (named accounts × close rate × ACV) replaces top-down TAM theater.
The audit that makes "what should we invest in" answerable instead of debatable.
The deep audit that quantifies the marketing motion — efficiency, growth loops, where expansion comes from, what changes if you change inputs. Triggered on demand, takes 1–2 weeks. Subsumes the v06 expansion-analysis module.
Contents- Unit economics — CAC, LTV, LTV/CAC ratio, payback period, gross margin (incl. inference cost for AI features per Balfour)
- Channel-level efficiency — CAC by channel, payback by channel, ARPU mix per channel
- Growth loops — which are active (PLG, content, founder-led, integration partner), which could be activated, expected loop math
- NRR analysis — usage cohorts, upgrade triggers, churn drivers, gross vs net retention
- Scenario modeling — 2× paid spend, 1% churn drop, price change, new geography, AE hire, channel mix shift
- Bottom-up SOM — named accounts × realistic close rate × ACV (when a sizing question matters)
- 4 Revenue Levers ranking (Kramer) — top-of-funnel volume vs new audience vs increase value vs efficiency
Subscription analytics + Product analytics + Marketing analytics + Competitive landscape (for ARPU benchmarking) + Channel strategy (for the candidate set being modeled).
OutputA pro-forma model the founder + finance can argue with. Key constraints named. 1–2 specific recommended bets for the next quarter, scored for cost-of-delay if you skip them.
CadenceOn-demand, triggered by a specific decision. Typical triggers: planning cycle, pricing rework, raise vs extend, channel re-allocation, founder-vs-hire decision. 1–2 weeks of work end-to-end.
The highest-leverage revenue lever most early-stage SaaS teams never properly pull. A 1% improvement in monetization moves the bottom line 4× more than the same improvement in acquisition — yet most companies have spent under 8 hours on pricing in their entire history, have nobody qualified to run a proper rework, and mistake "pick a number and ship the pricing page" for a strategy. Pre-AI, fixing this required either a $150K–$500K consultant or in-house expertise almost no founding team has. AI removes both barriers.
Operator endorsement"If you improve your monetisation by 1% you will improve your bottom line by just under 13%. Improve your acquisition by 1% and you'll see a 3% boost."
— Patrick Campbell, ProfitWell · Business of Software 2016
"The average SaaS company spends about eight hours in their lifetime thinking about pricing."
— Kyle Poyar, OpenView · SaaS Club Podcast
- "Argue, guess, and check" — Campbell's label for how most SaaS teams handle pricing: no designated owner, no trained people, handed off to whoever volunteers.
- Expertise is expensive: external consultants cost $150K–$500K and take months; only 6% of SaaS companies have done sophisticated WTP research (OpenView survey, 2,200 companies).
- WTP interviews and Van Westendorp surveys require specialist knowledge to design and interpret — most founders don't have it and don't know they need it.
- The result: average 2.7 years between pricing updates; most ≤Series A teams underprice by 30–50% without realizing it.
- Patrick Campbell / ProfitWell — Van Westendorp sensitivity meter; value-metric alignment; 512-company study on acquisition vs. monetization vs. retention leverage.
- Kyle Poyar / OpenView — 2,200-company pricing survey; "how to run a pricing project" framework; LTV/CAC multipliers by pricing sophistication (1.68× → 3.23× → 11.09×).
- Wes Bush's PLG pricing — packaging tied to activation milestones.
- Balfour's ARPU↔CAC spectrum — pricing determines which channels are even viable.
- Claude guides WTP interview design and extracts willingness signals + price-anchor language from the Calls & transcripts corpus — no specialist needed to run the research.
- Subscription cohort clustering (ChartMogul / Stripe MCP) surfaces ARPA distribution, plan adoption, upgrade/downgrade patterns in one pass.
- Competitor pricing scraped and structured via Claude Code crawler.
- Synthetic-customer stress-tests (Evidenza / Synthetic Users, 0.85–0.92 parity) for variant testing — directional only, not final validation.
Pricing has always been the highest-leverage lever and the most neglected one. AI removes the expertise barrier that kept it inaccessible to early-stage teams.
The big one-off when pricing genuinely needs reworking — typically every 18–24 months or after a meaningful product or segment shift. Most teams arrive here having never done it properly: no WTP data, no value-metric analysis, pricing set by gut feel and never revisited. Run it as a deep audit, ship a clean change for everyone, then leave it alone for 12–18 months. Frequent small changes create more chaos than upside.
Inputs- 10–15 qualitative WTP interviews (the small-team alternative to surveys; use the Calls & transcripts corpus)
- Optional Van Westendorp sensitivity meter — only with ~100+ qualified responses; under that, qualitative beats quantitative
- Product usage data — which features correlate with higher WTP, what activation patterns predict expansion
- Subscription data — ARPA distribution, plan adoption, upgrade/downgrade rates, expansion vs new-logo NRR
- Competitive pricing comparison — incl. AI-feature pricing patterns from Metronome AI Pricing Index
- Inference cost per user/per task if AI-feature heavy (Balfour's Channel-Model break point)
- Pricing structure — flat, tiered, per-seat, usage-based, hybrid (base + usage), outcome-based ($0.99/ticket-style)
- Value metric — what unit aligns price with customer-perceived value
- Packaging — how features bundle into plans
- Free trial vs freemium vs paid trial — paid trials are rising for AI-cost-heavy products
- Grandfathering policy — which existing customers stay on old plans, for how long
Recommended pricing structure + packaging + value metric. Migration plan for existing customers (with grandfathering policy). Updated pricing page brief for Website module. Updated Messaging framework value-prop tier.
CadenceOn-demand. Typical triggers: 18–24 months elapsed, major product shift, AI-feature launch breaks unit economics, segment shift, competitive pricing pressure. The continuous-capability version (Series B+ pricing & packaging ops) is in Considered & rejected.
The lightest possible "is this whole thing pointed in the right direction?" check. AI-assisted, takes hours not weeks, gives you the bottleneck on a plate. Promoted from "useful" because Balfour's PMF Collapse essay made this a continuous monitor — Chegg dropped 90% in 9 months, Stack Overflow saw similar near-instant collapse — quarterly was already too infrequent.
Operator endorsement"The fits are always evolving / changing / breaking. When that happens, you can't simply change one element, you have to revisit and potentially change them all."
— Brian Balfour, Four Fits AI-era (Sept 2025)
"Companies can find and lose product-market fit almost instantly now."
— Brian Balfour, Four Fits AI-era
"Markets reflect customer expectations, and those are changing quickly. If a person uses and likes ChatGPT, it changes their expectations of other products too."
— Brian Balfour, on the Expectation Reset
"Are we in the right business — and what's the #1 bottleneck right now?"
Where the method comes from- Balfour's Four Fits AI-era (Reforge / brianbalfour.com, 2025) — the canonical update, replacing the 2015 original. Four dimensions: Product-Market, Product-Channel, Channel-Model, Model-Market.
- Balfour's "PMF Collapse" thesis — fits can break in months, not years. Chegg, Stack Overflow, Chegg-style monetization-side casualties.
- Subsumes v06 priority diagnostic + competitive risk + expansion review — same diagnostic, single artifact.
- Feed Master context (positioning, ICP, channel strategy, metrics, recent customer evidence) to Claude / Perplexity Deep Research → scored Four Fits in <1 hour.
- Cross-references Analytics-layer metrics to flag where the constraint is concrete vs hypothetical.
- Caveat: AI defaults to "differentiation over commoditization" and other trendslop biases (per HBR 2026). The rubric and judgment stay human; Claude stress-tests, doesn't decide.
The cheapest possible "should we keep going as-is?" check. Once a quarter, before the planning meeting, before the board update.
An AI-assisted scored review of the Four Fits + bottleneck identification. Light-effort: hours, not weeks. Sits opposite Growth model in the audit pair — Growth model is the heavy on-demand quantitative audit; this is the light recurring qualitative check that triggers it.
Framework — Balfour's Four Fits AI-era- Product-Market — do customers genuinely need this? (Track Sean Ellis "very disappointed" %, retention curves, free-to-paid)
- Product-Channel — can it be distributed through the channels we picked? (Channel CTR, conversion, attribution viability)
- Channel-Model — do channels support the business-model economics? (CAC by channel vs LTV; inference-cost margin compression)
- Model-Market — does the model work for the market's size + buying behavior? (Christoph Janz Elephants→Flies map; ARPU↔CAC spectrum)
Reading Analytics-layer metrics, what is the single constraint costing the most right now? Activation, churn, acquisition, positioning, pricing, or motion-fit? (Subsumes the v06 "priority diagnostic" module.)
AI-era pressure points to score against- AI search intercepting organic discovery (Channel collapse)
- Inference economics breaking gross margin (Lovable ~35%; Channel-Model break)
- Buyer expectations shifting from ChatGPT exposure (Expectation Reset)
- Competitor-product-market-fit collapse cascading into yours (Chegg pattern)
Each Fit scored 1–10. Worst Fit named. Top 1–2 things to fix in the next quarter. If one Fit scores below 5, triggers Growth model deep audit.
CadenceQuarterly minimum, plus on-demand whenever something feels off — before launching new spend, hiring, or board update. ~1 hour with Master context loaded.
The check that catches both over-spending pre-PMF AND the more common bootstrap-fixation trap — founders being too conservative and paying for it in a lost year of inactivity. AI-assisted, runs in an hour with Master context loaded. Useful — not core — because it's largely a CEO question marketing reads in on, not a marketing-owned audit.
What it answers"Are we resourcing this right for our stage? Should we extend, raise, or cut?"
Why it earns 6, not higher- Important but not weekly — most teams need this once or twice a year.
- Largely a CEO/finance question; marketing reads in to defend or argue down its line item.
- Sourced primarily from outside the marketing system (board materials, finance models) — this card is the marketing-side mirror.
- Useful as a tool for negotiating budget with the CEO when marketing is under-resourced, which is the more common gap pre-Series A.
- SaaS Capital, OpenView, Bessemer benchmarks — public datasets on stage-appropriate burn / efficiency / spend allocation.
- Lemkin's "$10M before CMO" heuristic (paraphrased; treat as directional) — when marketing leadership is and isn't justified.
- Kramer's 35–45% of CAC to marketing rule — the budget rule of thumb at this scope.
- Multithreaded marketer thesis (Srinivasan) — the headcount math that justifies AI-augmented small teams against larger traditional teams.
- Feed runway data + spend allocation + stage benchmarks to Claude → produces flag list in <1 hour.
- Stage benchmarks come from public sources updated quarterly.
- Surfaces under-investment vs over-investment with named comparable companies.
The cheapest possible reality-check on whether you're playing the right game with the resources you have.
"Are we resourcing this right for our stage? Should we extend, raise, or cut — and is the marketing line item the right size for what's expected of it?"
Evaluates- Burn rate and runway given current trajectory + planned hires
- Spend allocation (product / sales / marketing / ops) against stage norms (SaaS Capital, OpenView, Bessemer)
- Marketing line item against Kramer's 35–45% of total CAC heuristic
- Headcount math — is the team multithreaded enough (Srinivasan, 8–12 parallel threads orchestrated with AI), or operating with old SaaS headcount assumptions?
- Under-investment vs over-investment flags
- Raise, extend, or cut decision frame
- Over-spending pre-PMF — burning down founder/runway on activity that doesn't compound.
- Bootstrap-fixation — founders being too conservative with spend and paying for it in a lost year of inactivity. Especially common at seed → Series A.
The artifact marketing brings to the CEO budget conversation. "Here's what we have, here's what stage-comparable peers have, here's the gap, here's what bridging it would do per the Growth model."
CadenceOnce or twice per year. Plus on-demand triggers: pre-raise, post-raise, runway hits 12 months, stage transition signal.
Real discipline at scale. At pre-seed → Series A scope, two structural problems put it in the rejected pile: not enough buying-decision events to make the signal more than anecdote, and what it would surface already flows through Customer evidence sprint + Calls & transcripts. Useful when you've outgrown those primary sources.
Operator framingWin/loss interviews are post-decision and decision-anchored. JTBD interviews are situation-anchored and buyer-journey-focused. Different questions, different output. At small scale they overlap; at scale they diverge.
— Bob Moesta, on the JTBD-vs-win/loss distinction
"In B2B, we typically lose 25% of deals to 'no decision.' Customer indecision is a buying decision — and post-decision interviews are how you learn it."
— April Dunford, Lenny's Newsletter
- Volume problem: 3 wins and 2 losses per quarter isn't a signal. Win/loss needs ≥10 deals/quarter for the patterns to stabilize — that's typically Series A+ with structured pipeline.
- Overlap problem: what it would surface (objections, deal-killers, status-quo wins) already shows up in Customer evidence transcripts and Calls & transcripts synthesis.
- Effort-to-signal ratio: dedicated win/loss is 3–5 hrs per interview at small scale, where the same hours invested in JTBD or Customer advocacy interviews compound across more modules.
Once you have a sales-led motion with 10+ buying decisions per quarter, OR positioning starts feeling stale and you need a sharper post-decision pulse than JTBD interviews provide. Activate as a standalone Foundation module then.
A real Series A+ discipline. At this scope, JTBD + Calls cover the same territory cheaper.
Periodic (typically quarterly) program of 15–30 minute interviews with recent buyers and recent non-buyers in the category. Single question on the way in: "why did you decide what you decided?"
Two interview tracks- Win interviews — buyers who chose you. Why us, why now, who else they evaluated, what nearly killed the deal.
- Loss interviews — prospects who chose someone else (or "do nothing"). What they bought instead, what we did wrong, what would have changed their mind.
- Win-rate breakdown by competitor + by ICP segment
- Deal-killer pattern list (top 3–5 reasons for losses)
- Reasons-to-buy pattern list (top 3–5 wedges)
- Updates to Messaging objection-handling + Sales enablement battle cards
JTBD is situation-anchored ("when X happened, I started looking…"). Win/loss is decision-anchored ("at the moment of choosing, what tipped it?"). Different output shape; same skill set on the interviewer side.
Cadence (when activated)Quarterly, ~10–15 interviews per quarter (mix of wins / losses / no-decisions). Run by PMM or marketing-led if no PMM.
Real discipline at scale. At pre-seed → Series A scope, two structural problems put it in the rejected pile: not enough pipeline volume to justify the structure, and ownership doesn't naturally sit under marketing. Activate the day you hire your second AE.
Operator framing"RevOps foundation comes first, AI acceleration comes second. AI amplifies whatever lives in your CRM, including the problems."
— The Smarketers, RevOps Guide for B2B 2026
"GTME is RevOps reborn in marketing-adjacent form. Forward-deployed GTMEs + internal GTMEs operating like a product engineering team."
— Everett Berry, Clay GTM engineering
Kramer's "avoid the terms MQL / SQL — putting team names in front of stages creates territorialism, and the customer journey is never that clean or linear." Use Qualified 1 / Qualified 2 instead — adapt for your team.
— Emily Kramer, MKT1 Field Guide Part 2
- Scope: RevOps becomes load-bearing once you have AEs running structured pipeline — typically Series A+ or already Series B. Pre-seed and seed don't have the lead volume or sales motion to justify it.
- Ownership: even when the discipline is real, it usually doesn't sit under marketing — it's its own function or owned by a head of growth / COO.
- GTM-engineered outbound covers the marketing-adjacent slice already (signals, enrichment, agent personalization) without needing a separate RevOps module at this scope.
The day you hire your second AE. When CRM hygiene starts visibly costing pipeline. When attribution arguments become weekly meetings. AI tools amplify whatever lives in the CRM, including the rot — fix structure before adding agents.
Pre-Series-A teams don't need RevOps; they need a clean Master context, a tracked pipeline, and Account-Driven GTM in the CRM.
The function that defines pipeline stages with entry/exit criteria, MQL/SQL/PQL boundaries, attribution model, lead routing rules, and CRM hygiene rituals. Lives at the marketing-sales boundary.
Components (when activated at Series A+)- Pipeline stage definitions — entry/exit criteria per stage, calibrated against historical data
- Lifecycle stages — for both accounts and contacts (Kramer's Account-Driven GTM)
- MQL/SQL/PQL boundaries — or Qualified-1/Qualified-2 to avoid team-name territorialism
- Attribution model — multi-touch, position-based, or AI-attributed
- Lead routing rules — round-robin, account-based, expertise-based
- CRM hygiene rituals — duplicate detection, data validation, field decay alerts
- Marketing-sales SLAs — response time, follow-up cadence, hand-back rules
HubSpot or Salesforce as the system of record · Default / RevenueHero for routing · Clay for enrichment · 42Agency / Smarketers patterns · Marketo or Customer.io for marketing automation. The full Common Room signals layer when CLG is active.
What's marketing's slice vs separate functionMarketing owns: lead source attribution, campaign tracking, channel mix. Sales/RevOps owns: pipeline stages, AE routing, deal velocity. Shared: hand-off rules, SLAs, attribution model. At Series A+, the shared part needs an owner — that's the RevOps person.
When activated, where it sitsOwn function reporting to head of growth / COO / VP Sales — not under marketing. Marketing reads in to defend its line item and ensure attribution stays honest.
Objections matter; a separate database doesn't. The signal a dedicated objection library would surface already flows through three other modules. Adding it as a fourth source creates maintenance overhead and stale entries — without producing a unique signal nobody else has.
Operator framing"Differentiated value is a reason to buy. An objection is a reason a prospect might not buy, even if the value is something they really want. Often objections come from constituents who are not the deal champion but can kill a deal."
— April Dunford, on the value-vs-objection distinction
The Customer evidence sprint already extracts skepticisms as one of its 15 components per interview. Calls & transcripts already capture them in the source. Messaging framework already bakes in objection handling.
Why removed- Triple overlap: JTBD "skepticisms" (Customer evidence) + Calls & transcripts (raw objections in context) + Messaging framework (objection-handling section per Dunford) already cover this territory.
- Maintenance tax: a separate database goes stale fast — the same objection in transcripts is fresh; in a curated library it's last quarter's.
- Wrong primitive: at small scope, you don't need a structured objection corpus — you need to be in the calls.
- AI flips the asymmetry: Claude can search across 50 transcripts in seconds for any objection pattern. The library is the slow path; transcript search is the fast one.
When messaging copy is being written by people who don't sit on sales calls — a structured corpus becomes useful because the writers can't pattern-match objections from memory. Series A+ with multiple writers and an active sales motion.
Right discipline, wrong artifact. The transcripts are the library; the messaging doc is the rebuttals.
Curated catalog of every objection buyers raise — paired with the response the team has tested and validated. Living library, refreshed from sales calls, JTBD interviews, review sites, and lost deals.
Per entry (when activated)- Objection text (verbatim where possible)
- Source (call, interview, review, lost deal)
- Frequency / recency tags
- Tested rebuttal — what works in the moment
- Underlying concern (per Dunford: champion vs IT vs procurement vs legal — different responses)
- Owner / approval status
- Customer evidence sprint — "skepticisms" is one of the 15 components extracted per interview. The structured place where objections originate as research signal.
- Calls & transcripts — raw objections in context, queryable by Claude across the corpus.
- Messaging framework — Dunford's spec includes "top skepticisms from interviews + responses" as a required section.
Notion or Airtable database with versioning. Updated weekly from new transcripts. Sales enablement battle cards reference specific entries by ID. Marketing copy reviews check against the library before publishing.
A list, not a system. The work that would justify maintaining a public-presence inventory is already covered by Customer advocacy (driving reviews) and AEO production (entity consistency across owned + earned media). As a standalone module, it duplicates without producing operational leverage.
Operator framing"Bing Places must match Google Business Profile exactly. Microsoft Copilot leans heavily on LinkedIn for B2B queries — Company Page completion + 1–2× weekly posting + executive profile authority materially affect B2B discoverability."
— Surmado, AEO/GEO Guide
Brand mentions correlate with AI visibility more strongly than backlinks (r = 0.664 per Averi). Entity consistency is real work — but it lives inside AEO production, not as a separate inventory.
Why removed- List ≠ system: an inventory of URLs is a Notion page, not a discipline. The actual work is updating each surface, which is the AEO-production job.
- Triple overlap: Customer advocacy drives the reviews, AEO production handles entity consistency, Social proof library stores the testimonials that came from those surfaces.
- No unique signal: nothing surfaces in a public-presence module that wouldn't surface inside the three modules above.
- "Where do we exist?" is a 1-hour audit, not a recurring module — fold into Marketing context audit on day 1.
If the company hits 10+ active third-party listings without an AEO production module absorbing entity-consistency work — listing-management can become its own thing. Otherwise the work folds into AEO production naturally.
"Where do we exist online?" is a one-page list. The work that matters is keeping each entry consistent — that's AEO production, not inventory.
An inventory layer for review profiles, marketplace listings, partner directories, and third-party sites. Per entry: URL, owner, credentials (if any), last updated, current rating, link to canonical positioning copy.
What surfaces a list would track- Review platforms — G2, Capterra, TrustRadius, Trustpilot, GetApp, Software Advice
- Discovery surfaces — Product Hunt, BetaList, There's An AI For That
- Partner marketplaces — HubSpot, Salesforce AppExchange, Shopify, Stripe Apps, Notion Templates
- Identity surfaces — Crunchbase, LinkedIn Company Page, Bing Places, Google Business Profile
- Industry directories — vertical-specific listings
- Press / media mentions — coverage tracking
- Customer advocacy — runs the review-generation program (~15 G2/Capterra reviews/quarter)
- AEO production — handles entity consistency across owned and earned media; ensures Bing/Google Business Profile match LinkedIn match website
- Social proof library — stores testimonials that flow from review surfaces
- Marketing context audit — day-1 "where do we exist?" snapshot, no recurring overhead
Notion / Airtable list with quarterly review of stale entries. Owner: whoever runs Customer advocacy. Realistically: most teams just put it in Master context and don't make it a module.
Wrong owner. For most B2B SaaS, integration tracking is a product / DevRel artifact, not a marketing asset. For developer-first products where integrations ARE the primary distribution channel (Stripe-, Twilio-, Zapier-shaped), it becomes load-bearing — but that maps to a DevRel hire, not the marketing operator.
Operator framing"For developer-facing or inherently community-shaped products (Stripe, Twilio, dbt, Notion, Figma), CLG can be the primary channel."
— Common Room, CLG framework
Mary Thengvall's The Business Value of Developer Relations is the canonical reference — and the discipline it describes does not live under marketing.
Why removed- Wrong function: integration ownership sits with product or DevRel, not marketing.
- Wrong altitude: maintaining a list of integrations is product-page content, not a marketing system.
- Wrong scope: most ≤Series A B2B SaaS has 0–10 integrations and a list in Notion. There's nothing to systematize yet.
- Founder-led community already covers the developer-facing piece for community-shaped products.
Developer-first or platform-y products where integrations are the primary distribution channel — Stripe, Twilio, Zapier, dbt, n8n shapes — and you have ≥10 integrations. Pair with a DevRel hire (not marketing). Read Thengvall.
For 90% of B2B SaaS, this is a Notion page maintained by product. For the other 10%, it's a DevRel function. Either way, not a marketing module.
Catalog of every integration the product supports — third-party platforms, embeddable widgets, API connectors, partner-marketplace listings. Per entry: name, type, documentation URL, marketplace profile URL, internal docs, owner, last updated, status.
What it would track (when activated)- Native integrations — built and maintained in-house
- Partner-built — third-party connectors built against your API
- Aggregator listings — Zapier, Make, n8n, Workato
- Marketplace presence — HubSpot, Salesforce AppExchange, Shopify Apps, Slack Directory
- Per integration: docs URL, marketplace URL, last review, current status, owner, customer-usage volume
- Maintenance is technical (API changes, auth flows, sandbox debugging) — not marketing's skill set.
- The marketing surface is the marketplace listing copy + co-marketing — those fit inside Website (integrations page) and Partnerships (when activated).
- The discipline that holds the catalog together is DevRel's, not marketing's.
Hire a DevRel lead. Read Mary Thengvall's The Business Value of Developer Relations. The catalog becomes a content surface (per-integration docs, demo videos, partner co-marketing) and the function reports outside marketing.
Real Series B+ discipline — Rob Litterst counted ~1,800 SaaS pricing changes in 2025 alone — but a continuous capability creates a real operational tax that ≤Series A teams can't carry. The right small-company pattern is one-shot rework via the Pricing analysis deep audit, then leave it alone 12–18 months.
Operator framing"There's not going to be one size fits all pricing. The value prop is changing at a rate that most companies are struggling to keep up with. Every team becomes a success team."
— Scott Woody, Metronome CEO
"The inertia leads to avoidance, which ultimately hurts the business more than making change. Pricing changes are really hard because they're cross-functional."
— Metronome SaaS Pricing 2025
- Operational tax is real: continuous pricing changes mean analytics separates cohorts, support handles grandfathered customers, product maintains plan-feature mappings. ≤Series A teams can't carry that overhead.
- AI doesn't multiply this: pricing decisions depend on customer + data thinking, not generation. Claude helps with the audit, not with continuous-capability execution.
- Frequent changes destroy trust: at small scale, customers notice and resent it. Big-company "test-and-iterate" pricing reads as chaos at startup scale.
- Already covered by Pricing analysis (Deep audits) for the actual rework events.
Run a Pricing analysis (Deep audits, score 8) when you suspect you're underselling or the product has shifted meaningfully. Make a clean one-shot change for everyone. Leave it alone for 12–18 months. Then audit again.
When to revisitSeries B+, when you have analytics + support + pricing-ops capacity to run a continuous capability without it eating into everything else. Hire a dedicated pricing operator.
Right discipline at the wrong stage. Pricing analysis (audit) covers the small-team need; this is its Series B+ continuous sibling.
The continuous side of pricing as run by larger SaaS companies — ongoing value-metric reviews, packaging A/B tests, grandfathering policies, migration comms, billing infrastructure, hybrid base + usage pricing models.
Components (when activated at Series B+)- Quarterly value-metric review — is the current metric still aligned with customer-perceived value?
- Packaging tests — A/B per cohort, with explicit guardrails to prevent testing into chaos
- Grandfathering policy — which customers stay on which plans, for how long, with what signal
- Migration comms — staged rollouts with customer communication
- Billing infrastructure — Stripe Billing, Metronome, Orb for usage-based; Chargify, Recurly for traditional subscription
- Per-cohort analytics — separate retention, expansion, churn curves per pricing tier
- Plan-feature mapping — kept in sync with product team's feature flags
Three teams have to be staffed: pricing operator (defines and tests), analytics (separates cohorts), support (handles grandfathered customers). At ≤Series A, none of those three exist as roles — every change costs the founder a week.
2026 trends to watch- Hybrid (base + usage) is the new default per Metronome
- Outcome-based pricing rising (Intercom Fin's $0.99/ticket as the canonical example)
- Paid trials replacing free trials for AI-cost-heavy products
- AI credits as a packaging primitive
A genuine channel — Kramer's "Ecosystem Map" treats complements as essential, not optional — but pure relationship and negotiation work. AI helps draft partner-facing copy and run affiliate analytics, but doesn't multiply the relational core. The "you have to pay twice" warning is real for ≤Series A: co-marketing eats founder calendar before it returns pipeline.
Operator framing"Yes, referral programs work for SaaS — but you gotta put in the work. Partnership programs at seed often cost more attention than they return."
— Jason Lemkin, SaaStr
"Build a list of potential partners. For each, note type and size of partner, audience overlap, and potential ROI. Include integrations, influencers, communities, affiliates, advisors, etc. Startups often over-focus on competitors and neglect complements."
— Emily Kramer, MKT1 Field Guide Part 2 (Ecosystem Map)
- Pure relationship work: co-marketing aligns two companies' calendars, approvals, and brand voices. AI drafts the email; humans run the relationship.
- "You have to pay twice": partnership programs cost the partner discount + the operational overhead of managing the partner. Net economics often negative pre-Series A.
- No multiplier: the "build once, use everywhere" principle that makes most modules AI-leveraged doesn't apply to partner relationships — every one is bespoke.
- Pre-PMF kills partnerships: partners need something repeatable to sell. If you don't have it yet, the partnership stalls.
- Affiliate programs — most AI-native form (Reditus, PartnerStack, Cello). Tracking + payouts automated; ~10% of new MRR achievable for product-shaped fit.
- Integration partner listings — one-time marketplace ops (HubSpot, Salesforce AppExchange, Slack Directory). Usually folds into Website module's integrations page.
- Heavy co-marketing — bespoke per partner; activates when you have a CSM-style relationship lead, typically Series A+.
When you have something a partner can actually sell — meaningful product surface, proven economics, repeatable customer story. Affiliate programs are activatable lighter; reseller / heavy co-marketing waits for Series A+.
Real channel, mostly relational. AI helps with the admin; the asset is the relationships, and those don't compound from a CLAUDE.md.
Distribution through other companies — integration partner listings, co-marketing with adjacent tools, reseller programs, affiliate programs. A genuine channel that converts well when product-fit and partner-fit align, but operationally heavy.
Three flavors (different difficulty)- Affiliate programs — lowest-touch. Tracking via Reditus, PartnerStack, Cello. Affiliate signs up, gets a link, gets paid on conversions. ~10% of new MRR achievable in product-shaped categories.
- Integration partnerships — medium-touch. Marketplace listings, joint blog posts, co-webinars. One-time ops to set up; ongoing maintenance light.
- Reseller / co-marketing — highest-touch. Contracts, MDF, joint pipeline meetings, calendar alignment. Series A+ or pre-Series A only when a single partner has obvious leverage.
- Type and size of partner
- Audience overlap (Kramer's Ecosystem Map dimension)
- Potential ROI estimate
- Co-marketing assets if applicable (joint webinar, blog, case study)
- Listing presence (integration page, marketplace)
- Tracking (UTMs, affiliate dashboards)
Helps: drafting partner-facing copy, joint blog posts, affiliate-program outreach sequences, tracking dashboards. Doesn't help: building the relationship, aligning brand approvals, calendar coordination, contract negotiation.
ReferencesCello (B2B referral software 2026 guide); PartnerStack (anatomy of a high-performing referral program); Reditus (affiliate / referral platform built for B2B SaaS); Kramer's Ecosystem Map.
The clearest noise module in the system. Top-down TAM is theater — pitch-deck content, not an operating tool. It doesn't change what a 5-person marketing team should do this quarter. The lowest-scored card in the entire v10 because the cost-benefit is genuinely negative: it produces a number that misleads, with hours that should have gone to JTBD interviews or messaging.
Operator endorsement (the consensus is unanimous)"A tiny piece of a giant pie is the wrong way to think about things. Instead, you should be aiming for a big piece of a small pie, initially."
— Ben Yoskovitz, Focused Chaos
The "1% of a $100B market" formulation is a credibility killer. Top-down TAM is theatre.
— Jason Lemkin / Sara Ledterman / Robert Kaminski (consensus across SaaStr, Focused Chaos, FletchPMM)
"Markets aren't set in stone. Small markets shift regularly and some of them will become big. You become an expert in that niche, which provides you with an unfair advantage."
— Ben Yoskovitz
- It changes nothing operational: knowing the market is "$100B" or "$10B" doesn't tell a 5-person team which channel to test or which ICP to interview next.
- The "1% of a giant market" math is famously dishonest: capturing 1% requires the same focused execution as winning a smaller niche outright.
- Top-down sizing rewards bad judgment: it incentivizes founders to inflate definitions until the number looks fundable.
- Bottom-up SOM is the part that's useful — and it's already inside Growth model (Deep audits).
Bottom-up SOM only — named accounts × realistic close rate × ACV. That work is already inside Growth model (Deep audits, score 8) and informs ICP + Channel strategy. No separate module needed.
What v06 looked likeStood as its own card in the Deep analysis layer at score 3. The v07 audit moved it to "Considered and rejected" outright — keeping it visible specifically to signal to readers that sizing matters less than it appears to.
The single most-confidently rejected module. Almost every credible operator says skip it; almost every founder runs it anyway because investors ask for it.
- TAM — Total Addressable Market: every entity that could theoretically buy
- SAM — Serviceable Available Market: the slice you could plausibly reach with current GTM
- SOM — Serviceable Obtainable Market: the slice you can realistically capture
Top-down TAM = market size × estimated penetration. Bottom-up SOM = named accounts × close rate × ACV.
Why top-down is theatreThe estimated-penetration number is unfalsifiable. Pick the assumption that produces the headline you want; nobody can prove you wrong; nobody can use the answer to make a decision.
What's actually useful- Bottom-up SOM — folded into Growth model (Deep audits)
- Named-account list — folded into ICP & segmentation (Strategy)
- Christoph Janz's Elephants→Flies map — used inside Channel strategy as the Model-Market Fit framing
- Yoskovitz's "big piece of small pie" — niche-first wedge, then expand
- MarTech industry: $321B–$510B globally (Yoskovitz)
- Marketing Automation: $5.8–6.5B (2023), 12.3% CAGR
- Global CRM market: ~$60B
- None of these numbers tell you what to do this quarter.
Show bottom-up SOM with explicit assumptions. Acknowledge the top-down number is a sanity check, not a target. Move on.
Sources
This system was seeded by Jan's personal operating manual and known frameworks, then reworked into AI-native form with help from 6 deep research reports synthesizing roughly 3,650 sources between them, plus another 280 manually added or web-scraped articles, podcasts, and reports, some are surfaced below.
Deep research reports
- System v05 review — The 42-module audit: what survives contact with a small B2B SaaS team. 705 sources, 29 cited inline.
- System v07 review — Pushing back on AI-native marketing system. 647 sources, 32 cited inline.
- The AI Reckoning in B2B SaaS Marketing — Where AI is and isn't replacing marketing functions, with operator and labor-market data. 533 sources, 38 cited inline.
- AI Creative Tooling for Small B2B SaaS Teams in 2026 — End-to-end stack: image and video models, platforms, pricing, conversion economics. 515 sources, 69 cited inline.
- Customer interview & positioning — Methodology for JTBD interviews and positioning sprints (FletchPMM, Forget the Funnel, Dunford, DemandMaven, Moesta). 416 sources, 11 cited inline.
- AI failure modes for solo operators — Sycophancy, hallucination, context rot, prompt injection, agent fiascos, bot liability, privacy, cost runaways, homogenization. 852 sources, 38 cited inline.
Operating models, planning & first 90 days
- MKT1 Field Guide to B2B Startup Marketing, Part 1 ·
- MKT1 Field Guide to B2B Startup Marketing, Part 2 ·
- Building an efficient marketing machine: the fuel & the engine ·
- How to prioritize marketing activities & avoid Random Acts of Marketing ·
- MKT1 Guide to Annual Marketing Planning ·
- The Future of Martech — MKT1 Martech Showcase 24 ·
- The MKT1 B2B marketing tools survey + future predictions ·
- AI broke the old marketing playbook — here's the new one ·
- Lessons from Emily Kramer (synthesis) ·
- How to make an impact in your first 90 days ·
- How to build a powerful marketing machine ·
- B2B SaaS Marketing in 2026: Strategy, Funnel & Growth Playbook ·
- The 2026 Blueprint for Scalable B2B SaaS Marketing ·
- 6 Growth Engines: Channel-Startup Fit ·
- Traction: How Any Startup Can Achieve Explosive Customer Growth (book) ·
Marketing-role evolution: Gen Marketer · Multithreaded · Architect
- Meet the [Gen] Marketer of the Future ·
- How to hire & get hired in 2026: the Gen Marketer skillset ·
- We saw the future of B2B marketing — Gen Marketer Summit takeaways ·
- Gen Marketer: Why AI-Powered Generalists Are Replacing Specialists ·
- Marketers Should Be Architects, Not Operators ·
- The Rise of the Multithreaded Marketer ·
- The AI-Native CMO: Rewiring Marketing For Warp Speed (podcast) ·
- The AI-Native CMO (companion blog post) ·
- The Pi-Shaped Marketing Team of One ·
- What Is a Pi-Shaped Marketer? ·
- Defining "AI-native" employees and companies ·
- Scott Brinker — if he reset his career today ·
Positioning — Dunford framework + Pierri
- A guide to advanced B2B positioning ·
- A quickstart guide to positioning ·
- April Dunford on Positioning (Marketing History podcast) ·
- Pessimism, Positioning, and How to Win Deals ·
- How to Operationalize Positioning and Messaging ·
- Product Positioning That Works ·
- A quickstart guide to positioning ·
- A product positioning exercise ·
- Stop Using JTBD for Product. Use It for Positioning. ·
- Positioning and Messaging Teardown ·
- How we used April Dunford's 10-step framework (Paperless.io case) ·
- Sales Pitch & Obviously Awesome (books) ·
JTBD, customer evidence & advocacy
- Jobs to Be Done with Bob Moesta — SaaS Club Podcast Ep. 423 ·
- Unlocking Customer-Led Growth in SaaS ·
- Customer-Led Growth — interview ·
- From Chaos to Clarity: The Fast-Track JTBD AI Workflow ·
- Case Study Interview Questions (the 26-question playbook) ·
- Dovetail — research repository & synthesis ·
- Survicate — NPS & in-app surveys ·
- closedloop.sh — sales-call processing ·
- Why agentic coding needs product intelligence ·
- Synthetic Users: If, When, and How to Use AI-Generated "Research" ·
- Synthetic Users — platform site ·
- Symar — synthetic research platform ·
- From Static Data to Dynamic Intelligence: Belkins' Evolution to Synthetic Market Research ·
Four Fits, growth strategy & market sizing
Product-led growth & onboarding (Wes Bush)
AEO / GEO / AI search visibility
- The Fall of the Blue Links and the Rise of GEO (AI Search Manual, Ch. 1) ·
- AI Search Manual (full manual landing) ·
- Answer Engine Optimization: The Complete AEO and GEO Guide for 2026 ·
- ChatGPT vs Perplexity vs Google AI Mode — B2B SaaS Citation Benchmarks 2026 ·
- 12 AI Marketing Tools for B2B SaaS in 2026 (Under $500/mo Stack) ·
- Answer Engine Optimization Trends in 2026 ·
- HubSpot AEO Grader (free baseline tool) ·
- The Complete 2026 Guide to Answer Engine Optimization ·
- AI Brand Monitoring: The Complete Guide ·
- Profound — AI Search Visibility Platform ·
- Peec AI — AI Search Analytics ·
- AthenaHQ — AI SEO across SEO/GEO/AEO ·
- Ahrefs MCP — keyword and backlink data via Claude ·
- Ahrefs Brand Radar — AI Overview & citation monitoring ·
- How we're adapting SEO for LLMs and AI search ·
- The ultimate guide to AEO ·
Founder-led distribution & community
- Dave Gerhardt on Building Exit Five ·
- How Dave Gerhardt Turned LinkedIn Fame into a $1M+ Community ·
- How to Build a LinkedIn Founder Brand that Drives Revenue ·
- Community-Led Growth in B2B: 2026 Guide ·
- Executive Dinners are the New Trade Shows ·
- GrowthMentor — how 1:1 mentor friendships scaled ·
- GrowthMentor ·
Lifecycle, retention & expansion
GTM engineering & outbound
- How we built Clay's GTM engineering function ·
- The GTM engineering era begins now (Clay Series C) ·
- The Best MCP Servers for Sales and Cold Outreach in 2026 ·
- What is a Cold Email MCP Server? ·
- B2B Outbound Automation Stack Benchmark Report 2026 ·
- GTM Engineering Is Growth Marketing (skeptic) ·
- How we built Clay's GTM engineering function (Clay blog) ·
AI sales transformation — Lemkin · Norton · Clay
AI-native scaling & breakout startups (Poyar reports)
Claude Code, agentic engineering & context
- How Anthropic's Growth Marketing Team Cut Ad Creation Time From 30 Min to 30 Sec ·
- The Complete Guide to Building Skills for Claude ·
- Vibe coding is passé. Karpathy's "agentic engineering" ·
- Install the MKT1 MCP Server (Claude Code, Cowork, Plugins, Skills) ·
- How I do content engineering with Claude Code ·
- How Anthropic uses Claude (marketing — Austin Lau /rsa case) ·
Vibe coding, marketer-built tools, copilot ceiling
- I built 8 GTM diagnostic tools without writing code ·
- Cursor AI for Marketers: Build Tools Without Being a Developer ·
- Lovable vs Cursor: Which AI Builder Works Better ·
- Claude Code vs Cursor: Marketing Ops in 2026 ·
- From Content to Software: Cursor for high-margin marketing tools ·
- Anthropic's Own Marketing Team Hit a Ceiling With Claude (copilot ceiling) ·
- Getting paid to vibe code (Lovable's first vibe coder) ·
AI creative — image & video models
AI creative — platforms & infrastructure
AI creative — adoption & market data
- B2B 2026 Content Marketing Trends Research ·
- AI Marketing Statistics 2026 ·
- AI Ad Creative Benchmarks 2026 (CTR / ROAS data) ·
- Digital Advertising Statistics 2026 ·
- The AI Ad Gap Widens ·
- Gen AI in video ads ·
- AI generated advertising 2026 ·
- Generative AI market forecast ·
- Generative AI market size, trends, forecast ·
- Marketing agency cost-reduction case study (~$14K → ~$3.1K/month, 3–4× volume) ·
Solo founder economics & one-person companies
Pricing, packaging & monetization
Martech stack & RevOps: composable, MCP, agentic
- Stacks on a Plane: Reshaping martech on a universal data layer ·
- From apps to infrastructure: martech's most important migration ·
- Systems of context and systems of truth ·
- FAQ on martech: AI agents and composable stacks 2026 ·
- The MarTech Canvas: New Composable Architecture Standard ·
- Best MCP Server for Attio in 2026 ·
- HubSpot MCP Server: AI Agent Integration Guide ·
- RevOps Guide for B2B 2026: AI & Data ·
Channel selection & review sites
Affiliate & partnership platforms
AI risks — sycophancy, trendslop & homogenization
- Sycophantic AI decreases prosocial intentions and promotes dependence ·
- AI overly affirms users asking for personal advice (Stanford Report) ·
- How sycophancy shapes the reliability of LLMs (SycEval) ·
- Sycophancy in GPT-4o (postmortem) ·
- Social Sycophancy / ELEPHANT — "wait a minute" priming ·
- Researchers Asked LLMs for Strategic Advice. They Got "Trendslop" ·
- Generative AI enhances individual creativity but reduces collective novelty ·
- 53.7% of LinkedIn long-form posts flagged Likely AI ·
AI risks — hallucination, citations & package security
- Introducing the next generation of Vectara's hallucination leaderboard ·
- Introducing SimpleQA (and GPT-5 system-card data) ·
- GPT-5.5 tops benchmarks but still hallucinates frequently (AA-Omniscience) ·
- We Compared Eight AI Search Engines. They're All Bad at Citing News ·
- Why Language Models Hallucinate ·
- Fabrication and errors in bibliographic citations generated by ChatGPT ·
- Hallucination-Free? Reliability of leading AI legal research tools ·
- We Have a Package for You! GenAI's insecure package recommendations ·
- Diving deeper into AI package hallucinations (huggingface-cli case) ·
- AI Hallucination Cases (court tracker, >1,200 globally) ·
AI risks — context rot, productivity & cognitive offloading
- Context Rot: How input length affects LLM performance ·
- Lost in the Middle: How Language Models Use Long Contexts ·
- RULER: What's the Real Context Size of Your Long-Context LMs? ·
- Context rot is the primary failure mode for coding agents ·
- Measuring the impact of early-2025 AI on experienced OS developer productivity ·
- The Impact of Generative AI on Critical Thinking ·
- Your Brain on ChatGPT (EEG study) ·
AI risks — agent fiascos, prompt injection & cost runaways
AI risks — privacy, discoverability & bot liability
- Sensitive data flowing into AI tools (2026 AI Adoption & Risk Report) ·
- Updates to our consumer terms (training-data default) ·
- OpenAI loses privacy gambit; 20M ChatGPT logs to copyright case ·
- Moffatt v. Air Canada, 2024 BCCRT 149 ·
- NYC's MyCity AI chatbot: Mamdani targets it as "functionally unusable" ·
- ChatGPT defeats defamation lawsuit (Walters v. OpenAI) ·
- Krafton / Subnautica 2 Delaware Chancery opinion ·
- Deloitte refunds A$97K after AI hallucinations in Australian government report ·