╭──────────────────────────────────────────────────────────╮
    │ WEBFLOW ANALYZER LINEAGE                                │
    │                                                          │
    │ Jan        Feb         Mar                Apr            │
    │ detect  →  extract  →  govern         →  productize     │
    │ experiment   MCP         policy +         reviewer +     │
    │ origin        server      review ops       creator help  │
    │                                                          │
    │ The system got stronger as its boundaries got clearer.   │
    ╰──────────────────────────────────────────────────────────╯
research

Webflow Analyzer Lineage: From Detection to Governed Review

This experiment reconstructs the Webflow analyzer lineage across January to April 2026. What begins as plagiarism detection expands into browser-backed MCP review, policy snapshots, remote reviewer operations, and finally creator-facing validation and autofill flows. The result is a system story, not a feature list.

Flow model

The analyzer story is a lineage, not a feature list.

Detection, extraction, governed review, and creator assistance become one system when the boundary history is visible.

Detect

Plagiarism analysis starts the lineage.

Extract

MCP turns evidence into tools.

Govern

Policy and review ops add boundaries.

Productize

Creator assistance inherits the system.

╔══════════════════════════════════════════════════════════════════╗
║  WEBFLOW ANALYZER LINEAGE                         APR 2026       ║
║  ────────────────────────────────────────────────────────────    ║
║                                                                  ║
║  Jan 31         Feb 6          Mar 27           Apr 21-23        ║
║  Experiment  →  MCP Server  →  Review Ops   →  Creator Autofill ║
║  detection      extraction      queue +         validation +     ║
║  problem        + Designer      feedback        submission UX     ║
║                  state                                            ║
║                                                                  ║
║  The arc is not "better crawling."                               ║
║  The arc is "review became a system."                            ║
╚══════════════════════════════════════════════════════════════════╝

Hypothesis

If the Webflow analyzer history is read as a sequence of system decisions instead of a pile of features, a clear pattern appears:

the project moved from narrow analysis to governed review and then to creator-facing workflow assistance.

That is the lineage worth documenting.

Why This Experiment Exists

The repository already contains strong analyzer artifacts:

  • the original plagiarism-detection experiment
  • the analyzer MCP package
  • the policy-grounded review architecture paper
  • the validator and submission surfaces

What it did not yet have was one artifact that explains how those pieces fit together historically.

This experiment fills that gap by reading the git history itself as evidence.

Timeline

Phase 1: Start with a narrow analysis problem

On January 31, 2026, the repository added:

  • packages/io/content/experiments/webflow-plagiarism-detection.md

That experiment framed the problem as template similarity and originality detection:

  • fingerprint large sets of templates
  • compare structure and semantics efficiently
  • lower review cost
  • expose the analysis through MCP tools

This phase matters because it established the first durable insight:

Webflow review work benefits from structured tooling, not only ad hoc human inspection.

But it was still a narrow view of the task. It solved a detection problem, not a whole review workflow.

Phase 2: Turn analysis into browser-backed infrastructure

On February 6, 2026, the repository added:

  • packages/webflow-site-analyzer-mcp/

This is the moment the work stopped being only a research experiment and became infrastructure.

The new package introduced:

  • browser-backed extraction
  • published-site checks
  • Webflow Designer metadata extraction
  • observability hooks
  • versioned scripts
  • a self-improvement loop

The key change was conceptual:

the analyzer no longer treated "the website" as a single truth surface.

It started distinguishing between:

  • published pages as runtime evidence
  • Designer state as authoring evidence

That distinction is what made later review automation possible.

Phase 3: Review becomes policy-grounded

On February 18, 2026, the repository added the canonical policy context pipeline.

This introduced a third truth surface:

  • external review policy

From this point forward, the system was no longer only asking:

"What is on the site?"

It could also ask:

"What policy version are we reviewing against?"

Then on March 2, 2026, the repository shows a cluster of review-oriented work:

  • checklist coverage analysis
  • human-vs-automated boundary analysis
  • Archipro review runs
  • MCP coverage validation

This phase hardened the core idea:

Review is not just extraction. Review is evidence joined to policy, with explicit manual boundaries.

Phase 4: Operators need workflows, not demos

On March 27, 2026, the repository added remote deployment and template review operations work:

  • queued jobs
  • remote execution
  • fallback paths
  • Airtable feedback loops
  • broader production checklist coverage

This is the point where the analyzer became an operating surface for reviewers.

The system now had production-shaped concerns:

  • bounded concurrency
  • progress phases
  • domain fallback
  • review job state
  • feedback collection

That is a different class of software from a local analyzer demo.

Phase 5: Public articulation and distribution

On April 13, 2026, two things happened that matter together.

First, the repo published:

  • packages/io/content/papers/analyzer-mcp-review-architecture.md

That paper gave the architecture a public thesis:

the analyzer is a policy-grounded review system, not merely a crawler.

Second, the same date brought major analyzer hardening and distribution work:

  • more checklist coverage
  • Worker deployment
  • MCP hub connectivity
  • workflow prompts and tool exposure
  • template name validation

This was the distribution moment:

the analyzer was no longer only local infrastructure. It became a networked review service.

Phase 6: The analyzer moves upstream into submission

The most important late-stage shift happened on April 21-23, 2026.

The repository added analyzer autofill and validation flows to:

  • apps/webflow-dashboard-cloud
  • apps/marketplace-template-submission-cloud

Alongside that, the repository also carried a creator-facing validator and analyzer branch:

  • packages/webflow-template-validation
  • packages/webflow-template-analyzer

This phase is easy to underread if you only look at individual commits.

What actually happened is that analyzer capability moved upstream in the workflow:

  • from reviewer inspection
  • to creator pre-checks
  • to autofill of submission data
  • to screenshot packaging
  • to clearer validation summaries
  • to submission success handling

The system started helping creators before formal review.

That is productization, not just feature growth.

What the History Shows

The history does not describe one static tool.

It describes an analyzer family with different trust boundaries:

  1. Reviewer-side MCP analysis

    • deep evidence gathering
    • policy-linked review output
    • manual states preserved
  2. Creator-side validation

    • bounded checks
    • actionable guidance
    • lower ambiguity before handoff
  3. Creator-side autofill

    • extracted facts become filled fields
    • screenshots become downloadable assets
    • the submission form becomes easier to complete

That split is not duplication. It is the workflow maturing.

The Core Insight

The Webflow analyzer became strategically interesting when it stopped being "a smart crawler."

It became interesting when it learned to respect three boundaries:

  • evidence boundaries
  • policy boundaries
  • role boundaries

Those boundaries are what allowed one lineage to support both reviewers and creators without pretending those users need the same thing.

What This Proves

This lineage review supports five claims:

  1. The analyzer story is coherent when read across January to April 2026, not commit-by-commit.
  2. Policy ingestion was a turning point, not a small add-on.
  3. Remote operations and queued review jobs marked the transition from demo to system.
  4. Productization happened when analyzer output was translated into creator-facing validation and autofill.
  5. The strongest framing for the series is governed review, not generic site analysis.

What This Does Not Prove

This experiment does not prove:

  • that every analyzer package should collapse into one deploy surface
  • that reviewer and creator tools should share the same UI or permission boundary
  • that automation should replace final reviewer judgment

The history points in the opposite direction:

the system gets stronger as its boundaries become clearer.

Reproducibility

To reconstruct the same lineage, review these commits in order:

  • 52878ccf — original experiment
  • bc8b5a9e — analyzer MCP package
  • c574e212 — policy context pipeline
  • a7758624 plus the 2026-03-02 review-analysis cluster
  • 1fa19860 — remote review ops
  • 963f0487 — architecture paper
  • 919aefb0 — analyzer hardening and distribution
  • 7033af53 — validator import
  • 95921a7e and c41e2988 — dashboard/submission autofill

Primary files:

  • packages/io/content/experiments/webflow-plagiarism-detection.md
  • packages/webflow-site-analyzer-mcp/README.md
  • packages/io/content/papers/analyzer-mcp-review-architecture.md
  • packages/webflow-template-validation/README.md
  • packages/webflow-template-analyzer/cloudflare/README.md
  • apps/marketplace-template-submission-cloud/README.md

Conclusion

The git history tells a stronger story than "we kept improving an analyzer."

It shows a system learning what review actually requires:

  • more than one truth surface
  • explicit policy provenance
  • preserved manual states
  • operational execution paths
  • and finally, creator-facing workflow help

That is why the next series should treat the analyzer as a lineage:

from detection, to governed review, to submission copilot.