SEO & AI Engine Optimization Framework · May 2026

Information Gain (Google Patent US 11,995,114 B2): the value of original measurement

Q: What is the Google Information Gain patent number and when was it granted?

The patent is US 11,995,114 B2, titled "Information Gain Score for Documents in a Document Repository." It was granted in May 2024 but filed years earlier, reflecting a system likely operational in Google's algorithms long before patent grant. The core insight: when many documents cover the same topic, Google can measure how much each one adds beyond the others, then prefer high-gain documents over low-gain ones that merely repeat available information.

Q: What categories of Information Gain can content contribute?

The framework defines ten categories: original research (the strongest), first-hand experience, synthesis across sources, contrarian analysis, specific examples and case studies, updated information, edge cases and failure documentation, connecting topics usually treated separately, methodology and process detail, and quantification of previously qualitative topics. Each article should contribute at least one type; articles that contribute nothing should not be published.

Q: How do I test whether my content actually contributes Information Gain?

Use the framework's validation tests. The Top 10 Test: read the top 10 results for your query and identify what your article adds beyond all of them. The Removal Test: if your article vanished, would readers notice a loss of specific knowledge? The Citation Test: over time, do other sites, AI engines, or journalists cite it? Also verify originality with Copyscape or Originality.ai (30%+ similarity is a red flag).

Q: Where does Information Gain matter most in 2026 search?

Three contexts. Standard search rankings, where Google uses it as a tiebreaker and increasingly a primary signal between competing pages. AI Overview source selection, where ChatGPT, Perplexity, Claude, Gemini, and Google's AI Overviews favor sources adding novel information to the synthesis. And featured snippet selection, where it influences which page wins position 0. Aggregation alone cannot win in any of these.

Q: What schema should I use for original research and first-hand experience articles?

For original research, use ScholarlyArticle schema with a mainEntity Dataset (including name, description, creator, and DataDownload distribution with a CSV contentUrl) to formalize the contribution. For first-hand experience, use Article schema with an isBasedOn CreativeWork describing the experience, for example "First-hand experience implementing this on {NUMBER} {CONTEXT}." Add a "What This Article Adds" callout and a "Cite This Research" block to make the gain legible and citable.

By Joseph W. Anady — Founder & Lead Engineer, ThatDevPro (BA Computer Engineering, MA Cybersecurity) · Updated May 2026

Information gain is a score, described in a Google patent granted in 2024, for how much new information a page adds beyond what a user has already seen. Practically, it rewards content that contributes original data, analysis, or first-hand experience and penalizes pages that just rehash what already ranks.

Google's Patented Signal for "What New Does This Content Add" — Originality, Novelty, and Differentiation

A comprehensive installation and audit reference for understanding Information Gain — the patented Google signal that measures how much novel, unique, or original information a page contributes beyond what's already available — and structurally engineering content to score high on this signal. This document is dual-purpose: installation manual and audit document.

Cross-stack implementation note: the code samples in this framework are written in plain HTML for clarity. For React, Vue, Svelte, Next.js, Nuxt, SvelteKit, Astro, Hugo, 11ty, Remix, WordPress, Shopify, and Webflow equivalents of every pattern below, see framework-cross-stack-implementation.md. For pure client-rendered SPAs (no SSR/SSG) see framework-react.md. For Tailwind-specific concerns (purge, dynamic classes, dark-mode CLS, focus accessibility) see framework-tailwind.md.

Quick answer

Information Gain is a score, described in Google patent US 11,995,114 B2 ("Information Gain Score for Documents in a Document Repository," granted May 2024), for how much new information a page adds beyond what a user has already seen. It rewards original data, analysis, or first-hand experience and penalizes pages that rehash what already ranks. To score high, contribute something only you can: original research, testing, contrarian analysis, or specific case examples.

1. Document Purpose & How to Use This Document

1.1 What This Document Is

This is the canonical reference for implementing Information Gain principles into content strategy. Information Gain is the principle (and patent) behind one of Google's most consequential ranking signals in 2026: the system that prefers content adding genuine novelty over content that re-states what's already widely available. Information Gain operates implicitly in nearly every algorithm core update of the past three years and explicitly in AI Overviews source selection.

This document specifies how to identify the Information Gain a page should contribute, how to structurally engineer that contribution into the content, and how to audit existing content for Information Gain — including the specific content elements, schema, and signals that demonstrate genuine novelty to Google's systems.

1.2 Three Operating Modes

Mode A — Install Mode: Building Information Gain into content strategy and individual articles. Follow Sections 2 → 14.

Mode B — Audit Mode: Evaluating existing content for Information Gain contribution. Skip to Section 11.

Mode C — Hybrid Mode: Audit then install for failing items.

1.3 How Claude Code CLI Should Consume This Document

Read Section 2 — collect client variables
Read Section 3 — understand the Information Gain patent and what Google measures
Apply Section 4 — categorize types of Information Gain a site can contribute
Install Sections 5-9 — per-article and site-wide patterns
Validate — Section 11
Generate report — Section 14

1.4 Conflict Resolution Rules

Conflict	Rule
Existing aggregator content with no original insight	Either deeply re-engineer with original contribution or remove. Aggregator content fails Information Gain by definition.
Existing content with original insight buried	Surface the original contribution prominently.
Existing content where original claims aren't substantiated	Either substantiate or soften claims. Unsubstantiated novelty fails.
Multiple articles competing on same topic	Consolidate to a single Information-Gain-rich page.

1.5 Required Tools

Google Search Console — for understanding what queries pages rank for and where Information Gain matters
Top 10 SERP analysis tools (Surfer, Frase, Clearscope) — to understand what's already covered for target queries
Original research tools — for conducting genuine new research (surveys, data analysis, testing)
Note-taking and observation tools — for capturing first-hand experience that becomes Information Gain
Plagiarism / similarity tools (Copyscape, Originality.ai) — to verify content isn't accidentally too similar to existing sources

2. Client Variables Intake

# ============================================
# INFORMATION GAIN FRAMEWORK CLIENT VARIABLES
# ============================================

# --- Business Identity (REQUIRED) ---
business_name: ""
primary_domain: ""
business_industry: ""

# --- Original Contribution Capacity (REQUIRED) ---
team_size: 0
domain_expertise_areas: []           # Topics where the team has genuine deep expertise
years_of_first_hand_experience: {}   # {topic: years}
data_collection_capabilities: []     # Surveys, analytics, testing labs, etc.
proprietary_data_sources: []         # Internal data the site has unique access to
unique_perspectives: []              # Angles or viewpoints distinctive to this site

# --- Content Inventory Information Gain Audit (REQUIRED for audit) ---
total_published_articles: 0
articles_with_clear_original_contribution: 0
articles_with_original_research: 0
articles_with_first_hand_experience: 0
articles_that_are_purely_aggregated: 0
articles_with_no_demonstrable_information_gain: 0

# --- Original Research Infrastructure (RECOMMENDED) ---
conducts_original_surveys: false
publishes_original_data_studies: false
performs_original_product_testing: false
operates_original_research_lab: false
has_unique_dataset_access: false
publishes_industry_benchmarks: false

# --- Differentiation Strategy (REQUIRED) ---
primary_differentiation: ""          # The one thing this site does that competitors don't
secondary_differentiations: []
content_uniqueness_check_in_workflow: false  # Pre-publish check for Information Gain
abandons_topics_without_clear_contribution: false  # Discipline to NOT publish on topics where you can't contribute novelty

# --- Citation & Attribution Strategy (RECOMMENDED) ---
gets_cited_by_other_authoritative_sources: false
ai_engines_cite_site_for_topics: []
research_pages_with_external_citations: 0
quoted_in_industry_publications: false

# --- Expertise Demonstration (REQUIRED) ---
has_credentialed_experts: false
publishes_first_hand_methodology_descriptions: false
documents_failures_and_edge_cases: false
shares_specific_case_data_anonymized: false

3. What Information Gain Is

Information Gain is the principle that documents adding novel, unique, or original information beyond what's already known about a topic should rank higher than documents merely repeating widely-available information. The principle was formalized in Google patent US 11,995,114 B2 — "Information Gain Score for Documents in a Document Repository" — granted in May 2024 but filed years earlier and reflecting a system likely operational in Google's algorithms long before patent grant.

The core insight of the patent: when many documents cover the same topic, Google can measure how much each document adds beyond the others. Documents with high Information Gain scores contribute novelty — original data, unique analysis, first-hand experience, contrarian findings, specific examples not available elsewhere. Documents with low Information Gain scores essentially repeat what other documents already say. The ranking system can then prefer high-gain documents over low-gain ones.

This isn't merely abstract theory. The patent describes practical mechanisms:

Comparison against established sources — the system compares each candidate document against a corpus of existing documents on the topic
Embedding similarity analysis — semantically similar content is detected even when wording differs
Specific contribution detection — the system identifies where a document adds new entities, claims, data points, or relationships not present in other sources
Temporal scoring — earlier-published novel content scores higher than later content covering the same novelty
Source authority weighting — novelty from authoritative sources weights higher than novelty from unestablished sources

In 2026, Information Gain operates as one of the most consequential ranking signals — particularly in three contexts:

1. Standard search rankings — When evaluating multiple competing pages for a query, Google uses Information Gain as a tiebreaker (and increasingly, primary signal) for which page deserves higher placement.

2. AI Overview source selection — When ChatGPT, Perplexity, Claude, and especially Gemini and Google's own AI Overviews choose sources to cite, Information Gain heavily influences which sources earn citation. AI engines look for sources adding novel information to the synthesis.

3. Featured snippet selection — Information Gain influences which page wins position 0.

The practical implication: in 2026, sites cannot win on aggregation. A "best 10 X tools" article that just describes what each tool's website already says won't rank — there are dozens like it, and none add Information Gain. To rank, content must contribute something. Original testing. Unique data. First-hand experience. Contrarian analysis. Specific case examples. Anything that, if removed, would represent an actual loss to the topic's collective knowledge.

This is uncomfortable for many SEO strategies built around "create comprehensive content on high-volume keywords" — comprehensiveness alone doesn't create Information Gain. Information Gain comes from the specific contributions a creator can make based on what they've genuinely done, observed, tested, or learned that others haven't.

The Information Gain framework, paired with HCS (framework-hcs.md) and the AI Citations framework (framework-aicitations.md), defines the new content strategy required to win in 2026 search.

4. Categories of Information Gain

Information Gain isn't a single thing — it's a category of contributions content can make. Categorize what types of gain your site can authentically provide.

4.1 Original Research

The strongest Information Gain comes from original research — surveys, experiments, data analysis, testing — that produces findings not available elsewhere.

Examples:

Annual industry surveys with hundreds or thousands of respondents
Lab testing of products with documented methodology
Analysis of proprietary data the site has unique access to
Experimental designs documented and replicated
Long-term studies of phenomena over time

When original research is the Information Gain, the page should:

Document methodology in detail
Provide raw data downloadable
Use Dataset schema
Cite limitations honestly
Be the canonical reference for the findings

4.2 First-Hand Experience

Detailed first-hand experience is Information Gain that competitors can't replicate without doing the same work.

Examples:

"I implemented this on 130 client sites and here's what failed"
"I worked at this company for 8 years and here's how the process actually worked"
"I lived in this region for a decade and here are the considerations I never see in tourist guides"
"I've practiced this medical specialty for 15 years and here are the patterns I've observed"
"I built this tool and here's why the obvious approach doesn't work"

When first-hand experience is the Information Gain, the page should:

Use first-person voice consistently
Include specific dates, numbers, names (anonymized where appropriate)
Document failure cases as well as successes
Include original photos/screenshots/artifacts
Be authored by the person with the experience (not ghost-written without disclosure)

4.3 Synthesis Across Sources

When multiple sources are widely-available individually but no one has connected them, synthesizing is Information Gain.

Examples:

"Here's what FDA guidance, peer-reviewed studies, and patient advocate organizations all say — and where they conflict"
"Combining what economists, historians, and demographers have said about this topic produces this pattern"
"Looking across these 5 industry reports, the meta-finding nobody has stated is..."

When synthesis is the Information Gain, the page should:

Cite all source materials
Make the synthesis itself the contribution (not just summarize each source)
Note where sources agree and disagree
Identify the meta-pattern that emerges from synthesis

4.4 Contrarian Analysis

When consensus exists on a topic but the consensus is wrong (or partially wrong), well-reasoned contrarian analysis is high-value Information Gain.

Examples:

"The conventional wisdom that X always works — here's when it doesn't"
"Most articles say Y is the best approach — based on testing, here's why Z is better"
"The industry treats P as obvious — but the data shows P is overstated"

When contrarian analysis is the Information Gain, the page should:

Acknowledge the prevailing view fairly before disagreeing
Provide specific evidence for the contrarian position
Be authored by someone with credible standing to disagree
Be open about limitations of the contrarian view
Avoid contrarianism for its own sake (must be genuinely informed, not provocative)

4.5 Specific Examples and Case Studies

Generic content covers the abstract; specific case examples are Information Gain.

Examples:

"I worked with [Client] who had [specific situation], we did [specific approach], and here are the specific results"
"Here's exactly what happened on [date] when we tried [approach]"
"This company implemented X, here are the actual numbers from their P&L 6 months later"

When specific examples are the Information Gain, the page should:

Anonymize where required (and explain why)
Include actual data, screenshots, artifacts
Document the specific steps taken
Acknowledge what was unique about this case vs. general application
Include enough detail that readers can identify whether their situation is similar

4.6 Updated Information

When existing content on a topic is outdated, providing current information is Information Gain — but only if the update is substantive.

Examples:

"Most articles say X tax rate; the rate changed in 2026, the new rate is Y"
"Industry-standard guidance was published in 2022; here's how the field has evolved through 2026"
"Software version 14 introduced changes that older articles don't reflect"

When updated information is the Information Gain, the page should:

Date-stamp specifically
Compare current state to prior state explicitly
Explain what changed and why
Acknowledge sources of the older guidance
Be maintained going forward (otherwise the update becomes outdated)

4.7 Edge Cases and Failure Documentation

Generic content covers the happy path; documenting edge cases and failures is Information Gain.

Examples:

"Here are the 7 ways this approach fails that no one warns you about"
"Standard advice says X works — except in these specific scenarios"
"Common pitfalls that cost me weeks before I figured them out"

When edge cases are the Information Gain, the page should:

Document specific scenarios where standard guidance breaks
Explain why the breakdown happens
Provide alternative approaches for the edge cases
Distinguish between rare edge cases and common pitfalls
Update as new edge cases are discovered

4.8 Connecting Topics

Sometimes Information Gain comes from connecting topics that are usually treated separately.

Examples:

"How [tax law change] interacts with [estate planning approach]"
"What [technology trend] means for [unrelated industry]"
"The connection between [historical pattern] and [current event]"

When topic connection is the Information Gain, the page should:

Establish each topic adequately (or link to authoritative sources for each)
Make the connection itself the central contribution
Provide evidence that the connection is real, not just speculative
Acknowledge limitations of the connection

4.9 Methodology and Process Detail

Generic content describes outcomes; documenting methodology in detail is Information Gain.

Examples:

"The exact prompt I used and why each phrase matters"
"Step-by-step process I follow, including the specific tools and configurations"
"The decision framework I use, with the specific criteria I weigh"

When methodology is the Information Gain, the page should:

Be specific enough that readers can replicate
Include configuration details, tool versions, settings
Document the reasoning behind each step
Show the artifacts of the process (screenshots, files, output)

4.10 Quantification of Previously Qualitative Topics

When industry discussion of a topic has been qualitative, quantifying it is Information Gain.

Examples:

"How long does X actually take? I tracked 50 instances and here are the numbers"
"Industry talks about Y as 'expensive' — here's the actual cost data across 30 cases"
"The question of Z gets debated qualitatively — here are the measurable outcomes"

When quantification is the Information Gain, the page should:

Document data collection methodology
Provide raw or summary data tables
Acknowledge sample limitations
Compare to existing qualitative claims

5. Per-Article Information Gain Implementation

Every article should contribute at least one type of Information Gain. Articles that contribute nothing should not be published.

5.1 Pre-Publish Information Gain Check

Before publishing any article, answer:

What does this article add that's not already in the top 10 results for the target query?
If this article didn't exist, what specific knowledge would the world be missing?
Which Information Gain category (Section 4) does this contribute to?
What specific evidence supports the contribution?
Could a competitor easily replicate this contribution? If yes, why is ours better?

If all answers are weak, the article is not ready to publish. Either strengthen the contribution or kill the article.

5.2 Information Gain Markers in Article Structure

Every article should make its Information Gain contribution visible. Suggested structural pattern:

<article class="information-gain-article">
  <!-- Standard header -->
  <header>
    <h1>{{TITLE_REFLECTING_THE_CONTRIBUTION}}</h1>
    {{BYLINE}}
    {{DATES}}
  </header>

  <!-- Information Gain marker — what this article adds -->
  <aside class="info-gain-marker" aria-label="What this article adds">
    <h2>What This Article Adds</h2>
    <p>{{ONE_PARAGRAPH_DESCRIPTION_OF_THE_GAIN_CONTRIBUTION}}</p>
    <ul>
      <li>{{SPECIFIC_CONTRIBUTION_1}}</li>
      <li>{{SPECIFIC_CONTRIBUTION_2}}</li>
      <li>{{SPECIFIC_CONTRIBUTION_3}}</li>
    </ul>
    <p>This article is based on: {{SOURCE_OF_GAIN_E.G._original_testing_OR_first_hand_experience_OR_proprietary_data}}.</p>
  </aside>

  <!-- Standard introduction -->
  <section class="introduction">
    {{INTRODUCTION_THAT_FRAMES_THE_TOPIC}}
  </section>

  <!-- Body content where the Information Gain lives -->
  <section class="content-body">
    {{SUBSTANTIVE_CONTENT_WITH_GAIN_CONTRIBUTION_THROUGHOUT}}
  </section>

  <!-- Methodology section if research-based -->
  {{IF_ORIGINAL_RESEARCH}}
  <section class="methodology">
    <h2>How I {{TESTED/RESEARCHED/STUDIED}} This</h2>
    {{DETAILED_METHODOLOGY}}
  </section>
  {{/IF}}

  <!-- Limitations section — honest about what's not covered -->
  <section class="limitations">
    <h2>Limitations</h2>
    <p>{{HONEST_ACKNOWLEDGMENT_OF_LIMITATIONS_OR_WHAT_THIS_DOESNT_COVER}}</p>
  </section>

  <!-- Standard footer -->
  <footer>
    {{REFERENCES}}
    {{AUTHOR_BOX}}
  </footer>
</article>

The "What This Article Adds" callout is critical — it makes the contribution legible to both human readers and AI systems evaluating the page.

5.3 Schema for Information Gain Content

For original research articles, use Dataset and ScholarlyArticle schemas to formalize the contribution:

<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "ScholarlyArticle",
  "headline": "{{TITLE}}",
  "author": {"@id": "https://{{PRIMARY_DOMAIN}}/authors/{{AUTHOR_SLUG}}/#person"},
  "datePublished": "{{PUBLISHED_DATE}}",
  "dateModified": "{{UPDATED_DATE}}",
  "publisher": {"@id": "https://{{PRIMARY_DOMAIN}}/#organization"},
  "about": {"@type": "Thing", "name": "{{PRIMARY_TOPIC}}"},
  "abstract": "{{2_SENTENCE_ABSTRACT}}",
  "creditText": "{{CREDIT_LINE}}",
  "isPartOf": {"@id": "https://{{PRIMARY_DOMAIN}}/#website"},
  "mainEntity": {
    "@type": "Dataset",
    "name": "{{DATASET_NAME}}",
    "description": "{{DATASET_DESCRIPTION}}",
    "creator": {"@id": "https://{{PRIMARY_DOMAIN}}/authors/{{AUTHOR_SLUG}}/#person"},
    "datePublished": "{{PUBLISHED_DATE}}",
    "license": "https://creativecommons.org/licenses/by/4.0/",
    "distribution": [
      {
        "@type": "DataDownload",
        "encodingFormat": "text/csv",
        "contentUrl": "{{CSV_DOWNLOAD_URL}}"
      }
    ]
  }
}
</script>

For first-hand experience articles, use Article schema with specific properties demonstrating the experience:

<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "Article",
  "headline": "{{TITLE}}",
  "author": {"@id": "https://{{PRIMARY_DOMAIN}}/authors/{{AUTHOR_SLUG}}/#person"},
  "datePublished": "{{PUBLISHED_DATE}}",
  "publisher": {"@id": "https://{{PRIMARY_DOMAIN}}/#organization"},
  "about": {"@type": "Thing", "name": "{{TOPIC}}"},
  "isBasedOn": {
    "@type": "CreativeWork",
    "description": "First-hand experience implementing this on {{NUMBER}} {{CONTEXT}}"
  }
}
</script>

5.4 Citation-Worthy Format

When the Information Gain is research or analysis worthy of citation, format the article to be citable:

<aside class="cite-this-research">
  <h3>Cite This Research</h3>
  <p>If you reference this work, please cite it as:</p>
  <pre id="citation-apa">{{LASTNAME}}, {{FIRST_INITIAL}}. ({{YEAR}}). {{TITLE}}. {{BUSINESS_NAME}}. {{URL}}</pre>
  <button onclick="navigator.clipboard.writeText(document.getElementById('citation-apa').textContent)">Copy citation</button>
</aside>

Citation-friendly content gets cited more, which compounds Information Gain (high-citation content has higher Information Gain weight in Google's evaluation).

5.5 Methodology Documentation

Original research articles must document methodology in detail. Template:

<section class="methodology">
  <h2>Methodology</h2>

  <h3>Research Question</h3>
  <p>{{SPECIFIC_QUESTION_THE_RESEARCH_ADDRESSED}}</p>

  <h3>Approach</h3>
  <p>{{HOW_THE_RESEARCH_WAS_CONDUCTED}}</p>

  <h3>Sample / Data Source</h3>
  <p>{{DESCRIPTION_OF_DATA_INCLUDING_SAMPLE_SIZE_AND_SOURCE}}</p>

  <h3>Time Period</h3>
  <p>{{WHEN_DATA_WAS_COLLECTED}}</p>

  <h3>Analysis Method</h3>
  <p>{{HOW_DATA_WAS_ANALYZED}}</p>

  <h3>Limitations</h3>
  <ul>
    <li>{{LIMITATION_1}}</li>
    <li>{{LIMITATION_2}}</li>
  </ul>

  <h3>Replication</h3>
  <p>{{HOW_OTHERS_COULD_REPLICATE_THIS_RESEARCH}}</p>
</section>

Detailed methodology demonstrates rigor, supports Information Gain claims, and provides material for citation by other researchers.

5.6 Failure and Edge Case Documentation

For experience-based articles, dedicate a section to what didn't work or edge cases:

<section class="failures-and-edge-cases">
  <h2>What Doesn't Work (or Where Standard Advice Fails)</h2>

  <h3>{{FAILURE_PATTERN_1}}</h3>
  <p>{{SPECIFIC_DESCRIPTION_INCLUDING_WHEN_AND_WHY}}</p>
  <p><strong>What to do instead:</strong> {{ALTERNATIVE_APPROACH}}</p>

  <h3>{{FAILURE_PATTERN_2}}</h3>
  <p>{{SPECIFIC_DESCRIPTION_INCLUDING_WHEN_AND_WHY}}</p>
  <p><strong>What to do instead:</strong> {{ALTERNATIVE_APPROACH}}</p>
</section>

This section is uniquely powerful for Information Gain because it requires actual experience that others can't easily replicate.

6. Site-Wide Information Gain Strategy

Beyond per-article work, the site needs an overall Information Gain posture.

6.1 Phase 1: Audit for Information Gain Distribution

Audit all existing content. For each article, classify:

Original contribution clear — Article makes a specific, identifiable Information Gain contribution
Contribution exists but buried — Original insight is present but not surfaced
No clear contribution — Article is essentially aggregated/derivative
Outdated even relative to derivative content — Article was once derivative; now it's stale derivative

Goal distribution:

60%+ articles with clear original contribution
<10% articles with no clear contribution

If existing distribution is worse, plan remediation: surface buried contribution, deepen weak articles, remove or consolidate articles with no contribution.

6.2 Phase 2: Develop Information Gain Capacity

Audit the site's capacity to produce Information Gain regularly:

Are credentialed experts producing content based on first-hand experience?
Is original research being conducted?
Does the team have access to proprietary data?
Are unique perspectives being articulated?
Is there a routine for collecting first-hand observations?

Common capacity-building actions:

For service businesses: Document client work as case studies. Aggregate patterns across clients into industry insights.

For SaaS companies: Use product analytics (with privacy consideration) to publish industry benchmarks.

For media sites: Conduct surveys, original investigations, and primary-source reporting.

For e-commerce: Test products genuinely. Document pros and cons honestly. Track customer outcomes.

For solo creators: Document personal experience in detail. Run experiments. Document failures.

6.3 Phase 3: Original Research Cadence

Establish a cadence for publishing original research:

Annual industry survey (timed for peak interest in your industry)
Quarterly data study from proprietary or aggregated public data
Monthly case study from client/customer work (anonymized)
Ongoing first-hand experience documentation

Publishing original research at cadence builds expectation, attracts citations, and demonstrates ongoing Information Gain capacity.

6.4 Phase 4: Differentiation Strategy

Identify what your site does that others don't:

Unique data access: What data do you have that competitors don't?
Unique experience: What have you done that competitors haven't?
Unique perspective: What angle do you take that competitors don't?
Unique methodology: How do you approach topics differently?
Unique audience: Who do you serve that others miss?
Unique combination: What expertise areas combine in this site uniquely?

Document differentiation publicly:

About page describes what makes the site distinctive
Topical hub pages explain the unique perspective taken on each pillar
Author bios articulate unique experience and credentials

6.5 Phase 5: Topic Discipline

The hardest discipline: don't publish on topics where you can't contribute Information Gain.

Maintain a "topics we don't cover" list publicly:

<section class="topics-we-dont-cover">
  <h2>Topics We Don't Cover</h2>
  <p>We cover {{TOPICS}} deeply because that's where we have genuine expertise and original perspective. We don't cover:</p>
  <ul>
    <li><strong>{{ADJACENT_TOPIC_1}}</strong> — for that, we recommend {{TRUSTED_SOURCE_1}}</li>
    <li><strong>{{ADJACENT_TOPIC_2}}</strong> — for that, we recommend {{TRUSTED_SOURCE_2}}</li>
  </ul>
</section>

This section signals editorial integrity and prevents diluting the site's Information Gain by publishing weak content on adjacent topics.

6.6 Phase 6: Citation-Earning Strategy

Earn citations by being cite-worthy:

Publish data others want to reference
Format claims to be quotable
Include "cite this" buttons on research pages
Reach out to relevant journalists when research is published
Build relationships with other authoritative sources who cite each other
Make it easy to cite (clear data, downloadable, properly formatted)

When earned media cites the site, it amplifies Information Gain signal — citation is external validation that the contribution is novel and valuable.

7. Information Gain Validation

How to test whether content actually contributes Information Gain.

7.1 The Top 10 Test

For an article on a target query:

Search Google for the target query in incognito mode
Read the top 10 results
Note what each result covers
Identify what your article adds beyond all 10

Specific questions:

Does your article have specific data, examples, or insights none of the top 10 have?
Does your article cover edge cases none of the top 10 cover?
Does your article take a perspective none of the top 10 take?
Could a reader who's read all 10 still benefit from yours?

If you can articulate the unique contribution clearly, the article passes Information Gain.

If you can't articulate it, either don't publish, or rework until you can.

7.2 The Removal Test

Imagine your article is removed from the internet entirely.

Would readers seeking this topic notice the loss?
Is there any specific knowledge that would now be unavailable?
Would competitors miss something they currently can't get elsewhere?

If the answer to all is "no," the article doesn't contribute Information Gain.

7.3 The Citation Test

Over time, does the article earn citations?

Are other sites linking to it as a reference?
Are AI engines citing it?
Are journalists quoting from it?
Are practitioners referencing it?

Citation-earning content has clear Information Gain. Content that doesn't earn citations either failed to articulate its gain (fixable) or didn't have gain to begin with (not fixable through SEO tactics).

7.4 Originality Verification

Use plagiarism/similarity detection (Copyscape, Originality.ai) to verify content isn't accidentally too similar to existing sources. If 30%+ similarity to any existing source on a topic appears, that's a red flag for Information Gain.

But: low similarity scores don't prove Information Gain. Content can be unique in wording while contributing nothing new. Information Gain is about substantive contribution, not just word uniqueness.

7.5 Self-Honest Evaluation

Ask the author/team honestly:

Why did we publish this article?
What did we know that others didn't?
Would we have published this if Google didn't exist?
Do we have evidence supporting our specific claims?

If answers are weak, the article likely doesn't contribute Information Gain regardless of word count or formatting.

8. Common Mistakes & Anti-Patterns

8.1 Aggregator Articles

Anti-pattern: "10 best X tools" articles that just describe what each tool's website says.

Why it fails: No Information Gain. Anyone could write this from public information.

Fix: Actually use the tools. Document specific use cases. Compare on dimensions the tool websites don't address. Provide testing data.

8.2 SEO-Optimized Without Substance

Anti-pattern: Articles structured perfectly for SEO (length, keywords, headers) but adding nothing substantively new.

Why it fails: Google's Information Gain evaluation sees through structural optimization without substantive contribution.

Fix: Substance first, structure second. If you can't articulate the contribution, no structural optimization saves the article.

8.3 Generic First-Person Without Specifics

Anti-pattern: "In my experience, X is best" without any specific data, dates, examples, or detail.

Why it fails: Generic experience claims don't function as Information Gain. They're indistinguishable from fabricated experience.

Fix: Specific dates, specific cases, specific data, specific methodology. Detail demonstrates real experience.

8.4 Synthetic Surveys

Anti-pattern: "We surveyed 1000 marketers" — but the survey was poorly designed, run on a small biased panel, or fabricated entirely.

Why it fails: Once detected, ruins credibility and does long-term damage.

Fix: Real research with real methodology. Document sample acquisition. Acknowledge limitations. Quality > quantity in research.

8.5 Stale Original Research

Anti-pattern: Publishing one piece of original research, then never updating it as the topic evolves.

Why it fails: Original research becomes derivative as time passes. Without updates, the Information Gain decays.

Fix: Refresh research annually or as the topic warrants. Make clear what's been updated and when.

8.6 Duplicate Internal Coverage

Anti-pattern: Multiple articles on the same site covering the same topic with slight variations, none of them strong contributors.

Why it fails: Self-cannibalization. Each article dilutes the others. None achieves strong Information Gain.

Fix: Consolidate to one strong article per topic. 301 redirect duplicates.

8.7 Information Gain Buried

Anti-pattern: Article does have original contribution, but it's buried in section 7 of an 8-section article.

Why it fails: Both human readers and Google's algorithms may not surface the contribution. Information Gain isn't just present — it must be visible.

Fix: Surface the contribution prominently. Lead with what's new. Use callouts.

8.8 Citation Without Substance

Anti-pattern: Articles that cite many sources but don't synthesize or contribute. Just an annotated bibliography.

Why it fails: Citing existing work doesn't contribute Information Gain unless you're synthesizing or building on it.

Fix: Make synthesis or original contribution the primary content. Citations support the contribution, don't replace it.

8.9 AI-Generated "Insights"

Anti-pattern: Asking AI to generate "novel insights" on a topic. AI can't have novel insights — only synthesize existing material.

Why it fails: AI-generated "insights" tend to be obvious patterns dressed up as novelty. Detection is improving. The content adds nothing genuinely new.

Fix: Use AI for research assistance, drafting, editing — not for the actual Information Gain contribution. The contribution must come from human experience, observation, or research.

8.10 Over-Hedging Original Claims

Anti-pattern: Making original claims so heavily hedged ("This might suggest, in some cases, possibly...") that the claim becomes meaningless.

Why it fails: Strong claims without evidence are bad; over-hedged claims are also bad. The hedge undermines the contribution.

Fix: Make specific, supportable claims with appropriate caveats. "Based on testing 47 instances over 6 months, we found X happens in approximately 70% of cases when Y conditions apply."

9. Information Gain Stack-Specific Notes

9.1 WordPress

Custom fields per article: "Information Gain Type," "Source of Gain," "Specific Contribution"
Editorial workflow requires Information Gain identification before publish
Plugin to surface "What this article adds" callout consistently
Custom post type for original research that includes downloadable data

9.2 Next.js / Astro / Hugo

TypeScript schema for content frontmatter requiring Information Gain fields
Pre-publish validation that Information Gain markers are present
Component for "What this article adds" callout
Dataset schema generated from research article frontmatter

9.3 Universal Pattern

Regardless of stack:

Define Information Gain types in CMS taxonomy
Require Information Gain articulation before publishing
Surface Information Gain markers in templates
Track citations and references over time
Audit periodically for distribution of Information Gain types

10. Cross-Reference to the 14-Tier Framework

Information Gain implementation touches:

Tier 2 CBO — Content Brief Optimization should require Information Gain identification at brief stage
Tier 2 ETO — E-E-A-T Trust Optimization is foundational; Experience pillar is Information Gain through first-hand involvement
Tier 3 LLMO — LLM Optimization rewards Information Gain via citation worthiness
Tier 3 SGA — SearchGPT Optimization specifically rewards Information Gain
Tier 4 TLO — Thought Leadership Optimization is Information Gain at scale
Tier 6 RSO — Research/Studies Optimization is original research as Information Gain
Tier 6 OCO — Original Content Optimization
Tier 7 OPO — Original Perspective Optimization

Information Gain is the unifying principle across multiple tiers. When the site struggles with Information Gain, multiple tiers will show weaknesses simultaneously.

11. Audit Mode

11.1 Per-Article Audit

For sample articles, score:

#	Criterion	Pass/Fail
IG1	Article has clearly identifiable Information Gain contribution
IG2	Information Gain is surfaced prominently (not buried)
IG3	Specific evidence supports the contribution
IG4	Contribution category is identifiable (research/experience/synthesis/etc.)
IG5	Article passes the "Top 10 Test"
IG6	Article passes the "Removal Test"
IG7	Methodology documented if research-based
IG8	First-person specifics if experience-based
IG9	Contribution couldn't be easily replicated by competitors
IG10	Article has earned citations or shows citation potential

Score per article: 10. World-class IG article: 9+/10.

11.2 Site-Wide Audit

#	Criterion	Pass/Fail
IGS1	60%+ of articles have clear Information Gain contribution
IGS2	<10% of articles are pure aggregators with no contribution
IGS3	Site has documented Information Gain capacity (research, experience, data access)
IGS4	Original research published on regular cadence
IGS5	Differentiation strategy documented and visible
IGS6	Topical discipline maintained (don't publish where can't contribute)
IGS7	Citation infrastructure in place (cite-this buttons, formal citations)
IGS8	"What this article adds" markers used consistently
IGS9	Citation-worthy content earns external citations
IGS10	Pre-publish Information Gain check is part of editorial workflow

Site score: 10. World-class IG site: 9+/10.

12. Maintenance Schedule

12.1 Weekly

Review newly published articles for Information Gain markers
Track new external citations to existing research

12.2 Monthly

Audit one piece of older research for refresh need
Identify articles where IG is buried; surface it
Plan next original research project

12.3 Quarterly

Site-wide IG distribution audit
Citation tracking review
Research cadence review — is original research being published as planned?
Topical discipline review — are we publishing on topics where we contribute IG?

12.4 Annually

Comprehensive IG framework audit
Strategic review: what new IG capacity should we build?
Refresh original research that's now 12+ months old
Update differentiation strategy based on competitive evolution

13. Implementation/Audit Report Templates

13.1 IG Implementation Report Template

# Information Gain Framework Implementation Report

**Site**: {{BUSINESS_NAME}}
**Implementation Date**: {{TODAY}}

## Summary
- Articles audited: {{COUNT}}
- Articles with clear IG contribution: {{PERCENTAGE}}%
- Articles requiring IG enhancement: {{COUNT}}
- Articles removed/consolidated due to no IG: {{COUNT}}

## IG Capacity Assessment
{{CAPACITY_FOR_RESEARCH_EXPERIENCE_DATA_ETC}}

## IG Cadence Established
{{RESEARCH_PUBLICATION_SCHEDULE}}

## Differentiation Strategy Documented
{{LIST_OF_DIFFERENTIATIONS}}

## Pre-Publish Workflow Updated
{{WORKFLOW_DESCRIPTION}}

## Sign-Off

13.2 IG Audit Report Template

# Information Gain Audit Report

**Site**: {{BUSINESS_NAME}}
**Audit Date**: {{TODAY}}

## Executive Summary
{{SUMMARY}}

## Site IG Score
{{X}}/10

## Per-Article Sample Scores
{{TABLE_OF_SAMPLED_ARTICLES_WITH_SCORES}}

## IG Distribution
- Original research: {{COUNT}} articles
- First-hand experience: {{COUNT}} articles
- Synthesis: {{COUNT}} articles
- Contrarian analysis: {{COUNT}} articles
- Specific case studies: {{COUNT}} articles
- Aggregator (no IG): {{COUNT}} articles

## Critical Findings
{{LIST}}

## Recommended Remediation
{{PRIORITIZED_LIST}}

## Sign-Off

End of Framework Document

Document version: 1.0 Last updated: 2026-04-29 Maintained by: ThatDeveloperGuy

Information Gain is the principle that distinguishes content that earns its rankings from content that just occupies them. In 2026, with the volume of available content multiplied by AI, only content that adds genuine novelty earns sustainable visibility. The frameworks for E-E-A-T, HCS, YMYL, and SQRG all converge on Information Gain — they are different angles on the same fundamental requirement: contribute something only you can contribute.

Companion documents:

framework-eeat.md
framework-ymyl.md
framework-hcs.md
framework-sqrg.md
framework-coreupdates.md
framework-entitysalience.md
framework-knowledgegraph.md
framework-aicitations.md

Frequently asked questions

What is the Google Information Gain patent number and when was it granted?

The patent is US 11,995,114 B2, titled "Information Gain Score for Documents in a Document Repository." It was granted in May 2024 but filed years earlier, reflecting a system likely operational in Google's algorithms long before patent grant. The core insight: when many documents cover the same topic, Google can measure how much each one adds beyond the others, then prefer high-gain documents over low-gain ones that merely repeat available information.

What categories of Information Gain can content contribute?

The framework defines ten categories: original research (the strongest), first-hand experience, synthesis across sources, contrarian analysis, specific examples and case studies, updated information, edge cases and failure documentation, connecting topics usually treated separately, methodology and process detail, and quantification of previously qualitative topics. Each article should contribute at least one type; articles that contribute nothing should not be published.

How do I test whether my content actually contributes Information Gain?

Use the framework's validation tests. The Top 10 Test: read the top 10 results for your query and identify what your article adds beyond all of them. The Removal Test: if your article vanished, would readers notice a loss of specific knowledge? The Citation Test: over time, do other sites, AI engines, or journalists cite it? Also verify originality with Copyscape or Originality.ai (30%+ similarity is a red flag).

Where does Information Gain matter most in 2026 search?

Three contexts. Standard search rankings, where Google uses it as a tiebreaker and increasingly a primary signal between competing pages. AI Overview source selection, where ChatGPT, Perplexity, Claude, Gemini, and Google's AI Overviews favor sources adding novel information to the synthesis. And featured snippet selection, where it influences which page wins position 0. Aggregation alone cannot win in any of these.

What schema should I use for original research and first-hand experience articles?

For original research, use ScholarlyArticle schema with a mainEntity Dataset (including name, description, creator, and DataDownload distribution with a CSV contentUrl) to formalize the contribution. For first-hand experience, use Article schema with an isBasedOn CreativeWork describing the experience, for example "First-hand experience implementing this on {NUMBER} {CONTEXT}." Add a "What This Article Adds" callout and a "Cite This Research" block to make the gain legible and citable.

Want this framework implemented on your site?

ThatDevPro ships these frameworks as productized services. SDVOSB-certified veteran owned. Cassville, Missouri.

See Engine Optimization service ›