Information Gain (Google Patent US 11,995,114 B2): the value of original measurement
A comprehensive installation and audit reference for understanding Information Gain — the patented Google signal that measures how much novel, unique, or original information a page contributes…
Google's Patented Signal for "What New Does This Content Add" — Originality, Novelty, and Differentiation
A comprehensive installation and audit reference for understanding Information Gain — the patented Google signal that measures how much novel, unique, or original information a page contributes beyond what's already available — and structurally engineering content to score high on this signal. This document is dual-purpose: installation manual and audit document.
Cross-stack implementation note: the code samples in this framework are written in plain HTML for clarity. For React, Vue, Svelte, Next.js, Nuxt, SvelteKit, Astro, Hugo, 11ty, Remix, WordPress, Shopify, and Webflow equivalents of every pattern below, see
framework-cross-stack-implementation.md. For pure client-rendered SPAs (no SSR/SSG) seeframework-react.md. For Tailwind-specific concerns (purge, dynamic classes, dark-mode CLS, focus accessibility) seeframework-tailwind.md.
1. Document Purpose & How to Use This Document
1.1 What This Document Is
This is the canonical reference for implementing Information Gain principles into content strategy. Information Gain is the principle (and patent) behind one of Google's most consequential ranking signals in 2026: the system that prefers content adding genuine novelty over content that re-states what's already widely available. Information Gain operates implicitly in nearly every algorithm core update of the past three years and explicitly in AI Overviews source selection.
This document specifies how to identify the Information Gain a page should contribute, how to structurally engineer that contribution into the content, and how to audit existing content for Information Gain — including the specific content elements, schema, and signals that demonstrate genuine novelty to Google's systems.
1.2 Three Operating Modes
Mode A — Install Mode: Building Information Gain into content strategy and individual articles. Follow Sections 2 → 14.
Mode B — Audit Mode: Evaluating existing content for Information Gain contribution. Skip to Section 11.
Mode C — Hybrid Mode: Audit then install for failing items.
1.3 How Claude Code CLI Should Consume This Document
- Read Section 2 — collect client variables
- Read Section 3 — understand the Information Gain patent and what Google measures
- Apply Section 4 — categorize types of Information Gain a site can contribute
- Install Sections 5-9 — per-article and site-wide patterns
- Validate — Section 11
- Generate report — Section 14
1.4 Conflict Resolution Rules
| Conflict | Rule |
|---|---|
| Existing aggregator content with no original insight | Either deeply re-engineer with original contribution or remove. Aggregator content fails Information Gain by definition. |
| Existing content with original insight buried | Surface the original contribution prominently. |
| Existing content where original claims aren't substantiated | Either substantiate or soften claims. Unsubstantiated novelty fails. |
| Multiple articles competing on same topic | Consolidate to a single Information-Gain-rich page. |
1.5 Required Tools
- Google Search Console — for understanding what queries pages rank for and where Information Gain matters
- Top 10 SERP analysis tools (Surfer, Frase, Clearscope) — to understand what's already covered for target queries
- Original research tools — for conducting genuine new research (surveys, data analysis, testing)
- Note-taking and observation tools — for capturing first-hand experience that becomes Information Gain
- Plagiarism / similarity tools (Copyscape, Originality.ai) — to verify content isn't accidentally too similar to existing sources
2. Client Variables Intake
# ============================================
# INFORMATION GAIN FRAMEWORK CLIENT VARIABLES
# ============================================
# --- Business Identity (REQUIRED) ---
business_name: ""
primary_domain: ""
business_industry: ""
# --- Original Contribution Capacity (REQUIRED) ---
team_size: 0
domain_expertise_areas: [] # Topics where the team has genuine deep expertise
years_of_first_hand_experience: {} # {topic: years}
data_collection_capabilities: [] # Surveys, analytics, testing labs, etc.
proprietary_data_sources: [] # Internal data the site has unique access to
unique_perspectives: [] # Angles or viewpoints distinctive to this site
# --- Content Inventory Information Gain Audit (REQUIRED for audit) ---
total_published_articles: 0
articles_with_clear_original_contribution: 0
articles_with_original_research: 0
articles_with_first_hand_experience: 0
articles_that_are_purely_aggregated: 0
articles_with_no_demonstrable_information_gain: 0
# --- Original Research Infrastructure (RECOMMENDED) ---
conducts_original_surveys: false
publishes_original_data_studies: false
performs_original_product_testing: false
operates_original_research_lab: false
has_unique_dataset_access: false
publishes_industry_benchmarks: false
# --- Differentiation Strategy (REQUIRED) ---
primary_differentiation: "" # The one thing this site does that competitors don't
secondary_differentiations: []
content_uniqueness_check_in_workflow: false # Pre-publish check for Information Gain
abandons_topics_without_clear_contribution: false # Discipline to NOT publish on topics where you can't contribute novelty
# --- Citation & Attribution Strategy (RECOMMENDED) ---
gets_cited_by_other_authoritative_sources: false
ai_engines_cite_site_for_topics: []
research_pages_with_external_citations: 0
quoted_in_industry_publications: false
# --- Expertise Demonstration (REQUIRED) ---
has_credentialed_experts: false
publishes_first_hand_methodology_descriptions: false
documents_failures_and_edge_cases: false
shares_specific_case_data_anonymized: false
3. What Information Gain Is
Information Gain is the principle that documents adding novel, unique, or original information beyond what's already known about a topic should rank higher than documents merely repeating widely-available information. The principle was formalized in Google patent US 11,995,114 B2 — "Information Gain Score for Documents in a Document Repository" — granted in May 2024 but filed years earlier and reflecting a system likely operational in Google's algorithms long before patent grant.
The core insight of the patent: when many documents cover the same topic, Google can measure how much each document adds beyond the others. Documents with high Information Gain scores contribute novelty — original data, unique analysis, first-hand experience, contrarian findings, specific examples not available elsewhere. Documents with low Information Gain scores essentially repeat what other documents already say. The ranking system can then prefer high-gain documents over low-gain ones.
This isn't merely abstract theory. The patent describes practical mechanisms:
- Comparison against established sources — the system compares each candidate document against a corpus of existing documents on the topic
- Embedding similarity analysis — semantically similar content is detected even when wording differs
- Specific contribution detection — the system identifies where a document adds new entities, claims, data points, or relationships not present in other sources
- Temporal scoring — earlier-published novel content scores higher than later content covering the same novelty
- Source authority weighting — novelty from authoritative sources weights higher than novelty from unestablished sources
In 2026, Information Gain operates as one of the most consequential ranking signals — particularly in three contexts:
1. Standard search rankings — When evaluating multiple competing pages for a query, Google uses Information Gain as a tiebreaker (and increasingly, primary signal) for which page deserves higher placement.
2. AI Overview source selection — When ChatGPT, Perplexity, Claude, and especially Gemini and Google's own AI Overviews choose sources to cite, Information Gain heavily influences which sources earn citation. AI engines look for sources adding novel information to the synthesis.
3. Featured snippet selection — Information Gain influences which page wins position 0.
The practical implication: in 2026, sites cannot win on aggregation. A "best 10 X tools" article that just describes what each tool's website already says won't rank — there are dozens like it, and none add Information Gain. To rank, content must contribute something. Original testing. Unique data. First-hand experience. Contrarian analysis. Specific case examples. Anything that, if removed, would represent an actual loss to the topic's collective knowledge.
This is uncomfortable for many SEO strategies built around "create comprehensive content on high-volume keywords" — comprehensiveness alone doesn't create Information Gain. Information Gain comes from the specific contributions a creator can make based on what they've genuinely done, observed, tested, or learned that others haven't.
The Information Gain framework, paired with HCS (framework-hcs.md) and the AI Citations framework (framework-aicitations.md), defines the new content strategy required to win in 2026 search.
4. Categories of Information Gain
Information Gain isn't a single thing — it's a category of contributions content can make. Categorize what types of gain your site can authentically provide.
4.1 Original Research
The strongest Information Gain comes from original research — surveys, experiments, data analysis, testing — that produces findings not available elsewhere.
Examples:
- Annual industry surveys with hundreds or thousands of respondents
- Lab testing of products with documented methodology
- Analysis of proprietary data the site has unique access to
- Experimental designs documented and replicated
- Long-term studies of phenomena over time
When original research is the Information Gain, the page should:
- Document methodology in detail
- Provide raw data downloadable
- Use Dataset schema
- Cite limitations honestly
- Be the canonical reference for the findings
4.2 First-Hand Experience
Detailed first-hand experience is Information Gain that competitors can't replicate without doing the same work.
Examples:
- "I implemented this on 130 client sites and here's what failed"
- "I worked at this company for 8 years and here's how the process actually worked"
- "I lived in this region for a decade and here are the considerations I never see in tourist guides"
- "I've practiced this medical specialty for 15 years and here are the patterns I've observed"
- "I built this tool and here's why the obvious approach doesn't work"
When first-hand experience is the Information Gain, the page should:
- Use first-person voice consistently
- Include specific dates, numbers, names (anonymized where appropriate)
- Document failure cases as well as successes
- Include original photos/screenshots/artifacts
- Be authored by the person with the experience (not ghost-written without disclosure)
4.3 Synthesis Across Sources
When multiple sources are widely-available individually but no one has connected them, synthesizing is Information Gain.
Examples:
- "Here's what FDA guidance, peer-reviewed studies, and patient advocate organizations all say — and where they conflict"
- "Combining what economists, historians, and demographers have said about this topic produces this pattern"
- "Looking across these 5 industry reports, the meta-finding nobody has stated is..."
When synthesis is the Information Gain, the page should:
- Cite all source materials
- Make the synthesis itself the contribution (not just summarize each source)
- Note where sources agree and disagree
- Identify the meta-pattern that emerges from synthesis
4.4 Contrarian Analysis
When consensus exists on a topic but the consensus is wrong (or partially wrong), well-reasoned contrarian analysis is high-value Information Gain.
Examples:
- "The conventional wisdom that X always works — here's when it doesn't"
- "Most articles say Y is the best approach — based on testing, here's why Z is better"
- "The industry treats P as obvious — but the data shows P is overstated"
When contrarian analysis is the Information Gain, the page should:
- Acknowledge the prevailing view fairly before disagreeing
- Provide specific evidence for the contrarian position
- Be authored by someone with credible standing to disagree
- Be open about limitations of the contrarian view
- Avoid contrarianism for its own sake (must be genuinely informed, not provocative)
4.5 Specific Examples and Case Studies
Generic content covers the abstract; specific case examples are Information Gain.
Examples:
- "I worked with [Client] who had [specific situation], we did [specific approach], and here are the specific results"
- "Here's exactly what happened on [date] when we tried [approach]"
- "This company implemented X, here are the actual numbers from their P&L 6 months later"
When specific examples are the Information Gain, the page should:
- Anonymize where required (and explain why)
- Include actual data, screenshots, artifacts
- Document the specific steps taken
- Acknowledge what was unique about this case vs. general application
- Include enough detail that readers can identify whether their situation is similar
4.6 Updated Information
When existing content on a topic is outdated, providing current information is Information Gain — but only if the update is substantive.
Examples:
- "Most articles say X tax rate; the rate changed in 2026, the new rate is Y"
- "Industry-standard guidance was published in 2022; here's how the field has evolved through 2026"
- "Software version 14 introduced changes that older articles don't reflect"
When updated information is the Information Gain, the page should:
- Date-stamp specifically
- Compare current state to prior state explicitly
- Explain what changed and why
- Acknowledge sources of the older guidance
- Be maintained going forward (otherwise the update becomes outdated)
4.7 Edge Cases and Failure Documentation
Generic content covers the happy path; documenting edge cases and failures is Information Gain.
Examples:
- "Here are the 7 ways this approach fails that no one warns you about"
- "Standard advice says X works — except in these specific scenarios"
- "Common pitfalls that cost me weeks before I figured them out"
When edge cases are the Information Gain, the page should:
- Document specific scenarios where standard guidance breaks
- Explain why the breakdown happens
- Provide alternative approaches for the edge cases
- Distinguish between rare edge cases and common pitfalls
- Update as new edge cases are discovered
4.8 Connecting Topics
Sometimes Information Gain comes from connecting topics that are usually treated separately.
Examples:
- "How [tax law change] interacts with [estate planning approach]"
- "What [technology trend] means for [unrelated industry]"
- "The connection between [historical pattern] and [current event]"
When topic connection is the Information Gain, the page should:
- Establish each topic adequately (or link to authoritative sources for each)
- Make the connection itself the central contribution
- Provide evidence that the connection is real, not just speculative
- Acknowledge limitations of the connection
4.9 Methodology and Process Detail
Generic content describes outcomes; documenting methodology in detail is Information Gain.
Examples:
- "The exact prompt I used and why each phrase matters"
- "Step-by-step process I follow, including the specific tools and configurations"
- "The decision framework I use, with the specific criteria I weigh"
When methodology is the Information Gain, the page should:
- Be specific enough that readers can replicate
- Include configuration details, tool versions, settings
- Document the reasoning behind each step
- Show the artifacts of the process (screenshots, files, output)
4.10 Quantification of Previously Qualitative Topics
When industry discussion of a topic has been qualitative, quantifying it is Information Gain.
Examples:
- "How long does X actually take? I tracked 50 instances and here are the numbers"
- "Industry talks about Y as 'expensive' — here's the actual cost data across 30 cases"
- "The question of Z gets debated qualitatively — here are the measurable outcomes"
When quantification is the Information Gain, the page should:
- Document data collection methodology
- Provide raw or summary data tables
- Acknowledge sample limitations
- Compare to existing qualitative claims
5. Per-Article Information Gain Implementation
Every article should contribute at least one type of Information Gain. Articles that contribute nothing should not be published.
5.1 Pre-Publish Information Gain Check
Before publishing any article, answer:
- What does this article add that's not already in the top 10 results for the target query?
- If this article didn't exist, what specific knowledge would the world be missing?
- Which Information Gain category (Section 4) does this contribute to?
- What specific evidence supports the contribution?
- Could a competitor easily replicate this contribution? If yes, why is ours better?
If all answers are weak, the article is not ready to publish. Either strengthen the contribution or kill the article.
5.2 Information Gain Markers in Article Structure
Every article should make its Information Gain contribution visible. Suggested structural pattern:
<article class="information-gain-article">
<!-- Standard header -->
<header>
<h1>{{TITLE_REFLECTING_THE_CONTRIBUTION}}</h1>
{{BYLINE}}
{{DATES}}
</header>
<!-- Information Gain marker — what this article adds -->
<aside class="info-gain-marker" aria-label="What this article adds">
<h2>What This Article Adds</h2>
<p>{{ONE_PARAGRAPH_DESCRIPTION_OF_THE_GAIN_CONTRIBUTION}}</p>
<ul>
<li>{{SPECIFIC_CONTRIBUTION_1}}</li>
<li>{{SPECIFIC_CONTRIBUTION_2}}</li>
<li>{{SPECIFIC_CONTRIBUTION_3}}</li>
</ul>
<p>This article is based on: {{SOURCE_OF_GAIN_E.G._original_testing_OR_first_hand_experience_OR_proprietary_data}}.</p>
</aside>
<!-- Standard introduction -->
<section class="introduction">
{{INTRODUCTION_THAT_FRAMES_THE_TOPIC}}
</section>
<!-- Body content where the Information Gain lives -->
<section class="content-body">
{{SUBSTANTIVE_CONTENT_WITH_GAIN_CONTRIBUTION_THROUGHOUT}}
</section>
<!-- Methodology section if research-based -->
{{IF_ORIGINAL_RESEARCH}}
<section class="methodology">
<h2>How I {{TESTED/RESEARCHED/STUDIED}} This</h2>
{{DETAILED_METHODOLOGY}}
</section>
{{/IF}}
<!-- Limitations section — honest about what's not covered -->
<section class="limitations">
<h2>Limitations</h2>
<p>{{HONEST_ACKNOWLEDGMENT_OF_LIMITATIONS_OR_WHAT_THIS_DOESNT_COVER}}</p>
</section>
<!-- Standard footer -->
<footer>
{{REFERENCES}}
{{AUTHOR_BOX}}
</footer>
</article>
The "What This Article Adds" callout is critical — it makes the contribution legible to both human readers and AI systems evaluating the page.
5.3 Schema for Information Gain Content
For original research articles, use Dataset and ScholarlyArticle schemas to formalize the contribution:
<script type="application/ld+json">
{
"@context": "https://schema.org",
"@type": "ScholarlyArticle",
"headline": "{{TITLE}}",
"author": {"@id": "https://{{PRIMARY_DOMAIN}}/authors/{{AUTHOR_SLUG}}/#person"},
"datePublished": "{{PUBLISHED_DATE}}",
"dateModified": "{{UPDATED_DATE}}",
"publisher": {"@id": "https://{{PRIMARY_DOMAIN}}/#organization"},
"about": {"@type": "Thing", "name": "{{PRIMARY_TOPIC}}"},
"abstract": "{{2_SENTENCE_ABSTRACT}}",
"creditText": "{{CREDIT_LINE}}",
"isPartOf": {"@id": "https://{{PRIMARY_DOMAIN}}/#website"},
"mainEntity": {
"@type": "Dataset",
"name": "{{DATASET_NAME}}",
"description": "{{DATASET_DESCRIPTION}}",
"creator": {"@id": "https://{{PRIMARY_DOMAIN}}/authors/{{AUTHOR_SLUG}}/#person"},
"datePublished": "{{PUBLISHED_DATE}}",
"license": "https://creativecommons.org/licenses/by/4.0/",
"distribution": [
{
"@type": "DataDownload",
"encodingFormat": "text/csv",
"contentUrl": "{{CSV_DOWNLOAD_URL}}"
}
]
}
}
</script>
For first-hand experience articles, use Article schema with specific properties demonstrating the experience:
<script type="application/ld+json">
{
"@context": "https://schema.org",
"@type": "Article",
"headline": "{{TITLE}}",
"author": {"@id": "https://{{PRIMARY_DOMAIN}}/authors/{{AUTHOR_SLUG}}/#person"},
"datePublished": "{{PUBLISHED_DATE}}",
"publisher": {"@id": "https://{{PRIMARY_DOMAIN}}/#organization"},
"about": {"@type": "Thing", "name": "{{TOPIC}}"},
"isBasedOn": {
"@type": "CreativeWork",
"description": "First-hand experience implementing this on {{NUMBER}} {{CONTEXT}}"
}
}
</script>
5.4 Citation-Worthy Format
When the Information Gain is research or analysis worthy of citation, format the article to be citable:
<aside class="cite-this-research">
<h3>Cite This Research</h3>
<p>If you reference this work, please cite it as:</p>
<pre id="citation-apa">{{LASTNAME}}, {{FIRST_INITIAL}}. ({{YEAR}}). {{TITLE}}. {{BUSINESS_NAME}}. {{URL}}</pre>
<button onclick="navigator.clipboard.writeText(document.getElementById('citation-apa').textContent)">Copy citation</button>
</aside>
Citation-friendly content gets cited more, which compounds Information Gain (high-citation content has higher Information Gain weight in Google's evaluation).
5.5 Methodology Documentation
Original research articles must document methodology in detail. Template:
<section class="methodology">
<h2>Methodology</h2>
<h3>Research Question</h3>
<p>{{SPECIFIC_QUESTION_THE_RESEARCH_ADDRESSED}}</p>
<h3>Approach</h3>
<p>{{HOW_THE_RESEARCH_WAS_CONDUCTED}}</p>
<h3>Sample / Data Source</h3>
<p>{{DESCRIPTION_OF_DATA_INCLUDING_SAMPLE_SIZE_AND_SOURCE}}</p>
<h3>Time Period</h3>
<p>{{WHEN_DATA_WAS_COLLECTED}}</p>
<h3>Analysis Method</h3>
<p>{{HOW_DATA_WAS_ANALYZED}}</p>
<h3>Limitations</h3>
<ul>
<li>{{LIMITATION_1}}</li>
<li>{{LIMITATION_2}}</li>
</ul>
<h3>Replication</h3>
<p>{{HOW_OTHERS_COULD_REPLICATE_THIS_RESEARCH}}</p>
</section>
Detailed methodology demonstrates rigor, supports Information Gain claims, and provides material for citation by other researchers.
5.6 Failure and Edge Case Documentation
For experience-based articles, dedicate a section to what didn't work or edge cases:
<section class="failures-and-edge-cases">
<h2>What Doesn't Work (or Where Standard Advice Fails)</h2>
<h3>{{FAILURE_PATTERN_1}}</h3>
<p>{{SPECIFIC_DESCRIPTION_INCLUDING_WHEN_AND_WHY}}</p>
<p><strong>What to do instead:</strong> {{ALTERNATIVE_APPROACH}}</p>
<h3>{{FAILURE_PATTERN_2}}</h3>
<p>{{SPECIFIC_DESCRIPTION_INCLUDING_WHEN_AND_WHY}}</p>
<p><strong>What to do instead:</strong> {{ALTERNATIVE_APPROACH}}</p>
</section>
This section is uniquely powerful for Information Gain because it requires actual experience that others can't easily replicate.
6. Site-Wide Information Gain Strategy
Beyond per-article work, the site needs an overall Information Gain posture.
6.1 Phase 1: Audit for Information Gain Distribution
Audit all existing content. For each article, classify:
- Original contribution clear — Article makes a specific, identifiable Information Gain contribution
- Contribution exists but buried — Original insight is present but not surfaced
- No clear contribution — Article is essentially aggregated/derivative
- Outdated even relative to derivative content — Article was once derivative; now it's stale derivative
Goal distribution:
- 60%+ articles with clear original contribution
- <10% articles with no clear contribution
If existing distribution is worse, plan remediation: surface buried contribution, deepen weak articles, remove or consolidate articles with no contribution.
6.2 Phase 2: Develop Information Gain Capacity
Audit the site's capacity to produce Information Gain regularly:
- Are credentialed experts producing content based on first-hand experience?
- Is original research being conducted?
- Does the team have access to proprietary data?
- Are unique perspectives being articulated?
- Is there a routine for collecting first-hand observations?
Common capacity-building actions:
For service businesses: Document client work as case studies. Aggregate patterns across clients into industry insights.
For SaaS companies: Use product analytics (with privacy consideration) to publish industry benchmarks.
For media sites: Conduct surveys, original investigations, and primary-source reporting.
For e-commerce: Test products genuinely. Document pros and cons honestly. Track customer outcomes.
For solo creators: Document personal experience in detail. Run experiments. Document failures.
6.3 Phase 3: Original Research Cadence
Establish a cadence for publishing original research:
- Annual industry survey (timed for peak interest in your industry)
- Quarterly data study from proprietary or aggregated public data
- Monthly case study from client/customer work (anonymized)
- Ongoing first-hand experience documentation
Publishing original research at cadence builds expectation, attracts citations, and demonstrates ongoing Information Gain capacity.
6.4 Phase 4: Differentiation Strategy
Identify what your site does that others don't:
- Unique data access: What data do you have that competitors don't?
- Unique experience: What have you done that competitors haven't?
- Unique perspective: What angle do you take that competitors don't?
- Unique methodology: How do you approach topics differently?
- Unique audience: Who do you serve that others miss?
- Unique combination: What expertise areas combine in this site uniquely?
Document differentiation publicly:
- About page describes what makes the site distinctive
- Topical hub pages explain the unique perspective taken on each pillar
- Author bios articulate unique experience and credentials
6.5 Phase 5: Topic Discipline
The hardest discipline: don't publish on topics where you can't contribute Information Gain.
Maintain a "topics we don't cover" list publicly:
<section class="topics-we-dont-cover">
<h2>Topics We Don't Cover</h2>
<p>We cover {{TOPICS}} deeply because that's where we have genuine expertise and original perspective. We don't cover:</p>
<ul>
<li><strong>{{ADJACENT_TOPIC_1}}</strong> — for that, we recommend {{TRUSTED_SOURCE_1}}</li>
<li><strong>{{ADJACENT_TOPIC_2}}</strong> — for that, we recommend {{TRUSTED_SOURCE_2}}</li>
</ul>
</section>
This section signals editorial integrity and prevents diluting the site's Information Gain by publishing weak content on adjacent topics.
6.6 Phase 6: Citation-Earning Strategy
Earn citations by being cite-worthy:
- Publish data others want to reference
- Format claims to be quotable
- Include "cite this" buttons on research pages
- Reach out to relevant journalists when research is published
- Build relationships with other authoritative sources who cite each other
- Make it easy to cite (clear data, downloadable, properly formatted)
When earned media cites the site, it amplifies Information Gain signal — citation is external validation that the contribution is novel and valuable.
7. Information Gain Validation
How to test whether content actually contributes Information Gain.
7.1 The Top 10 Test
For an article on a target query:
- Search Google for the target query in incognito mode
- Read the top 10 results
- Note what each result covers
- Identify what your article adds beyond all 10
Specific questions:
- Does your article have specific data, examples, or insights none of the top 10 have?
- Does your article cover edge cases none of the top 10 cover?
- Does your article take a perspective none of the top 10 take?
- Could a reader who's read all 10 still benefit from yours?
If you can articulate the unique contribution clearly, the article passes Information Gain.
If you can't articulate it, either don't publish, or rework until you can.
7.2 The Removal Test
Imagine your article is removed from the internet entirely.
- Would readers seeking this topic notice the loss?
- Is there any specific knowledge that would now be unavailable?
- Would competitors miss something they currently can't get elsewhere?
If the answer to all is "no," the article doesn't contribute Information Gain.
7.3 The Citation Test
Over time, does the article earn citations?
- Are other sites linking to it as a reference?
- Are AI engines citing it?
- Are journalists quoting from it?
- Are practitioners referencing it?
Citation-earning content has clear Information Gain. Content that doesn't earn citations either failed to articulate its gain (fixable) or didn't have gain to begin with (not fixable through SEO tactics).
7.4 Originality Verification
Use plagiarism/similarity detection (Copyscape, Originality.ai) to verify content isn't accidentally too similar to existing sources. If 30%+ similarity to any existing source on a topic appears, that's a red flag for Information Gain.
But: low similarity scores don't prove Information Gain. Content can be unique in wording while contributing nothing new. Information Gain is about substantive contribution, not just word uniqueness.
7.5 Self-Honest Evaluation
Ask the author/team honestly:
- Why did we publish this article?
- What did we know that others didn't?
- Would we have published this if Google didn't exist?
- Do we have evidence supporting our specific claims?
If answers are weak, the article likely doesn't contribute Information Gain regardless of word count or formatting.
8. Common Mistakes & Anti-Patterns
8.1 Aggregator Articles
Anti-pattern: "10 best X tools" articles that just describe what each tool's website says.
Why it fails: No Information Gain. Anyone could write this from public information.
Fix: Actually use the tools. Document specific use cases. Compare on dimensions the tool websites don't address. Provide testing data.
8.2 SEO-Optimized Without Substance
Anti-pattern: Articles structured perfectly for SEO (length, keywords, headers) but adding nothing substantively new.
Why it fails: Google's Information Gain evaluation sees through structural optimization without substantive contribution.
Fix: Substance first, structure second. If you can't articulate the contribution, no structural optimization saves the article.
8.3 Generic First-Person Without Specifics
Anti-pattern: "In my experience, X is best" without any specific data, dates, examples, or detail.
Why it fails: Generic experience claims don't function as Information Gain. They're indistinguishable from fabricated experience.
Fix: Specific dates, specific cases, specific data, specific methodology. Detail demonstrates real experience.
8.4 Synthetic Surveys
Anti-pattern: "We surveyed 1000 marketers" — but the survey was poorly designed, run on a small biased panel, or fabricated entirely.
Why it fails: Once detected, ruins credibility and does long-term damage.
Fix: Real research with real methodology. Document sample acquisition. Acknowledge limitations. Quality > quantity in research.
8.5 Stale Original Research
Anti-pattern: Publishing one piece of original research, then never updating it as the topic evolves.
Why it fails: Original research becomes derivative as time passes. Without updates, the Information Gain decays.
Fix: Refresh research annually or as the topic warrants. Make clear what's been updated and when.
8.6 Duplicate Internal Coverage
Anti-pattern: Multiple articles on the same site covering the same topic with slight variations, none of them strong contributors.
Why it fails: Self-cannibalization. Each article dilutes the others. None achieves strong Information Gain.
Fix: Consolidate to one strong article per topic. 301 redirect duplicates.
8.7 Information Gain Buried
Anti-pattern: Article does have original contribution, but it's buried in section 7 of an 8-section article.
Why it fails: Both human readers and Google's algorithms may not surface the contribution. Information Gain isn't just present — it must be visible.
Fix: Surface the contribution prominently. Lead with what's new. Use callouts.
8.8 Citation Without Substance
Anti-pattern: Articles that cite many sources but don't synthesize or contribute. Just an annotated bibliography.
Why it fails: Citing existing work doesn't contribute Information Gain unless you're synthesizing or building on it.
Fix: Make synthesis or original contribution the primary content. Citations support the contribution, don't replace it.
8.9 AI-Generated "Insights"
Anti-pattern: Asking AI to generate "novel insights" on a topic. AI can't have novel insights — only synthesize existing material.
Why it fails: AI-generated "insights" tend to be obvious patterns dressed up as novelty. Detection is improving. The content adds nothing genuinely new.
Fix: Use AI for research assistance, drafting, editing — not for the actual Information Gain contribution. The contribution must come from human experience, observation, or research.
8.10 Over-Hedging Original Claims
Anti-pattern: Making original claims so heavily hedged ("This might suggest, in some cases, possibly...") that the claim becomes meaningless.
Why it fails: Strong claims without evidence are bad; over-hedged claims are also bad. The hedge undermines the contribution.
Fix: Make specific, supportable claims with appropriate caveats. "Based on testing 47 instances over 6 months, we found X happens in approximately 70% of cases when Y conditions apply."
9. Information Gain Stack-Specific Notes
9.1 WordPress
- Custom fields per article: "Information Gain Type," "Source of Gain," "Specific Contribution"
- Editorial workflow requires Information Gain identification before publish
- Plugin to surface "What this article adds" callout consistently
- Custom post type for original research that includes downloadable data
9.2 Next.js / Astro / Hugo
- TypeScript schema for content frontmatter requiring Information Gain fields
- Pre-publish validation that Information Gain markers are present
- Component for "What this article adds" callout
- Dataset schema generated from research article frontmatter
9.3 Universal Pattern
Regardless of stack:
- Define Information Gain types in CMS taxonomy
- Require Information Gain articulation before publishing
- Surface Information Gain markers in templates
- Track citations and references over time
- Audit periodically for distribution of Information Gain types
10. Cross-Reference to the 14-Tier Framework
Information Gain implementation touches:
- Tier 2 CBO — Content Brief Optimization should require Information Gain identification at brief stage
- Tier 2 ETO — E-E-A-T Trust Optimization is foundational; Experience pillar is Information Gain through first-hand involvement
- Tier 3 LLMO — LLM Optimization rewards Information Gain via citation worthiness
- Tier 3 SGA — SearchGPT Optimization specifically rewards Information Gain
- Tier 4 TLO — Thought Leadership Optimization is Information Gain at scale
- Tier 6 RSO — Research/Studies Optimization is original research as Information Gain
- Tier 6 OCO — Original Content Optimization
- Tier 7 OPO — Original Perspective Optimization
Information Gain is the unifying principle across multiple tiers. When the site struggles with Information Gain, multiple tiers will show weaknesses simultaneously.
11. Audit Mode
11.1 Per-Article Audit
For sample articles, score:
| # | Criterion | Pass/Fail |
|---|---|---|
| IG1 | Article has clearly identifiable Information Gain contribution | |
| IG2 | Information Gain is surfaced prominently (not buried) | |
| IG3 | Specific evidence supports the contribution | |
| IG4 | Contribution category is identifiable (research/experience/synthesis/etc.) | |
| IG5 | Article passes the "Top 10 Test" | |
| IG6 | Article passes the "Removal Test" | |
| IG7 | Methodology documented if research-based | |
| IG8 | First-person specifics if experience-based | |
| IG9 | Contribution couldn't be easily replicated by competitors | |
| IG10 | Article has earned citations or shows citation potential |
Score per article: 10. World-class IG article: 9+/10.
11.2 Site-Wide Audit
| # | Criterion | Pass/Fail |
|---|---|---|
| IGS1 | 60%+ of articles have clear Information Gain contribution | |
| IGS2 | <10% of articles are pure aggregators with no contribution | |
| IGS3 | Site has documented Information Gain capacity (research, experience, data access) | |
| IGS4 | Original research published on regular cadence | |
| IGS5 | Differentiation strategy documented and visible | |
| IGS6 | Topical discipline maintained (don't publish where can't contribute) | |
| IGS7 | Citation infrastructure in place (cite-this buttons, formal citations) | |
| IGS8 | "What this article adds" markers used consistently | |
| IGS9 | Citation-worthy content earns external citations | |
| IGS10 | Pre-publish Information Gain check is part of editorial workflow |
Site score: 10. World-class IG site: 9+/10.
12. Maintenance Schedule
12.1 Weekly
- Review newly published articles for Information Gain markers
- Track new external citations to existing research
12.2 Monthly
- Audit one piece of older research for refresh need
- Identify articles where IG is buried; surface it
- Plan next original research project
12.3 Quarterly
- Site-wide IG distribution audit
- Citation tracking review
- Research cadence review — is original research being published as planned?
- Topical discipline review — are we publishing on topics where we contribute IG?
12.4 Annually
- Comprehensive IG framework audit
- Strategic review: what new IG capacity should we build?
- Refresh original research that's now 12+ months old
- Update differentiation strategy based on competitive evolution
13. Implementation/Audit Report Templates
13.1 IG Implementation Report Template
# Information Gain Framework Implementation Report
**Site**: {{BUSINESS_NAME}}
**Implementation Date**: {{TODAY}}
## Summary
- Articles audited: {{COUNT}}
- Articles with clear IG contribution: {{PERCENTAGE}}%
- Articles requiring IG enhancement: {{COUNT}}
- Articles removed/consolidated due to no IG: {{COUNT}}
## IG Capacity Assessment
{{CAPACITY_FOR_RESEARCH_EXPERIENCE_DATA_ETC}}
## IG Cadence Established
{{RESEARCH_PUBLICATION_SCHEDULE}}
## Differentiation Strategy Documented
{{LIST_OF_DIFFERENTIATIONS}}
## Pre-Publish Workflow Updated
{{WORKFLOW_DESCRIPTION}}
## Sign-Off
13.2 IG Audit Report Template
# Information Gain Audit Report
**Site**: {{BUSINESS_NAME}}
**Audit Date**: {{TODAY}}
## Executive Summary
{{SUMMARY}}
## Site IG Score
{{X}}/10
## Per-Article Sample Scores
{{TABLE_OF_SAMPLED_ARTICLES_WITH_SCORES}}
## IG Distribution
- Original research: {{COUNT}} articles
- First-hand experience: {{COUNT}} articles
- Synthesis: {{COUNT}} articles
- Contrarian analysis: {{COUNT}} articles
- Specific case studies: {{COUNT}} articles
- Aggregator (no IG): {{COUNT}} articles
## Critical Findings
{{LIST}}
## Recommended Remediation
{{PRIORITIZED_LIST}}
## Sign-Off
End of Framework Document
Document version: 1.0 Last updated: 2026-04-29 Maintained by: ThatDeveloperGuy
Information Gain is the principle that distinguishes content that earns its rankings from content that just occupies them. In 2026, with the volume of available content multiplied by AI, only content that adds genuine novelty earns sustainable visibility. The frameworks for E-E-A-T, HCS, YMYL, and SQRG all converge on Information Gain — they are different angles on the same fundamental requirement: contribute something only you can contribute.
Companion documents:
framework-eeat.mdframework-ymyl.mdframework-hcs.mdframework-sqrg.mdframework-coreupdates.mdframework-entitysalience.mdframework-knowledgegraph.mdframework-aicitations.md
Want this framework implemented on your site?
ThatDevPro ships these frameworks as productized services. SDVOSB-certified veteran owned. Cassville, Missouri.
See Engine Optimization service ›