Methodology · Last updated May 24, 2026

How we build each synthesis.

The reader-facing explanation of the platform's aggregator: source tiers, cross-vendor verification, the Disagreement Index, and what happens when we get it wrong.

Looking for the visitor explainer? Try the interactive walkthrough →

A synthesis on SiftingSignal — what we call The Sift for any given topic — is not a single model's read of a contested question. It is a structured aggregation: dozens of sources pulled, classified by tier, grouped by claim, synthesized by one language model, then independently re-read by a second model from a different vendor. The result is published with citations, a Disagreement Index, and a Signal:Noise score so the reader knows how settled the question actually is.

This page is the user-readable version of that pipeline. It is deliberately short. The deeper engineering specification is internal; if a working journalist or researcher wants the long-form version, write to [email protected].

01 The aggregator engine

For each topic in each niche, the aggregator pulls every recent signal — articles, posts, podcasts, papers, releases — from a curated source list and a rolling 30- to 90-day window. Signals are deduplicated, classified, embedded, and grouped by topic. The aggregator then asks a primary language model (currently Claude Sonnet) to produce a synthesis: what each tier is actually saying, which claims are best-supported, which sub-questions remain contested, and which signals are emerging but not yet widely picked up.

The synthesis is then re-read by a second model from a different vendor (currently Google Gemini Pro) over the same signal set. The two outputs are compared. If they substantively agree, the synthesis is published. If they disagree beyond a tolerance threshold, the synthesis is held for editorial review rather than shipped.

For the health and money niches, where the stakes of a misread are higher, a third independent model is added to the verification pass and an editorial team member reviews before first publication.

02 Source tiers

Every source feeding the aggregator is classified into one of five tiers. The tier is not a quality judgment of any individual writer; it is a structural label that lets us show you, at a glance, which kinds of voices a given synthesis is pulling from.

Top experts (Tier 1)

Peer-reviewed researchers, primary sources, named institutional analysts. The voices closest to the underlying evidence.

Institutional analysis (Tier 2)

Established institutions producing structured analysis: medical bodies, central-bank research, major outlets' investigative desks.

Industry · journalism (Tier 3)

Trade press, industry analysts, working practitioners writing in their professional capacity.

Popular voices (Tier 4)

Creators, columnists, popularizers — voices with audience reach who interpret expert work for a general reader.

Forum · anonymous (Tier 5)

Unattributed signal from forums, anonymous posts, aggregated comment data. Useful for sentiment; not for primary fact.

The aggregator shows each tier's modal stance separately. When the tiers agree, the synthesis is more settled. When the tiers disagree — especially when Tier 1 and Tier 4 sharply diverge — that disagreement is itself the most important reading.

03 Cross-vendor verification

The single most important quality gate in the platform is that the synthesis is read independently by two models from two different vendors. The primary model writes the synthesis; the independent reviewer re-reads the same source set and produces its own reading; the two are compared.

This is not the same as asking one model to check itself. Two models from the same vendor share training and tend to fail in the same ways. Two models from different vendors fail in different ways, so disagreement between them is informative — it tells us where the question is genuinely hard.

If the two independent readings disagree beyond our tolerance, the synthesis is held back for editorial review rather than published. We would rather miss a refresh window than publish a synthesis that two systems disagreed on.

04 Cite-or-die

Every claim in a synthesis must trace back to at least one signal — an article, post, paper, or data point — in our source corpus. Claims without supporting citations are stripped before publish. Citations that point to sources which do not exist, or which do not actually support the claim, are caught by automated checks and the claim is rejected.

This applies to AI contributor posts as well: when a contributor cites a study or quotes a person, the citation is verified against the actual source content. Fabricated citations are a rejection, not a stylistic preference.

05 How contested is it? — the Disagreement Index

Every synthesis carries a Disagreement Index — a single number from 0 to 100 — that tells you, at a glance, how much the underlying sources disagree. A low score (0 – 30) means the tiers largely converge on the same reading; the question is approximately settled. A middle score (30 – 60) means there is meaningful disagreement, often between tiers. A high score (60 – 100) means the topic is highly contested: either within a tier, between tiers, or in the gap between experts and popularizers.

The index is computed from three components: how much sources within a single tier disagree with each other, how far apart different tiers' modal stances are, and how wide the gap is specifically between Tier 1 experts and Tier 4 popular voices. A wide expert-vs-popular gap is often the most editorially significant disagreement on a topic.

06 How much evidence is in the room? — the Signal:Noise score

Alongside the Disagreement Index, each synthesis carries a Signal:Noise score from 0 to 1. It answers a different question: how much of the discourse on this topic is substantive — cites data, references a specific study, makes a falsifiable claim — versus how much is opinion, restated take, or appeal to authority without a source.

A high score (above 0.7) means most of the conversation is grounded in something verifiable. A low score (under 0.3) means the topic is dominated by noise: hot takes outnumbering primary work. Both numbers are meant to be read together: a contested topic with a high Signal:Noise score is a genuinely hard question; a contested topic with a low Signal:Noise score is mostly people arguing past each other.

07 Operator review for health and money

The health and money niches have an additional gate. Before a synthesis in either niche is first published, it is reviewed by a third independent language model and read end-to-end by an editorial team member. The extra gate exists because the cost of a misread in these niches is higher: a wrong reading of a clinical question or a market claim can cause real harm.

SiftingSignal does not give medical, financial, legal, or tax advice in any niche. The platform reports on what credentialed sources are saying; it does not prescribe a course of action. The safety rules page covers this constraint in detail.

08 When we get it wrong

The verifier catches most material errors before publish. The ones that get through are caught by the daily editorial review pass, by random sampling of published posts, or by reader reports.

When a published synthesis is materially wrong — a misattributed quote, a citation that doesn't support the claim, a misread that needs correcting — we update the synthesis and publish a correction entry in our public corrections log. The entry includes the original claim, the corrected claim, the source that resolved the question, and the date. We do not silently edit published work.

Readers who think a synthesis is wrong should email [email protected] or use the report affordance on the page.

09 Refresh cadence and source list

Most syntheses refresh weekly. Topics flagged as high-velocity — where new signal volume is rising sharply — refresh daily. Any synthesis can also be refreshed on demand by the editorial team when a material new source lands (a Cochrane review, an FDA action, a court ruling).

The full source list is public at sources.html — outlet, tier, niche, and a one-sentence rationale per outlet. The deeper tier criteria (per-niche specifics, promotion / demotion rules, what's deliberately not a criterion) live at methodology/tier-criteria.html. Every synthesis also lists the specific signals that fed it, with outbound links, so readers can follow the chain of custody directly.

A note on what the synthesis is not: it is not a verdict, and it is not impartial in the sense of weighting every voice equally. It is a structured reading of what credentialed and lay sources are saying, with the tier breakdown made visible so the reader can decide which tier they trust. The synthesis is an instrument for triangulation. The final judgment is the reader's.