AI search

The agent review-trust standard: how AI shoppers decide which ratings to believe

AI shopping agents do not take a star rating at face value. They check whether each review is verified, dated and attributable, then weight the score accordingly. Here is the signal stack and the schema that earns trust.

Rohin AggarwalCo-founder · Idukki.io · June 25, 2026 · 11 min read

We ran the same prompt across three stores in a live test: "find me a well-reviewed waterproof jacket under fifty pounds." All three showed a confident 4.8 on the product page. The agent quoted one of them and silently discounted the other two. The difference was not the score. It was whether each review came with a verified-buyer flag, a real date and a name the agent could point back to.

An AI shopping agent treats a star rating as a claim to be checked, not a fact to be repeated. Before it cites "4.8 from 2,300 reviews", it looks for three things on each underlying review: a verified-buyer signal, a publish date, and an attributable author. Reviews that pass get full weight; reviews that look unverified, undated or anonymous get discounted, and a rating built entirely from those gets quietly skipped.

That is the whole game now. Your visible average is the easy part. The harder, more valuable part is making each review machine-checkable so an agent will repeat your number instead of someone else's.

In this article

~0%
of shoppers worldwide read reviews before buying
Representative range from Bazaarvoice / PowerReviews shopper studies
Verified
purchase badges are the single most-trusted review signal cited by buyers
PowerReviews consumer survey findings
4.2-4.7
is the rating sweet spot buyers (and agents) trust more than a flat 5.0
Representative pattern reported across PowerReviews / Spiegel research
0
tolerance: the FTC fake-reviews rule allows civil penalties per violation
US FTC rule on fake and deceptive reviews, in force 2024

Trust is concentrated in a few checkable signals. Agents lean on the same ones humans do, only literally.

What signals does an agent actually check on a review?

An agent does not "feel" that a review is credible. It parses structured evidence. The signals that consistently carry weight are the ones a machine can read without guessing, and the ones a human shopper has been trained to look for too.

Verified-buyer flag: a marker that the author actually purchased the item. This is the closest thing to a trust primitive in reviews.
Date: a real datePublished. An undated review could be from yesterday or from three product versions ago, so engines treat undated reviews as low-confidence.
Attributable author: a name or stable handle the engine can tie the review to, rather than "Anonymous" repeated 2,000 times.
Volume plus recency: a credible average needs enough reviews to be stable, and recent ones to show the product is still good today.
Photo and video evidence: a review with media attached is harder to fake and reads as first-hand, so it lifts the trust of the whole rating.

The reason this maps so cleanly to AI is that engines were trained on Google's own review-snippet guidelines and Schema.org's Review type, both of which expect exactly these fields. Get the data structured the way the spec asks, and the agent does not have to infer anything. We covered the broader case for this in why verified reviews read as evidence to AI engines.

How agents weight verified vs unverified review signals

Verified buyer + date + author + photo
Full weight
Verified buyer + date, no media
82%
Dated + attributable, not verified
55%
Undated or anonymous
28%
No structured data, scraped average only
12%

Representative weighting pattern: how much trust an agent assigns to a review depending on which checks it passes. Illustrative, not a published score.

Why do unverified and synthetic reviews get penalised?

There are two separate reasons to care, and they compound. One is that engines discount weak signals on quality grounds: a rating they cannot verify is a rating they will not stake a recommendation on. The other is regulatory. The US FTC fake-reviews rule, in force since 2024, makes buying, selling or hosting fake reviews and inflated ratings a finable offence, and engines have every incentive to avoid quoting numbers that could be deceptive.

So a wall of suspicious five-star reviews is worse than fewer honest ones. A flat 5.0 with no negatives, no dates and no names is a pattern that both shoppers and models read as manufactured. The defensible position is a believable average (commonly the 4.2 to 4.7 band) built from reviews that each carry their own evidence.

What does the verification pipeline look like end to end?

Making review data agent-trusted is a pipeline, not a plugin. The work happens between collecting the review and exposing it to a crawler, and each stage adds a signal the agent can later check.

The review-trust pipeline

01
Collect
Gather reviews and UGC from your store, review platforms (Google Reviews, Trustpilot, Feefo, TripAdvisor) and social, with the source attached to each item.
Source kept
02
Verify buyer
Match the review to an order or a permissioned creator so a verified-purchase flag can be set truthfully, not assumed.
Purchase matched
03
Attribute + date
Capture a real author and datePublished, and for UGC secure rights so the name and media can be quoted legally.
Author + date
04
Mark up
Emit Review and AggregateRating JSON-LD with the verified flag, author and date populated from real data.
Schema valid
05
Expose to crawlers
Serve the structured data so GPTBot, Googlebot and agent crawlers can read it, and surface it in the agent feed.
Crawlable

Collect, verify the buyer, mark it up, expose it. Each step adds a checkable signal an agent can read later.

What schema actually gets quoted?

The schema that travels is Schema.org's Review and AggregateRating, with the trust fields populated. The example below is illustrative: the values are placeholders to show the shape, not real customer data. Note the author, the datePublished, and the verified-buyer flag carried in additionalProperty, plus an AggregateRating whose count maps to real reviews. We go deeper on the wider set in the twelve JSON-LD shapes agents quote and on rating snippets specifically in aggregate rating schema and rich snippets.

json

{
  "@context": "https://schema.org",
  "@type": "Product",
  "name": "Example Waterproof Jacket",
  "aggregateRating": {
    "@type": "AggregateRating",
    "ratingValue": "4.6",          // ILLUSTRATIVE placeholder
    "reviewCount": "318",          // must map to real reviews
    "bestRating": "5",
    "worstRating": "1"
  },
  "review": [
    {
      "@type": "Review",
      "author": {
        "@type": "Person",
        "name": "<real reviewer name>"   // never invent this
      },
      "datePublished": "2026-05-14",       // real date, required
      "reviewBody": "<real review text>",
      "reviewRating": {
        "@type": "Rating",
        "ratingValue": "5",
        "bestRating": "5"
      },
      "additionalProperty": {
        "@type": "PropertyValue",
        "name": "verifiedPurchase",
        "value": true                       // only if truly verified
      }
    }
  ]
}

Verified-and-marked-up vs a bare star average: what wins?

CompareTwo stores, same visible 4.8

1Agent-trusted

Verified + structured

Each review carries a verified flag, a date and an author, exposed as Review + AggregateRating JSON-LD.

Wins at

Agent quotes the rating directly
Recency and volume are readable
Photo evidence lifts trust
Defensible under the FTC rule

Struggles with

Requires order-matching to set verified honestly
Needs rights for UGC attribution

Quotedcited in the agent answer

2Face value

Bare scraped average

A 4.8 rendered in the page, no structured data, no per-review verification, undated.

Wins at

Looks fine to a human at a glance
Trivial to display

Struggles with

Agent cannot verify it, so it discounts it
No recency or evidence signal
Indistinguishable from rating spam
Risk of being skipped entirely

Discounteddown-weighted or skipped

The visible score is identical. What the agent can verify is not.

How ready is your review data? Score it.

Before you ship anything, get an honest score. Review-trust readiness is the share of your top products whose reviews pass all three checks (verified, dated, attributable) and are exposed as valid structured data a crawler can reach.

Review-trust readiness

58 / 100

Typical starting score before a verification pass

At risk (0-40)
Partial (40-70)
Agent-trusted (70-100)

Share of priority products with verified, dated, attributable reviews exposed as valid, crawlable schema. Most stores we audit start in the amber band.

If you land in amber, the usual gaps are predictable: verified flags missing because reviews were never matched to orders, dates dropped during a platform migration, or valid schema that no crawler can actually reach. The fix is the pipeline above, run once and kept current.

Where Idukki fits

Idukki collects UGC and reviews from social and from review platforms (Google Reviews, Trustpilot, Feefo, TripAdvisor), then handles the parts that make them quotable. Rights Management captures consent so a creator's name and media can be attributed legally, which is what turns anonymous content into attributable evidence. The schema output emits Review and AggregateRating JSON-LD with author and date populated from real data, and the agent feed exposes it to crawlers as public JSON, JSON-LD and an llms.txt entry. If you want the strategy layer around all of this, see AEO for reviews: getting ratings quoted by agents.

Your visible star average is a marketing asset. Your per-review verification is the thing an agent can actually stake a recommendation on. Only one of them earns the citation.
Rohin Aggarwal, Co-founder, Idukki

How do you measure whether it worked?

Review counts are an input, not a result. The result is citations: how often an engine quotes your rating, and which products earn the mention. Track three things and you will know if the verification pass paid off.

1Citation rate: how many priority products get their rating quoted by an agent or answer engine, sampled with the same prompts each week.
2Verified coverage: the share of reviews carrying a true verified-purchase flag, since that is the lever that moves the most weight.
3Schema validity and crawl reach: percentage of products with error-free Review + AggregateRating markup that a crawler can actually fetch.

Sources

1Schema.org: Review type · Canonical fields: author, datePublished, reviewRating.
2Schema.org: AggregateRating type · ratingValue, reviewCount, bestRating.
3Google Search Central: review snippet guidelines · What structured review data Google expects and how it is used.
4US FTC: rule on fake and deceptive reviews · Bans buying, selling and hosting fake reviews and inflated ratings.
5PowerReviews: consumer reviews research · Verified-purchase trust and the rating sweet spot.
6Bazaarvoice: shopper experience research · Review-reading behaviour and trust signals.

Written by

Rohin Aggarwal

Co-founder · Idukki.io

A builder. In the long way of saying it.

Day job: SAP architect, the unglamorous backbone software that runs UK government and Fortune 500s, mostly used while people are complaining about it. The brief, simplified: make the systems behind those services feel less like punishment for the people running them.

Night job, and most weekends: co-founded Idukki.io in 2022, building UGC, shoppable video and reviews for DTC brands from a kitchen table in Egham. The Venn diagram of those two communities is, on a good day, approximately one person.

Writes here when he has an opinion he can defend with numbers. Still shipping. Still nervous before each release.

Coding since '99
Worked in 9+ countries
London-based, mostly
Vegetarian, no exceptions
Girl-dad
Friend group's IT dept
Opinions about font rendering

More by Rohin inLinkedIn

#Review schema#Verified buyers#Answer engines

The agent review-trust standard: how AI shoppers decide which ratings to believe

What signals does an agent actually check on a review?

Why do unverified and synthetic reviews get penalised?

What does the verification pipeline look like end to end?

Collect

Verify buyer

Attribute + date

Mark up

Expose to crawlers

What schema actually gets quoted?

Verified-and-marked-up vs a bare star average: what wins?

Verified + structured

Bare scraped average

How ready is your review data? Score it.

Where Idukki fits

How do you measure whether it worked?

Sources

More from Rohin Aggarwal

How to run a UGC competition that fills your gallery, online and in-store

Why we built the Conversational PDP

PDP before and after UGC: what actually changes on the page

Same data model. Every surface a shopper meets.