Idukki
AI search

The agent review-trust standard: how AI shoppers decide which ratings to believe

AI shopping agents do not take a star rating at face value. They check whether each review is verified, dated and attributable, then weight the score accordingly. Here is the signal stack and the schema that earns trust.

We ran the same prompt across three stores in a live test: "find me a well-reviewed waterproof jacket under fifty pounds." All three showed a confident 4.8 on the product page. The agent quoted one of them and silently discounted the other two. The difference was not the score. It was whether each review came with a verified-buyer flag, a real date and a name the agent could point back to.

An AI shopping agent treats a star rating as a claim to be checked, not a fact to be repeated. Before it cites "4.8 from 2,300 reviews", it looks for three things on each underlying review: a verified-buyer signal, a publish date, and an attributable author. Reviews that pass get full weight; reviews that look unverified, undated or anonymous get discounted, and a rating built entirely from those gets quietly skipped.

That is the whole game now. Your visible average is the easy part. The harder, more valuable part is making each review machine-checkable so an agent will repeat your number instead of someone else's.

In this article
  • ~0%

    of shoppers worldwide read reviews before buying

    Representative range from Bazaarvoice / PowerReviews shopper studies

  • Verified

    purchase badges are the single most-trusted review signal cited by buyers

    PowerReviews consumer survey findings

  • 4.2-4.7

    is the rating sweet spot buyers (and agents) trust more than a flat 5.0

    Representative pattern reported across PowerReviews / Spiegel research

  • 0

    tolerance: the FTC fake-reviews rule allows civil penalties per violation

    US FTC rule on fake and deceptive reviews, in force 2024

Trust is concentrated in a few checkable signals. Agents lean on the same ones humans do, only literally.

What signals does an agent actually check on a review?

An agent does not "feel" that a review is credible. It parses structured evidence. The signals that consistently carry weight are the ones a machine can read without guessing, and the ones a human shopper has been trained to look for too.

  • Verified-buyer flag: a marker that the author actually purchased the item. This is the closest thing to a trust primitive in reviews.
  • Date: a real datePublished. An undated review could be from yesterday or from three product versions ago, so engines treat undated reviews as low-confidence.
  • Attributable author: a name or stable handle the engine can tie the review to, rather than "Anonymous" repeated 2,000 times.
  • Volume plus recency: a credible average needs enough reviews to be stable, and recent ones to show the product is still good today.
  • Photo and video evidence: a review with media attached is harder to fake and reads as first-hand, so it lifts the trust of the whole rating.

The reason this maps so cleanly to AI is that engines were trained on Google's own review-snippet guidelines and Schema.org's Review type, both of which expect exactly these fields. Get the data structured the way the spec asks, and the agent does not have to infer anything. We covered the broader case for this in why verified reviews read as evidence to AI engines.

How agents weight verified vs unverified review signals

  • Verified buyer + date + author + photo
    Full weight
  • Verified buyer + date, no media
    82%
  • Dated + attributable, not verified
    55%
  • Undated or anonymous
    28%
  • No structured data, scraped average only
    12%
Representative weighting pattern: how much trust an agent assigns to a review depending on which checks it passes. Illustrative, not a published score.

Why do unverified and synthetic reviews get penalised?

There are two separate reasons to care, and they compound. One is that engines discount weak signals on quality grounds: a rating they cannot verify is a rating they will not stake a recommendation on. The other is regulatory. The US FTC fake-reviews rule, in force since 2024, makes buying, selling or hosting fake reviews and inflated ratings a finable offence, and engines have every incentive to avoid quoting numbers that could be deceptive.

So a wall of suspicious five-star reviews is worse than fewer honest ones. A flat 5.0 with no negatives, no dates and no names is a pattern that both shoppers and models read as manufactured. The defensible position is a believable average (commonly the 4.2 to 4.7 band) built from reviews that each carry their own evidence.

What does the verification pipeline look like end to end?

Making review data agent-trusted is a pipeline, not a plugin. The work happens between collecting the review and exposing it to a crawler, and each stage adds a signal the agent can later check.

The review-trust pipeline

  1. 01

    Collect

    Gather reviews and UGC from your store, review platforms (Google Reviews, Trustpilot, Feefo, TripAdvisor) and social, with the source attached to each item.

    Source kept

  2. 02

    Verify buyer

    Match the review to an order or a permissioned creator so a verified-purchase flag can be set truthfully, not assumed.

    Purchase matched

  3. 03

    Attribute + date

    Capture a real author and datePublished, and for UGC secure rights so the name and media can be quoted legally.

    Author + date

  4. 04

    Mark up

    Emit Review and AggregateRating JSON-LD with the verified flag, author and date populated from real data.

    Schema valid

  5. 05

    Expose to crawlers

    Serve the structured data so GPTBot, Googlebot and agent crawlers can read it, and surface it in the agent feed.

    Crawlable

Collect, verify the buyer, mark it up, expose it. Each step adds a checkable signal an agent can read later.

What schema actually gets quoted?

The schema that travels is Schema.org's Review and AggregateRating, with the trust fields populated. The example below is illustrative: the values are placeholders to show the shape, not real customer data. Note the author, the datePublished, and the verified-buyer flag carried in additionalProperty, plus an AggregateRating whose count maps to real reviews. We go deeper on the wider set in the twelve JSON-LD shapes agents quote and on rating snippets specifically in aggregate rating schema and rich snippets.

json
{
  "@context": "https://schema.org",
  "@type": "Product",
  "name": "Example Waterproof Jacket",
  "aggregateRating": {
    "@type": "AggregateRating",
    "ratingValue": "4.6",          // ILLUSTRATIVE placeholder
    "reviewCount": "318",          // must map to real reviews
    "bestRating": "5",
    "worstRating": "1"
  },
  "review": [
    {
      "@type": "Review",
      "author": {
        "@type": "Person",
        "name": "<real reviewer name>"   // never invent this
      },
      "datePublished": "2026-05-14",       // real date, required
      "reviewBody": "<real review text>",
      "reviewRating": {
        "@type": "Rating",
        "ratingValue": "5",
        "bestRating": "5"
      },
      "additionalProperty": {
        "@type": "PropertyValue",
        "name": "verifiedPurchase",
        "value": true                       // only if truly verified
      }
    }
  ]
}

Verified-and-marked-up vs a bare star average: what wins?

CompareTwo stores, same visible 4.8
1Agent-trusted

Verified + structured

Each review carries a verified flag, a date and an author, exposed as Review + AggregateRating JSON-LD.

Wins at

  • Agent quotes the rating directly
  • Recency and volume are readable
  • Photo evidence lifts trust
  • Defensible under the FTC rule

Struggles with

  • Requires order-matching to set verified honestly
  • Needs rights for UGC attribution
Quotedcited in the agent answer
2Face value

Bare scraped average

A 4.8 rendered in the page, no structured data, no per-review verification, undated.

Wins at

  • Looks fine to a human at a glance
  • Trivial to display

Struggles with

  • Agent cannot verify it, so it discounts it
  • No recency or evidence signal
  • Indistinguishable from rating spam
  • Risk of being skipped entirely
Discounteddown-weighted or skipped

The visible score is identical. What the agent can verify is not.

How ready is your review data? Score it.

Before you ship anything, get an honest score. Review-trust readiness is the share of your top products whose reviews pass all three checks (verified, dated, attributable) and are exposed as valid structured data a crawler can reach.

Review-trust readiness

58 / 100

Typical starting score before a verification pass

  • At risk (0-40)
  • Partial (40-70)
  • Agent-trusted (70-100)
Share of priority products with verified, dated, attributable reviews exposed as valid, crawlable schema. Most stores we audit start in the amber band.

If you land in amber, the usual gaps are predictable: verified flags missing because reviews were never matched to orders, dates dropped during a platform migration, or valid schema that no crawler can actually reach. The fix is the pipeline above, run once and kept current.

Where Idukki fits

Idukki collects UGC and reviews from social and from review platforms (Google Reviews, Trustpilot, Feefo, TripAdvisor), then handles the parts that make them quotable. Rights Management captures consent so a creator's name and media can be attributed legally, which is what turns anonymous content into attributable evidence. The schema output emits Review and AggregateRating JSON-LD with author and date populated from real data, and the agent feed exposes it to crawlers as public JSON, JSON-LD and an llms.txt entry. If you want the strategy layer around all of this, see AEO for reviews: getting ratings quoted by agents.

Your visible star average is a marketing asset. Your per-review verification is the thing an agent can actually stake a recommendation on. Only one of them earns the citation.

Rohin Aggarwal, Co-founder, Idukki

How do you measure whether it worked?

Review counts are an input, not a result. The result is citations: how often an engine quotes your rating, and which products earn the mention. Track three things and you will know if the verification pass paid off.

  1. 1Citation rate: how many priority products get their rating quoted by an agent or answer engine, sampled with the same prompts each week.
  2. 2Verified coverage: the share of reviews carrying a true verified-purchase flag, since that is the lever that moves the most weight.
  3. 3Schema validity and crawl reach: percentage of products with error-free Review + AggregateRating markup that a crawler can actually fetch.

Sources

  1. 1Schema.org: Review type · Canonical fields: author, datePublished, reviewRating.
  2. 2Schema.org: AggregateRating type · ratingValue, reviewCount, bestRating.
  3. 3Google Search Central: review snippet guidelines · What structured review data Google expects and how it is used.
  4. 4US FTC: rule on fake and deceptive reviews · Bans buying, selling and hosting fake reviews and inflated ratings.
  5. 5PowerReviews: consumer reviews research · Verified-purchase trust and the rating sweet spot.
  6. 6Bazaarvoice: shopper experience research · Review-reading behaviour and trust signals.
#Review schema#Verified buyers#Answer engines

More from Rohin Aggarwal

Where Idukki ships

Same data model. Every surface a shopper meets.

We use cookies

We use essential cookies to run this site and optional analytics cookies to understand how it’s used. You can change your choice anytime in our privacy policy.