Home  /  Insights  /  Why AI Answers Quote Reddit and YouTube About You
Insights

Why AI Answers Quote Reddit and YouTube About You

Why AI Answers Quote Reddit and YouTube About You

Ask ChatGPT, Gemini, or Google’s AI Overviews about almost anything and look at where the answer comes from. Not a polished publication, not the official source, but a Reddit thread from three years ago and a YouTube video with forty thousand views. For a lot of executives and public figures, that is a genuinely unsettling thing to discover: the machine that now writes their first impression is reading a comment section.

It is not a glitch. It is the design. Understanding why these two platforms became load-bearing sources for AI answers is the first step to controlling what those answers say about you.

The numbers are not subtle

Independent studies of AI citations all land in the same place. Across ChatGPT, Google AI Mode, Gemini, Perplexity, and AI Overviews, Reddit and YouTube consistently rank among the most-cited domains anywhere, alongside Wikipedia (Search Engine Land). One analysis found Reddit claiming the single top spot as the most-cited domain in AI-generated answers (Press Gazette). Another found that Wikipedia and Reddit together account for more than a quarter of all ChatGPT citations in the United States, while the Wall Street Journal, New York Times, and Bloomberg did not appear in the top twenty at all (Profound).

Sit with that comparison for a second. The most authoritative newsrooms in the country are being out-cited, by a wide margin, by a forum and a video host. That is the world your reputation now lives in.

Why Reddit, of all places

Three forces made Reddit indispensable to the AI systems, and none of them is going away.

The first is money and access. In 2024 Google signed a content licensing deal with Reddit, reported at roughly sixty million dollars a year, giving Google direct, real-time access to Reddit’s data to train and ground its models (Bloomberg Law). Reddit has since signed similar arrangements with other AI companies. When a platform is both licensed and structurally easy to read, it becomes a default (Columbia Journalism Review).

The second is the shape of the content. Reddit is millions of real people asking real questions and arguing toward answers, organized into tidy question-and-answer threads with votes attached. That structure is almost perfectly suited to how a language model works: a clear question, several ranked human responses, and signals about which one the crowd trusted. The model does not have to interpret a brochure. The answer is already laid out.

The third is the one most people miss. After years of searchers adding “reddit” to the end of their Google queries to escape low-quality marketing content, the engines concluded that people trust peer experience over polished copy. The AI systems inherited that conclusion. They reach for Reddit precisely because it reads as unfiltered and human, which is exactly what makes it dangerous when the unfiltered, human thing being said about you is wrong.

Why YouTube became the other half of the answer

YouTube earned its place for a related but distinct reason: it is video the machines can read. Every video carries a transcript, a title, a description, timestamps, and structured metadata, which together hand an AI system clean, extractable text tied to a specific moment and topic. The assistant never has to watch anything. It reads the transcript.

That readability has made YouTube one of the most-cited domains in Google’s AI Overviews, and its share of AI citations has climbed sharply, in some measurements overtaking Reddit as the leading social source over the course of 2025 (Adweek). It helps that Google owns YouTube and has every incentive to surface it. For a public figure, this means a single interview clip, a conference talk, or a critic’s video essay can become the source an assistant leans on to summarize you, long after the moment has passed.

What this actually means for your reputation

Put the two together and the picture is clear. The AI answer about you is increasingly assembled from the most quotable thread and the most readable video, not the most accurate or most authoritative source. A grievance posted by one anonymous account, if it sits in a well-trafficked subreddit, can carry more weight in an AI summary than your own website or a favorable feature in a major paper. A years-old clip, stripped of its context, can define you to anyone who asks a machine who you are.

This is the part that catches sophisticated people off guard. They have spent years and real money making sure the right things appear when someone Googles them. None of that work was wasted, but it was aimed at the ten blue links. The AI answer is a different surface with a different set of preferred sources, and it has to be managed deliberately, on its own terms. I wrote about that broader shift in GEO vs SEO and in the practical exercise of what ChatGPT says about you.

What you do about it

The instinct, when someone learns a hostile Reddit thread is feeding their AI answer, is to try to delete it. That instinct is almost always wrong, and acting on it tends to make things worse. The durable approach is to change what the machine has to read.

That means building a presence the AI systems can find and trust as readily as they find the thread: substantive, well-structured content under your control, credible third-party coverage, accurate and claimed entity records, and, where it genuinely serves you, a real footprint on the very platforms the engines favor. It means monitoring what the assistants actually say, because the answers drift as the models update, and catching a wrong one on your own schedule rather than in the middle of a board meeting. And when content crosses the line into something fake, defamatory, or against a platform’s own rules, it means addressing that through the proper channels rather than feeding it attention.

This is the core of what we do under Search and AI Visibility: treat search results and AI answers as one surface to be managed together, with the AI sources understood for what they are.

The machine writing your first impression is reading a comment section. The work is making sure it has something better to read.

The platforms behind the answers will keep changing. Reddit led, YouTube surged, and something else will rise next. What does not change is the principle. If you do not give the AI systems an accurate, accessible version of your story, they will assemble one from whatever is easiest to quote.

Sources

More in this series
Generative AI Consulting: Native, Not RetrofittedGEO vs SEO: What Changes When AI Answers the QuestionLink Building That Actually Works (and What to Avoid)What Is llms.txt and Why It Matters for AI VisibilityWhat Does ChatGPT Say About You?Answer Engine Optimization (AEO), Explained for Leaders