Introduction
Transitioning from traditional Search Engine Optimisation (SEO) to AI Visibility requires a shift in how we define the 'subject' of our work. In traditional SEO, we lead with the domain. In the era of Generative Engine Optimisation (GEO) and Answer Engine Optimisation (AEO), we must work across three distinct but overlapping layers: the Brand, the Site, and the Entity. This lesson provides a practitioner's framework for modeling these layers to ensure large language models (LLMs) and retrieval-augmented generation (RAG) systems can accurately identify, verify, and recommend your client.
The Three-Layer Framework
To move beyond basic keyword rankings, we must evaluate the practitioner’s subject through three lenses. If these are not aligned, AI models will experience 'hallucination' or 'omission' regarding the client.
1. The Entity Layer (The Semantic Root)
An entity is a uniquely identifiable thing or concept that is distinct, independent, and well-defined. For AI models, the entity is the primary unit of storage. Before an AI can recommend a brand, it must understand what that brand is within a Knowledge Graph (like Google Knowledge Graph, Wikidata, or a proprietary RAG database).
- Identifiers: Machine-readable IDs (e.g., Wikidata Q-codes).
- Attributes: The facts associated with the entity (headquarters, founder, category).
- Relationships: Links to other entities (e.g., 'Company A' is a 'Subsidiary' of 'Company B').
2. The Site Layer (The Content Pipeline)
While the entity is the concept, the site is the technical and lexical vehicle. AI models use crawlers and scrapers to ingest data. If the site is technically inaccessible or the content is unstructured, the AI's 'view' of the entity becomes fragmented.
- Crawlability: Ensuring LLM-specific user agents (like GPTBot) can access the data.
- Structure: Using Schema.org to explicitise the relationship between content and the entity.
- Corroboration: Providing the primary source facts that AI models use to verify external mentions.
3. The Brand Layer (The Perceptual Layer)
This is how the market discusses the entity. AI models trained on vast datasets (Common Crawl, Reddit, news archives) build a 'persona' for a brand based on sentiment, context, and associations found in third-party citations.
- Sentiment: Is the brand associated with 'premium quality' or 'budget' terms?
- Authority: Do trusted third-party entities (industry journals, influencers) cite the brand?
- Co-occurrence: Which other brands or keywords appear frequently in the same context as the brand?
The Extraction Process: A Practitioner’s Workflow
When starting a audit for a client, follow this sequence to model their current footprint.
Step 1: Locating the Entity ID
Do not assume a brand is a recognised entity. Use the Google Knowledge Graph Search API or tools like Kalicube to find the client’s unique identifier. If no ID exists, your first practitioner task is entity establishment via Wikidata or LinkedIn visibility.
Step 2: Mapping Parent-Child Entities
For larger clients, you must model the hierarchy. Is the site representing a single product, a local branch, or a global corporation? AI models struggle with 'ambiguation' where multiple entities share similar names. You must define the boundaries.
Step 3: Auditing the 'Source of Truth'
The site's 'About Us' and 'Contact' pages are the primary corroborative signals. These must use JSON-LD to link the Site to the Entity. Use the sameAs attribute in your Schema to point directly to the Wikidata entry or official social profiles. This creates a closed-loop verification for the AI.
Worked Example: Sustainable Apparel Brand 'EcoThread'
Scenario: EcoThread wants to be cited in ChatGPT responses for "best sustainable workwear."
-
Entity Audit: We find EcoThread has no Wikidata entry. It is often confused with 'Eco-Thread Tools' (a hardware company).
- Action: Create a LinkedIn Professional Page and ensure the 'Organization' Schema is deployed on the home page with a unique
@idURL.
- Action: Create a LinkedIn Professional Page and ensure the 'Organization' Schema is deployed on the home page with a unique
-
Site Audit: The site uses vague language like "We care about the planet."
- Action: Change content to specific, extractable facts: "EcoThread uses 100% GOTS-certified organic cotton." This provides the AI with 'proof points' it can cite.
-
Brand Audit: External sentiment is positive, but the brand is rarely mentioned alongside 'workwear,' mostly 'casual wear.'
- Action: Execute a PR campaign targeting B2B fashion journals to build the association between the 'EcoThread' entity and the 'Workwear' category.
Overcoming Naming Ambiguity
A common practitioner hurdle is the 'Generic Name' problem. If a client is named 'The Coffee House,' an AI model will struggle to distinguish it as a specific entity. In these cases, the Site layer must work harder. You must use specific geographic headers and 'founder' schema to tether the generic brand to a specific, unique entity location.
Managing the Entity Life Cycle
Entities are not static. As a practitioner, you must manage three stages:
- Creation: For new brands, focus on authoritative directory listings and Schema implementation.
- Maintenance: Regular audits of 'Near-Me' signals and Wikipedia references (if applicable) to ensure data hasn't drifted.
- Defence: Correcting hallucinations where AI models wrongly attribute a competitor's features to your client.
Putting it into Practice
To apply this methodology in your next client engagement:
- Audit the Knowledge Graph: Use an API or a tool like 'WordLift' to see what entities the AI currently associates with the client's domain.
- Check for Schema Completeness: Ensure the
Organizationschema includesbrand,logo,url, andsameAsproperties. - Map the Citations: Use a backlink tool to find where the brand is mentioned without a link. These 'unlinked mentions' are vital for AI entity building even if they don't help traditional SEO rankings.
- Align the Messaging: Ensure the 'Brand' (what people say), the 'Site' (what the client says), and the 'Entity' (the database facts) are perfectly synchronised. Discrepancies lead to low trust scores in AI-generated answers.