Introduction
Entity analysis is the foundation of modern search visibility. As we transition from strings to things, search engines and Large Language Models (LLMs) no longer just index keywords; they attempt to understand the relationships between concepts, organisations, people, and places. To be visible in AI-generated answers and Google’s Knowledge Panels, your brand must be recognised as a distinct entity within the global Knowledge Graph. This lesson provides a systematic framework for auditing a brand’s presence across the three most influential entity nodes: Wikidata, Wikipedia, and the Google Knowledge Graph.
The Triple-Node Audit Framework
To audit a brand entity effectively, we look at three primary layers of validation:
- The Structured Layer (Wikidata): The machine-readable database that stores factual data points.
- The Contextual Layer (Wikipedia): The narrative validation that provides semantic depth and authority.
- The Proprietary Layer (Google Knowledge Graph): How a search engine synthesises these sources into a formal entity ID.
1. Auditing Wikidata Coverage
Wikidata is arguably the most important data source for AI visibility. It is the structured backbone for Wikipedia and a primary feed for Google, Bing, and various LLMs. Unlike Wikipedia, which requires high 'notability' and narrative flow, Wikidata focuses on structured facts.
How to audit:
- Search by Label: Use the Wikidata search interface to find the brand. Check for aliases (e.g., 'IBM' vs 'International Business Machines').
- Check the Statement Count: A weak entity has 3–5 statements. A strong entity has 20+. Look for missing critical properties like
instance of (P31),industry (P452),official website (P856), andheadquarters location (P159). - Identifier Integrity: Verify if external identifiers are linked. Does the Wikidata entry link to the brand’s LinkedIn, Twitter, or Official Gazettes (like Companies House in the UK)?
2. Auditing Wikipedia Presence
Wikipedia is the 'gold standard' for topical authority. However, it is also the hardest to influence due to strict neutrality and notability requirements.
Points of Audit:
- Notability Criteria: Evaluate if the brand has enough independent, high-quality, third-party coverage to sustain an entry. If a brand lacks a page, look for 'mention' coverage—is the brand cited in other relevant industry pages?
- Citations and Links: Audit the existing citations on the page. Are they dead links? Are they biased? AI models use the citation graph to measure trust.
- Accuracy and Vandalism: Check the 'View History' tab. Frequent reverts or edit wars indicate a de-stabilised entity that might be less trusted by search algorithms.
3. Probing the Google Knowledge Graph
Google maintains its own private graph, partly derived from public sources and partly from its own crawling. You must verify if Google has assigned a unique Knowledge Graph ID (MID) to the brand.
The API Check:
Use the Google Knowledge Graph Search API (or tools like the Kalicube Explorer or the Google APIs Explorer).
- Query the Brand Name: Does an ID (starting with
/m/or/g/) appear? - Result Score: Google provides a 'resultScore'. A high score indicates high confidence in the entity's uniqueness.
- The Knowledge Panel: Does a panel appear in the SERP? Is it 'claimed'? If a panel appears but offers no 'Claim this knowledge panel' button, it suggests the entity is already managed or Google is highly confident in its mapping.
Worked Example: Auditing a Mid-Market Fintech Agency
Let’s imagine we are auditing 'FinTech Flow', a hypothetical UK-based agency.
Step 1: Wikidata. We find a Wikidata entry, but it only lists 'official website'. It lacks P159 (Headquarters) and P112 (Founder). Verdict: Incomplete entity. LLMs may struggle to connect the agency to its location or leadership.
Step 2: Wikipedia. No page exists. However, the founder is mentioned in the 'List of Fintech Entrepreneurs'. Verdict: We have a 'seed' entity status. We should use Schema markup on the agency site to point to the founder’s Wikipedia mention.
Step 3: Google Knowledge Graph. A search for the Brand ID via API returns /g/11h_xxxx. This confirms Google recognises the entity independently of a Wikipedia page. The result score is 12 (relatively low). Verdict: The entity is 'known' but not 'trusted'.
The Role of Schema.org in Entity Audits
If the Wikidata and Wikipedia layers are weak, your website's Organization schema becomes the 'source of truth'. During an audit, ensure you are using sameAs properties to bridge the gap. Link your website to your Wikidata ID and social profiles using sameAs to help search engines reconcile different mentions into one entity.
Putting it into Practice
- Identify the MID: Use the Google Knowledge Graph Search API to find your brand's unique ID.
- Map the Gap: Create a spreadsheet comparing your Wikidata statements against a top competitor. Note every missing property.
- The 'SameAs' Reconciliation: Update your website’s JSON-LD schema to include all identified entity URLs (Wikidata, Wikipedia, Social Profiles).
- Monitor the Panel: Use a VPN to check how your Knowledge Panel looks in different regions. Inconsistencies often reveal conflicting entity data in different languages or regions.