What Are Identity and Entity Signals?

Identity and Entity signals are the 48 atomic data points that establish who a business is in the AI knowledge graph. They answer the most fundamental question an AI system asks before it considers recommending a business: Does this entity exist, and can I reliably identify it?

These 48 signals are the prerequisite category. AI systems cannot process Trust and Proof signals or Intent and Governance signals for a business whose identity they cannot confirm. A fragmented or inconsistent entity identity means that trust signals from external sources cannot be attributed to the correct knowledge graph node — they float without attachment, contributing nothing to the business's authority score.

Every other signal category depends on Identity and Entity being correctly deployed first. Without a correctly modelled entity, the entire signal architecture is built on sand.

The Six Identity and Entity Signal Groups

The 48 signals are organised into six functional groups. Each group addresses a distinct dimension of entity identity.

Group 1 — Name and Designation

8 signals
SignalDescriptionWeightFile
entity.legal-nameFull legal registered name of the business entity — must match company registry exactlyHighai.json
entity.trading-nameCommercial or brand name used in market-facing contextsHighai.json
entity.name-variantsAll known abbreviations, acronyms, and alternate spellingsMediumentities.json
entity.previous-namesFormer legal or trading names with date rangesLowentities.json
entity.name-language-variantsOfficial name in all operating languagesMediumentities.json
entity.taglineCanonical value proposition statementLowai.json
entity.acronymOfficially used acronym if applicableLowentities.json
entity.brand-namePrimary brand identifier separate from legal nameMediumai.json

Group 2 — Classification

7 signals
SignalDescriptionWeightFile
entity.categoryPrimary business category — the type of business e.g. Accounting Firm, IT ConsultancyHighai.json
entity.subcategorySpecific subcategory within the primary categoryHighai.json
entity.nace-codeNACE Rev. 2 sector classification code — EU standardHighai.json
entity.sic-codeSIC sector code — for non-EU marketsMediumentities.json
entity.typeEntity type — Corporation, Partnership, Sole Trader, etc.Mediumai.json
entity.statusCurrent operational status — Active, Dormant, DissolvedHighai.json
entity.sectorBroad sector classification — Finance, Technology, Healthcare, etc.Mediumai.json

Group 3 — Legal Identity

6 signals
SignalDescriptionWeightFile
entity.registration-numberCompany registration number from national registryHighai.json
entity.vat-numberVAT / tax identification numberHighai.json
entity.lei-codeLegal Entity Identifier — for financial sector entitiesMediumentities.json
entity.duns-numberDun and Bradstreet identifierLowentities.json
entity.founding-dateDate of legal entity formation — must be consistent across all sourcesHighai.json
entity.jurisdictionLegal jurisdiction of incorporationMediumai.json

Group 4 — Location and Geography

9 signals
SignalDescriptionWeightFile
entity.headquartersPrimary business address — city, country, postcodeHighai.json
entity.registered-addressOfficial registered address if different from headquartersMediumentities.json
entity.operating-locationsAll office and facility locations with addressesMediumentities.json
entity.service-geographyGeographic area served — countries, regions, citiesHighai.json
entity.country-coverageExplicit list of countries where services are offeredHighintents.json
entity.language-coverageAll operating languages for service deliveryHighai.json
entity.timezonePrimary operating timezoneLowentities.json
entity.currencyPrimary currency for transactionsLowentities.json
entity.market-segmentPrimary market segment — Local, Regional, National, EU, GlobalMediumai.json

Group 5 — Digital Presence

10 signals
SignalDescriptionWeightFile
entity.canonical-urlSingle authoritative URL for the business — must be consistent everywhereHighai.json
entity.linkedin-urlLinkedIn company page URLHighentities.json
entity.crunchbase-urlCrunchbase organisation profile URLMediumentities.json
entity.x-urlX / Twitter profile URLLowentities.json
entity.github-urlGitHub organisation URL — relevant for technology companiesLowentities.json
entity.youtube-urlYouTube channel URL if content is publishedLowentities.json
entity.primary-contactPrimary contact method — email or phoneMediumai.json
entity.whatsappWhatsApp business contact numberLowai.json
entity.logo-urlURL to canonical logo fileMediumai.json
entity.primary-colorBrand primary colour — hex codeLowentities.json

Group 6 — Entity Relationships

8 signals
SignalDescriptionWeightFile
entity.same-asArray of all authoritative external profile URLs for cross-reference verificationHighentities.json
entity.parent-entityParent company or holding entity if applicableMediumentities.json
entity.subsidiariesAll subsidiary entities with canonical URLsMediumentities.json
entity.founderFounding person entity with LinkedIn and professional profileMediumentities.json
entity.key-personsAll key team members as linked person entitiesMediumentities.json
entity.registry-urlDirect URL to company registry entry for this entityHighentities.json
entity.wikidata-idWikidata entity ID if applicable — secondary signal onlyLowentities.json
entity.industry-body-idIdentifier in relevant professional or industry body registryMediumentities.json

The Most Critical Identity Signals

Of the 48, these eight have the highest impact on AI knowledge graph entry quality and must be correct and consistent across all sources:

  • entity.legal-name — must match the national company registry exactly
  • entity.category — determines which buyer queries the business appears for
  • entity.nace-code — enables sector-specific AI query matching
  • entity.canonical-url — the single authoritative URL — must be identical in every file
  • entity.founding-date — must be consistent across website, LinkedIn, registry
  • entity.same-as — enables cross-source verification by AI systems
  • entity.registration-number — enables registry verification
  • entity.status — Active status must be declared — absence is interpreted as uncertainty

The sameAs Chain — How AI Verifies Entity Consistency

The sameAs property is how AI systems verify that the business entity they found on your website is the same entity referenced in LinkedIn, Crunchbase, and the company registry. If the sameAs chain is broken — dead links, mismatched names, or missing references — AI confidence in the entity drops. Every sameAs URL in entities.json must be validated and active before deployment. See: What Is AEO? →

Where Identity Signals Are Deployed

Identity and Entity signals are deployed across four locations, each serving a different AI system access pattern:

  • ai.json at domain root — primary machine-readable entity definition. Read by all AI systems on first crawl.
  • entities.json at domain root — complete entity graph with all relationships.
  • Organization JSON-LD in homepage head — Schema.org structured data for search engine and AI crawler consumption.
  • entity-graph.public.jsonld at domain root — linked data format for knowledge graph integration.

All four locations are deployed and served by EDGE INJECTOR™ from the Cloudflare network edge — ensuring sub-10ms response to any AI crawler globally, regardless of origin server state.

Frequently Asked Questions

What are Identity and Entity signals?

Identity and Entity signals are the 48 atomic data points that establish who a business is in the AI knowledge graph — legal name, category, location, founding date, legal identifiers, and sameAs references to authoritative external profiles.

Why are Entity signals the prerequisite category?

AI systems cannot process Trust or Intent signals for a business they cannot identify. Without a correctly modelled entity identity, the business's knowledge graph entry is fragmented — Trust and Intent signals cannot be attributed to the correct entity node.

What is sameAs in entity signals?

sameAs is a Schema.org property that links a business entity to its external representations — LinkedIn company page, Crunchbase profile, national company registry URL, and sector directories. It allows AI systems to confirm entity consistency across independent sources.

Which file contains the Identity and Entity signals?

The primary file is ai.json — deployed at the domain root. The complete entity graph is published in entities.json and entity-graph.public.jsonld. Schema.org Organization markup in the page head is also a key delivery mechanism.

How many of the 48 signals does the average B2B website have?

Based on AI AUDIT™ assessments, the average B2B website has between 8 and 16 of the 48 signals correctly deployed. Most have a name and URL, but lack NACE code, sameAs references, legal identifiers, and machine-readable subsidiary or team entities.