Enigma Knowledge

Implementation

Business Graph: Understanding Corporate Structures for KYB

February 5, 2026

How graph-based data models reveal business relationships, ownership structures, and risk patterns that traditional databases miss.

A business graph is a data model that represents companies as interconnected nodes (brands, legal entities, locations, and people) linked by ownership, operational, and affiliation relationships. Traditional databases store business records as isolated rows; graphs make relationships first-class data, revealing structures that matter for KYB verification.

This guide explains how business graphs work, what they reveal, and why they matter for KYB verification and risk detection.

Why Graphs?

The Limits of Tables

Traditional databases store business information in tables:

GTL Services LLC

  • Address: 1209 Orange St
  • EIN: 12-3456789
  • State: DE
  • Owner: Smith Holdings LLC

This works for simple lookups: "Find GTL Services LLC." But it fails when relationships matter:

  • Who ultimately owns GTL Services LLC? (Smith Holdings is another company)
  • What other businesses share this address?
  • Is this the same entity operating as "Green Thumb Landscaping" elsewhere?
  • Are there concerning patterns in the ownership network?

To answer these questions with tables, you need complex joins, recursive queries, and significant engineering. With graphs, you just traverse relationships.

Graphs Make Relationships Explicit

In a graph, both entities and their connections are stored:

[Person: Jane Smith]
       │
       │ owns (60%)
       ↓
[Entity: Smith Holdings LLC]
       │
       │ owns (100%)
       ↓
[Entity: GTL Services LLC]
       │
       ├── operates_as → [Brand: Green Thumb Landscaping]
       │
       ├── registered_at → [Location: 1209 Orange St, Wilmington DE]
       │
       └── located_at → [Location: 456 Main St, Columbus OH]

Query: "Who ultimately owns GTL Services LLC?" Graph traversal: Start at GTL Services LLC → follow 'owns' edges upward → reach Jane Smith.

Graph Structure

Node Types

Business graphs typically include these node types:

Person nodes: Natural persons who own, control, or represent businesses

  • Beneficial owners
  • Directors and officers
  • Registered agents (individuals)
  • Authorized signers

Entity nodes: Legal structures recognized by jurisdictions

  • Corporations
  • LLCs
  • Partnerships
  • Trusts
  • Foreign entities

Brand nodes: Customer-facing business identities

  • Trade names (DBAs)
  • Franchise brands
  • Product/service brands

Location nodes: Physical and registered addresses

  • Headquarters
  • Operating locations
  • Registered agent addresses
  • Mailing addresses

Edge Types (Relationships)

Edges connect nodes with typed, often attributed relationships:

Ownership edges:

  • owns: Person → Entity, Entity → Entity
  • Attributes: ownership percentage, effective date, type (direct/indirect)

Operational edges:

  • operates_as: Entity → Brand
  • located_at: Brand → Location, Entity → Location
  • registered_at: Entity → Location

Role edges:

  • officer_of: Person → Entity (with role: CEO, CFO, etc.)
  • director_of: Person → Entity
  • agent_for: Person → Entity, Entity → Entity

Temporal attributes:

  • Start/end dates for relationships
  • Historical versions
  • Change events

What Can a Graph Actually Show You?

Ownership Chains

The primary use case: tracing beneficial ownership through corporate layers.

Simple ownership:

Jane Smith (Person)
    │ owns 100%
    ↓
ABC Company (Entity)

UBO: Jane Smith owns 100%

Layered ownership:

Jane Smith (Person)
    │ owns 60%
    ↓
Smith Holdings LLC (Entity)
    │ owns 80%
    ↓
GTL Services LLC (Entity)

Effective ownership: Jane Smith owns 60% × 80% = 48% of GTL Services LLC

Complex ownership:

Jane Smith ──owns 40%──→ Holding A ──owns 50%──→ Target Co
          ──owns 60%──→ Holding B ──owns 30%──→ Target Co

Total effective ownership: (40% × 50%) + (60% × 30%) = 20% + 18% = 38%

Graphs handle these calculations through path traversal and aggregation.

Hidden Connections

Graphs reveal relationships that aren't obvious from individual records:

Shared registered agents:

[Agent: ABC Registered Agents Inc.]
    │
    ├── agent_for → [Entity: Company 1]
    ├── agent_for → [Entity: Company 2]
    ├── agent_for → [Entity: Company 3]
    └── agent_for → [872 more entities...]

A formation agent servicing hundreds of entities isn't suspicious on its own. But if those entities were all formed on the same day, share the same beneficial owner, and are now applying for merchant accounts...

Address overlap:

[Location: 1209 Orange St, Wilmington DE]
    │
    ├── registered_at ← [Entity: Company A]
    ├── registered_at ← [Entity: Company B]
    └── registered_at ← [15,000 more entities...]

This is a well-known registered agent address (Corporation Trust Center). Expected for Delaware entities. But if a company claims this as their operating location, that's a red flag.

Officer networks:

[Person: John Doe]
    │
    ├── officer_of → [Entity: Recently formed LLC 1]
    ├── officer_of → [Entity: Recently formed LLC 2]
    ├── officer_of → [Entity: Recently formed LLC 3]
    └── [all filed same day, same agent, similar names...]

Patterns emerge that record-by-record review misses.

Network Patterns

Graph analysis reveals structural patterns:

Star pattern: One node connected to many others

  • Person connected to many entities → possible nominee, formation agent, or serial entrepreneur
  • Address connected to many entities → registered agent office or possible address fraud

Chain pattern: Sequential ownership layers

  • Depth of ownership chain correlates with complexity/opacity
  • Very deep chains may indicate intentional obscuration

Circular pattern: Ownership loops

  • A owns B owns C owns A
  • May be legitimate (cross-holdings) or concerning (ownership obscuration)

Cluster pattern: Densely connected groups

  • Group of entities sharing addresses, officers, formation dates
  • May indicate related-party network, shell company factory, or fraud ring

Temporal Patterns

Graphs with temporal data reveal evolution:

  • Burst formation: many entities created in a short period
  • Ownership churning: frequent ownership changes
  • Status changes: Active to Dissolved to Reinstated patterns
  • Officer turnover: frequent officer/director changes

How Do Graphs Fit into KYB Workflows?

Verification

Enrich verification with graph context:

Application received: "Green Thumb Landscaping" at "456 Main St, Columbus OH"

Graph lookup:

  1. Resolve "Green Thumb Landscaping" to known brands
  2. Find connected entities (GTL Services LLC)
  3. Verify entity exists and is active
  4. Retrieve ownership graph
  5. Identify beneficial owners

Result: Complete picture of business identity and ownership for verification decision.

Risk Scoring

Graph features improve risk models:

Ownership depth

  • Calculation: Layers to reach natural person
  • Risk Signal: Complexity/opacity

Address concentration

  • Calculation: Other entities at same address
  • Risk Signal: Shell company risk

Agent entity count

  • Calculation: Other entities sharing agent
  • Risk Signal: Formation agent pattern

Formation timing

  • Calculation: Days since registration
  • Risk Signal: Established vs. new

Network density

  • Calculation: Connections among related entities
  • Risk Signal: Related-party risk

Officer overlap

  • Calculation: Shared officers with other applicants
  • Risk Signal: Potential fraud ring

These features are impossible to calculate without graph structure.

Ongoing Monitoring

Graphs enable event-driven monitoring:

Change detection:

  • New ownership edge added → ownership change alert
  • New entity added to owner's network → related entity alert
  • Officer resignation → control change alert
  • Entity status change → operational status alert

Periodic review:

  • Re-traverse ownership for UBO refresh
  • Check for new concerning patterns
  • Validate continued accuracy

Building a Business Graph

Data Sources

Graphs are constructed from multiple sources:

Authoritative sources:

  • Secretary of State filings (entities, agents, officers)
  • Business registries (ownership, status)
  • UBO registries (where available)

Commercial sources:

  • Business information aggregators
  • Corporate data providers
  • Commercial registry compilations

Web/public sources:

  • Business websites (brands, locations)
  • Google Business Profiles
  • Social media presence

Transaction/operational sources:

  • Payment processing records
  • Customer applications
  • Internal CRM data

Entity Resolution

Entity resolution is prerequisite to graph construction. Before connecting nodes with edges, you must know which records represent the same entity.

The graph actually helps with resolution; shared relationships provide evidence for matching:

  • Two records share an officer → more likely same entity
  • Two records share address AND phone → strong match signal

This creates a virtuous cycle: better resolution enables richer graphs; richer graphs improve resolution.

Graph Technology

Graph databases: Purpose-built for graph storage and traversal

  • Neo4j, Amazon Neptune, TigerGraph
  • Native graph query languages (Cypher, Gremlin)
  • Optimized for relationship traversal

Relational with graph queries: Graph features on traditional databases

  • PostgreSQL with recursive CTEs
  • SQL Server with graph tables
  • Slower for deep traversal but familiar

Knowledge graph platforms: Semantic graph infrastructure

  • Schema and ontology management
  • Reasoning and inference capabilities
  • Often used for enterprise knowledge management

Maintenance

Graphs require ongoing maintenance:

Regular updates:

  • Pull fresh data from sources
  • Apply changes to graph
  • Version historical state

Quality monitoring:

  • Detect orphan nodes (unconnected entities)
  • Identify data gaps
  • Validate relationship accuracy

Performance tuning:

  • Index frequently traversed paths
  • Partition large graphs
  • Cache common queries

Practical Considerations

Starting Simple

You don't need a full knowledge graph to benefit from graph thinking:

Phase 1: Ownership chains

  • Store ownership relationships in any database
  • Query with recursive joins or simple traversal
  • Calculate effective ownership percentages

Phase 2: Address and agent networks

  • Link entities sharing addresses
  • Identify registered agent patterns
  • Add basic network analysis

Phase 3: Full business graph

  • Multiple node types (persons, entities, brands, locations)
  • Multiple edge types with attributes
  • Graph database for performance
  • Advanced analytics

Buy vs. Build

Build if:

  • Core competitive advantage
  • Unique data sources
  • Specific requirements not met by vendors
  • Graph expertise in-house

Buy/partner if:

  • Speed to market critical
  • Standard KYB use cases
  • Limited graph expertise
  • Prefer operational simplicity

Many organizations use commercial business graphs for baseline coverage and enrich with proprietary data.

Business Graphs in Agentic Workflows

AI agents that make or inform decisions about business counterparties need the graph's properties more than human analysts do, and they need them exposed in a queryable, traversable form.

What agents need from a business graph API

When a human analyst works with a business graph, they can browse a visualization, follow links, and apply judgment to what they see. An AI agent needs the same information surfaced through a structured API that supports graph-native query patterns.

Traversal queries: The agent needs to ask "who owns this entity?" and receive the immediate owners, then follow those owners upward. A flat list of all connected entities won't do. This hop-by-hop traversal is the natural query model for ownership chain analysis and should be a first-class API capability.

Relationship-typed retrieval: An agent investigating a specific question ("are any of this entity's officers also officers of entities on a watchlist?") needs to retrieve edges of a specific type, not all edges. Graph APIs that expose typed relationship queries let agents construct targeted investigations without retrieving entire entity neighborhoods.

Confidence and provenance on edges matter too. Just as node attributes should carry source metadata and freshness timestamps, graph edges should indicate when a relationship was established, when it was last verified, and the source for that relationship. An ownership edge derived from a FinCEN BOI filing is more authoritative than one inferred from a web crawl.

Subgraph snapshots: For agent workflows that need to reason about an entity's full network context, not just a single hop, the ability to retrieve a bounded subgraph (e.g., "all entities within two ownership hops of this node, with their key attributes") is more efficient than multiple sequential traversal queries.

Graphs and the data quality problem

A business graph is only as reliable as the underlying entity resolution that built it. Two graph edges connecting the wrong entities, because entity resolution produced a false match, propagate that error through every downstream graph query and agent inference.

This makes the virtuous cycle described earlier (better resolution enables richer graphs; richer graphs improve resolution) not just a capability story but a reliability story. Organizations investing in graph infrastructure for agentic workflows should invest equally in the entity resolution layer that populates it.

For a full map of how the graph fits into the broader agentic KYB architecture, see The Business Identity Stack for Agentic AI.

Key Takeaways

  • Graphs model relationships as first-class data, not an afterthought
  • Ownership chains become traversable with simple queries
  • Hidden patterns emerge: shell companies, fraud rings, suspicious networks
  • Risk scoring improves with graph features that tables can't provide
  • Entity resolution and graphs reinforce each other; better resolution enables richer graphs
  • Start simple. Ownership chains add value before full graph infrastructure.
  • Maintenance matters. Graphs require ongoing updates and quality monitoring.
  • Agentic workflows require graph-native APIs: traversal queries, typed relationship retrieval, and edge-level confidence metadata