ChangelogBook a demoSign up

Identifier rules

Identity resolution is only available on Business tier plans. You can use it with or without Customer Studio.
AudienceHow you’ll use this article
Data teamsUnderstand how identifier rules influence matching behavior and identity stability.
Platform adminsDebug unexpected merges or splits in resolved identities.

Overview

Identifier rules control how records are grouped into resolved identities during identity resolution runs.

You configure these rules per identity graph to influence how records merge and when identities stay separate. They’re most useful when you want to:

  • Validate that identities look correct
  • Debug unexpected merges or splits
  • Refine identity quality as your data evolves

Identifier rules work alongside your identifier mappings and model setup. They don’t clean data or introduce new identifiers, but determine how mapped identifiers are used during matching.


Prerequisites

Before configuring identifier rules, you must have:

Identifier rules are configured per identity graph and can only use identifiers that were mapped during graph creation.


Where to find identifier rules

To view or edit identifier rules:

  1. Go to Identity Resolution.
  2. Open the identity graph you want to inspect.
  3. Select the Rules tab.

If you have view-only permissions for the source, you can see these settings but can’t edit them.


How identifier rules affect matching

When Identity Resolution evaluates a record, identifier rules determine whether that record:

  • Joins an existing resolved identity, or
  • Forms a new identity

At a high level:

  • Identifier priority controls which identifiers are trusted first when matching records.
  • Limit values per profile enforces boundaries that prevent identities from growing too large or merging unrelated records.

Together, these controls balance coverage (linking related records) with accuracy (avoiding incorrect merges).


Identifier priority

The Identifier priority list defines which identifiers are eligible to link records and the order in which they’re evaluated.

In the UI, you’ll see rows like Match on email or Match on anonymous_id, which you can reorder from highest to lowest priority.

Merge rules

  • Identifiers at the top of the list are treated as more authoritative.
  • Lower-priority identifiers are evaluated only after higher-priority identifiers have been considered.
  • Identifiers not listed are still recorded in outputs but won’t be used to merge records.

Identifier priority affects matching behavior only. It does not control Golden Record value selection.

You can add additional identifiers to the priority list by clicking Add identifier.


Only identifiers that were mapped during model configuration are available here. If an identifier does not appear in the dropdown, go to the Models tab in your identity graph and click Configure to assign it first.

Evaluate the most reliable identifiers first — for example, user_id > email > phone > anonymous_id.


Adjust ordering based on your data quality and business logic.


Limit values per profile

Limit rules define how many distinct values of a given identifier an identity is allowed to contain.

Each limit rule specifies:

  • An identifier type (for example, email, user_id, anonymous_id)
  • A maximum number of values per identity

Limit rules

Limit rules help you:

  • Prevent over-merging caused by shared, recycled, or low-quality identifiers
  • Enforce business constraints (for example, one user_id per person)
  • Keep identities usable for analytics and downstream activation

Limit rules are often used to enforce business constraints (for example, one user_id per identity or one CRM ID per account).


Start with conservative limits for authoritative identifiers and expand cautiously after reviewing match results.

Overly permissive limits can create superclusters — large identities formed by shared or recycled identifiers (for example, shared emails or placeholder IDs).


If you observe unexpectedly large identities, revisit identifier priority and limits before rerunning the graph.


How limits interact with priority

When a proposed merge would violate a limit, Identity Resolution uses identifier priority to decide what happens:

  • Higher-priority identifier limit violated
    The merge is rejected, and the record remains in a separate identity. This prevents unrelated identities from collapsing together.

  • Lower-priority identifier limit violated
    The identity retains only the first N values (where N is the limit) for that identifier type. Additional values are ignored for matching purposes.

Identifiers not listed in Identifier priority are treated as lowest priority for these checks.


Example: how identifier priority prevents over-merging

Assume you’re resolving identities using email and anonymous_id, with the following records:

Timeemailanonymous_id
12:01ANON001
12:02ANON001
12:03john.doe@acme.comANON001
12:04ANON001
12:05ANON002
12:06john@dundermifflin.comANON002
12:07ANON002
12:08carl.smith@acme.comANON002

Configuration

  • Identifier priority: email (higher), anonymous_id (lower)
  • Limit rule: max 1 email value per identity

Resolution behavior

  • For ANON001, all records share the same anonymous_id and only one email appears, so they form a single identity (HT1).
  • For ANON002, early records form one identity (HT5) with john@dundermifflin.com.
  • When carl.smith@acme.com appears, merging would introduce a second email.

Because email is higher priority and exceeds the limit, the record is not merged and instead forms a new identity (HT8).

Resulting identities

Timeht_idemailanonymous_id
12:01HT1ANON001
12:02HT1ANON001
12:03HT1john.doe@acme.comANON001
12:04HT1ANON001
12:05HT5ANON002
12:06HT5john@dundermifflin.comANON002
12:07HT5ANON002
12:08HT8carl.smith@acme.comANON002

This example shows how identifier priority and limits work together to prevent unrelated people from being merged into the same identity.

IDs are assigned at the time the identity is first created and are not necessarily sequential


Common implementation patterns

While every dataset is different, many teams:

  • Place stable internal IDs (for example, user_id or CRM ID) at the highest priority
  • Place email below internal IDs
  • Place device or anonymous_id lower to prevent cross-user merging
  • Apply strict limits to authoritative identifiers (for example, max 1 user_id per identity)

Treat these as starting points and adjust based on your data quality and business model.


When to revisit identifier rules

You may want to review or adjust identifier rules if you notice:

  • Unexpected identity splits
  • Lower match rates than expected
  • Overly large identities (superclusters)
  • Activation issues caused by shared identifiers

After making changes, rerun your graph and review results in the Summary and Profiles tabs before activating downstream data.


Next steps

Ready to get started?

Jump right in or a book a demo. Your first destination is always free.

Book a demoSign upBook a demo

Need help?

Our team is relentlessly focused on your success. Don't hesitate to reach out!

Feature requests?

We'd love to hear your suggestions for integrations and other features.

Privacy PolicyTerms of Service

Last updated: Feb 11, 2026

On this page
  • Overview
  • Prerequisites
  • Where to find identifier rules
  • How identifier rules affect matching
  • Identifier priority
  • Limit values per profile
  • How limits interact with priority
  • Example: how identifier priority prevents over-merging
  • Common implementation patterns
  • When to revisit identifier rules
  • Next steps

Was this page helpful?