Identity resolution is a critical process in Salesforce Data Cloud that ensures the data about a single customer from multiple sources is consolidated into one accurate, unified profile.
What is it?
Identity matching involves comparing data across different records to determine if they belong to the same customer. This process uses predefined rules to match records accurately.
Key Details:
Matching Rules:
Rules that specify which fields to compare when identifying matches. Common fields include:
Weighted Matching:
Each field is given a weight based on its importance. For example:
Example:
Two records:
What is it?
Deduplication eliminates redundant records by merging duplicates into a single, clean record.
Key Details:
Merge Records:
Combines information from duplicates into one profile.
Priority Rules:
Example:
What is it?
A unified profile is the result of merging data from various sources to create a single, comprehensive view of the customer.
Key Features:
Example:
For "John Doe," a unified profile might include:
What are Matching Rules?
Rules that determine how records are compared during the identity resolution process.
Types of Matching:
Exact Matching:
Requires fields to match exactly.
Fuzzy Matching:
Allows slight variations in data to match.
Threshold Configuration:
Set a score threshold to determine a match.
What are Reconciliation Rules?
Rules for resolving conflicts when two records have different values for the same field.
How it works:
Example of Conflict Resolution:
Mastering identity resolution ensures your data is accurate, reliable, and ready for actionable insights.
An Identity Graph is a core component of Identity Resolution in Salesforce Data Cloud. It dynamically establishes relationships between different identity attributes (such as email, phone numbers, and social media IDs) to create a unified customer profile.
Multi-Source Identity Mapping
Graph-Based Relationship Building
Continuous Updates
A customer named John Doe might have multiple identity records across different platforms:
| Data Source | Identity Attribute |
|---|---|
| CRM System | Customer ID: 12345 |
| Email System | Email: [email protected] |
| Social Media | Twitter Handle: @john_doe |
| E-Commerce | Purchase History under Name: J. Doe |
Without an Identity Graph, these records would be stored separately. With Identity Graph, Salesforce Data Cloud automatically detects and consolidates them, ensuring John Doe is recognized as a single customer.
When merging customer records, businesses use two primary matching techniques to determine whether multiple records belong to the same person.
Definition:
Deterministic matching relies on exact matches of unique identifiers to link records.
Key Features:
Example of Deterministic Matching:
| Customer Record 1 | Customer Record 2 | Match? |
|---|---|---|
Email: [email protected] |
Email: [email protected] |
Match |
Phone: +1-555-1234 |
Phone: +1-555-5678 |
No Match |
Definition:
Probabilistic matching calculates a similarity score between multiple fields and determines matches based on probability thresholds.
Key Features:
Example of Probabilistic Matching:
| Customer Record 1 | Customer Record 2 | Similarity Score | Match? |
|---|---|---|---|
Name: John Doe |
Name: J. Doe |
85% | Match |
Email: [email protected] |
Email: [email protected] |
90% | Match |
Address: 123 Main St |
Address: 123 Main Str. |
95% | Match |
| Matching Type | Use Case | Pros | Cons |
|---|---|---|---|
| Deterministic Matching | When unique identifiers (email, customer ID) are available | Highly accurate, reduces false positives | Fails when minor discrepancies exist |
| Probabilistic Matching | When no unique identifier is available, but data has similarities | Flexible, can match variations | Risk of false positives |
Incorrect identity resolution can lead to major business risks, such as misaligned customer data, ineffective marketing, and compliance issues.
| Matching Issue | Definition | Business Impact |
|---|---|---|
| False Positive (Incorrect Match) | Different customers are mistakenly merged into one profile | Causes data confusion, leading to irrelevant marketing or security risks |
| False Negative (Missed Match) | The same customer is mistakenly treated as separate individuals | Causes incomplete customer profiles, impacting personalization and customer service |
Identity Resolution must handle millions of records efficiently, ensuring high-speed processing and accurate identity linking.
| Processing Type | Use Case | Pros | Cons |
|---|---|---|---|
| Batch Matching | Overnight processing of large historical datasets | Efficient for large-scale identity resolution | Not useful for real-time actions |
| Real-Time Matching | Dynamic updates to customer profiles as new data arrives | Enables instant personalization | More computationally expensive |
| Optimization Step | Before Optimization | After Optimization |
|---|---|---|
| Unindexed Matching | 12 hours for 10M records | N/A |
| Indexed Matching Fields | N/A | 4 hours |
| AI-Based Matching with Machine Learning | N/A | 3 hours |
By mastering these Identity Resolution techniques, businesses can build accurate, scalable, and real-time customer identity systems in Salesforce Data Cloud.
What is deterministic matching in Data Cloud identity resolution?
Deterministic matching links records using exact identifier matches such as email address or CRM contact ID.
Deterministic matching relies on precise identifier equality. If two records share the same identifier (for example identical email address), the system confidently assumes they represent the same person.
These rules provide high accuracy because they use trusted identifiers. However, they may fail when identifiers differ across systems—for example when a customer uses multiple emails.
Demand Score: 90
Exam Relevance Score: 94
What is probabilistic matching in identity resolution?
Probabilistic matching uses statistical similarity across multiple attributes to determine whether records belong to the same individual.
Instead of requiring exact identifier matches, probabilistic matching evaluates attributes such as:
name similarity
phone numbers
location
device identifiers
Each attribute contributes to a confidence score. When the score exceeds a defined threshold, the records are considered a match.
Demand Score: 88
Exam Relevance Score: 90
What is a reconciliation rule in Data Cloud identity resolution?
A reconciliation rule determines which attribute value becomes the authoritative value when multiple records are merged.
When identity resolution matches multiple records for the same individual, there may be conflicting values such as different phone numbers or addresses.
Reconciliation rules resolve these conflicts by applying logic such as:
source system priority
most recent value
highest data quality score
Demand Score: 87
Exam Relevance Score: 92
Why might duplicate customer profiles still appear after identity resolution?
Duplicates can occur if matching rules do not include the identifiers needed to link records across systems.
Identity resolution depends entirely on available identifiers. If two systems store different identifiers (for example phone vs email), and those identifiers are not included in the matching rules, the system cannot detect that the records represent the same individual.
Demand Score: 86
Exam Relevance Score: 91
What is the Unified Individual in Data Cloud?
The Unified Individual is the consolidated customer profile created after identity resolution links multiple source records.
It aggregates identifiers, engagement events, transactions, and attributes across all connected systems.
Demand Score: 85
Exam Relevance Score: 90
Why are identity resolution rules critical for segmentation accuracy?
Because segmentation operates on unified profiles rather than individual source records.
If identity resolution fails to link related records, segmentation may treat the same customer as multiple individuals, resulting in inaccurate audience targeting.
Demand Score: 84
Exam Relevance Score: 88