01 · Foundation

Foundation & readiness.

Data footprint, governance, sensitivity, and readiness gates before platform selection.

Most data collaboration projects fail before the platform decision because the data, rights, governance, and outputs are not clear.

Scope this playbook → Run the 5-minute assessment →

First-party data strategy is not one thing

Four strategy lanes.

Each lane has a different operating model. Confusion starts when they're treated as the same problem.

01
CRM strategy

Known customer records, consent, lifecycle, loyalty, retention, and customer value.
02
Behavioral data strategy

Site, app, content, product usage, engagement, and unauthenticated signals.
03
Clean room strategy

Governed matching, analysis, measurement, activation, or enrichment between two or more parties.
04
Enterprise data strategy

The operating model that connects data, governance, cloud, identity, analytics, activation, and decision workflows.

Data footprint

Footprint before platform fit.

The collaboration shape is decided by what data lives where, who owns it, how sensitive it is, and what governance must wrap it. Pick the platform from the decision, not the logo.

"If the use case cannot be tied to a business or customer outcome, it is not ready for a clean room."

Data sensitivity strategy

Sensitivity determines the collaboration pattern.

Read the pyramid from the base up: public, low-risk data sits at the wide bottom; logic and IP that must never leave sit at the narrow apex. The more sensitive the data, the more the product shifts from share to clean room to native app.

Logic / IP must also be protected native app, UDF, model container, code-based restrictions
Both sides cannot expose raw data symmetric clean room, multi-party collaboration, approved workloads
Individual records cannot be shown clean room queries, approved templates, output restrictions
Exact statistics cannot be exposed aggregation policy, thresholds, differential privacy
PII or regulated identifiers hashed IDs, clean room match, masking policies
Recipient-specific data private listing, secure share, row access policy
Public or low-risk data marketplace listing, open sample, public documentation

"The more sensitive the data, the more the product shifts from share to clean room to native app."

Practical artifact

The collaboration canvas.

A clean room or cloud workflow should not start until these twelve questions are answered. Now extended with the semantic, evaluation, and agent-exposure fields the richer stack requires.

Ambition

What are we trying to improve?

Business benefit

How will this improve revenue, retention, efficiency, growth, risk, or customer value?

Customer benefit

Why is this good for the end user or customer experience?

Data in

What data is required, what format is it in, and where does it live?

Legal basis

What consent, rights, permitted use, and processing basis exist?

Match logic

How will the data connect safely? What identifiers, keys, or aggregation rules apply?

Data out

What output is allowed: insight, aggregate, model score, audience, export, or activation?

KPI

How will success be measured?

Production path

What happens after the POC if it works?

Semantic context

What definitions, metadata, synonyms, metric logic, or approved questions are needed for business users to trust the output?

Evaluation loop

How will accuracy, completeness, business usefulness, and edge cases be tested over time?

Agent / API exposure

Will this workflow be exposed to agents, APIs, dashboards, apps, or automated decision systems? If so, what controls apply?

Four readiness gates

Before the platform conversation.

Most vendors skip from data readiness to platform selection. That is why POCs stall. Governance, semantics, and evaluation need to be designed before the workflow scales.

01
Data readiness
- Do we know what data exists?
- Is it documented?
- Is ownership clear?
- Is refresh cadence known?
- Are sensitive fields tagged?
02
Governance readiness
- Who can access what?
- What outputs are allowed?
- What is the consent or legal basis?
- What is auditable?
- What needs approval?
03
Semantic readiness
- Are business metrics defined?
- Are synonyms and value mappings documented?
- Are example questions available?
- Are benchmark answers known?
- Can business users understand the data?
04
Agentic readiness
- Can tools be safely exposed?
- Are usage limits defined?
- Are model / agent outputs evaluated?
- Is tracing available?
- Is feedback captured?

First-party readiness

First-party marketing analytics readiness.

Before a platform or a clean room, five readiness questions decide whether a marketing-analytics workflow can actually run — identity, data, tech, measurement, and activation.

01
Identity readiness
- Is there a consistent first-party customer ID?
- When is it assigned — site visit, login, purchase, app session, CRM record, or offline event?
- What share of users can be connected to that ID?
- Which systems capture it?
- Does it connect across online and offline data?
- Which hashed, pseudonymous, or platform IDs are in use?
- What consent state is attached?
02
Data readiness
- Where does CRM data live?
- Where do site / app events live?
- Where does offline sales or product data live?
- Which media-exposure and campaign logs are accessible?
- Are GA4 / BigQuery exports configured?
- Are schemas documented?
- Are timestamps, campaign taxonomy, and customer keys consistent?
03
Tech readiness
- What tools execute campaigns today?
- Which platforms collect, transform, analyze, visualize, and activate data?
- What is the current path from insight to activation?
- What sits outside GMP / Google?
- Which CDP, CRM, email, DSP, attribution, tag-management, and warehouse tools are used?
04
Measurement readiness
- What business questions need answers?
- What KPIs matter?
- Which channels need attribution, MMM, lift, or incrementality?
- What output can leave each environment?
- What is the production measurement cadence?
05
Activation readiness
- Where can outputs be activated?
- Which audiences are eligible?
- What suppression rules exist?
- What CRM, paid-media, owned, and site / app paths are available?
- Who owns the activation handoff?

Run the readiness assessment →

Meta signal readiness

Meta signal readiness — where Meta media is material.

Before deeper Meta analytics, the signal pipeline has to hold. These checks decide whether Meta measurement, attribution, and lift can be trusted — see the Meta Advanced Analytics deep dive.

Is the Pixel implemented correctly?
Is the Conversions API (CAPI) implemented?
Are event IDs used for deduplication across Pixel and CAPI?
Is Event Match Quality (EMQ) monitored?
Are offline / CRM events mapped to the right event taxonomy?
Are app events and SKAN constraints understood?
Are Aggregated Event Measurement (AEM) priorities documented?
Are attribution windows documented?
Is consent captured and respected?
Are datasets / event sources organized?
Is the output policy clear before advanced analytics?

Maturity model

Enterprise collaboration maturity.

Most companies do not jump from media-led to decision-orchestrated. They need a roadmap.

01
Media-led

Campaign execution, platform reports, broad audiences, limited customer-data use, basic reach and frequency.
- mostly platform reporting
- limited owned data
- limited link to business outcomes
- last-click or channel-only measurement
- teams work in silos
02
Data-explorative

Some owned data is used for targeting, suppression, exploration, or platform uploads.
- CRM or web/app data exists
- direct platform uploads
- early audience segments
- single-channel testing
- limited clean room experimentation
03
Collaboration-ready

Governed data, clear business questions, clean room or cloud path, output policy, and POC criteria exist.
- use cases are prioritized
- match keys are defined
- measurement approach is clear
- governance owner exists
- platform fit is based on output
04
Decision-orchestrated

Repeatable workflows connect identity, measurement, activation, insights, and optimization to business KPIs.
- recurring clean room workflows
- deduplicated cross-channel measurement
- predictive or LTV models
- business KPI optimization
- in-house analytics or clear partner operating model
- data collaboration feeds planning and investment decisions

Open vs governed

Open collaboration does not mean open access.

Enterprise buyers want interoperability, but not uncontrolled movement. The commercial promise is open collaboration with governed access — work across tools, engines, clouds, and partners while preserving policy, consent, auditability, and output control.

01
Open access

Anyone with credentials can move, copy, or export data. No policy boundary.
02
Secure sharing

Data is shared with named partners under contract — but downstream use is largely unmonitored.
03
Clean room

Match, query, and aggregation happen inside a governed environment. Raw data does not leave.
04
Governed output

Approved aggregates, scores, or activations can be exported — under output policy.
05
Activation rights

Defined permission to use the output to activate (suppress, target, optimize) — separately scoped.
06
Agent / tool permissions

Specific tools, queries, and APIs are allow-listed for agentic systems with usage controls and tracing.

Foundation looks ready? Pick the right environment next.

Once governance, footprint, semantics, and agentic readiness are mapped, the next decision is which collaboration environment fits the job.

Scope this playbook → Next: Platform Fit → Back to overview →

Foundation & readiness.

CRM strategy

Behavioral data strategy

Clean room strategy

Enterprise data strategy

Ambition

Business benefit

Customer benefit

Data in

Legal basis

Match logic

Data out

KPI

Production path

Semantic context

Evaluation loop

Agent / API exposure

Data readiness

Governance readiness

Semantic readiness

Agentic readiness

Identity readiness

Data readiness

Tech readiness

Measurement readiness

Activation readiness

Media-led

Data-explorative

Collaboration-ready

Decision-orchestrated

Open access

Secure sharing

Clean room

Governed output

Activation rights

Agent / tool permissions

Foundation looks ready? Pick the right environment next.