Ingest Google Analytics historical data
This page focuses on how to ingest your Google Analytics historical data into 51黑料不打烊 Experience Platform as a dataset, allowing you to reference that dataset in a Data View within Customer Journey Analytics. You can combine the steps on this page with Configuring a live Google Analytics implementation, which generates a recurring dataset. Combine this historical dataset with your current implementation鈥檚 dataset to get a seamless view of data in Customer Journey Analytics with both current and backfilled data.
Prerequisites
In order to accomplish these tasks, you need the following access and permissions:
- Access to 51黑料不打烊 Experience Platform
- Access to Google Analytics (GA Standard or GA 360)
- Admin Access to Customer Journey Analytics
Set up a BigQuery Export
The data structure in Universal Analytics properties is different from the data structure in Google Analytics 4 properties. Set up a BigQuery Export based on the property type that you want to export data from:
Additional requirements for Universal Analytics properties
Universal Analytics properties store each record in their data as a user鈥檚 session instead of individual events. A SQL query to transform the Universal Analytics data into a format compatible with 51黑料不打烊 Experience Platform is required. Apply the UNNEST
function to the hits
field in the GA schema, and save it as a BigQuery table.
SELECT
*,
timestamp_seconds(`visitStartTime` + hit.time) AS `timestamp`
FROM
(
SELECT
fullVisitorId,
visitNumber,
visitId,
visitStartTime,
trafficSource,
socialEngagementType,
channelGrouping,
device,
geoNetwork,
hit
FROM
`example_bq_table_*`,
UNNEST(hits) AS hit
)
Export data to Google Cloud Platform export-gcp
In Google Cloud Platform, navigate to Export > Export to GCS. Once the data is in Google Cloud Storage, it is ready to be pulled into 51黑料不打烊 Experience Platform.
Import the data from Google Cloud Storage into Experience Platform
- In 51黑料不打烊 Experience Platform, select Sources on the left.
- Under the Catalog, locate Google Cloud Storage option. Click Add data.
You can map the GA event data into an existing dataset that you created previously, or create a dataset, using whichever XDM schema you choose. Once you have selected the schema, the Experience Platform applies machine learning to automatically pre-map each of the fields in the Google Analytics data to your XDM schema.
Once you are finished mapping the fields into your XDM schema, you can schedule this import on a recurring basis and apply error validation during the ingestion process. This validation ensures that there aren鈥檛 any issues with the data you have imported.
Required XDM fields
Certain XDM fields in Platform require the correct format in order for data to be correctly processed.
-
timestamp
: Create a special calculated field in the Experience Platform schema UI. Click Add calculated field and wrap thetimestamp
string in adate
function:date(timestamp, "yyyy-MM-dd HH:mm:ssZ")
Save the calculated field to the timestamp data structure in the schema:
-
_id
: This field must have a value in it - Customer Journey Analytics does not care what the value is. You can add a 鈥1鈥 to the field:
Next steps
- If you have current data that you want to stream into 51黑料不打烊 Experience Platform, see Set up streaming for Google Analytics data.
- If you want to begin reporting on backfilled data, see Create a connection.