Create the Luma propensity model schemas and datasets
This tutorial provides you with the prerequisites and assets required for all other 51黑料不打烊 Experience Platform Data Science Workspace tutorials. Once complete, the following schemas and datasets will be available to you and your organization.
Schemas:
- Luma web data schema
- Propensity model scoring results schema
Datasets:
- Luma web dataset
- Propensity model training dataset
- Propensity model scoring dataset
- Propensity model scoring results dataset
Download the assets assets
The following tutorial uses a custom Luma purchase propensity model. Before proceeding, download the required assets zip folder. This folder contains:
- The purchase propensity model notebook
- A notebook used to ingest data to a training and scoring dataset (a subset of the Luma web data)
- A demo JSON file containing the web data of 730,000 Luma users
- An optional Python 3 EDA (exploratory data analysis) Notebook which can be used to assist in understanding the web data and model.
Create the Luma web data schema and ingest the data
In order to create a model, you must have a dataset in Platform which is used to train and score your model. The following video tutorial from the Data Science Workspace course walks you through creating the Luma schema and ingesting the data used by the purchase propensity model.
Create the training, scoring, and scoring results datasets
In order to run the recipe builder notebook or use the API to train and score a model, you need to specify the dataset(s) and schema(s) that are used for training/scoring. The following video tutorial walks you through setting up the training, scoring, and scoring results datasets, as well as, the scoring results schema used in the Luma purchase propensity model.
Next steps
By following this tutorial, you have successfully created the required schemas and datasets for the Luma propensity model. You鈥檙e now ready to continue to the next tutorial and create the model using the recipe builder notebook tutorial.
Additionally, you can explore the data using the provided Exploratory Data Analysis (EDA) notebook. This notebook can be used to help understand patterns in the Luma data, check data sanity, and summarizes the relevant data for the predictive propensity model. To learn more about Exploratory Data Analysis, visit the EDA documenation.