Skip to main content

Snowplow

Shaped can ingest data in real-time from your Snowplow instance, for either Snowplow Community Edition, or Snowplow BDP (Behavioral Data Platform).

Preparation

Shaped consumes data from Snowplow via AWS Kinesis streams. In order to deploy a dataset backed by AWS Kinesis, you should create a dataset of schema type CUSTOM and configure the dataset with the following fields:

- deploy_realtime: true
- tenant_aws_account_id: <your_aws_account_id>
- column_schema: <your_unified_event_schema>

As mentioned in the AWS Kinesis documentation, when creating a dataset with the deploy_realtime field set to true, Shaped will automatically provision an AWS Kinesis stream and a corresponding IAM role within the Shaped Ingestion AWS account with appropriate permissions to write to the Kinesis stream, and to be assumed by your AWS principal. This IAM role and Kinesis stream can be retrieved for you by the Shaped team.

Snowplow Configuration

Events can be forwarded from Snowplow to Shaped in two manners:

  • Snowbridge: Snowplow's native tooling to forward events to destinations with minimal configuration.
  • Custom Kinesis Forwarding: Using a custom Kinesis Client Library application to forward events from your Snowplow enriched data stream to Shaped.