Scale requirements

We're continuously increasing our scale constraints to support larger traffic loads.

How we ingest and process your data at scale

Shaped has a real-time, durable feature-store that ingests, and transforms your data for training and serving.

Data Ingestion Architecture

Shaped continuously encodes your data with fresh multi-modal embeddings and trains your retrieval and ranking models with best in class MLOps.

Training Pipeline Architecture

Shaped has a serverless real-time serving system that scales with your requests based on latency, request volume and pod memory.

Inference Architecture

Shaped’s cloud API supports the following scale limits for each tenant:

Dimension	Limit
Unique users	50 million
Unique items	50 million
Unique events	250 million
Personal filters	50 million
Requests per second	500
Train frequency	2 hours
Event and filter ingestion	< 30 seconds
User and item catalog ingestion	10 minute

Please get in touch if you have specific performance or scale constraints that you want us to meet: Schedule a call