Ngram (Sequential)
Description
The Ngram policy implements a simple, frequency-based sequential model. It predicts the next item based on the conditional probability derived from counts of the immediately preceding n-1
items (the n-gram context). It's effective at capturing short-term co-occurrence patterns in user behavior sequences.
Policy Type: ngram
Supports: scoring_policy
Configuration Example
scoring_policy_ngram.yaml
policy_configs:
scoring_policy:
policy_type: ngram
n: 3 # Sequence length (e.g., 3 for trigrams - uses last 2 items)
laplace_smoothing: 0.05 # Smoothing factor to handle unseen sequences (avoids zero probability)
References
- Concept based on N-gram language models. See: Jurafsky, D., & Martin, J. H. (2009). Speech and Language Processing. Prentice Hall.
- Wikipedia: Word n-gram language model