
7/15/2025, 10:30:00 AM

Customer acquisition costs are rising fast. For D2C brands, the path to sustainable growth runs through retention, not acquisition. The SegMark Engine is Cosminder's answer—a hybrid segmentation and retargeting system that combines classical RFM analysis with predictive XGBoost modeling to identify, score, and activate high-value customer segments at scale.
Built on 120,000 transaction and behavioral logs from a D2C skincare brand, SegMark delivered a 64.7% uplift in repeat purchase rate and ₹22L in incremental net revenue within six weeks.
The Direct-to-Consumer model gives brands direct access to first-party data—transactions, browsing behavior, engagement signals. But most brands underutilize this asset, relying on generic campaigns that treat every customer the same.
As competition intensifies and CAC climbs, the math is clear:
SegMark was built to turn raw customer data into a precision retention engine.
RFM analysis segments customers along three behavioral dimensions:
Each customer is scored on a 1–5 scale using quintile binning across all three dimensions. This produces intuitive segments like Champions (R=5, F=5, M=5), Potential Loyalists (recent but infrequent), and At-Risk (once-valuable customers who've gone quiet).
RFM gives a clean snapshot of historical value—but it's backward-looking. It tells you who was valuable, not who will be.
To shift from descriptive to predictive analytics, SegMark layers an XGBoost classifier on top of the RFM scores. XGBoost (Extreme Gradient Boosting) is an ensemble method that builds decision trees sequentially, with each tree correcting the errors of its predecessors.
The predictive target: will the customer make a repeat purchase within 30 days?
The RFM scores serve as expert-engineered features—distilling complex transactional history into three high-signal numerical inputs. These are augmented with behavioral features:
This hybrid approach—classical marketing framework plus modern ML—injects domain expertise directly into the model.
The trained XGBoost classifier achieved strong, balanced results on the holdout test set:
| Metric | Score | |--------|-------| | Accuracy | 92.15% | | Precision | 88.32% | | Recall | 85.11% | | F1-Score | 86.68% |
High precision means minimal wasted spend on unlikely converters. High recall means we capture the vast majority of customers who would have converted.
The model outputs a probability score (0.0–1.0) for each customer. Combined with historical RFM data, this creates four actionable segments:
Prime for Upsell — Probability > 0.85 AND high monetary score. The brand's best customers, highly likely to buy again. Strategy: early access, premium product launches, exclusive loyalty rewards.
Nurture to Loyalty — Probability 0.6–0.84. Engaged and interested, but need a gentle push. Strategy: personalized recommendations, free shipping incentives, sample-with-purchase offers.
Win-Back Opportunity — Low recency score AND probability 0.4–0.59. Once-active customers the model still sees potential in. Strategy: multi-channel (email + SMS) win-back with 25% off, 48-hour urgency.
Monitor & Maintain — Probability < 0.4. Low predicted engagement. Strategy: low-cost automated newsletter only. No active spend.
A model sitting in a notebook generates zero revenue. The SegMark pipeline pushes predictions directly into Klaviyo via API:
repeat_purchase_score is written as a custom property via Klaviyo's Profile API.This creates a living system—not a one-time analysis. As customer behavior evolves, the model rescores, segments shift, and marketing flows adapt automatically.
To establish causal impact (not just correlation), 10% of each targetable segment was held out as a randomized control group receiving only generic communications. Results after six weeks:
| Metric | Baseline (Control) | Post-SegMark (Treatment) | Relative Uplift | |--------|-------------------|-------------------------|-----------------| | Repeat Purchase Rate | 17.3% | 28.5% | +64.7% | | Average Order Value | ₹3,200 | ₹3,550 | +10.9% | | Incremental Net Revenue | — | — | +₹22,00,000 |
The ₹22L figure is incremental revenue—additional revenue generated above what the control group baseline would have produced. This is the true, isolated impact of personalized segmentation.
The SegMark architecture—data pipeline → predictive model → marketing automation—is a reusable foundation. Planned extensions include:
SegMark proves that data is not a byproduct of commerce—it is the central asset around which competitive strategy is built. The combination of RFM's intuitive behavioral framework with XGBoost's predictive power creates a segmentation engine that is both interpretable for marketers and performant for data scientists.
The result is a growth flywheel: every customer interaction generates new data, which refines predictions, which drives more effective marketing, which generates more engagement. This continuous loop is what separates brands that scale from brands that stall.
Disclaimer: This case study is based on Cosminder's SegMark Engine implementation for a D2C skincare brand. For technical details, model code, or partnership inquiries, contact support@cosminder.com.
Sources:
A data-driven approach to turning customer behavior into revenue