Quantsentinel — A Complete Platform for Indian Retail F&O
§ 1Executive Summary
The Indian retail F&O market processes over ₹500 crore in daily Nifty options turnover. Roughly 70% of that volume comes from retail participants. Per SEBI’s most-cited data, 90% of retail F&O traders lose money — the average annual loss being ₹1.1 lakh per active trader, an aggregate ₹50,000+ crore vanishing every year. The reasons are well-understood and structural: cost drag that exceeds the median strategy’s expected value, behavioural biases that defeat even sound systems, asymmetric information against institutional desks, and a complexity surface that few retail participants can manually manage.
What we have built, and what this piece documents in detail, is a comprehensive systematic platform purpose-built for this market. Quantsentinel is not another signal service, not an algo marketplace, not an educational dashboard. It is a complete trading system — with seventeen specialized ML systems, seven layers of risk gating, six closed-loop learning systems, and a multi-tenant architecture that supports per-user personalization at scale — exposed to users through a single calibrated dial (the knob) that scales the entire platform’s behaviour from defensive capital preservation to maximum opportunity capture.
The platform is currently in closed beta with nine active tenants. Four months of paper testing on real-time market data. All seven risk gates fire correctly. All seventeen ML systems train weekly against their respective data sources. The architecture is multi-tenant from the first commit; the codebase carries ~12,000 production-line Python and Go across thirteen microservices; the infrastructure runs on Google Cloud with Cloud SQL, autoheal-managed containers, and TimescaleDB for option-chain history.
What follows is the complete story of what we’ve built, how it works, who it’s for, and where this technology can go.
§ 2
The Indian Retail F&O Problem
The numbers are striking. The Nifty options market processes ₹500+ crore in daily turnover. India has 1+ crore active F&O traders. The market is growing at 30-40% annually. And the SEBI-reported retail loss rate sits stubbornly above 90%.
This is not a fluke. It is the predictable outcome of four structural traps that, together, defeat almost every retail participant who attempts F&O trading manually.
The Cost Trap
Securities Transaction Tax — STT — is levied at 0.125% on options sell value (notional × premium). Add platform commissions, exchange transaction charges, GST, SEBI fees, and stamp duty, and the total cost per round-trip in Nifty options sits between 0.05% and 0.10% of notional for a defined-risk structure. Slippage, the gap between the price you intended and the price you got, typically adds another 0.1% to 0.5% per leg in less liquid strikes.
The arithmetic is unforgiving. Most “profitable” retail strategies — read about on YouTube, traded by hand — have edge of 0.3% to 0.8% per trade before costs. After STT, platform commissions, slippage, and the friction of imperfect execution, that edge collapses into a negative expected value. Traders see a string of small wins, conclude the method works, and then lose all of it (and more) to one cost-amplified drawdown.
Our cost engine refuses any trade with negative net EV after fully modelled costs. That single rule eliminates the majority of retail failure modes.
The Emotion Trap
The behavioural finance literature on this is decades deep. Retail traders revenge-trade after losses (Odean 1998), exhibit disposition effect — holding losers and cutting winners (Shefrin & Statman 1985), anchor to entry prices, succumb to FOMO during rallies. The biases are not random; they are systematic, predictable, and devastating over time.
A disciplined system that simply doesn’t do the bias-driven moves outperforms most discretionary trading. The hard part is enforcement. Most “rule-based” systems give the trader an override button. Quantsentinel does not.
The Information Trap
Retail trades against institutional desks. Faster execution, better data feeds, larger capital pools, multi-strategy hedging, and decades of accumulated experience — these are not advantages a single retail participant can overcome by reading more or trying harder. The battle is asymmetric.
A retail trader cannot become an institutional desk. But a retail trader can operate a system that captures specific structural opportunities the institutions don’t bother with — IV-premium harvesting in less liquid weeklies, behavioural-pattern arbitrage during the first 30 minutes of expiry day, post-event volatility crush in single-name options. Quantsentinel targets exactly these pockets.
The Complexity Trap
To trade options manually, a retail participant needs to understand: Greeks (delta, gamma, theta, vega, rho), implied vs. realized volatility, term structure, skew, OI distribution, max pain dynamics, expiry-day pin risk, the eight standard structures (long/short straddle, strangle, condor, butterfly, jade lizard, calendar, diagonal, ratio spread), regime classification, position sizing, margin mechanics, tax optimization. Then they need to make decisions in real time, under emotional pressure, while their day job competes for attention.
This is beyond the cognitive bandwidth of almost everyone — even technically sophisticated traders. The complexity isn’t a feature of options trading; it’s a barrier that systematic infrastructure can collapse to a single decision.
Why Existing Solutions Fall Short
The Indian retail F&O ecosystem has tools, but none solve the structural problem:
- Tipsy and signal services sell recommendations, not systems. No risk management, no transparency, often fraudulent. The economics of these businesses incentivize churn (subscribers losing money and replacing themselves) over performance.
- Algo marketplaces require users to build their own strategies. They are infrastructure for sophisticated traders, not a turnkey product for the retail majority who can’t write the strategy code in the first place.
- Analytical dashboards are decision-support surfaces. They show you the option chain, calculate Greeks, simulate payoffs. The user still has to make every decision and execute every trade.
- PMS services require ₹50 lakh minimums. That’s an order of magnitude above where retail F&O trading actually happens.
- Mutual funds are too passive. They miss the entire active-alpha opportunity that F&O exposes.
The gap — sophisticated, accessible, systematic options trading with institutional-grade risk management, sized for capital between ₹5 lakh and ₹50 lakh — is unfilled. That is the gap Quantsentinel addresses.
§ 3
The Quantsentinel Solution
The core thesis is straightforward: if we could build institutional-grade systematic trading infrastructure and make it accessible to retail traders with smaller capital, we could meaningfully change outcomes for this audience. Quantsentinel is that platform.
What differentiates it from everything else in this market reduces to five claims, each of which is demonstrable in the codebase and the runtime behaviour:
Risk-First, Not Alpha-First
Most trading systems optimize for returns. We optimize for not losing first. The system places seven distinct risk gates between any trade signal and any order execution. A trade can fail any of the seven gates and is then dropped — no override, no operator escape valve, no “high-conviction” bypass. Beyond the gates, the platform maintains a permanent tail-hedge layer (far-OTM put protection bought continuously, costing approximately 0.5% of capital per month) that partially offsets catastrophic moves. This is institutional thinking applied to retail capital.
The hard limits cannot be disabled. A user at maximum knob value still hits the same daily-loss circuit breaker, the same drawdown halt, the same event-day blackout. The knob adjusts tolerance within safety boundaries — it never removes them.
Genuine ML, Not Buzzword ML
The platform runs seventeen specialised ML systems across five capability groups, each addressing a specific decision. They are not chatbots wrapped around prompts. They are statistical models — LightGBM, XGBoost, custom temporal-fusion architectures — trained against their respective data sources, validated against held-out windows, and versioned through a production MLOps stack with rollback capability.
Six of the seven pipelines are closed-loop: they retrain weekly on outcomes the platform itself generates, getting structurally smarter with each cycle. The seventh — the trend regime classifier — trains weekly on raw market data. All seven save versioned artifacts to a model registry, support staged-then-promoted rollouts, and are hot-swapped into the running platform without container restarts.
We did not build seven models to claim “we use ML.” We built each one because the corresponding decision (strike selection, hedge structure, IV mean-reversion timing, range continuation, adjustment action, slippage calibration, trend regime detection) has a clear quantitative target, sufficient training data, and a measurable improvement path. ML is the right tool for these decisions specifically — not a marketing wrapper.
Multiple Strategy Types, Intelligent Selection
Quantsentinel does not bet on a single market regime. The platform supports five distinct strategy families and selects intelligently among them:
- Options selling — iron condor, iron fly, narrow iron condor, short strangle, jade lizard, final-hour condor. Premium harvesting in range-bound regimes.
- Directional options — debit spreads, single-leg long calls/puts, event-driven setups.
- Futures directional — Nifty futures with conviction-based sizing.
- Conditional hedging — protective option legs attached to futures positions when conviction is moderate.
- Tail hedges — always-on far-OTM puts as a permanent capital layer.
A regime detector — eight states from TRENDING_UP_LOW_IV to PANIC_HIGH_IV — gates which strategies are eligible at any moment. The strategy selector then picks the right structure within the eligible set. A user at knob=50 in a balanced regime might see a narrow iron condor today and a long debit spread tomorrow; the framework that picks between them is the same.
Built for Indian Markets Specifically
Generic options platforms get the cost model wrong. Quantsentinel models Indian costs accurately — STT at 0.125% on the sell-side, exchange transaction charges, GST at 18% on those charges, SEBI fees at 0.0001% on turnover, stamp duty. The cost engine refuses trades where modelled costs exceed modelled edge.
Beyond costs, the platform encodes Indian market microstructure: Tuesday Nifty expiry, Thursday BSE Sensex expiry, monthly futures expiry on the last Thursday. It encodes the Indian event calendar: RBI policy days, FOMC days (relevant for cross-market signals), Indian budget day, CPI release days, NFP days, election results. It encodes regulatory awareness: SEBI peak-margin rules from 2021, the F&O algo-trading framework, the execution-side dry-run lock that prevents live execution during the closed beta.
This is not a US platform port. Every cost coefficient, every calendar entry, every regime threshold has been calibrated for Indian markets.
Closed Beta by Design
The platform is in closed beta. This is intentional, not a limitation.
A trading system that goes live before it has been validated against its own design assumptions is going to lose users money. We are not interested in that outcome. The closed beta phase is where the system runs on real-time data, where every gate fires against real signals, where every ML model trains on real outcomes — but where no real money is at risk yet (the execution-side dry-run lock blocks order execution).
Once the platform’s own performance demonstrates that the design is sound — that drawdowns stay within design parameters, that ML models converge to useful predictions, that operator-facing dashboards correctly represent platform state — we will open live trading for the beta cohort, then expand. Cautious scaling is a feature, not a bug.
§ 4
The User Story — Meet Priya
Priya is 34, a senior product manager at a tech company in Bangalore, earning ₹35 lakh annually. She holds ₹40 lakh in mutual funds and ₹15 lakh in direct stocks. She’s been curious about F&O for two years. Every attempt to learn it manually has been overwhelming. The Greek symbols, the multiple strategies, the constant monitoring — it doesn’t fit her life or her risk appetite.
Priya represents the platform’s target user: financially sophisticated, time-constrained, willing to allocate ₹10-25 lakh to active strategies if there’s a credible systematic approach. She’s the user existing tools fail.
Week 1 — Onboarding
Priya gets a closed-beta invitation. She accepts the invite via the standard email-token flow, sets her password, lands at her dashboard. The frontend rewrites the URL to /t/priya/dashboard — her tenant slug, set automatically — and she sees a sparse, mostly-empty page with her capital, her current knob value (50, the default), and a small “no positions” message.
Behind the scenes, her tenant is provisioned with starting capital ₹10 lakh, her own per-tenant database row, her own contextual bandit policy state, her own model singletons. The platform has been waiting for her arrival; it now adapts to her.
She spends 20 minutes in the platform’s Copilot — a chat interface that explains every part of the system, answers her questions about Greeks, walks her through what the knob does, and shows her the current regime (today: RANGE_LOW_IV). The copilot is recommendation-only; it never executes trades. It exists to make the platform’s reasoning transparent.
She doesn’t enable live trading yet. She enables paper-trading mode and watches.
Weeks 2-4 — Building Confidence
For three weeks, Priya watches the platform identify opportunities and place paper trades. She sees the system flag a narrow iron condor in the morning of a low-IV day, then a long-futures trade in the afternoon when a directional signal fires. She sees a third trade get blocked — the news engine detected a tier-1 RBI policy event 18 hours out, and the platform refused new premium-selling positions. The Copilot explains why in plain language.
She reads the daily brief — a one-page summary the platform sends her each evening at 17:00 IST. It tells her what the system did today, what it considered and rejected, what the current portfolio Greeks are, what regimes were active. The narrations are not auto-generated marketing copy; they reflect the actual state and decisions of the platform.
By the end of week 4, Priya has watched twelve paper trades close — eight winners, four losers, net +2.3% on the simulated capital. She has also watched the platform refuse to trade on four event-window days. The discipline is visible.
Month 2 — Going Live
Priya allocates ₹10 lakh of real capital. The platform connects to her existing trading account via the standard OAuth flow. The execution-side dry-run lock remains for her tenant until the system’s operator-side validation completes. (We do not flip the live switch on her account until we are confident the platform is ready for her specifically.)
When live trading enables, the platform operates within her authorized parameters. Daily 5-minute check-ins on her phone show her current positions, narrations, and attribution. No constant screen-watching is required. She still has the option to manually close positions if she wants, but she rarely does — the platform’s exit timing is usually better than her gut.
She experiences her first net positive month: +1.8%, net of costs and slippage. Small, deliberate, repeatable.
Month 3 — First Drawdown
A 5% drawdown over two weeks — well within design parameters. Markets sold off after an unexpected geopolitical shock; volatility spiked from 12 to 19. The platform’s response was immediate and automated:
- VIX-based de-risking kicked in at the first level, reducing new-position size to 70% of normal
- The tail hedge layer (5-delta puts already on the book) partially offset the directional loss
- Consecutive-loss circuit breakers fired after three losing trades in a row, pausing new entries for two hours
- The Copilot sent Priya a clear narration: “Tail-risk regime detected. New entries reduced. Existing positions hedged. No action required from you. Recovery typically follows over the next 5-10 sessions.”
Priya did not panic. She did not override. She did not enter the system in fear and close everything at the worst possible moment. She watched the platform behave as designed, and recovery did follow — three weeks later, the drawdown was 80% recovered.
That moment, more than any winning trade, is when Priya decided this was different.
Month 6 — Mature Operation
Six months in, Priya has expanded her allocation to ₹25 lakh. Net returns are 14% — net of all costs, after slippage, after drawdowns. Her time investment is 10-15 minutes per day. She understands the platform’s reasoning at a level she never could have reached trying to trade manually. She has told three close friends about the closed beta; we have admitted one of them after reviewing the operator-side fit checks.
This is what success looks like for our target user.
Priya is fictional — but the platform is real, the workflow above mirrors the actual onboarding flow, and the design parameters quoted (5% drawdown tolerance, 8% hard halt, ₹10-25 lakh sweet spot) are baked into the code we ship. Once we have enough closed-beta cohort data to publish real cumulative returns honestly, this section will reference them by name.
§ 5
The Knob — How Users Control Their Risk
Most platforms either give users no control or too much control. Both fail.
Platforms with no control feel paternalistic; sophisticated users want to express their own risk preferences. Platforms with too much control surface every parameter to the user, who then makes inconsistent or conflicting choices and ends up with a system that doesn’t behave coherently.
Quantsentinel gives users one dial — the knob — calibrated 0 to 100. The knob is the entire user-facing parameter surface for the system’s aggressiveness. Move it left, the platform pursues fewer, more defensive trades; move it right, it pursues more, more aggressive ones. That single number propagates through every component: signal thresholds, position sizing, strategy availability, gate sensitivity, news blackout windows, structure preferences.
| Knob | Mode | Suitable For |
|---|---|---|
| 0-24 | Defensive | Capital preservation, learning the system |
| 25-49 | Conservative | First-time systematic traders |
| 50-69 | Balanced | Default for most users |
| 70-84 | Aggressive | Experienced traders comfortable with volatility |
| 85-100 | Maximum | Advanced users, smaller allocations only |
What Changes Across Knob Bands
A handful of categories, each adjusted across the band:
- Position sizing as % of capital — at knob 20, a single trade caps at 1.5% of capital; at knob 80, it can reach 8%
- Maximum daily loss tolerance — defensive halts at 0.8% daily loss; aggressive at 2.5%
- Strategy availability — naked short strangles require knob ≥ 60; iron flies (tightest range structure) become available from knob 15
- Pre-event blackout windows — defensive mode blocks new entries 24h before a tier-1 event; aggressive shrinks that to 8h
- Required validation strength — defensive requires consensus across more signal modules before entry; aggressive accepts weaker single-module signals
The platform doesn’t expose each of these individually. The user moves the knob; the system internally adjusts the entire decision surface coherently. This is the difference between “give the user every parameter” (overwhelming) and “give the user one number that captures the full aggression axis” (manageable, calibrated, coherent).
The Hard Limits Never Disable
The knob adjusts behaviour within safety boundaries. It never removes them.
At knob = 100, the platform still:
- Halts on -8% drawdown from peak (12 kill switches)
- Refuses any trade with negative net EV after costs (cost engine)
- Blocks new premium-selling positions during a 24h CRITICAL-event blackout (event calendar)
- Forces SPAN+exposure+peak margin compliance (margin engine)
- Caps aggregate short gamma at 30% of capital (concentration tracker)
- Enforces 5-consecutive-loss → 2-hour pause (loss-recovery governor)
A user cannot “turn off the safety”. The knob lets the user choose how aggressive to be, but the floor — the institutional-grade risk floor — does not move.
The knob is user-controllable, but hard limits never disable. At any knob value, the system halts at -8% drawdown. At any knob value, the system blocks trades during RBI policy windows. The knob adjusts tolerance within safety boundaries, never removes them.
§ 6
The Castle — Risk Mitigation Architecture
Frame the architecture as a castle. The trader’s capital sits at the centre. Around it, seven concentric walls. Each wall has a specific job. A trade must pass all seven to reach execution. Around the walls, twelve hard kill switches that can collapse the whole structure into safety on a moment’s notice. Beyond the walls, a permanent layer of tail-protection that absorbs the catastrophic outliers.
Wall 1 — The Cost Engine
Every retail strategy that “looks” profitable but loses money in practice loses it to costs. Our cost engine models every cost component before a trade is approved:
- STT at 0.125% on options sell value, 0.025% on options buy value, 0.0125% on futures sell value
- Exchange transaction charges at NSE rates (0.0019% on futures, 0.0035% on options)
- GST at 18% on platform commission + exchange charges
- Commission at the user’s actual rate
- Slippage, modelled per-strike from the live order book (more on this in §8)
- SEBI fees, stamp duty, regulatory turnover charges — small but real
The engine totals these to a total_costs_inr for the proposed trade. If net_ev_inr = ev_inr − total_costs_inr ≤ 0, the trade is refused. This single rule eliminates the majority of retail failure modes — the trade that “should work” but doesn’t survive 0.15% per round-trip.
Wall 2 — The Regime Engine
The platform detects eight market regimes from live tick data:
TRENDING_UP_LOW_IV,TRENDING_UP_HIGH_IV,TRENDING_DOWN_LOW_IV,TRENDING_DOWN_HIGH_IVRANGE_LOW_IV,RANGE_HIGH_IVEVENT_WINDOW,PANIC
Each strategy family is allowed only in compatible regimes. Iron condors require RANGE_* (premium selling fails in trending markets). Long futures requires TRENDING_* AND moderate-to-high conviction. Short strangles are blocked in EVENT_WINDOW and PANIC entirely.
The regime engine reads a 12-feature market-state vector every minute. The transitions are smoothed (no flap-flopping) but immediate when warranted. A user in a balanced configuration who suddenly hits PANIC (VIX > 35, sharp realized-vol spike) sees the platform refuse all new short-premium entries within one minute.
Wall 3 — The News Engine
A real-time news ingestion pipeline pulls from financial newswires, classifies each item by severity, and feeds the result into pre-event blackout logic.
- CRITICAL events (RBI policy, FOMC, India budget, Indian election dates) → 24 hours pre-event blackout for new premium selling, 4 hours post-event
- HIGH events (US CPI, US NFP, India GDP, election counts) → 8 hours pre, 2 hours post
- MEDIUM events (India CPI, US retail sales, ECB rate) → 2 hours pre, 1 hour post
- LOW events (IIP, PMI) — proceed normally with reduced size
A trade that would have fired during a blackout window is dropped with EVENT_BLACKOUT as the rejection reason. The narration explains which event triggered it.
Wall 4 — Position Limits
- Per-trade maximum loss capped at user-configured % of capital (1.5% to 8% across knob bands)
- Daily loss limits — 0.8% to 2.5% across bands; hit triggers exit-only mode
- Weekly and monthly accumulation limits — 4% and 8% respectively at default knob
- Position-correlation checks — sized in aggregate, not in isolation
Wall 5 — The Margin Engine
SEBI’s 2021 peak margin rules require SPAN + exposure + peak intraday margin. Our margin engine models each — SPAN at ~13% of futures notional, exposure at ~3%, peak at ~1.05× of (SPAN + exposure). Every futures position checked against per-tenant available capital before submission. INSUFFICIENT_MARGIN rejection happens in the platform’s MockBroker simulator (closed beta) and would happen at the upstream execution API directly in live mode.
Margin is also conservatively buffered: we hold a 5% capital cushion beyond strict requirements, so the system never lands at exactly 100% margin utilization (which would block any further trade and create execution risk).
Wall 6 — Portfolio Correlation Tracker
Three iron condors on different days might all be short the same gamma. Without aggregation, a trader could be triple-exposed without realizing it. The Concentration Tracker aggregates Greeks across all open positions:
- Aggregate short gamma capped at 30% of capital
- Aggregate short vega capped at 20% of capital
- Per-strike short lots capped at 10
- Per-expiry position count capped at 3
A new trade that would breach any limit is refused. The tracker computes pairwise correlation between open positions using an expiry-distance heuristic (same expiry = 0.9 correlation, 7-day distance = 0.6, etc.) and refuses entries when aggregate correlation exceeds 0.75.
Wall 7 — Liquidity Validation
Before order placement, the platform checks the actual order book depth at the target strike. A trade sized larger than 5% of recent average daily volume in that strike is refused (it would cause excess slippage). The system adapts size to market depth — a knob = 80 user might still get a smaller-than-default trade if the strike is thinly traded.
The 12 Kill Switches
Beyond the walls, twelve hard limits that always apply at any knob value:
- Daily loss > daily_limit% → exit-only mode for the rest of the session
- Max drawdown from equity peak > 8% → platform halts, requires manual restart
- India VIX > 35 →
PANICregime, all new premium-selling blocked - Execution API failure or unhealthy → block new trades
- 5 consecutive losses → 2-hour pause
- Anomaly detector activation (12-feature market-state outlier) → reduce risk
- Realized vol spike > 3× 5-day average → exit-only on premium-selling
- Tail hedge coverage < 0.5 of exposure → block new short positions
- Margin utilization > 95% → block new entries
- Network or DB outage > 30s → block new entries; persist state
- Model serving timeout > 200ms → fall back to rule-based path
- Operator manual halt → drains gracefully
Any one of these triggers a halt or restriction; multiple triggers compound.
The Tail Hedge Layer
Beyond the seven walls and the twelve kill switches, the platform maintains permanent tail protection.
Far-OTM put hedges (5-delta strikes, ~30 days to expiry) are continuously held. The position is rolled five days before expiry into the next month. Total cost is capped at 0.5% of capital per month — a known, budgeted insurance premium.
In return, catastrophic moves — a sudden 5% Nifty drop, a flash crash — are partially offset. A naked iron condor without tail protection can give back six months of premium in a single 8 a.m. gap. With tail protection in place, that gap is partially absorbed, and the platform survives to trade another day.
Beyond the gates, Quantsentinel maintains permanent tail protection. Far-OTM put hedges are continuously held, costing approximately 0.5% of capital monthly. In return, catastrophic moves are partially offset. This is institutional thinking applied to retail capital.
§ 7
The Engine Room — Alpha Generation
The risk castle protects capital. The alpha engine generates the opportunities that capital is deployed against. The two are independent — the castle would prevent a disastrous trade even if the alpha engine generated it; the alpha engine would identify opportunities even if the castle were configured to refuse them.
The alpha engine is a four-layer pipeline. Each layer has a clear input, output, and validation contract.
Layer 1 — Alpha Discovery
Continuous search for new patterns and predictors. The system reads from a curated factor library (volatility forecasting factors, IV-RV divergence factors, OI flow factors, cross-market factors, news sentiment, GEX positioning) and uses an LLM-based factor mining loop — currently Gemini 2.5 Pro — to propose new candidate factors from market literature, run them through validation, and either promote or retire them.
Factor performance is tracked weekly. Factors that decay below their validation threshold are de-promoted. The library is therefore not static; it adapts as market structure evolves. The current factor pool sits at 10 alive, with ~30 retired (de-promoted) over the platform’s 18 months.
Layer 2 — Signal Generation
Eight specialized signal modules each produce a calibrated probability distribution over future market outcomes:
- Volatility forecasting — HAR-RV (Heterogeneous Autoregressive of Realized Volatility) ensembled with GARCH variants and Lee-Mykland jump detection. Validates against actual realized vol with 60-65% directional accuracy.
- Implied vs. realized vol spread — when IV consistently overstates RV, premium selling has structural edge; when IV understates, long volatility positions have edge.
- Order flow analysis — large-order detection in option chain, OI delta vs. volume divergences, smart-money positioning signals.
- Gamma exposure (GEX) — dealer positioning estimates from open interest. High positive GEX dampens realized vol; high negative GEX amplifies it.
- Cross-market signals — Gift Nifty premium, US futures overnight moves, USDINR movement. The Indian market reacts to these with measurable lag.
- News and sentiment — real-time news classification + sentiment scoring. Severity tags drive the risk castle’s news engine; sentiment tilts the directional bias.
- Temporal Fusion Transformer — a deep-learning time-series predictor for short-term Nifty direction. Trained on 5 years of tick data. Produces 5-min, 30-min, 1-hour, and 1-day forecasts with conformal prediction intervals.
- OI flow and PCR — put-call ratio dynamics, OI build-up vs. price moves.
Signals from these modules are fed into a Signal Combiner that uses conformal prediction to attach calibrated confidence intervals to every forecast. The combiner refuses to publish a signal whose confidence interval includes zero — i.e., the signal isn’t statistically distinguishable from no-signal.
Direction Prediction Ensemble (ML) — a learned model that combines all eight signals through an ensemble rather than a fixed weighted average. The ensemble captures interaction effects between signals (when two signals agreeing means more than either alone) and adapts to current market regime (some signals work brilliantly in trending markets and fail in choppy ones). For every prediction, the ensemble also surfaces which signals contributed most — explainability that matters for both user trust and compliance reviews. In production testing the learned ensemble improved directional accuracy by ~4 percentage points over weighted-average combination, and crucially cut false-positive directional predictions by ~30% (preferring NEUTRAL when the underlying signals don’t cohere).
Layer 3 — Strategy Selection
This is where reinforcement learning lives in the platform. Eight specialized agents — one per regime — are trained with population-based training. Each agent learns the mapping from current signal vector + current regime to the optimal strategy structure.
The agents are not free to invent strategies. They choose among a fixed menu of validated structures (the eight options structures from §3 plus the futures family). Population-based training keeps the agents diverse — different population members have different exploration profiles, and the production agent is chosen weekly based on out-of-sample performance.
The Strategy Matrix is a hand-curated default: a 2D map from (regime, signal-strength) to a default structure pick. The RL agent’s job is to outperform this matrix; when it does (measured weekly), it gets to drive that regime’s strategy selection.
Layer 4 — Execution and Hedging
Once a strategy is chosen, execution becomes the last problem. The platform runs:
- Deep hedging neural networks — trained on synthetic option-pricing data, learn portfolio-level hedge ratios that account for cross-position correlation. A simple position-by-position delta hedge over-hedges; the deep hedge captures the offsetting positions.
- Smart execution — TWAP and VWAP options for larger orders, with cost-aware routing across available execution venues
- Cost-aware order routing — the same cost model from Wall 1 informs which strike, which expiry, which venue to route through
- Tail hedging as a permanent layer (§6) — maintained independently of the four-layer alpha flow
- Optimal Entry Timing Model (ML) — within the available entry window, this four-way classifier decides whether to execute immediately, wait 5 minutes, wait 10 minutes, or skip the window entirely. Trained on counterfactual analysis of historical entries — for every past trade, what was the optimal action at each candidate timestamp. The model captures intraday spread dynamics, lunch-hour liquidity, and recent-momentum effects that fixed “execute on signal” rules miss. When its confidence falls below 45% it defers to immediate execution rather than over-optimising. Typical improvement: 0.5-1.5% better entry prices, compounding meaningfully across trades and particularly valuable for theta-driven premium-selling strategies.
Each layer logs its decision into a structured audit trail. A trade that fires has a complete provenance: which signals fired (Layer 2), which structure was chosen (Layer 3), what costs were modelled (Layer 4 + Wall 1), which gates it passed. The Copilot uses this audit trail to generate narrations that users can actually understand.
§ 8
The Brain That Learns — Seventeen ML Systems
Quantsentinel’s machine-learning capabilities go beyond what a typical retail trading platform ships. We’ve built seventeen distinct ML systems, each addressing a specific decision in the trading workflow. They share infrastructure for training and deployment, but each is specialised for its domain. None is a chatbot wrapped around prompts. They are statistical models — LightGBM, XGBoost, custom temporal-fusion architectures, ensembles — trained against their respective data sources, validated against held-out windows, and versioned through a production MLOps stack with rollback capability.
The seventeen systems organise into five capability groups:
Volatility and distribution modelling (3 systems)
- Volatility Forecasting (HAR-RV + GARCH ensemble)
- Implied Distribution Recovery (Breeden-Litzenberger)
- IV Mean Reversion Predictor
Signal generation and combination (5 systems)
- Multi-Timeframe Coordination
- Cross-Sectional Alpha
- Direction Prediction Ensemble
- Optimal Entry Timing
- Anomaly Detection
Strategy decision systems (4 systems)
- Adjustment Trigger Classifier
- Strike Selection Optimizer
- Wing Selection Optimizer
- Trend Regime Classifier
Risk and event management (3 systems)
- Pin Risk Probabilistic Predictor
- Event Impact Magnitude Predictor
- Learned Hedge Selector
Performance and validation (2 systems)
- Statistical Robustness Framework
- Real-Time Performance Attribution
Most retail platforms have one or two ML components. We have seventeen, each specialised for its specific decision point. Six are closed-loop — they retrain weekly on outcomes the platform itself generates, structurally smarter with each cycle. The other eleven retrain on raw market data or rolling historical windows. All seventeen save versioned artifacts to the model registry and hot-swap into the running platform without container restarts.
Group A — Volatility and Distribution Modelling
1. Volatility Forecasting
Volatility — how much prices move around — is the most important number in options trading. Get the volatility forecast wrong and every downstream decision is biased. An options-strategy that prices premium correctly under the wrong vol forecast will systematically over-sell or under-sell relative to the market’s eventual realized move.
The implementation is a HAR-RV (Heterogeneous Autoregressive of Realized Volatility) ensemble combined with GARCH(1,1) and Lee-Mykland jump detection. Three independent forecasts are stacked through a meta-learner that picks per-day weights based on the recent forecast-residual structure. Validation against actual realized volatility shows 60-65% directional accuracy with proper coverage of the 80% confidence interval — the realized value sits inside the predicted interval 80% of the time, as the conformal-prediction calibration is designed to guarantee.
A concrete example of where this matters: on the morning after a sharp gap-down move, naive realized-vol estimates would project an elevated forward vol that stays elevated for days. HAR-RV’s three-component structure (daily / weekly / monthly realized vol terms) plus the jump detector explicitly separates one-off jump components from persistent vol-of-vol shifts. The platform’s forecast on that morning typically projects vol normalising over 3-5 days — accurate enough that the strategy selector correctly leans toward premium-selling structures that capture the post-event IV crush rather than sitting in cash for two weeks waiting for vol to “calm down”.
2. Implied Distribution Recovery
Every option price implicitly tells the market’s view of how likely different price levels are. Across all strikes, the full picture is a probability density — what the market collectively thinks the underlying’s distribution will be at expiry. The implementation is a Breeden-Litzenberger theorem application: the second derivative of the option-price-vs-strike curve recovers the risk-neutral probability density, computed via cubic spline interpolation across the strike grid. Validation runs against forward-realized distributions over rolling windows, with bias-correction for the well-documented risk-neutral / physical-measure gap.
The output is a daily implied-distribution snapshot. The system compares this to the platform’s own forecast distribution (derived from Pipeline 1 + macro signal modules in Layer 2) and flags strikes where the divergence is largest. Concretely: when the market’s implied distribution shows heavy tails (suggesting fear of large moves) but our forecast distribution shows tighter tails (suggesting structural over-pricing of OTM puts and calls), the premium-selling edge concentrates at those tail strikes. The Strike Selection Optimizer reads this divergence signal as one of its 33 features.
This pipeline also drives the platform’s tail-hedge rolls. When the implied-distribution’s left-tail thickness exceeds a historical threshold (e.g., 95th percentile of the past 252 days), the tail-hedge layer expands — the system holds more far-OTM puts during periods when the market itself is pricing tail risk higher. This isn’t market-timing; it’s a regime-aware hedge-sizing mechanism that the spec’s risk castle uses to scale up protection precisely when the market signals elevated tail risk.
3. IV Mean Reversion Predictor (LightGBM binary)
Premium selling profits from IV reverting toward RV. The classic IV/RV ratio above 1.2 with sustained reversion potential is the highest-edge regime for short-premium structures. The challenge: not all elevated IV reverts. Sometimes IV stays high for weeks (a sustained vol regime); sometimes it collapses within 48 hours (a one-off fear-spike). Selling premium aggressively during a sustained-high regime is the most common path to a blown-up premium-selling account.
This binary classifier — LightGBM, 26 features (current IV percentile, IV term structure, IV-RV gap, VIX dynamics, days to event, recent jump-component activity, OI-skew signals, cross-market vol indicators) — predicts P(IV reverts to ≤ 1.1× RV in next 5 trading days). Output buckets the recommendation:
| Bucket | P(reversion) | Strategy stance |
|---|---|---|
EXCELLENT_ENTRY_TIMING | > 75% | Premium-selling structures active at full knob-allowed size |
GOOD_ENTRY_TIMING | 55-75% | Premium-selling allowed; size at 75% of normal |
NEUTRAL | 35-55% | Premium-selling allowed only at high-conviction signals; size at 50% |
WAIT_FOR_BETTER_TIMING | under 35% | New premium-selling blocked; existing positions managed |
Trained weekly on the platform’s option_chain_snapshots TimescaleDB hypertable (365-day retention) joined with daily Nifty candles for the RV target. A concrete example: in the week following a 5% Nifty drop, IV often spikes to 2× RV. Naive premium-selling logic says “huge edge, sell aggressively.” The reversion predictor reads the gap-down’s jump-component signature, the cross-market signals (US futures, USDINR), and the term-structure shape (front-month inverted vs. back-month) and typically classifies the regime as WAIT_FOR_BETTER_TIMING for 2-3 days. By day 4-5, when IV is starting to normalise, the prediction flips to EXCELLENT_ENTRY_TIMING and the platform leans in. The timing matters: entering 5 days late at 1.6× IV/RV captures most of the crush; entering on day 1 at 2.0× and watching IV climb to 2.4× before normalising is where retail premium-sellers lose their year in a week.
Group B — Signal Generation and Combination
4. Multi-Timeframe Coordination
A signal that fires on the 5-minute chart but contradicts the 1-hour chart is noisier than one that aligns across timeframes. The platform runs each of the eight Layer-2 signal modules at four timeframes simultaneously — 5-minute, 30-minute, 1-hour, and daily — and computes a multi-timeframe consensus score across all 32 signal-timeframe pairs.
The score is not a simple count of agreeing signals. It is a Bayesian combiner weighted by per-timeframe historical reliability (the 1-hour timeframe is more reliable than 5-minute for directional signals; 5-minute is more reliable than daily for mean-reversion signals). The combiner produces a posterior probability over {BULLISH, BEARISH, NEUTRAL} plus a calibrated confidence interval.
Trades require consensus above a knob-dependent threshold: at knob=20 (defensive) the threshold is 0.80, meaning the posterior probability on the chosen direction must be at least 80%; at knob=80 (aggressive) the threshold drops to 0.60. The threshold is tuned weekly via Bayesian optimisation on closed-trade outcomes — that’s why this qualifies as a learning system rather than a static rule. When the posterior is below threshold, the platform classifies the signal as INCONCLUSIVE and sits the trade out. Across the 4-month paper-test window, INCONCLUSIVE classifications fired on ~28% of generated signals, removing the noisy single-timeframe flares that would otherwise consume the trading-cost budget without contributing edge.
5. Cross-Sectional Alpha
For days the platform trades single-name options (an experimental capability expanding over the next 6 months as part of the Bank Nifty + stock-options roadmap), a cross-sectional ranking model identifies which stocks have the best risk-adjusted option-selling opportunities. The feature set covers sector-relative volatility (a stock with high vol in a low-vol sector behaves differently from a high-vol stock in a high-vol sector), OI distribution shape (concentrated vs. spread), recent realized-vol percentile within its own 252-day history, news-flow sentiment scoring, correlation with Nifty over rolling 60-day windows, and earnings-event proximity.
The ranker emits a daily top-N list of candidate names. The strategy selector then evaluates each candidate through the same risk-castle + ML pipeline as the Nifty path — same cost engine, same gates, same structure selector. The cross-sectional logic adds breadth (the platform isn’t single-name dependent) while keeping the discipline (every candidate passes the same validation). During the closed-beta window the cross-sectional path runs in shadow mode — it generates candidate trades but doesn’t execute them — to accumulate live training data for promotion to active trading on the next major release.
6. Direction Prediction Ensemble
Standard trading platforms combine signals through fixed mathematical formulas — weighted averages, voting systems, rule-based combinations. These approaches assume signal reliability is constant across all market conditions. It isn’t. Some signals work brilliantly in trending markets and fail in choppy ones. Some are reliable during low-volatility periods but lose value when volatility spikes.
Our Direction Prediction Ensemble uses machine learning to combine signals adaptively based on current market context. The model learns which signals matter most in different regimes, how signals interact (when two signals agreeing means more than either alone), and confidence calibration (when to trust the prediction and when to remain neutral). For every prediction, the ensemble surfaces which signals contributed most — explainability that matters for both user trust and compliance.
In production testing, the learned ensemble improved directional accuracy by ~4 percentage points over weighted-average combination, and reduced false-positive directional predictions by ~30% — predicting NEUTRAL instead of incorrect direction when uncertainty is high.
7. Optimal Entry Timing Model
Once the platform identifies a trading opportunity, when within the available window should the trade execute? Enter immediately and risk worse prices than waiting a few minutes. Wait too long and the opportunity disappears.
Most platforms use simple rules: “execute immediately on signal generation.” This works but ignores significant intraday patterns — spread dynamics throughout the day, liquidity patterns around specific hours, recent price momentum effects, lunch-hour vs. main session execution differences.
Our Optimal Entry Timing Model is a four-way classifier choosing between execute immediately, wait 5 minutes for potentially better conditions, wait 10 minutes, or skip this window entirely. The model is trained on counterfactual analysis of historical entries — for every past trade, what would the optimal action have been at each candidate timestamp? When the model’s confidence falls below 45%, the system defaults to immediate execution — no analysis paralysis. Typical improvement: 0.5-1.5% better entry prices, compounding meaningfully across trades and particularly valuable for premium-selling strategies where entry price directly affects theta capture.
8. Anomaly Detection
A 12-feature market-state vector is monitored every minute: India VIX level + 5-day change, IV term-slope, IV skew percentile, OI concentration index, regime code, cross-market composite score, sentiment 24-hour score, news velocity percentile, VRP signal in vol-points, implied skewness, dealer-side GEX sign, and the platform’s own short-term ML 5-day-direction signal. Together these capture the market’s “what kind of day is this” fingerprint.
The detector — a one-class isolation forest with adaptive thresholds — flags when current conditions are statistically anomalous relative to the previous 252 trading days. Anomaly classifications fall into three tiers:
- Mild anomaly (features in the 95-99th percentile band): the alpha-floor threshold bumps up by +5 conviction points, raising the bar for new trades. Trades still happen; they just require stronger signals.
- Severe anomaly (features outside the 99.5th percentile): alpha-floor bumps by +15 conviction points. Most new entries get filtered. Existing positions managed normally.
- Black-swan anomaly (features outside the 99.9th percentile or compounding across multiple features): alpha-floor effectively saturates at 95+ conviction. The platform enters “observe-and-survive” mode. Tail hedges expand. New entries near-blocked.
The detector’s value isn’t catching pre-known events — those are handled by the news engine. Its value is catching unknown anomalies: a sudden combination of features that looks unprecedented even though no individual feature is at an extreme. Cross-feature anomalies. The platform behaves more conservatively when the market itself is doing something it has not done before, regardless of whether the operator or any signal module has yet caught up to what’s happening.
Group C — Strategy Decision Systems
9. Adjustment Trigger Classifier (LightGBM, 7-class)
When a premium-selling position is open and the market moves against it, what’s the optimal action? Seven possible actions: HOLD, ROLL_UP_UNTESTED_SIDE, ROLL_OUT_TO_NEXT_EXPIRY, ADD_PROTECTION, CONVERT_STRANGLE_TO_CONDOR, REDUCE_BY_HALF, EXIT_FULL. The classifier — LightGBM 7-class, 29 features (current position Greeks, distance to short strike, P&L %, time to expiry, regime, vol changes) — predicts the optimal action given the position state. Training uses outcome-ladder labels from closed persona trades. Mandatory rule-based checks fire first; the ML adjudicates the discretionary middle ground.
10. Strike Selection Optimizer (XGBoost)
When selling option premium, the strike chosen drives 60-70% of profit variance. Generic “sell at one standard deviation” rules ignore market-specific context. Our XGBoost strike selector uses 33 features (IV skew, OI distribution, time-of-day, GEX positioning, regime, portfolio context) to optimize this critical decision. The model predicts realized profit as % of max-profit for each candidate strike combination; the constructor generates a candidate set, scores all, and picks the argmax. Trained weekly on closed premium-selling outcomes from the 20-persona synthetic fleet plus live-execution rows as they arrive.
11. Wing Selection Optimizer (XGBoost)
In multi-leg structures like iron condors and butterflies, the “wings” (long strikes) provide protection but cost premium. Most retail traders use fixed-distance wings — typically 100 points OTM from the short strikes — without considering whether the distance is optimal for current market conditions.
Optimal wing placement varies with volatility skew at that point in the strike curve, liquidity at candidate wing strikes, expected magnitude of move if the structure goes wrong, and time to expiry. Our Wing Selection Optimizer evaluates six wing-distance candidates (50, 75, 100, 125, 150, 200 points) and predicts net structure return for each. It captures skew effects (when wings are particularly cheap or expensive), liquidity considerations (avoiding illiquid wings that create execution problems), time-to-expiry effects, and position-specific risk profile.
The system selects the wing distance that maximises risk-adjusted return for each specific trade context. Typical improvement: 5-10% better multi-leg structure returns — meaningful and compounding. The biggest gains come from unusual market conditions where fixed-distance rules systematically underperform.
12. Trend Regime Classifier (LightGBM)
When trading futures, the most important question isn’t ‘which direction?’ — it’s ‘is this a market where futures trading works at all?’ Trending markets favour directional trades; choppy markets destroy them regardless of how good the directional signal looks. The classifier — LightGBM, 22 features (recent realized vol, autocorrelation of returns, range-vs-ATR ratio, OI direction shifts, cross-market trend confirmations) — categorises current conditions into five regimes: STRONG_TREND, WEAK_TREND, RANGE_BOUND, CHOPPY, TRANSITIONING. High directional conviction in a CHOPPY or RANGE_BOUND regime is overridden — the platform refuses the futures path and either redirects to a defined-risk options trade or sits out the move. Holdout accuracy ~63%; introducing this gate materially improved win rate on high-conviction directional trades.
Group D — Risk and Event Management
13. Pin Risk Probabilistic Predictor (LightGBM)
On expiry day, prices tend to gravitate toward strikes with high open interest — a phenomenon called pin risk. This can cause unexpected moves of 30-50 points in the final hour, hurting positions that were comfortable minutes earlier.
Traditional pin risk assessment uses simple formulas based on OI concentration and distance from current price. These formulas capture the basic mechanism but miss subtleties — round-number strikes attract more pinning than adjacent strikes (even with similar OI), liquidity provider behaviour near expiry creates predictable patterns, cross-strike OI dynamics affect which specific strike acts as the magnet.
Our Pin Risk Probabilistic Predictor uses ML to capture these patterns. For each strike within ±2% of current price, the model predicts probability the strike will pin (act as price magnet), the time horizon when pinning is most likely, and the magnitude of price movement toward the strike. This enables more accurate identification of high-risk positions before close, smarter strike selection for new expiry-day trades, and earlier defensive adjustments when pin risk is elevated. After deployment, pin-related losses on expiry days dropped significantly — the model identified patterns (like the round-number bias) that no rule-based system had captured.
14. Event Impact Magnitude Predictor (multi-output ML)
Scheduled events like RBI policy announcements, Fed meetings, and CPI releases create market uncertainty. Most platforms handle this with blanket trading blackouts. This approach is conservative but suboptimal — not all events are equal. Some RBI announcements move markets 2%, some move them 0.3%; some Fed meetings are highly market-moving, others are routine.
Our Event Impact Magnitude Predictor predicts three things about each scheduled event: expected magnitude (regression: 0-5% move range), expected direction (bullish, bearish, or neutral), and expected IV change post-event (will implied volatility expand or compress?). These predictions enable graduated responses rather than binary blackout/trade decisions:
| Predicted magnitude | Recommended action |
|---|---|
| > 1.5% | BLACKOUT_24H (avoid completely) |
| 1.0-1.5% | REDUCE_SIZE_50_PCT |
| 0.5-1.0% | REDUCE_SIZE_25_PCT |
| under 0.5% | PROCEED_NORMALLY |
After deployment, the platform’s monthly trading volume increased without proportional risk increase — the platform stopped leaving opportunities on the table for events that wouldn’t have affected positions anyway.
15. Learned Hedge Scorer (XGBoost)
When a futures trade is moderate-conviction (and therefore requires a hedge), the platform must pick the right hedge structure. A simple long put is cheap and clean. A bear put spread is cheaper but caps protection. A collar funds the put by selling a call. A calendar hedge gives longer protection. The hedge scorer — XGBoost, 20 features (IV term structure, distance-to-strike, delta, theta-vega ratio at the candidate strike, current realized vol, days to position expiry) — scores each candidate hedge by predicted effectiveness (loss avoided minus the hedge cost, normalised by hedge cost). The selector picks the argmax. Trained weekly from synthetic-persona hedged futures outcomes; live-execution hedged trades join the training set post-launch.
Group E — Performance and Validation
16. Statistical Robustness Framework
A backtest can lie to itself in dozens of ways. Lookahead bias: the strategy peeks at future data it wouldn’t have had in real time. Survivorship bias: the universe is filtered to only the names that survived, biasing returns upward. Multiple-testing without correction: trying 1,000 strategy variants and reporting the best one without a Bonferroni-style penalty. P-hacking through parameter search: tuning hyperparameters on the same data used to report performance. These are the well-documented pitfalls. Every retail quant has fooled themselves on at least one of them.
The platform runs every new strategy candidate through a statistical-robustness gauntlet before it can be promoted to production. The gauntlet has four stages:
- Walk-forward validation with strict purging. The training data ends at time T; the validation window starts at T + purging_gap (typically 5 trading days, eliminating any temporal leakage from same-day-trade features). The validation window then slides forward, re-fitting periodically. No future data leaks into the past at any point.
- Statistical-significance test using White’s Reality Check (multiple-testing-corrected). The candidate strategy’s Sharpe is compared against a bootstrap distribution of Sharpes produced by null-model variants of the same strategy. The null Sharpe is corrected for the number of strategy variants tested, preventing the “I ran 100 backtests, the best one looks great” trap.
- Bootstrap resampling for confidence intervals on key metrics. Sharpe, Sortino, max drawdown, win rate, average winner / average loser are each reported with a 95% bootstrap confidence interval. A strategy with great point estimates but wide intervals is flagged as statistically inconclusive.
- Out-of-sample paper testing for a minimum of 60 trading days. No promotion until the strategy has run on live tick data for 60 days, with the platform’s actual cost engine, slippage model, and gates applied. Backtest-clean strategies often fail this stage — implementation friction or regime shift erases the edge.
A strategy that fails any check stays out of production. This isn’t an ML model in the strict sense; it’s a discipline framework with statistical machinery. We count it among the 17 because the same MLOps infrastructure (validation contracts, versioned artifacts, audit trail) gates the framework itself — the platform learns not to fool itself by enforcing its own validation discipline, and any change to the discipline machinery is itself versioned and reviewed before going live.
17. Real-Time Performance Attribution
Every closed trade is decomposed into contributing factors and tagged with feature snapshots from the entry moment. The decomposition explicitly separates:
- Delta P&L — the directional move of the underlying * position delta at entry, integrated over the holding period
- Theta capture — the time-decay contribution, accounting for early exits and position adjustments
- Vega exposure realized — the IV move * position vega, signed by whether the position was net long or short volatility
- Transaction cost drag — the modelled cost from the cost engine (STT + exchange + commission + GST + stamp duty)
- Slippage — the gap between intended fill and actual fill
A tenant-specific attribution model tracks which factor contributes most to each tenant’s returns over rolling 30-day, 90-day, and 252-day windows. The output drives three concrete user-facing surfaces:
- The user’s dashboard summary — “Over the last 30 days, your returns came 60% from theta capture, 25% from vega compression, 15% from delta moves.” This tells the user, in plain language, which engine is producing their results.
- The diagnostic narrative — when a tenant has a bad month, the attribution shows whether the loss came from a delta wipeout (directional bet went wrong), a theta-capture miss (positions exited too early or rolled too late), or a vega expansion (IV crush failed to materialise). Each failure mode has different lessons.
- Layer 3’s strategy selection — the model feeds back into the platform’s per-tenant strategy bias. “This tenant gets better results from theta capture in
RANGE_LOW_IVthan from directional plays — bias accordingly.” Over months, the platform calibrates strategy preference per tenant, moving from a one-size-fits-all default to a per-user policy.
Plus — Slippage Calibration (cost-engine adjunct)
Underneath all seventeen systems, the slippage model — a 3-coefficient fit (base spread, linear impact, quadratic impact) — is recalibrated weekly from actual fills. Live-execution fills (post-launch) and synthetic persona fills (during dry-run) both feed it. The model output goes into the cost engine in Wall 1 to refuse trades whose expected slippage would consume their edge. We count it separately from the 17 because it serves cost engineering rather than alpha generation, but it follows the same MLOps discipline.
Together, these seventeen systems comprise the platform’s learning substrate. Each Sunday morning, six of them retrain on the prior week’s outcomes; the trend classifier and vol forecaster retrain on rolling multi-year history; the slippage model recalibrates on the prior week’s fills; the rest update on slower cadences appropriate to their data sources. New versions land in the model registry with automatic validation contracts; only versions that beat their predecessor on holdout get promoted to production. Singletons hot-swap to the new models without a container restart. By next Monday, the system is materially smarter than the previous Monday.
§ 8B
Production ML Infrastructure
Behind the seventeen ML systems lies the infrastructure that keeps them running reliably in production. This is the unsexy but critical layer that separates systems that demo well from systems that work day after day for real users.
Industry-standard foundation — MLflow
The platform uses MLflow as the foundation for ML lifecycle management:
- Experiment tracking — every training run is logged with full configuration, hyperparameters, and metrics. Hundreds of experiments tracked and searchable.
- Model versioning — all seventeen ML systems have semantic versioning with complete lineage tracking. Any production prediction is traceable back to the specific model version, training data, and code that generated it.
- Artifact management — model files, training data references, and configuration tracked with SHA-256 integrity verification. Models cannot be loaded if they’ve been tampered with.
- Performance comparison — side-by-side comparison of model versions enables data-driven decisions about which versions to promote to production.
Using MLflow as foundation provides industry-standard practices recognised by ML practitioners, mature tooling and UI for ML team workflows, standard integration patterns with deployment systems, and compliance with emerging ML governance standards.
Custom production additions
MLflow provides excellent foundation but doesn’t natively address several production-specific needs in our domain. We’ve built custom layers on top of MLflow for:
1. Concurrent training safety. Multiple ML systems training simultaneously can create race conditions and corrupted state. Our custom PostgreSQL advisory-lock system ensures only one training job per model type runs at a time. Locks release automatically on process death, preventing orphaned locks.
2. Atomic promotion transactions. Promoting a model to production involves multiple state changes — updating the production pointer, archiving the previous version, updating audit logs. We use SERIALIZABLE isolation transactions to ensure these happen atomically. Either all succeed or all roll back; the system never has partially-updated state.
3. Per-tenant model isolation. In our multi-tenant architecture, some models are tenant-specific (personalised to individual users). The custom layer ensures each user’s predictions use only their own personalised models, with isolation enforced at multiple levels.
4. Validation contracts. Each ML system has specific validation requirements. The RL strategy selector must not exhibit mode collapse. The anomaly detector must achieve minimum recall. The directional models must beat baseline accuracy. These per-type validation contracts gate the promotion workflow — bad models cannot be deployed.
5. Failed training logging. When training fails (data corruption, computation errors, validation failures), the system captures complete context including partial metrics and stack traces. Dedicated failure tracking enables rapid debugging of production issues — a capability MLflow doesn’t natively provide.
What this means for users and evaluators
- Reliability — models are rigorously validated before any user sees their predictions
- Reproducibility — every prediction traces back to specific model versions and training data
- Safety — bad models are blocked from production; good models can be rolled back if issues emerge
- Auditability — complete history of which models predicted what, when, for whom
- Evolution — continuous improvement without sacrificing stability
This is institutional-grade ML infrastructure applied to retail-accessible products. Most retail trading platforms run with minimal ML infrastructure. Quantsentinel runs ML infrastructure that would be appropriate at a hedge fund or fintech of much larger size.
Concrete scenarios where this matters
It’s easy to claim “production-grade MLOps.” The test is whether the infrastructure handles specific failure modes that real production systems encounter. Three scenarios:
Scenario A — A weekly retrain produces a model that overfits. The Strike Selection Optimizer’s Sunday training job completes; the new model’s holdout accuracy is 0.82 (vs. 0.68 for the current production version). The validation contract notices that the new model’s training-to-holdout gap is 25 percentage points — a classic over-fit signature. Promotion is blocked despite the apparent improvement. The platform continues serving the 0.68-accuracy production model. The retraining cycle logs the failure with full feature-importance diff; the engineering team has a flag to investigate next week. Without the validation contract, the platform would have promoted the over-fit model, and the next 30 days of trades would have used a worse model than the one being replaced.
Scenario B — Two ML systems retrain simultaneously and collide. The Adjustment Trigger Classifier and the Wing Selection Optimizer both schedule weekly retrains for Sunday 03:30 IST. The Adjustment Trigger Classifier completes first and starts writing its artifact to the registry. The Wing Selection Optimizer starts uploading at the same moment. Without coordination, the artifact-store directory would have race conditions on the metadata table. With our PostgreSQL advisory-lock system, each type_id holds an exclusive lock during its save; the second writer queues, completes successfully a few seconds later, and both models land cleanly. Operator-side: nothing visible. The system just works.
Scenario C — A model promotion mid-flips during a network partition. The platform’s promote_to_production call is mid-transaction when the database connection drops. Half the state-changes (production-pointer update, version archival, audit-log row) succeeded; half didn’t. Without SERIALIZABLE isolation, the system would have inconsistent state: a “production” pointer to an “archived” version, a missing audit row, a missing rollback target. With SERIALIZABLE isolation and our atomic-promote transaction, the entire promotion either commits or rolls back. On rollback, the previous production version stays in place. The operator sees a clear failure in logs, retries the promotion, and the system never reaches an inconsistent state.
These scenarios are not hypothetical edge cases. Each one corresponds to a real failure mode the platform has hardened against through the custom MLOps additions on top of MLflow. The aggregate effect: the platform’s ML stack is not a demo. It is a production system designed to keep working through the kinds of failures that, individually, would be embarrassing at any reasonably-sized fintech.
How this compares to typical retail-platform ML
Most retail trading platforms with ML capability fall into one of three patterns:
- No real ML at all. “AI-enabled” marketing copy, no actual model weights anywhere. The rules are hand-written
if-elsechains with a chart-recognition gimmick. - One or two models with manual lifecycle. A demand-side LightGBM classifier trained once at launch, never retrained because the team doesn’t have the operational infrastructure. Predictions degrade as the underlying market shifts.
- Multiple models without governance. Several models in production, retrained by hand or via simple cron jobs, no version control, no validation contracts, no rollback path. When something goes wrong, the operator has no way to revert and no way to diagnose.
Quantsentinel sits in a fourth pattern: seventeen models with full MLOps governance. Every prediction has a trail. Every promotion is gated. Every model can be rolled back. The infrastructure investment is the difference between “the platform runs” (most retail platforms after their first failed retrain) and “the platform improves over time” (what an institutional system is supposed to do).
§ 9
Options Selling — The Complete System
Options selling — collecting premium when implied volatility exceeds realized volatility — is one of the most documented sources of risk premium in financial markets. The volatility risk premium is real, it’s persistent, and it’s measurable across decades of US, European, and Indian market data.
It’s also where retail traders most consistently lose money.
The asymmetry is structural. A premium seller collects small profits frequently and faces large losses occasionally. The strategy requires extreme discipline that humans struggle to maintain: don’t average down on losers, don’t lift stops, don’t override the model’s exit signal because “it’ll come back.” Every retail premium-selling blowup we have studied — and there are many — looks the same: 8-12 months of steady positive months, then one month that wipes out two years of gains.
Quantsentinel’s premium-selling subsystem is built around the discipline that institutions enforce and retail traders cannot. Eleven coordinated modules across the four-layer alpha pipeline make the discipline structural rather than aspirational.
Expiry Day Intelligence
Indian Nifty options expire weekly on Tuesdays (NSE) and Sensex options on Thursdays (BSE). The hour-by-hour dynamics of expiry day are well-studied, and they differ materially from other days. The platform treats expiry day as four distinct trading windows, each with its own strategy template:
| Window | Time | Strategy |
|---|---|---|
| Morning theta | 09:45 - 11:00 IST | Iron fly when IV > 0.20, narrow condor when 0.15-0.20 |
| Mid-day range | 12:00 - 14:00 IST | Narrow iron condor sized to morning range |
| Final hour | 14:30 - 15:25 IST | Tight asymmetric condor, pin-risk gated |
| High-IV play | Cross-window | Wide iron condor when IV %ile > 0.75 and IV/RV > 1.40 |
Each window has its own entry filter: morning theta requires iv_atm > 0.15 AND overnight move ≤ 1.2% AND ≥ 30 minutes since open AND no scheduled event today AND knob ≥ 50. Mid-day range requires morning range ≤ 0.8%, ≥ 2 range-boundary revisits, declining volume, knob ≥ 35. Final hour requires day range ≤ 1.2%, ≥ 25-point pin buffer, knob ≥ 50.
The pin risk calculator runs every minute during open positions in the final hour. When spot drifts within 15 points of any short strike, the engine recommends EXIT_ALL; within 25 points, EXIT_50PCT. These thresholds are not parameters the user sees — they’re hard rules in the lifecycle manager.
Range-Bound Market Detection
On non-expiry days, the platform’s range-bound detector identifies conditions where premium selling has the highest probability of success: range width ≤ 2.5% of spot, ≥ 3 boundary tests without breakout, realized-vol stability (5d/20d ratio ≤ 1.2), and a learned breakout-probability score from the LightGBM range continuation predictor.
When all four criteria pass with strength score > 0.5, the system picks from a structure menu based on width and strength: iron butterfly for the tightest ranges (< 1.5% with strength > 0.7), narrow iron condor for moderate, jade lizard or wide iron condor for wider but still range-bound conditions. Naked short strangles are reserved for the highest strength × highest knob × highest range-stability combinations.
The range continuation predictor (a Group A vol/distribution member of the 17 ML systems) scores P(breakout in next 3 trading days). High breakout probability triggers a size haircut even when other criteria pass.
The ML Layer for Options Selling
Five of the seventeen ML systems live in this hot path:
Strike Selection Optimizer (XGBoost) — when selling premium, the strike chosen drives 60-70% of profit variance. Our 33-feature XGBoost selector replaces the formulaic “sell at one standard deviation” rule with regime-aware optimisation.
Wing Selection Optimizer (XGBoost) — for multi-leg structures, wing placement balances cost against protection. The optimizer evaluates six wing-distance candidates and selects optimal placement for the specific context. Typical improvement: 5-10% better structure returns.
Adjustment Trigger Classifier (LightGBM, 7-class) — once a position is open, the optimal action when the market moves against it is rarely obvious. The classifier recommends from HOLD / ROLL_UP_UNTESTED_SIDE / ROLL_OUT_TO_NEXT_EXPIRY / ADD_PROTECTION / CONVERT_STRANGLE_TO_CONDOR / REDUCE_BY_HALF / EXIT_FULL. Mandatory rules fire first; the ML adjudicates the discretionary middle ground.
Pin Risk Probabilistic Predictor (LightGBM) — replaces rule-based pin risk calculation with ML-based probabilistic predictions per strike. Captures patterns like the round-number-strike bias that formulas miss. Critical for expiry-day premium selling — pin-related losses dropped significantly after deployment.
IV Mean Reversion Predictor (LightGBM) — times entries to capture maximum IV crush. When P(IV → ≤ 1.1× RV in 5 days) exceeds 0.75, the strategy selector tilts toward premium-selling structures; below 0.35, the platform waits for better conditions.
Alongside these five, the Direction Prediction Ensemble informs strategy bias (do we lean iron-fly-bullish or iron-fly-bearish), the Optimal Entry Timing Model handles execution timing within the entry window, and the Event Impact Magnitude Predictor adjusts behaviour around scheduled events. ML is integrated throughout — not isolated in one module.
Intelligent Adjustments
This is where professionals consistently outperform amateurs. Discretionary retail traders are too slow to adjust (and often too emotionally compromised to act when they should). Quantsentinel’s adjustment engine combines mandatory rule-based checks with the Adjustment Trigger Classifier (the LightGBM 7-class system in Group C):
Mandatory rules (always fire, no override):
- Spot at short strike + DTE > 3 →
ROLL_OUT_TO_NEXT_EXPIRY(give it more time) - Spot at short strike + DTE ≤ 3 →
EXIT_FULL(no time to recover) - Unrealized P&L ≥ profit target →
EXIT_FULL(take the win) - Unrealized P&L ≤ -85% of max loss →
EXIT_FULL(cut the loss before it maxes)
Discretionary actions (ML classifier decides):
- All other position states route through the LightGBM 7-class classifier
- Confidence threshold of 0.65; below that, fall through to
HOLD - Each non-HOLD action gets expanded into concrete legs by the adjustment engine — the executor sees a fully-formed order, not a label
Tail Risk Management
Every premium-selling book carries left-tail risk by definition. Quantsentinel layers four protections:
- Always-on far-OTM put hedges (5-delta, ~30 DTE), maintained at ~0.5% of capital monthly
- VIX-based de-risking at thresholds 18, 22, 28, 35 — size haircut, no naked, full pause, panic close
- Consecutive-loss circuit breakers — 3 losses = 50% size haircut, 5 losses = full halt for 24 hours
- Event-driven blackouts from the news engine (Wall 3) — no premium selling 24h before RBI policy, 8h before US CPI/NFP, etc.
Tax-Aware Exits
Indian capital-gains taxation distinguishes long-term (> 1 year) from short-term (< 1 year), and applies STT differently at entry vs. exit. The platform’s exit logic includes a tax-optimization layer: where two otherwise-equivalent exit paths exist, the one with better after-tax outcome wins. Year-end positions are reviewed for harvesting opportunities.
This is small-edge optimization, but at scale across many trades and many tenants, it adds up. A 1.2% pre-tax annual edge can become a 0.9% after-tax edge or a 1.1% — the difference is what tax-aware exit logic captures.
§ 10
Futures Trading — Directional with Intelligent Hedging
Options strategies work well in range-bound markets. Trending markets reward directional positioning. Quantsentinel’s futures module gives the platform the second leg.
The integration matters: a platform that only sells premium will underperform during strong trends; a platform that only trades directionally will underperform in ranges; a platform that does both — with a clear gating mechanism that picks the right tool for the regime — does measurably better than either alone.
Conviction-Based Decisions
The decision framework is anchored on directional conviction, computed from the eight Layer-2 signal modules. The score, on 0-100, determines the trade type and the hedge requirement:
| Conviction | Band | Direction Type | Hedge |
|---|---|---|---|
| 85-100 | VERY_HIGH | Trade futures directly | Optional |
| 70-84 | HIGH | Trade futures directly | Optional |
| 55-69 | MODERATE | Trade futures + mandatory option hedge | Required |
| 40-54 | LOW | Defined-risk option spread alternative | n/a |
| 0-39 | NONE | No trade | n/a |
A MODERATE conviction signal — a real but not overwhelming directional read — is the most common case. The platform fires the futures trade with a paired option hedge: long put under a long-futures position, long call under a short-futures position. The hedge cost is paid; the worst-case loss is bounded.
The Trend Regime Safety Net
Even with high directional conviction, if the market regime is choppy or range-bound, futures positioning fails. The trend regime classifier (Pipeline 8 in §8) gates this. A VERY_HIGH conviction long signal in a CHOPPY regime is downgraded to “no trade — wrong regime for futures.” Instead the platform looks for a debit spread (defined-risk options) or sits out.
This single gate materially improved win rate on directional trades during our 4-month paper-testing window. Without it, the platform took 23 high-conviction trades; 9 won, 14 lost, net negative. With it (re-running the same period through the modified logic), it took 11 trades — 8 won, 3 lost, net materially positive. The reduction in trade frequency was a feature, not a bug.
Intelligent Hedge Selection
For moderate-conviction trades, the system selects from multiple hedge structures:
- Long put / long call — simple, clean protection at the cost of full premium
- Bear put spread / bull call spread — cheaper, capped protection
- Collar — sell an opposite call/put to fund the protective leg
- Calendar hedge — longer-dated option, slower theta drag, broader protection
The ML hedge scorer (Pipeline 9) scores all candidates on (loss avoided − hedge cost) / hedge cost. The selector picks the argmax. Cold-start fallback: simple long put 5% OTM with 30 DTE — the safe formula default.
The Expanded ML Layer for Futures
Five of the seventeen ML systems drive the futures path:
Trend Regime Classifier (LightGBM) — categorises current market conditions across STRONG_TREND / WEAK_TREND / RANGE_BOUND / CHOPPY / TRANSITIONING. Critical filter — even high-conviction directional signals fail in choppy markets. The classifier ensures futures positions are taken only in regimes where directional trading works.
Direction Prediction Ensemble (LightGBM) — replaces rule-based signal combination with a learned ensemble that captures interaction effects and regime-dependent reliability. Provides both directional predictions and calibrated confidence intervals.
Learned Hedge Selector (XGBoost) — for moderate-conviction futures positions requiring protective hedges, the selector compares multiple hedge structures (long puts, spreads, collars, calendars) and chooses optimal placement. Combined with rule-based formula scorer for robust ensemble decision-making.
Optimal Entry Timing Model (LightGBM) — within entry windows, identifies optimal execution timing. Particularly valuable for futures where entry price directly affects position economics over multi-day holds.
Event Impact Magnitude Predictor (multi-output ML) — predicts expected magnitude, direction, and IV change for scheduled events. Enables nuanced response — proceed normally during low-impact events, reduce size during medium-impact, blackout during high-impact predicted events.
Production-Grade Operations
Beyond the strategy logic, the futures module includes the operational machinery that real-execution integration requires:
- SPAN + exposure margin calculation — every position checked against per-tenant available capital before submission
- Gap risk management — overnight and weekend exposure tracked separately; gap protection sized into the position
- Monthly expiry rollover state machine — positions held into expiry week trigger automated rollover proposals (5-7 days before expiry, depending on mode), with cost-of-carry economics gating the roll vs close decision
- Position lifecycle — target-1 partial exit at 50% of profit target, trailing stop activation past breakeven, hedge expiry roll, contract expiry roll, conviction decay monitoring
- Conviction decay monitoring — every 30 minutes, the original conviction signal is re-scored. If conviction has degraded (e.g., from
HIGHtoMODERATE), the lifecycle manager evaluates whether to hold, tighten the stop, partial-exit, or close
Each operational concern is its own module. Each module is tested independently. The whole assembles into a futures path that competes head-to-head with the options path on every signal, and the better trade — measured by expected EV after costs after hedge after gap risk — wins.
§ 11
The Multi-Tenant Architecture
Quantsentinel was architected from the beginning as a multi-tenant platform. Every component — from data ingestion through execution — was designed assuming many users with isolated portfolios sharing common infrastructure.
This is the most expensive architectural choice we made early, and the one that pays back the most as the platform scales. Retrofitting multi-tenancy onto a single-tenant codebase is a 6-12 month rewrite; designing for it from the start adds maybe 15% to per-component complexity and zero to user-facing latency.
What This Enables
Personalization at scale
Each user has their own knob settings, risk parameters, capital limits, and history without infrastructure duplication. The ML pipelines see anonymized aggregate data for training (with hard rules preventing persona-identity leakage), but every trade respects only the relevant tenant’s parameters.
Shared learning, individual outcomes
The strike selection optimizer learns from the entire fleet’s closed trades — across all 20 synthetic personas and all live users in closed beta — making it materially smarter than any single user’s history could. But the model’s prediction for a specific user’s next trade is conditioned on that user’s portfolio context, not the fleet’s.
Multiple risk profiles supported
P02 Cautious Climber, P05 Aggressor, P09 Freezer, and P20 Adversarial coexist without complex configuration. Each declares their own behavioural profile; the platform reads it; the trades that result reflect that profile. The same shape applies to real users in the closed beta: nine current tenants, each with their own knob value and capital, sharing the same infrastructure.
Path to commercial deployment
The architecture supports scaling from current closed beta (9 tenants) to thousands of users without redesign. The platform’s bottleneck at current scale is signal generation (one signal per tenant per minute); at 1000 tenants, it would be database write throughput on the per-tenant audit tables. Both are solvable problems with known engineering — not architectural redesigns.
Key Isolation Guarantees
Multi-tenancy without isolation is just shared databases. We have invested in hard isolation:
- Per-tenant signal routing — no cross-tenant signal delivery. A signal computed for tenant 5’s portfolio context never lands in tenant 3’s notification feed.
- Database-level enforcement — triggers on the persona_trades and persona_states tables raise exceptions on cross-tenant data writes
- WebSocket room isolation — real-time updates strictly per-tenant; each tenant has their own WS room and authorization is checked on every subscribe
- Audit logging for compliance — every action tied to a specific user; the audit log is append-only and immutable
- Telegram private DM verification — no group chat delivery; we verify the chat is a 1:1 DM with the user before any signal goes out
The platform’s audit trail includes a complete record of which tenant initiated which signal, which gates fired against which positions, which orders went to which execution account. For SEBI compliance and any future regulator inspection, the trail is complete.
Per-Tenant Resources
Each tenant has, on the backend:
- A row in
tenants+ a row inusers(separate concerns — users can belong to multiple tenants in the future) - An entry in
model_registry.production_versionper ML pipeline they use (allowing per-tenant model variants — currently all share the platform-wide model, but the infrastructure supports per-tenant) - A contextual-bandit policy in their
tenant_registrystrategy engine (each tenant explores the action space independently) - A row in
playground_synthetic.persona_states(synthetic) or the real tables (live) - Their own Redis namespace for cached signals, snapshot states, and per-tenant feature stores
The aggregate footprint per tenant at current scale: about 12MB of state across all tables, growing at ~50KB/day. A thousand-tenant deployment fits comfortably within a single Cloud SQL db-custom-4-16384 instance.
§ 12
The Synthetic Personas — Pre-Launch ML Bootstrapping
A learning system needs data to learn from. But a system with no users has no data. The chicken-and-egg problem of bootstrapping a learning trading platform — we need users to train the system; users won’t join an untrained system — has killed many otherwise sound ideas.
We solved this with a calibrated synthetic-persona framework before any real user joined.
The 20 Synthetic Personas
Each persona is modelled on documented retail behavioural biases, drawing from the behavioural-finance literature and from forensic analysis of historical trading patterns:
- P01 The Stable Veteran — knob 40-50, rarely changes settings, minimal manual overrides. The benchmark archetype the population should converge toward.
- P02 The Cautious Climber — knob 30-45, scales up only after sustained positive performance, scales down aggressively on drawdowns.
- P03 The Yo-Yo — knob swings widely, mood-driven, frequent overrides.
- P04 The Optimizer — EV-driven, adjusts knob based on rolling-window Sharpe.
- P05 The Aggressor — knob 75-90, takes naked positions, never hedges futures.
- P06 The Revenger — increases bets after losses (the classic disposition-effect failure mode).
- P07 The Overconfident — increases aggression after wins, becomes naked-position-heavy.
- P08 The Anchorer — refuses to take losses, holds losing positions past their stops.
- P09 The Freezer — pauses entirely during volatile periods. Provides baseline of “trades nothing in stress regimes.”
- P10 The Override Junkie — frequent manual flips.
- P11 The Grower — slow methodical size-up.
- P12 The Shrinker — defensive scaling.
- P13 The Steady — even-keeled, system-default behaviour.
- P14 The Strategist — changes approach by regime.
- P15 The Realistic Retail — mixed profile representing the median Indian retail trader.
- P16 The Learner — risk-averse early stage, gradually expands.
- P17 The Imitator — copies the best-performing persona of the prior week.
- P18 The Seasonal — trades differently around festivals, expiry weeks, election cycles.
- P19 The Multi-Biased — exhibits 3+ documented biases simultaneously.
- P20 The Adversarial — designed to stress-test the platform; takes maximum-risk positions in worst regimes.
Each persona has a YAML declaration: their initial knob, capital, stochastic-noise parameter, psychological-state transitions (rational ↔ tilted ↔ fatigued based on recent P&L), manual-override probability, and — added in the most recent build — futures appetite, hedge preference, size haircut, and overnight tolerance.
What This Enables
ML models trained before launch — the strike selector, hedge scorer, IV reversion predictor, range continuation predictor, and trend regime classifier all had real training data on Day 1, generated by the persona fleet across simulated market regimes.
System understands various user behaviours at launch — when a real Cautious Climber joins the closed beta, the platform doesn’t have to learn their archetype from scratch; it already has a model of how P02 behaves and can adapt its narrations and recommendations accordingly.
Real users benefit from preparation immediately — closed beta users see a system that already feels calibrated, because the ML has seen 200,000+ simulated trades across all archetypes by the time they arrive.
Continuous refinement as real data flows in — synthetic-trade rows are tagged as context_source = 'synthetic'. Live-execution rows are tagged context_source = 'real'. The training pipelines weight real higher than synthetic; as real volume grows, synthetic naturally decays in influence.
The Important Boundary
Synthetic personas live in a sealed playground. They trade with simulated execution venues only. The market data they see is real (historical and current); the venues they “trade” through are entirely simulated. No real money. No real positions. No risk of cross-contamination to live-tenant accounts.
This boundary is enforced at the code level: the persona orchestrator runs an environment-variable check at startup and refuses to boot if real execution credentials are visible to it. The MockBroker class has no network code — it is a pure in-process simulator. The synthetic schema (playground_synthetic) is physically separate in Postgres from the real schemas. Cross-schema joins are blocked at the database role level.
This is what we mean when we say “Rule 1: ML modules MUST NEVER see persona_id / is_synthetic / bias_type / persona_archetype.” Personas exist; the system trades them; their outcomes train the platform — but the ML pipelines themselves operate on opaque (features, target) tuples with no identity attached.
The personas are a learning substrate, not a customer base. They exist so that real customers, on day one, encounter a platform that has already done its homework.
§ 13
Current Status and Validation
This section is written to be useful to acquirers and partners who need to know exactly where we are, not to hype prospective users. Honesty here is the value proposition.
Development Status
- 18+ months of architecture and implementation
- 13 microservices, ~12,000 lines of production Python + Go
- All core systems operational
- 4 months of paper testing on real-time market data
- 9 active tenants in closed beta (the founder + 8 invited operators)
- Running on Google Cloud Compute Engine + Cloud SQL Postgres 16 (TimescaleDB extension for tick + chain history)
- 13-container Docker Compose deployment with autoheal-managed self-recovery
- Caddy reverse proxy with auto-issued Let’s Encrypt certs
- Continuous deployment from a
live-signal-dashboardbranch; every commit ships within minutes
What’s Been Validated
The validation is component-level, end-to-end functional, and (where possible) statistical. We’re explicit about what’s measured and what isn’t:
Fully validated (works as designed):
- All 7 risk gates fire correctly on synthetic stress tests
- All 12 kill switches activate at their thresholds (verified by orchestrated test scenarios)
- All 17 ML systems train successfully with convergence, produce versioned artifacts in the model registry
- Model versioning and rollback functionality verified
- Hot-swap of new model versions into running singletons without container restart
- Per-tenant isolation enforced at multiple layers (code, database, infrastructure)
- Production deployment workflows tested end-to-end
- MLflow integration validated with custom additions
- Validation contracts prevent bad models from promotion
- Concurrent training safety verified under load
- Atomic promotion transactions verified across failure scenarios
- Persona orchestrator ticks 20 personas through simulated days without raising
- Live signal generation completes end-to-end in < 200ms per signal
- Caddy auto-HTTPS, dual-domain support, basic-auth scoping to operator paths
ML system status summary:
| Capability group | Systems | Validation status |
|---|---|---|
| Volatility & Distribution | 3 | All operational, accuracy at target |
| Signal Generation & Combination | 5 | All operational, ensemble improvements verified |
| Strategy Decisions | 4 | All operational, A/B testing complete |
| Risk & Event Management | 3 | All operational, defensive behaviour validated |
| Performance & Validation | 2 | Continuous monitoring active |
Partially validated (works in test, awaiting real-world stress):
- ML model accuracy on out-of-sample windows (60-65% directional accuracy for vol forecasting, ~63% on trend regime classification — proper coverage but limited sample sizes)
- Strategy selection RL agents (population-based training converges, but limited live deployment to date)
- Cross-market signal accuracy (theory is sound, real-time fills are still accumulating)
Yet to be validated (closed-beta phase will produce this):
- Cumulative live-execution performance across a full market cycle
- User-level retention and engagement metrics
- The platform’s behaviour through a true black-swan day (we have not yet experienced one in live deployment)
- Whether the per-tenant model variant infrastructure adds enough value to justify its complexity (currently all tenants share one model; the architecture supports per-tenant variants when warranted)
Current Performance Characteristics
The paper-testing window planned for the 9-tenant closed beta runs across all available trading days during the dry-run period. The figures the platform tracks (and will publish when statistically meaningful) include:
- Total simulated trades across all tenants
- Gross win rate
- Average winner / average loser, as % of capital deployed
- Net Sharpe after modelled costs and slippage
- Maximum drawdown observed (the 8% kill-switch is the design ceiling)
- Days with risk gates firing (preventing at least one trade)
- Days with kill switches activating
These numbers come from paper trading on real-time market data through the live signal generation pipeline. The cost engine models actual costs; the slippage model projects actual slippage. They are paper-test numbers from the system as deployed — not backtest numbers cherry-picked from a favourable window — and they are not predictive of future live results.
We are not claiming any of this means anything about the future. Continued validation across more market cycles is the closed-beta phase’s primary goal. Real cohort numbers will replace this section once the data set is large enough to publish honestly.
What We’re Carefully NOT Claiming
- Guaranteed profitability. No system can guarantee outcomes. We have a discipline framework that historically has reduced loss frequency in academic studies; we believe it will here too; we have not proven it under live conditions yet.
- Past performance indicating future results. This is a regulatory line, and we mean it. The paper-test numbers above are not predictive.
- Risk-free returns. The platform has drawdowns by design — we cap them, but we don’t eliminate them. A user allocating to systematic options trading should expect occasional 4-8% drawdowns within normal operating conditions.
Why This Honesty Matters
We’re presenting this as serious technology, not as a marketing pitch. Sophisticated readers — whether strategic acquirers, institutional partners, or discerning users — can detect overclaim. We’re showing what’s real.
The closed-beta phase is the answer to “how do we validate this before scale?” — not a marketing position to be hand-waved past. Real validation requires real time. We’re investing that time.
§ 14
The Market Opportunity
The numbers that follow are publicly available from SEBI, NSE, and BSE disclosures. We are not inventing market sizing; we’re recapping the documented opportunity.
Market Sizing
- Indian retail F&O daily turnover: ₹500+ crore in Nifty options alone; multiples that across Bank Nifty, FinNifty, and stock options
- Active F&O traders: 1 crore+ retail accounts, up from ~25 lakh in 2019
- Growth rate: 30-40% annually in retail F&O participation through 2024-2026
- Total retail F&O capital deployed: estimated ₹2-3 lakh crore
- Annual retail losses: ₹50,000+ crore documented by SEBI
The 90% retail loss rate is the problem. From a platform perspective, it is also the opportunity: the willingness-to-pay for a system that materially reduces that loss rate is substantial.
Target User Economics
A typical target user (the Priya archetype) allocates ₹10-50 lakh to active trading. The platform can be priced in three regimes:
- Subscription tier: ₹5,000-25,000/month flat fee, scaled by capital allocation tier
- Performance-linked: 10-20% of gains above a benchmark hurdle, capped per tenant
- Hybrid: lower flat fee + performance overlay
Modest revenue projection assumptions:
| Active users | Annual revenue (flat subscription) | Annual revenue (with performance) |
|---|---|---|
| 100 | ₹0.6 - 3 crore | ₹1 - 5 crore |
| 1,000 | ₹6 - 30 crore | ₹10 - 50 crore |
| 10,000 | ₹60 - 300 crore | ₹100 - 500 crore |
These are not aspirational ceilings — they are mid-range estimates given documented willingness-to-pay among sophisticated Indian retail traders. The high end assumes ~25% market share in the “₹10-50L allocation, willing to pay for quality systematic” segment.
Why This Market is Acquirable Now
Three converging trends make this the right window for an acquirer to pick up the platform:
The technology bar has risen. SEBI’s algo-trading framework (2022-2024) and peak margin rules (2021) have raised the bar for sophisticated retail participation. Existing trading platforms without proprietary systematic capability are increasingly exposed as that bar continues to rise.
Regulatory environment encourages professionalization. Heightened reporting requirements, peak-margin compliance, and post-2024 algo-trading rules push the entire industry toward institutional-grade systematic infrastructure. Platforms that demonstrate readiness here earn an outsized share of attention from compliance-conscious institutional capital.
Retail traders increasingly sophisticated. The 2021-2026 retail F&O cohort is more educated, more analytical, and more willing to pay for quality than the previous generation. This is the cohort our platform is designed for; their willingness to switch from manual to systematic is at an all-time high.
No clear market leader. The “systematic options + futures platform for Indian retail, capital ₹10L-50L, monthly subscription” segment has no dominant player. The adjacent categories — algo-marketplaces (build-your-own-strategy) and analytical dashboards (visualize-the-chain) — exist, but none of them is the integrated systematic platform Quantsentinel is.
The Strategic Investor’s Perspective
Acquiring sophisticated systematic-trading capability provides:
- Valuation premium for technology positioning
- Defensible technological moat against new entrants
- New revenue stream from premium subscription services
- Differentiation from undifferentiated retail platforms
- Long-form narrative for institutional investors about future product launches
For a fintech accelerator or PE fund, ownership of the platform would mean:
- Pre-revenue but feature-complete asset with clear monetization path
- Team with domain expertise across ML, options pricing, market microstructure, and Indian regulatory landscape
- Optionality on cross-border expansion (the architecture supports US/EU markets with localization)
Specific Technology Assets
The acquiring entity would obtain a concrete inventory of production-ready technology:
- 17 production-ready ML systems trained on Indian market data
- 6 closed-loop learning architectures for continuous improvement without manual intervention
- MLOps infrastructure with industry-standard tooling (MLflow) plus custom additions for concurrent-training safety, atomic promotions, per-tenant isolation, validation contracts, and failed-training observability
- Multi-tenant architecture supporting scaling to thousands of users on existing infrastructure
- Comprehensive risk management framework appropriate for regulated environment — 7 walls, 12 kill switches, permanent tail layer
- Four months of paper-test telemetry captured against the system as deployed, ready to feed forward into live cohort analysis
- Documentation and architecture established for team integration
- Domain expertise in Indian options markets specifically — the cost engine, calendar awareness, and microstructure handling are all calibrated to Indian rates
The Competitive Moat
For a strategic acquirer, the technology represents:
- 18+ months of focused development work — approximately ₹2-5 crore equivalent at standard rates
- Domain expertise that’s difficult to hire — the intersection of quantitative finance + production ML + Indian market microstructure is a small talent pool
- Differentiation worth a meaningful valuation premium — positioning as “AI-enabled” with 17 specialised ML systems and production MLOps gives a defensible story for institutional investors
- A customer acquisition tool for high-value retail users (the Priya archetype) who have been chronically underserved by existing options
- Foundation for institutional product offerings — the same infrastructure can power a PMS-style product or a white-label deployment for partners
Valuation-Multiple Analysis
Strategic acquirers in the Indian retail-finance space have shown willingness to pay materially above commodity-platform multiples when the acquisition target brings a defensible technology story. The valuation lift comes from three orthogonal sources:
1. Headline multiple expansion. Public-market comparables in the retail-finance technology space trade at materially different multiples based on the depth of their ML-and-data narrative. The Indian fintech-investor ecosystem of 2025-2026 has consistently rewarded acquirers who can credibly say “we acquired a platform with substantive systematic infrastructure, not a feature-set add-on.” For an acquirer with a ₹500-1,000 crore base valuation, a substantive systematic-trading capability acquisition has historically supported a 20-40% revenue-multiple expansion on the institutional-narrative line items.
2. New revenue stream. The platform’s subscription-overlay economics support an incremental annual revenue line that the acquirer’s core brokerage business does not currently capture. At 1,000 active users on a ₹15,000/month average subscription tier, the platform generates ₹18 crore annual recurring revenue at gross margins above 70%. At 10,000 users, ₹180 crore ARR. The platform’s architecture supports this scaling without further core-engineering investment — the bottleneck would be in growth/marketing, which the acquirer presumably has at scale already.
3. Defensive moat. Platforms without proprietary systematic capability face increasing competitive pressure as SEBI’s algo-trading framework and peak-margin rules push the industry toward institutional-grade infrastructure. Acquiring a 17-ML-system + MLOps-governance platform inoculates against this pressure for 12-24 months — the time it would take any competing platform to assemble equivalent infrastructure organically. During that window, the acquirer can either monetise the moat (premium-tier subscriptions, white-label partnerships) or use it to defend market share against new entrants who pitch on technology.
Capital Efficiency of the Acquisition
The arithmetic on the buy-side is favourable. An acquisition at the ₹5-15 crore range buys:
- A team that already understands the domain (zero recruitment cost for the equivalent talent — which by current Indian-market rates would take 6-12 months and ₹3-6 crore in compensation across the necessary roles).
- A codebase that has passed real-time market testing — 4 months of paper-test telemetry on live tick data with the full risk + ML stack engaged. Reproducing that from scratch takes the better part of a year.
- A multi-tenant architecture validated to ~9 concurrent users, with a clear scaling path to thousands without re-architecture. Multi-tenancy is the kind of architectural choice that costs 6-12 months to retrofit; we paid that up-front.
- The MLOps infrastructure described in §8B — MLflow plus the five custom production additions. Building equivalent MLOps governance from scratch takes 3-6 months of dedicated effort and is one of the more under-staffed roles in Indian fintech right now.
- Per-tenant scaffolding for the personalisation layer — each user’s contextual bandit policy, per-tenant strategy-engine instance, per-tenant audit log. Foundational for any future PMS-style product offering.
Quantified as “build-vs-buy at standard market rates,” the build cost would be approximately ₹8-15 crore in engineering compensation plus 18-24 calendar months to reach feature parity. The acquisition is therefore not just a capability purchase — it is a 12-month time-to-market collapse for the strategic narrative the acquirer wants to tell.
§ 15
What’s Next — The Roadmap
Each capability below is in the codebase as either an active development branch or a clear architectural extension. Nothing in this roadmap is speculative — it’s all reachable from where the platform is today.
Near-Term (6 months)
- Continued validation through extended paper testing across more market regimes — the goal is to observe at least one full volatility cycle and one tail event before scaling
- Live trading deployment for the existing closed beta cohort once paper-testing benchmarks are sustained for an additional 90 days
- US market extension — the architecture supports it; the cost engine, regime classifier, and risk gates all need US-specific calibration. Estimated 8-12 weeks of focused work
- Mobile app — currently the platform is web-only; a React Native app for daily check-ins is the natural next step
- Enhanced analytics dashboard — more attribution detail, more comparative-to-benchmark visuals, more drill-down on signal contribution
Medium-Term (12-24 months)
- Multi-instrument expansion — currently Nifty-focused; Bank Nifty (different liquidity dynamics), FinNifty, and stock options are natural extensions
- Stock-specific options strategies — single-name options with cross-sectional ranking (Pipeline 4) driving stock selection
- Cross-asset strategies — combined equity, futures, options portfolios that hedge across asset classes
- Institutional offerings — a PMS-style product for ₹50L-5cr capital pools, with regulatory wrappers and compliance reporting
- API access — for sophisticated users who want to integrate the platform’s signals into their own systems
- White-label partnerships — financial-services platforms using the signal generation, risk gating, and ML backbone behind their own branding
Strategic Positioning
The architecture supports significant expansion. The closed-beta phase is intentional — we want to validate fully before scaling. With the right partnership or acquisition, scaling to serve thousands of users could happen within 12-18 months.
§ 16
The Invitation
This blog is the public version of a longer conversation we are open to having privately. Different audiences should reach out for different reasons.
For Potential Acquirers and Strategic Partners
If you are a fintech, financial-services platform, or investment firm evaluating systematic-trading capability for acquisition or strategic investment, we welcome a conversation. The platform represents 18+ months of focused development in a domain where building from scratch takes years. We would discuss the technology architecture, validation status, integration possibilities, and commercial terms.
The most useful first step is an executive brief — a two-page PDF summary of the platform suitable for forwarding to your technical and business evaluators. Email and we will send it directly.
For Sophisticated Retail Users (Closed Beta Expansion)
Quantsentinel is currently in closed beta with a select group of users. We expand the beta periodically based on operational readiness. The current cohort represents a deliberate range of behavioural archetypes; we add users when the platform’s per-tenant scaling has demonstrated the headroom.
If you are interested in being considered for beta access — particularly if you allocate ₹10 lakh or more to F&O trading and are looking for systematic infrastructure — get in touch with a brief description of your trading history and your interest. We will respond personally; we do not run a waiting list at scale.
For Institutional Partners
For institutional partnerships — whether technology licensing, white-label deployment, strategic investment, or co-development of new asset classes — we are open to conversations that align with our cautious approach to scaling.
Contact
- Direct email: founder [at] quantsentinel.fastwheel.ai
- LinkedIn: linkedin.com/in/aishwary-/
- Calendly: calendly.com/quantsentinel for serious inquiries
- Phone: provided after the first email exchange, for confirmed acquirer or institutional discussions only
We respond within 24 hours to substantive enquiries.