volarixs - applied AI & ML to finance

Explore our latest posts on machine learning, market dynamics, strategy architecture and design

Feature Engineering
Jun 2, 2026

Shrinking the Feature Space: PCA & Autoencoders

Many features are redundant or noisy. High dimensionality = harder to generalize.

PCA
Autoencoders
Features
9 min read
Strategy
May 24, 2026

How Asset Managers Can Implement AI & Machine Learning

Part 2: Infrastructure, Governance & Roadmap. What it takes to implement AI in asset management.

AI Implementation
Governance
Roadmap
18 min read
Deep Learning
May 20, 2026

Neural Networks for Market Data: MLPs, CNNs & LSTMs

We are selective with deep learning. Expensive to train, easy to overfit, harder to debug.

Neural Networks
MLP
LSTM
12 min read
Research
May 14, 2026

Signal Half-Life and Decay: How Long Do ML Edges Really Last?

If you discover a signal today, how long will it work?

Signal Decay
Half-Life
Edge Persistence
13 min read
Strategy
May 7, 2026

How Asset Managers Can Use AI & Machine Learning in Investment Decisions

Part 1: Use Cases & Value. Real-world use cases: idea generation, regime analysis, risk management.

Asset Management
AI & ML
Use Cases
15 min read
Volatility
Apr 27, 2026

Modeling Market Turbulence: GARCH, EGARCH & HAR

Volatility ≠ returns: heavy tails, clustering, mean reversion. Dedicated volatility models are essential.

GARCH
EGARCH
HAR
10 min read
Time Series
Apr 9, 2026

ARIMA, SARIMAX & VAR: When Classical Time-Series Still Win

Explicitly model temporal dependence with transparent structure.

ARIMA
SARIMAX
VAR
9 min read
Benchmarks
Mar 31, 2026

Volatility Forecasting Benchmarks: GARCH, HAR, and ML

Compare GARCH, HAR, and ML models for volatility forecasting.

Volatility
GARCH
HAR
11 min read
Machine Learning
Mar 24, 2026

How Market Regimes Break ML Models

Financial machine learning rarely fails because the model is 'bad'. It fails because the market regime changed.

Regimes
ML
Backtesting
8 min read
Models
Mar 17, 2026

Boosted Trees for Alpha: XGBoost & LightGBM

Gradient boosting dominates tabular ML. Learn how XGBoost and LightGBM deliver strong performance.

XGBoost
LightGBM
Boosting
11 min read
Features
Mar 10, 2026

The 19 Most Important Features for Equity Return Forecasting

Most ML performance in finance doesn't come from the model — it comes from the features.

Features
Alpha
Equities
12 min read
Methodology
Feb 27, 2026

Rolling Windows for Financial ML: A Complete Guide

If you use financial data and your model does not use a rolling window, the backtest is wrong.

Rolling Windows
Time Series
Backtesting
10 min read
Evaluation
Feb 16, 2026

Beyond Sharpe: A Research Framework for Evaluating ML Trading Strategies

Sharpe ratio is dangerously incomplete for ML strategies.

Evaluation
Metrics
Sharpe
15 min read
Models
Jan 28, 2026

Random Forests in Finance: Nonlinear Signals Without the Drama

Tree-based ensembles capture nonlinearities and interactions in market data.

Random Forest
Extra Trees
Trees
10 min read
Models
Jan 5, 2026

From Linear Regression to Lasso: Fast, Interpretable Baselines

Linear and regularized regressions still do serious work in finance.

Linear Regression
Ridge
Lasso
12 min read
Regimes
Dec 12, 2025

Market Regimes, Clusters & HMMs: Teaching Models to Respect the Environment

Episodes where statistical properties are stable enough: high vol vs low vol, risk-on vs risk-off.

K-Means
GMM
HMM
11 min read
Architecture
Nov 23, 2025

Building a Universe-Wide Prediction Grid

An alpha factory needs predictions for every asset at multiple horizons from multiple models.

Prediction Grid
Scaling
Alpha Factory
14 min read
Evaluation
Oct 8, 2025

Regime-Conditioned Performance: Measuring ML Robustness

Most backtests report a single Sharpe. But ML models fail by regime.

Regimes
Robustness
Performance
12 min read
Research
May 14, 2026
13 min read

Signal Half-Life and Decay: How Long Do ML Edges Really Last?

One of the most important questions in systematic trading: "If I discover a signal today, how long will it work?"

1. What Is Signal Half-Life?

Every edge has a shelf life. A signal that looks brilliant the day you discover it is usually decaying from the moment you trade it — the only real question is how fast. Signal half-life puts a number on that decay, and it's one of the most consequential numbers in systematic trading: it tells you how long a model's edge is worth holding before it's mostly gone.

More formally: take a prediction signal s_t (say, an ML-predicted excess return) and bucket observations by signal strength. For a given bucket, track the cumulative forward return over a holding horizon h.

As you extend h, the incremental edge erodes. The half-life is the horizon at which cumulative return reaches roughly 50% of its maximum — equivalently, the point where the marginal information from holding one more day drops below the threshold that makes it worth the risk and cost.

Signal Decay & Half-Life Explorer

Half-Life: 8 days
Plateau: 29 days
Max Cumulative Return: 1.221

The decay curve shows how signal edge erodes over time. Half-life is the point where cumulative return reaches 50% of maximum. Notice how different models and regimes affect signal persistence.

2. Measuring Decay Curves

Practical method:

  1. Sort daily observations by signal value into quantiles (e.g. top 10%)
  2. For each entry date t, track realised returns for h = 1,...,H days
  3. Compute average cumulative return for each h
  4. Plot decay curves: E[R_{0→h}^(top)] vs h

You typically see strong positive edge for small h, flattening or reversal beyond a certain horizon.

3. Half-Life for ML vs Simple Models

Research use-case: compare decay curves for:

  • ML models (e.g. XGBoost, LSTM)
  • Simple models (e.g. 12-month momentum, value)

Questions: Are ML signals shorter-lived or longer-lived? Do ML signals mostly capture microstructure/short-horizon effects? Does model complexity correlate with shorter half-life?

4. Stability Across Regimes and Universes

Half-life is not constant:

  • in high-vol regimes, edges may compress (shorter half-life)
  • in calm regimes, edges may extend
  • in small caps, microstructure edges may be strong but short-lived
  • in large caps, slower fundamental edges may dominate

Thus we measure half-life by regime, by asset bucket, and by horizon type.

5. Decay and Portfolio Construction

Knowing half-life helps:

  • choose optimal holding periods
  • avoid holding positions beyond their edge horizon
  • design staggered rebalancing policies
  • combine multiple signals with different half-lives

Example: ML microstructure signals (half-life 1–2 days) → trade with high turnover, tight risk. Medium-term factor signals (half-life 20–60 days) → trade with lower turnover, larger sizing.

6. How This Maps to volarixs

Decay analysis only works if you keep the raw material: a model's predictions over time, the horizon each one was made for, and the market regime it was made in. That's exactly what volarixs records. Every experiment stores multi-horizon predictions — 1-day, 5-day, 21-day, 63-day and beyond — for each model, and carries the regime context those predictions were generated under.

That history is the foundation a decay curve is built on. From it, you can measure how persistence actually varies rather than assuming a single number:

  • per experiment, per model, and per prediction horizon
  • and, because regime labels travel with every run, per regime

The goal is to move the question from “Is this model predictive?” to “How long does this edge persist, for which assets, and under which regimes?” — the version that actually governs how long you hold a position and how often you rebalance. That's the lens volarixs is built around, and the explorer above is a simplified, hands-on version of the same idea.

Signal Decay
Half-Life
Edge Persistence
Research

Get new research in your inbox

Applied AI & ML for the buy-side — new research on signals, regimes, and strategy design, straight to your inbox. No noise.

See the data behind the decay

Explore how volarixs records multi-horizon predictions and regime context across every experiment — the raw material for measuring edge persistence.