Date: 2026-03-24 Panel: Opus 4.6 + Sonnet 4.6 + Grok 4.2 Reasoning + Gemini Pro 3.1 Judge: Opus 4.6 Question: Additional data sources + advanced calculated metrics for Forex-Macro desk edge over retail Kalshi bettors
All four panelists delivered substantive, well-structured responses with concrete data sources, formulas, and prioritization. No hallucinated APIs, no vaporware. Overlap was high on core themes (carry models, regime detection, liquidity indices, COT positioning), which signals genuine consensus rather than groupthink -- each arrived via different reasoning paths.
| Dissent | Ruling | Reason |
|---|---|---|
| How many regimes? | 4 regimes (Opus): Risk-On, Risk-Off, Stagflation, Transition | Stagflation is distinct for forex; Transition prevents false signals |
| Custom Dollar Index? | Yes, but Phase 2 — simple trade-weighted from OANDA | Useful but not urgent; DXY proxy works for now |
| Fed speech NLP? | Deferred | High effort, moderate reward. Just track dates as vol catalysts |
| Foreign central bank APIs? | FRED only — FRED mirrors the key foreign yields | ECB SDW, BoJ, BoE have painful non-standard APIs |
| Google Trends? | Skip | Noisy, rate-limited, poor forex signal. COT serves this better |
| Term Premium? | Skip | Requires replicating academic models. Yield curve shape captures same info simpler |
| Geopolitical Risk Index? | Phase 3 — trivial to collect, marginal edge |
| # | Source | Frequency | Method |
|---|---|---|---|
| 1 | FRED bulk pull (15+ new series) | Daily | FRED API — TIPS yields (DFII5/10), breakevens (T5YIE, T10YIE, T5YIFR), foreign 10Y (6 countries), WALCL, RRPONTSYD, WTREGEN, BAMLH0A0HYM2, NFCI, ADSINDEX |
| 2 | CME FedWatch probabilities | Daily | Derive from Fed Funds futures or scrape |
| 3 | CFTC COT (forex futures) | Weekly | Bulk CSV download |
| 4 | Econoday consensus + actual | Daily | Scrape consensus for major releases |
| 5 | Atlanta Fed GDPNow | ~Weekly | FRED or scrape |
| # | Metric | Formula | Priority |
|---|---|---|---|
| M1 | Rate Expectations Deviation | deviation = kalshi_prob - cme_fedwatch_prob | HIGHEST — closest to arbitrage |
| M2 | Carry-to-Volatility Score | carry_score = rate_diff / realized_vol_20d | Core FX signal |
| M3 | Net Liquidity Index | net_liq = WALCL - RRPONTSYD - WTREGEN; impulse = 30d change; z-score over 2Y | Explains risk asset direction |
| M4 | Economic Surprise Index | surprise_z = (actual - consensus) / historical_std; ESI = EWMA(surprise_z, span=90) | Maps to Kalshi macro contracts |
| M5 | Macro Regime Classifier | 4 regimes from: 2s10s_z, vix_z, hy_spread_z, breakeven_direction | Gates all other signals |
| # | Metric | Formula |
|---|---|---|
| M6 | COT Positioning Signal | net_spec_pct z-scored 52w; contrarian at ±1.5 |
| M7 | Financial Conditions Index | mean(z(10Y), z(HY_spread), z(MOVE), z(USD)) |
| M8 | Yield Curve Shape | level/slope/curvature/butterfly, all z-scored 2Y |
| M9 | Real Yield Gap (G10) | (US_real_10Y) - (foreign_real_10Y) for Germany, Japan |
Custom Dollar Index, GPR Index, BIS REER, correlation regime breaks, cross-asset momentum, Fed speech calendar flags
Fed speech NLP, Term Premium models, Google Trends, foreign CB direct APIs, TIC/COFER/AAII/BIS quarterly data
fred_series table covers 70% of data needsThis ruling is final and binding.