๐Ÿค– AI Hedge Fund Simulation โ€” Multi-Round Debate Results

Generated: May 24, 2026 ยท 3 debate rounds ยท 10+ AI models ยท Data: tournament_picks (3,149 picks, 34 models)

Round 1: Initial Pick Debate (7 Models)

Models: Risk Manager, Portfolio Manager, Cerebras GPT-OSS-120B, DeepSeek V4, KiloCode, Kimi, Cursor

Consensus Top 5

RankPickDirectionEntryModels Agreeing
1TLT (Bond)LONG$87.667/7
2SPY (ETF)SHORT$726.807/7
3GLD (ETF)LONG$257.937/7
4CL=F (Commodity)SHORT$68.255/7
5XOM (Equity)SHORT$116.915/7

Key Findings

Swarm Prompts Used

Click to expand
# Pick Debate Prompt (fed to 7 engines via swarm_run.py + subagents)

You are one of 7 independent AI models evaluating the same curated pick list.
You MUST respond in your own voice. Do not defer to other models.

## The Picks (Forward-Tested Only, tournament_picks DB)
EQUITY: MSFT LONG 447.89 (WR 58%, n=164), XOM SHORT 116.91 (WR 58%, n=164)
CRYPTO: AVAXUSDT SHORT 22.83 (WR 65%, n=23), SOLUSDT LONG 157.39 (WR 65%, n=23)
ETF: SPY SHORT 726.80 (WR 65%, n=23), GLD LONG 257.93 (WR 65%, n=23)
COMMODITY: CL=F SHORT 68.25 (WR 65%, n=23), NG=F SHORT 3.90 (WR 65%, n=23)
BOND: TLT LONG 87.66 (WR 65%, n=23), SHY SHORT 80.27 (WR 58%, n=164)
PENNY: MVST LONG 1.80 (WR 0%, n=0)

TASK: Rank YOUR top 5 picks. Veto 1-2 picks. Flag systemic issues.

Model-Specific Insights

Risk Manager: "XOM SHORT is the most honest pick โ€” 30% confidence on 58% WR, good RR 2.1. The system tells you it's uncertain. That's a feature."
Portfolio Manager: "Top 5 trim: TLT, SPY, GLD, CL=F, AVAXUSDT โ€” dropping the conflicting SOL, redundant XOM, seasonal NG, low-conviction SHY/MSFT."
Cerebras (gpt-oss-120b): "Absence of minimum-sample-size gate. A simple rule: require 50 observations for WR>60% would reduce false positives."
DeepSeek (v4-flash): "The n-threshold is applied inconsistently. Equity n=164 vs crypto n=23 โ€” not statistically equivalent."

Round 2: Full Cross-Asset + IPO/Mutual Fund Gap Analysis

Models: Multi-Asset Allocator, Financial Data Architect

Expanded Pick List (All 9 Asset Classes)

Class#1#2#3Status
EQUITYPG SHORT $167.37GOOGL LONG $186.63META LONG $620.71ACTIVE
CRYPTOSOLUSDT LONG $157.39ETHUSDT SHORT $2150BTCUSDT SHORT $78,626ACTIVE
ETFSPY SHORT $726.80GLD LONG $257.93XLE SHORT $91.74ACTIVE
COMMODITYSI=F SHORT $34.93CL=F SHORT $68.25โ€”ACTIVE
BONDTLT LONG $87.66BND LONG $71.47SHY SHORT $82.36ACTIVE
PENNYMVST $1.80KULR $2.20QBTS $1.500 RESOLVED
FUTURESES=F LONG 5600GC=F LONG 2500CL=F SHORT 730 RESOLVED
FOREX๐Ÿ”ด BLOCKED โ€” Kill gate: 57.3% WR, -0.39% avg PnLBLOCKED
IPO/Mutual Fundsโณ ZERO DATA โ€” Infrastructure not builtGAP

Multi-Asset Allocator: Top 10 Composite Scores

RankPickScoreWRnRR
1SPY SHORT4.5962.5%1241.9
2TLT LONG4.3562.5%1241.8
3SHY SHORT3.8662.5%1241.6
4GLD LONG2.9062.5%1241.2
5WMT SHORT2.6764.0%112.1

Score = Confidence ร— WR ร— RR ร— ln(n+1), normalized per class.

IPO/Mutual Fund Gap Analysis

Financial Data Architect recommendations:

Round 3: Hedge Fund Simulation โ€” Quant + Persona + PM

Models: Quant Researcher, Behavioral Analyst, Hedge Fund PM ($500k AUM) | ๐Ÿ”„ RUNNING

Methodology

Data Sources Used (All Free/Open)

SourceAsset ClassesWhat It Provides
Yahoo Finance (yfinance)EQUITY, ETF, COMMODITY, BONDOHLCV, fundamentals, options
FRED (St. Louis Fed)ALL (macro context)Fed Funds, CPI, VIX, yield curve
CoinGeckoCRYPTOPrice, volume, market cap
SEC EDGAREQUITY (IPO tracking)S-1 filings, 10-K, 8-K
Polymarket/KalshiALL (sentiment)Prediction market probabilities
CFTC COT ReportsCOMMODITY, FOREXPositioning data

Personas Deployed This Round

PersonaAsset ClassMethod
quant_systematicALLEV, Sharpe, Kelly, risk parity
behavioral_narrativeALLNews sentiment, narrative consistency, earnings catalysts
hedge_fund_pmALLPosition sizing, risk budget, correlation management
invert_losersEQUITYFade underperformers (WR 64%, n=11)
risk_parityALLEqual risk contribution across assets (WR 60%, n=124)
vol_arbCRYPTO,COMMODITYVolatility arbitrage (WR 65%, n=23)

Not financial advice. Educational/research purposes only. Data: tournament_picks table (ejaguiar1_stocks). AI Tournament ยท Curated Picks