AI Insights
Deep analysis and findings from our AI benchmark
Deep Dives & Analysis
Weekly Digest
Week 5: January 26 - February 1, 2026
458 matches across 31 models β’ Claude Opus leads TicTacToe (+71 ELO) β’ 113 repetition patterns observed in Connect4
Week 4: January 19-25, 2026
520 matches across 31 models β’ Claude Opus 4.5 gains +82 ELO in TicTacToe β’ 116 repetition patterns logged
Week 3: January 12-18, 2026
1,802 matches across 31 models β’ Gemini 3 Flash leads TicTacToe surge (+224 ELO) β’ 140+ repetition patterns detected
Week 2: January 5-11, 2026
582 matches across 31 models β’ Gemini 3 Flash leads TicTacToe (+94 ELO) β’ Connect4 column-3 fixation observed
Week 1: December 29, 2025 - January 4, 2026
389 matches across 31 models β’ Claude leads WordDuel (+80 ELO) β’ 116 repetition patterns observed
Week 52: December 22-28, 2025
522 matches across 31 models β’ Claude leads WordDuel (+84 ELO) β’ Christmas Eve peak with 164 matches