AI Art & Creativity AI Model Comparisons AI News & Updates9 Min Read Artur MarkusonMarch 16, 2026 University of Montreal Tests AI Against 100,000 Humans on Creativity—GPT-4 Beats 72% But Top 10% Still Win GPT-4 now outperforms the average human on standardized creativity tests. But before you fire your creative team, consider this: when…
AI Coding & Development AI Model Comparisons AI News & Updates10 Min Read Artur MarkusonMarch 5, 2026 Moonshot AI Releases Kimi K2.5 with 100-Agent Swarm Feature—Trained on 15 Trillion Tokens, Beats GPT-5.2 on Coding and Video Benchmarks The AI scaling wars just took an unexpected turn: a Chinese startup released a model that orchestrates 100 parallel agents instead of chasing…
AI Art & Creativity AI Model Comparisons Human and AI10 Min Read Artur MarkusonFebruary 24, 2026 University of Montreal Study Proves AI Beats Average Humans on Creativity Tests—But Top 10% Still Outperform GPT-4 The world’s largest creativity study just revealed an uncomfortable truth: half of humanity is now less creative than a language model.…
AI Model Comparisons AI News & Updates Machine Learning8 Min Read Artur MarkusonFebruary 22, 2026 Google Gemini 3.1 Pro Scores 77.1% on ARC-AGI-2—2.5x Jump Over Predecessor in Single Generation Google just doubled AI reasoning capability in 90 days while keeping the price identical. The assumption that frontier AI improves linearly…
AI Model Comparisons AI News & Updates AI Startups & Companies10 Min Read Artur MarkusonFebruary 16, 2026 Snorkel AI Commits $3M to Open Benchmarks Grant—Targeting the ‘Biggest Blind Spot’ Where AI Models Excel on Tests But Fail in Production Claude Opus 4.6 just scored 76% on MRCR v2—up from 18.5% on its predecessor. GPT-5.3-Codex hit 77.3% on Terminal-Bench 2.0. Neither score…
AI Model Comparisons AI News & Updates Natural Language Processing10 Min Read Artur MarkusonFebruary 11, 2026 Claude Opus 4.6 Scores 76% on Long-Context Retrieval—4X Better Than Its Predecessor at 18.5% A 310% improvement in a single release isn’t iteration—it’s a discontinuity. Anthropic just proved that model performance can…
AI Model Comparisons AI News & Updates AI Startups & Companies8 Min Read Artur MarkusonFebruary 4, 2026 ChatGPT’s Market Share Drops to 61.3% as Gemini Surges 237% Year-Over-Year—The AI Chatbot Monopoly Era Ends ChatGPT lost 25 percentage points of market share in 12 months. The company that ate its lunch isn’t a startup—it’s Google, the…
AI Model Comparisons AI News & Updates AI Tools & Platforms10 Min Read Artur MarkusonJanuary 22, 2026 TII’s Falcon-H1R 7B Outperforms 47B Models on Math Reasoning While Running on a 16GB Laptop A 7-billion parameter model just scored 88.1% on AIME-24 math reasoning, beating models with 47 billion parameters. The parameter count arms…
AI Model Comparisons AI News & Updates Natural Language Processing9 Min Read Artur MarkusonJanuary 20, 2026 Google’s 12B TranslateGemma Outperforms Its Own 27B Model: Open Translation Hits 55 Languages with MetricX Score of 3.60 Google’s smaller translation model just beat its larger sibling on standardized benchmarks, forcing us to reconsider everything we…
AI Model Comparisons AI News & Updates Machine Learning10 Min Read Artur MarkusonJanuary 15, 2026 TII’s Falcon-H1R 7B Outperforms 47B Models on Math Reasoning While Running on a 16GB Laptop A 7-billion parameter model just scored 88.1% on AIME-24 math reasoning, crushing NVIDIA’s 47B Nemotron at 49.7%. The assumption that…