Posts | Florian Brand

Blog posts

2026-06-10The Myth of unsafe Open Source AI
2026-05-06The vibes in China’s AI labs
2026-02-26Quo vadis, LLM benchmarks?
2025-10-14Local models are (not) cope
2025-09-19You're not using LLMs enough
2025-07-06Living in the Agentic Era
2025-03-09Using OpenAI's Deep Research to save Time and Money
2024-11-04A Guide to LLMs for Programmers
2024-08-23Sane Python dependency management with uv
2024-08-04How FastHTML sparked my joy in web development

Guest posts & Talks

2026-06-03[Talk] LLM benchmarks in the time of agents
2026-05-16Artifacts 21: Open model bonanza! Gemma 4, DeepSeek V4, Kimi K2.6, MiMo 2.5, GLM-5.1 & others. On CAISI's V4 assessment.
2026-04-10MirrorCode: Evidence that AI can already do some weeks-long coding tasks
2026-03-30Artifacts 20: New orgs! New types of models! With Nemotron Super, Sarvam, Cohere Transcribe, & others
2026-03-03Artifacts 19: Qwen 3.5, GLM 5, MiniMax 2.5 - Chinese labs' latest push of the frontier
2026-02-13What do "economic value" benchmarks tell us?
2026-02-02Artifacts 18: Arcee's 400B MoE, LiquidAI's underrated 1B model, new Kimi, and anticipation of a busy month
2026-01-05Artifacts 17: NVIDIA, Arcee, Minimax, DeepSeek, Z.ai and others close an eventful year on a high note
2025-12-3117 predictions for AI in 2026
2025-12-23Why benchmarking is hard
2025-12-18Open models: Hot or Not with Nathan Lambert & Florian Brand
2025-12-142025 Open Models Year in Review
2025-11-23Artifacts 16: Who's building models in the U.S., China's model release playbook, and a resurgence of truly open models
2025-10-30What does OSWorld tell us about AI's ability to use computers?
2025-10-18Artifacts 15: It's Qwen's world and we get to live in it, on CAISI's report, & GPT-OSS update
2025-09-11Artifacts 14: NVIDIA's rise, "Swiss & UAE DeepSeek," and a resurgence of open data
2025-08-17Ranking the Chinese Open Model Builders
2025-08-11Artifacts 13: The abundance era of open models
2025-07-22Artifacts 12: Chinese models continue to dominate throughout the summer 🦦
2025-06-26Artifacts 11: Visualizing China's open models market share, Arcee's models, and VLAs for robotics
2025-06-13What skills does SWE-bench Verified evaluate?
2025-05-29Artifacts 10: New DeepSeek R1 0528!, more permissive licenses, everything as a reasoner, and from artifacts to agents
2025-04-21Artifacts 09: RLHF book draft, where the open reasoning race is going, and unsung heroes of open LM work
2025-03-20Artifacts 08: The return of ~30B models, side effects of OpenAI's proposed DeepSeek ban, and yet another reasoning roundup
2025-02-19Artifacts 07: Alpaca era of reasoning models, China's continued dominance, and tons of multimodal advancements
2025-01-27Artifacts 06: Reasoning models, China's lead in open-source, and a growing multimodal space

Interviews and media mentions