Tag Archive

LLM

24 articles with this tag

AI NewsJuly 20, 2026

Qwen3.8-Max Review: I Tested Alibaba's 2.4T Model

I ran Alibaba's new 2.4T Qwen3.8-Max-Preview through 4 real coding tests. Results rival Fable 5 and Grok 4.5 — with one big catch: speed.

Alibaba Qwen Qwen-3-8-Max+3

AI NewsJuly 9, 2026

Grok 4.5 Review: I Tested SpaceXAI's Cheap Coder

I ran Grok 4.5 through my usual coding tests — website builds, a Go poker sim, a site audit. It's fast, cheap, and it found a bug no other model caught.

SpaceXAI Grok Cursor+3

AI NewsJuly 7, 2026

MAI-Code-1-Flash Review: I Tested Microsoft's First Coding Model

I tested Microsoft's MAI-Code-1-Flash coding model on real projects. Fast and cheap, yes, but here's why I won't be switching from Kimi K2.7 Code.

Microsoft MAI-Code LLM+2

AI CodingJune 30, 2026

Context Engineering for AI Agents: A Field Guide

AI agents drift, forget, and derail on long tasks. Learn context engineering — 8 practical rules to keep your agents reliable, grounded, and on-goal.

AI Agents Agentic AI LLM+2

AI NewsJune 26, 2026

When Is DeepSeek V5 Coming Out? The Honest 2026 Answer

DeepSeek V5 has no announced release date. Here's what the July 24 deprecation actually means, plus V4 Pro pricing, vision API status, and Claude vs GPT-5.

DeepSeek LLM AI News+1

AI NewsJune 10, 2026

Claude Fable 5 Review: Best AI Coding Model Yet

My hands-on Claude Fable 5 review. I ran my usual coding tests and it one-shotted a poker sim no model ever beat. Best coding model yet, with caveats.

Anthropic Claude LLM+4

AI NewsJune 1, 2026

MiniMax M3 Review: Finally Matching GPT-5.5 & Opus?

I ran my usual coding tests — two websites, a poker sim, and a code audit. Here's how MiniMax M3 actually stacks up against GPT-5.5 and Opus 4.8.

MiniMax LLM AI Coding+2

AI NewsMay 27, 2026

Google Antigravity 2.0 Review: I Tested Gemini 3.5 Flash

Hands-on Antigravity 2.0 review: I tested Gemini 3.5 Flash on real coding tasks. Fast, impressive design — but the hidden token cost changes everything.

Google LLM Gemini+3

AI NewsMay 5, 2026

DeepSeek V4 Review: I Tested It on Real Code

DeepSeek V4 is here. I ran it through a TypeScript codebase audit, a poker simulation, and two web designs. Here's how it really compares to Opus 4.7 and GPT-5.5.

DeepSeek LLM AI News+1