Sebastian Raschka, LLM Research Engineer

Latest Articles

Controlling Reasoning Effort in LLMs

Jul 18, 2026

How LLMs Learn Low-, Medium-, and High-Effort Reasoning Modes

Using Local Coding Agents

Jun 27, 2026

Using Open-Weight Models in Local Coding Harnesses as an Alternative to Claude Code and Codex Subscriptions

LLM Research Papers: The 2026 List (January to May)

Jun 6, 2026

A curated roundup of notable LLM research papers that came out this year

May 16, 2026

From Gemma 4 to DeepSeek V4, How New Open-Weight LLMs Are Reducing Long-Context Costs

Jul 28, 2026

Kimi K3 Architecture Notes

Short architecture note on Kimi K3, including LatentMoE, Kimi Delta Attention, Attention Residuals, NoPE, multimodality, and inference-ef...

Jul 26, 2026

A Few Notable Open-Weight Models This Week

Short note on the architectures of six new open-weight models, including Nanbeige 4.2, Laguna S 2.1, Motif-3-Beta, Solar Open 2, Antares ...

Jul 25, 2026

Short correction note for the random seed in Listing 6.5 on page 198 of Build a Reasoning Model From Scratch.

Jul 16, 2026

Inkling: A New Open-Weight 975B MoE with a Few Surprises

Architecture and benchmark notes on Thinking Machines Lab's 975B Inkling MoE, including short convolutions, relative-position bias, train...

Jul 12, 2026

200,000 Subscribers

Short note celebrating Ahead of AI reaching 200,000 subscribers.

More notes

Hello, I'm Sebastian Raschka, PhD