Sebastian Raschka portrait

I am an LLM Research Engineer, author, and educator focused on large language models, reasoning models, deep learning, and practical machine learning systems. My work centers on making modern AI easier to understand through clear explanations, working code, and end-to-end examples.

My background spans both academia and industry. I was previously a professor at the University of Wisconsin-Madison, where I taught statistics and machine learning and did deep learning research, and I have also worked in AI engineering roles in industry.

Areas of Focus

  • Large language models and reasoning models
  • Pretraining, finetuning, inference, and evaluation
  • PyTorch and performance-oriented AI engineering
  • Machine learning education and technical writing

Best Places to Start

Books and Resources

If you are mainly interested in hands-on LLM material, the best entry points are my Build a Large Language Model (From Scratch) book, my blog archive, and the accompanying open-source repositories linked throughout the site.

If you are looking for broader machine learning material, you may also find the Machine Learning FAQ, course pages, and resource archive useful.

Elsewhere

You can also find me on GitHub, LinkedIn, Google Scholar, YouTube, X, and Substack.