The learning platform for builders

Master the code and the math
behind modern AI.

Free, comprehensive courses on Python, C, C++, and the mathematical foundations of large language models — taught from first principles to production-ready skills.

4Courses
200+Lessons
57Modules
FreeForever

Choose Your Track

Every course is self-contained — no fluff, no filler. Clear explanations, worked examples, and hands-on exercises from page one.

Landmark Papers

Rebuild breakthrough machine learning papers from scratch. There's no better way to build deep intuition than implementing the ideas yourself.

NLPReasoning
2025

Less is More: Recursive Reasoning with Tiny Networks

Alexia Jolicoeur-Martineau

Recursive reasoning with tiny networks, focusing on latent state updates and answer refinement.

CVTransformer
2020

Vision Transformer (ViT)

Dosovitskiy et al.

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale - applying Transformers directly to image patches for vision tasks.

RLReinforcement LearningDeep Learning
2015

Human-level Control through Deep Reinforcement Learning

Mnih, Kavukcuoglu et al.

Deep G-Network (DQN) combining G-learning with deep neural networks for end-to-end learning of action values from raw pixels.

OptimizationDeep LearningAdam
2014

Adam: A Method for Stochastic Optimization

Kingma, Ba

Adaptive moment estimation optimizer combining benefits of RMSProp and momentum, computing individual adaptive learning rates.

NLPRNN
1997

Long Short-Term Memory Networks

Hochreiter & Schmidhuber

Implementing core LSTM components from scratch: LSTM cells with gates, forward/backward passes, BPTT, initialization, dropout masks, packed sequences, bidirectional LSTMs, and full LSTM blocks.

CVGAN
2014

Generative Adversarial Networks

Goodfellow et al.

Framework for estimating generative models via an adversarial process.

NLPTransformer
2017

Attention Is All You Need

Vaswani et al.

The seminal transformer architecture replacing recurrence and convolutions entirely with self-attention mechanisms.

RLVAERNN
2018

World Models

Ha & Schmidhuber

Training generative neural network models of popular reinforcement learning environments to learn a compressed representation of the spatial and temporal aspects of the environment.

NLPRNN
1980

Recurrent Neural Networks

Hopfield et al.

Fundamental sequential processing architecture forming the basis of modern recurrent neural architectures.

Engineering Devlog

In-depth technical articles on AI safety, systems scaling, and the future of software engineering.

Latest ArticlesFeatured: Banned: The Inside Story of the Claude Fable 5 Shutdown

DeepML Engineering Devlog

Case studies, model safety evaluations, and codebase scaling audits — written by engineers, for engineers.

Ready to start building?

Pick a track, open a lesson, and write your first line of code. Everything is free — no sign-up, no credit card.

Browse Courses