The learning platform for builders

Master the code and the math
behind modern AI.

Free, comprehensive courses on Python, C, C++, and the mathematical foundations of large language models — taught from first principles to production-ready skills.

Explore Courses Read the Devlog

4Courses

200+Lessons

57Modules

FreeForever

What You'll Learn

Choose Your Track

Every course is self-contained — no fluff, no filler. Clear explanations, worked examples, and hands-on exercises from page one.

∑

Featured25 chapters

Math for LLMs

Linear algebra, calculus, probability, and optimization — taught through transformers, VAEs, and modern deep learning.

Python

The lingua franca of AI. Data structures, OOP, and scientific computing — built from first principles.

C

Understand memory, pointers, and how computers actually work. Low-level intuition every serious engineer needs.

C++

Modern C++ for high-performance systems. STL, memory models, and production-grade engineering patterns.

Research & Implementations

Landmark Papers

Rebuild breakthrough machine learning papers from scratch. There's no better way to build deep intuition than implementing the ideas yourself.

NLPReasoning

2025

Less is More: Recursive Reasoning with Tiny Networks

Alexia Jolicoeur-Martineau

Recursive reasoning with tiny networks, focusing on latent state updates and answer refinement.

CVTransformer

2020

Vision Transformer (ViT)

Dosovitskiy et al.

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale - applying Transformers directly to image patches for vision tasks.

RLReinforcement LearningDeep Learning

2015

Human-level Control through Deep Reinforcement Learning

Mnih, Kavukcuoglu et al.

Deep Q-Network (DQN) combining Q-learning with deep neural networks for end-to-end learning of action values from raw pixels.

OptimizationDeep LearningAdam

2014

Adam: A Method for Stochastic Optimization

Kingma, Ba

Adaptive moment estimation optimizer combining benefits of RMSProp and momentum, computing individual adaptive learning rates.

NLPRNN

1997

Long Short-Term Memory Networks

Hochreiter & Schmidhuber

Implementing core LSTM components from scratch: LSTM cells with gates, forward/backward passes, BPTT, initialization, dropout masks, packed sequences, bidirectional LSTMs, and full LSTM blocks.

CVGAN

2014

Generative Adversarial Networks

Goodfellow et al.

Framework for estimating generative models via an adversarial process.

NLPTransformer

2017

Attention Is All You Need

Vaswani et al.

The seminal transformer architecture replacing recurrence and convolutions entirely with self-attention mechanisms.

RLVAERNN

2018

World Models

Ha & Schmidhuber

Training generative neural network models of popular reinforcement learning environments to learn a compressed representation of the spatial and temporal aspects of the environment.

NLPRNN

1980