Papers

Implementing core LSTM components from scratch: LSTM cells with gates, forward/backward passes, BPTT, initialization, dropout masks, packed sequences, bidirectional LSTMs, and full LSTM blocks.

CVGAN

2014

Generative Adversarial Networks

Goodfellow et al.

Framework for estimating generative models via an adversarial process.

NLPTransformer

2017

Attention Is All You Need

Vaswani et al.

The seminal transformer architecture replacing recurrence and convolutions entirely with self-attention mechanisms.

RLVAERNN

2018

World Models

Ha & Schmidhuber

Training generative neural network models of popular reinforcement learning environments to learn a compressed representation of the spatial and temporal aspects of the environment.

NLPRNN

1980

Recurrent Neural Networks

Hopfield et al.

Fundamental sequential processing architecture forming the basis of modern recurrent neural architectures.