Library Search

Search Lessons

Search guides, examples, practice, and runnable labs with ranked results and highlighted snippets.

Reset

80 results for Python

PythonPython BasicsGuide

Guide

Python / Python Basics / Guide

Python is known for its clean and readable syntax. Unlike other languages, Python uses indentation (whitespace) to define code blocks instead of curly braces {}.

34 match hitsMatched in path, content
PythonStringsGuide

Guide

Python / Strings / Guide

print(text.strip().replace("World", "Python")) # "Hello Python"

42 match hitsMatched in content
PythonPerformanceGuide

Guide

Python / Performance / Guide

Python Performance & Optimization

42 match hitsMatched in content
Math for LLMsMultivariate CalculusGuide

Notes

Math for LLMs / Multivariate Calculus / Notes

Basic Python and NumPy; familiarity with operator overloading

50 match hitsMatched in content
Math for LLMsMathematical FoundationsLab

Theory Lab

Math for LLMs / Mathematical Foundations / Theory Lab

concepts from all 17 sections with runnable Python, NumPy, and PyTorch code.

41 match hitsMatched in content
Math for LLMsMathematical FoundationsLab

Theory Lab

Math for LLMs / Mathematical Foundations / Theory Lab

concepts from all 15 sections with runnable Python code, connecting abstract mathematics

46 match hitsMatched in content
Math for LLMsLinear Algebra BasicsLab

Theory Lab

Math for LLMs / Linear Algebra Basics / Theory Lab

Theory Lab > Runnable lab version for web reading. # Vector Spaces and Subspaces — From Axioms to Interpretability > A vector space is not a collection of arrows. It is a collection of objects that *behave like* arrows — and that behaviour is the entire point.

50 match hitsMatched in content
Math for LLMsFunctional AnalysisLab

Theory Lab

Math for LLMs / Functional Analysis / Theory Lab

Theory Lab > Runnable lab version for web reading. # Normed Spaces - Theory Notebook This notebook is the interactive companion to notes.md. It turns normed-space definitions into computations: unit balls, convergence, operator norms, dual norms, contractions,

40 match hitsMatched in content
PythonControl FlowGuide

Guide

Python / Control Flow / Guide

Match-case statements (Python 3.10+)

37 match hitsMatched in content
Math for LLMsMultivariate CalculusLab

Theory Lab

Math for LLMs / Multivariate Calculus / Theory Lab

Theory Lab > Runnable lab version for web reading. # Chain Rule and Backpropagation — Theory > *"Backpropagation is an algorithm for computing gradients efficiently in a computational graph. At its heart, it is nothing more than the chain rule of calculus appl

37 match hitsMatched in content
Math for LLMsMultivariate CalculusLab

Theory Lab

Math for LLMs / Multivariate Calculus / Theory Lab

Theory Lab > Runnable lab version for web reading. # Optimality Conditions — Theory Notebook > *"The art of optimization is knowing not just how to move, but when to stop."* Interactive theory: first-order conditions, second-order tests, convexity, Lagrange mu

37 match hitsMatched in content
Math for LLMsStatisticsLab

Theory Lab

Math for LLMs / Statistics / Theory Lab

Theory Lab > Runnable lab version for web reading. # Bayesian Inference — Theory Notebook > Posterior reasoning turns uncertainty into something we can compute with, visualize, and act on. This notebook is the interactive companion to notes.md. It follows the

37 match hitsMatched in content
Math for LLMsStatisticsLab

Theory Lab

Math for LLMs / Statistics / Theory Lab

Theory Lab > Runnable lab version for web reading. # Regression Analysis — Theory Notebook > Regression turns feature-response relationships into a concrete language for prediction, uncertainty, and regularization. This notebook is the interactive companion to

37 match hitsMatched in content
Math for LLMsFourier Analysis And Signal ProcessingLab

Theory Lab

Math for LLMs / Fourier Analysis And Signal Processing / Theory Lab

FNO spectral convolution layer (1D, NumPy/pure-Python implementation)

37 match hitsMatched in content
PythonNetworking ApisGuide

Guide

Python / Networking Apis / Guide

Networking & APIs in Python

34 match hitsMatched in content
Math for LLMsAdvanced Linear AlgebraLab

Theory Lab

Math for LLMs / Advanced Linear Algebra / Theory Lab

Theory Lab > Runnable lab version for web reading. # Positive Definite Matrices — Theory Notebook > *"Positive definiteness is the matrix condition that makes everything work."* Interactive exploration of positive definite matrices, Cholesky decomposition, Sch

36 match hitsMatched in content
Math for LLMsAdvanced Linear AlgebraLab

Theory Lab

Math for LLMs / Advanced Linear Algebra / Theory Lab

Theory Lab > Runnable lab version for web reading. # Matrix Decompositions > *"The purpose of computing is insight, not numbers — and no insight is more powerful than factoring a matrix into pieces whose structure you understand."* This notebook implements and

36 match hitsMatched in content
Math for LLMsFunctional AnalysisLab

Theory Lab

Math for LLMs / Functional Analysis / Theory Lab

Theory Lab > Runnable lab version for web reading. # Kernel Methods - Theory Notebook This notebook is the executable companion to notes.md. It turns kernel definitions into concrete computations: Gram matrices, PSD checks, feature maps, RKHS-style prediction,

36 match hitsMatched in content
PythonTestingGuide

Guide

Python / Testing / Guide

Testing in Python

33 match hitsMatched in content
Math for LLMsML Specific MathLab

Theory Lab

Math for LLMs / ML Specific Math / Theory Lab

Theory Lab > Runnable lab version for web reading. # Loss Functions - Theory Notebook This notebook is the executable companion to notes.md. It turns loss formulas into curves, gradients, masking rules, and stability checks. ### Code cell 2 import numpy as np

35 match hitsMatched in content
PythonModules PackagesGuide

Guide

Python / Modules Packages / Guide

Python Standard Library

32 match hitsMatched in content
PythonPackagingGuide

Guide

Python / Packaging / Guide

Python Packaging & Distribution

32 match hitsMatched in content
Math for LLMsAdvanced Linear AlgebraLab

Theory Lab

Math for LLMs / Advanced Linear Algebra / Theory Lab

Theory Lab > Runnable lab version for web reading. # Matrix Norms > *"The art of doing mathematics consists in finding that special case which contains all the germs of generality." — David Hilbert* Interactive theory notebook for the Matrix Norms section of A

34 match hitsMatched in content
Math for LLMsCalculus FundamentalsLab

Theory Lab

Math for LLMs / Calculus Fundamentals / Theory Lab

Theory Lab > Runnable lab version for web reading. # Series and Sequences — Theory Notebook > *"An infinite series is merely a sequence of partial sums — the question is whether that sequence has a limit."* Interactive exploration of sequences, series converge

34 match hitsMatched in content
Math for LLMsInformation TheoryLab

Theory Lab

Math for LLMs / Information Theory / Theory Lab

Theory Lab > Runnable lab version for web reading. # Fisher Information Fisher information measures local statistical distinguishability. This notebook is the interactive companion to notes.md and builds the chapter through concrete score computations, KL-curv

34 match hitsMatched in content
Math for LLMsGraph TheoryLab

Theory Lab

Math for LLMs / Graph Theory / Theory Lab

Theory Lab > Runnable lab version for web reading. # Spectral Graph Theory — Theory Notebook > *"To understand a graph, listen to its spectrum. The eigenvalues of the Laplacian are the resonant frequencies of the graph."* This notebook provides interactive der

34 match hitsMatched in content
Math for LLMsStatisticsLab

Theory Lab

Math for LLMs / Statistics / Theory Lab

Theory Lab > Runnable lab version for web reading. # Time Series — Theory Notebook > Time order changes what counts as signal, what can be predicted, and how uncertainty should evolve with horizon. This notebook is the interactive companion to notes.md. It mov

33 match hitsMatched in content
Math for LLMsGraph TheoryLab

Theory Lab

Math for LLMs / Graph Theory / Theory Lab

=== 5.1 Python dict-of-lists adjacency list ===

33 match hitsMatched in content
Math for LLMsGraph TheoryLab

Theory Lab

Math for LLMs / Graph Theory / Theory Lab

Theory Lab > Runnable lab version for web reading. # Graph Neural Networks — Theory Notebook > *"A graph neural network is a machine that reads a graph and learns by listening to its neighbors."* Interactive derivations covering: GCN propagation, over-smoothin

33 match hitsMatched in content
Math for LLMsGraph TheoryLab

Theory Lab

Math for LLMs / Graph Theory / Theory Lab

Theory Lab > Runnable lab version for web reading. # Random Graphs — Theory Notebook > *"In the random graph, order emerges from chaos — the giant component appears suddenly, like a phase transition in physics, when the average degree crosses one."* Interactiv

33 match hitsMatched in content
PythonData StructuresGuide

Guide

Python / Data Structures / Guide

from_string = list("Python") # ['P', 'y', 't', 'h', 'o', 'n']

30 match hitsMatched in content
PythonData Science MlGuide

Guide

Python / Data Science Ml / Guide

Data Science and Machine Learning Fundamentals with Python

30 match hitsMatched in content
PythonType HintsGuide

Guide

Python / Type Hints / Guide

Type hints (introduced in Python 3.5) add optional static typing to Python. They don't change how your code runs but provide metadata for tools and humans.

30 match hitsMatched in content
Math for LLMsStatisticsLab

Theory Lab

Math for LLMs / Statistics / Theory Lab

Theory Lab > Runnable lab version for web reading. # Estimation Theory > *"The problem of statistical estimation is one of the most fundamental in all of science: given observations drawn from some unknown process, what can we infer about that process?"* > — S

32 match hitsMatched in content
Math for LLMsInformation TheoryLab

Theory Lab

Math for LLMs / Information Theory / Theory Lab

Theory Lab > Runnable lab version for web reading. # Entropy — Theory Notebook > *"Information is the resolution of uncertainty."* — Claude Shannon Interactive derivations: entropy computations, MaxEnt proofs, entropy rate of Markov chains, perplexity, differe

32 match hitsMatched in content
Math for LLMsNumerical MethodsLab

Theory Lab

Math for LLMs / Numerical Methods / Theory Lab

print(f'Python float: {sys.float_info}')

32 match hitsMatched in content
Math for LLMsFourier Analysis And Signal ProcessingLab

Theory Lab

Math for LLMs / Fourier Analysis And Signal Processing / Theory Lab

Theory Lab > Runnable lab version for web reading. # §20-01 Fourier Series — Theory Notebook > *'Any periodic motion is a superposition of simple oscillations.'* — Fourier Interactive derivations for §20-01. Run top-to-bottom. | Section | Content | |---------|

32 match hitsMatched in content
PythonPython BasicsCode

Practice

Python / Python Basics / Practice

01 - Python Basics: Exercises

5 match hitsMatched in path, content
PythonPackagingCode

Examples

Python / Packaging / Examples

requires-python = ">={python_version}"

31 match hitsMatched in content
Math for LLMsMathematical FoundationsLab

Theory Lab

Math for LLMs / Mathematical Foundations / Theory Lab

with runnable Python, NumPy, and PyTorch code — from set-theoretic definitions through

31 match hitsMatched in content
Math for LLMsProbability TheoryLab

Theory Lab

Math for LLMs / Probability Theory / Theory Lab

Theory Lab > Runnable lab version for web reading. # Expectation and Moments > *"The expectation of the product of two independent random variables equals the product of their expectations — a theorem so simple it conceals its own depth."* Interactive theory n

31 match hitsMatched in content
Math for LLMsStatisticsLab

Theory Lab

Math for LLMs / Statistics / Theory Lab

Theory Lab > Runnable lab version for web reading. # Descriptive Statistics — Theory Notebook > *"The first step in wisdom is knowing what you don't know — and in statistics, that begins by knowing what your data actually looks like."* — John Tukey Interactive

31 match hitsMatched in content
Math for LLMsInformation TheoryLab

Theory Lab

Math for LLMs / Information Theory / Theory Lab

Theory Lab > Runnable lab version for web reading. # KL Divergence — Theory Notebook > *"The most important single quantity in information theory and in machine learning is the Kullback-Leibler divergence."* — David MacKay Interactive exploration of KL diverge

31 match hitsMatched in content
Math for LLMsGraph TheoryGuide

1 Intuition To 6 Sparse Matrix Formats

Math for LLMs / Graph Theory / 1 Intuition To 6 Sparse Matrix Formats

The Python dictionary-of-lists adjacency representation, while correct and flexible, has poor cache locality. Each list is a separate Python object allocated at a random memory address, causing cache misses during graph traversal.

31 match hitsMatched in content
PythonPython BasicsCode

Examples

Python / Python Basics / Examples

01 - Python Basics: Examples

4 match hitsMatched in path, content
PythonAutomationGuide

Guide

Python / Automation / Guide

print(f"Python: {platform.python_version()}")

28 match hitsMatched in content
PythonDebugging ProfilingGuide

Guide

Python / Debugging Profiling / Guide

print(f"{name=}, {age=}") # Python 3.8+: name='Alice', age=30

28 match hitsMatched in content
Math for LLMsCalculus FundamentalsLab

Theory Lab

Math for LLMs / Calculus Fundamentals / Theory Lab

Theory Lab > Runnable lab version for web reading. # Limits and Continuity - Examples This notebook demonstrates limits and continuity concepts with practical examples and visualizations. ## Topics Covered 1. Limit Intuition (Numerical Approach) 2. One-Sided L

30 match hitsMatched in content
Math for LLMsCalculus FundamentalsLab

Theory Lab

Math for LLMs / Calculus Fundamentals / Theory Lab

Theory Lab > Runnable lab version for web reading. # Integration — Theory Notebook > *"To integrate is to make whole — to recover from the rate of change the quantity that changed."* Interactive exploration of integration: Riemann sums, the Fundamental Theorem

30 match hitsMatched in content
Math for LLMsMultivariate CalculusLab

Theory Lab

Math for LLMs / Multivariate Calculus / Theory Lab

Theory Lab > Runnable lab version for web reading. # Partial Derivatives and Gradients > *"The gradient is the generalization of the derivative to functions of several variables, and it points in the direction in which the function increases most rapidly."* In

30 match hitsMatched in content
Math for LLMsInformation TheoryLab

Theory Lab

Math for LLMs / Information Theory / Theory Lab

Theory Lab > Runnable lab version for web reading. # Cross-Entropy > Cross-entropy is the bridge from information theory to the actual loss curves we use to train classifiers and language models. This notebook is the interactive companion to notes.md. We move

30 match hitsMatched in content
PythonRegexGuide

Guide

Python / Regex / Guide

Regular Expressions are patterns used to match character combinations in strings. Python's re module provides regex support.

27 match hitsMatched in content
Math for LLMsAdvanced Linear AlgebraLab

Theory Lab

Math for LLMs / Advanced Linear Algebra / Theory Lab

Theory Lab > Runnable lab version for web reading. # Eigenvalues and Eigenvectors — Theory Notebook > *"The eigenvalues of a matrix are not just numbers — they are the heartbeat of the linear map, the frequencies at which it resonates, the rates at which it re

29 match hitsMatched in content
Math for LLMsCalculus FundamentalsLab

Theory Lab

Math for LLMs / Calculus Fundamentals / Theory Lab

Theory Lab > Runnable lab version for web reading. # Derivatives and Differentiation > *"The derivative is the instantaneous rate of change — and backpropagation is just the chain rule applied to a computation graph."* Interactive theory notebook covering: der

28 match hitsMatched in content
Math for LLMsGraph TheoryLab

Theory Lab

Math for LLMs / Graph Theory / Theory Lab

notes is demonstrated with executable Python.

28 match hitsMatched in content
Math for LLMsFunctional AnalysisLab

Theory Lab

Math for LLMs / Functional Analysis / Theory Lab

Theory Lab > Runnable lab version for web reading. # Hilbert Spaces: Theory Notebook Hilbert spaces add angle, projection, orthogonal coordinates, and self-duality to normed spaces. This notebook is the interactive companion to notes.md. ### Code cell 2 import

28 match hitsMatched in content
Math for LLMsML Specific MathLab

Theory Lab

Math for LLMs / ML Specific Math / Theory Lab

Theory Lab > Runnable lab version for web reading. # Activation Functions - Theory Notebook Executable companion to notes.md. ### Code cell 2 import numpy as np import matplotlib.pyplot as plt import matplotlib as mpl try: import seaborn as sns sns.set_theme(s

28 match hitsMatched in content
Math for LLMsML Specific MathLab

Theory Lab

Math for LLMs / ML Specific Math / Theory Lab

Theory Lab > Runnable lab version for web reading. # Normalization Techniques - Theory Notebook Executable companion to notes.md. ### Code cell 2 import numpy as np import matplotlib.pyplot as plt import matplotlib as mpl try: import seaborn as sns sns.set_the

28 match hitsMatched in content
Math for LLMsML Specific MathLab

Theory Lab

Math for LLMs / ML Specific Math / Theory Lab

Theory Lab > Runnable lab version for web reading. # Sampling Methods - Theory Notebook Executable companion to notes.md. ### Code cell 2 import numpy as np import matplotlib.pyplot as plt import matplotlib as mpl try: import seaborn as sns sns.set_theme(style

28 match hitsMatched in content
Math for LLMsFourier Analysis And Signal ProcessingLab

Theory Lab

Math for LLMs / Fourier Analysis And Signal Processing / Theory Lab

Theory Lab > Runnable lab version for web reading. # Fourier Transform — Theory Notebook > *"The Fourier transform is a mathematical prism: it refracts a signal into its constituent frequencies, revealing structure invisible in time."* This notebook provides i

28 match hitsMatched in content
PythonCli ApplicationsGuide

Guide

Python / Cli Applications / Guide

python -m pytest --verbose

25 match hitsMatched in content
Math for LLMsMath For Specific ModelsLab

Theory Lab

Math for LLMs / Math For Specific Models / Theory Lab

Theory Lab > Runnable lab version for web reading. # Reinforcement Learning This notebook is the executable companion to notes.md. It keeps the MDPs tiny so Bellman equations, TD targets, policy gradients, PPO clipping, and preference losses can be inspected n

27 match hitsMatched in content
Math for LLMsMath For LLMsLab

Theory Lab

Math for LLMs / Math For LLMs / Theory Lab

Theory Lab > Runnable lab version for web reading. # Tokenization Math This notebook is the executable companion to notes.md. It implements tiny BPE, unigram/Viterbi, entropy, fertility, special-token, and context-cost examples from scratch. ### Code cell 2 im

27 match hitsMatched in content
Math for LLMsMath For LLMsLab

Theory Lab

Math for LLMs / Math For LLMs / Theory Lab

Theory Lab > Runnable lab version for web reading. # Embedding Space Math This notebook is the executable companion to notes.md. It turns embedding lookup, vector similarity, analogy structure, position encodings, output gradients, and diagnostics into small c

27 match hitsMatched in content
Math for LLMsMath For LLMsLab

Theory Lab

Math for LLMs / Math For LLMs / Theory Lab

Theory Lab > Runnable lab version for web reading. # Attention Mechanism Math This notebook is the executable companion to notes.md. It checks scaled dot-product attention, masks, entropy, multi-head shapes, KV-cache cost, ALiBi bias, and efficient-attention i

27 match hitsMatched in content
Math for LLMsMath For LLMsLab

Theory Lab

Math for LLMs / Math For LLMs / Theory Lab

Theory Lab > Runnable lab version for web reading. # Positional Encodings This notebook is the executable companion to notes.md. It checks sinusoidal rows, learned table sizes, relative offsets, RoPE rotations, ALiBi bias, and decode position ids. ### Code cel

27 match hitsMatched in content
Math for LLMsLLM Training Data PipelineLab

Theory Lab

Math for LLMs / LLM Training Data Pipeline / Theory Lab

Theory Lab > Runnable lab version for web reading. # Data Format Standards - Theory Notebook Executable companion to notes.md. ### Code cell 2 import numpy as np import matplotlib.pyplot as plt import matplotlib as mpl try: import seaborn as sns sns.set_theme(

27 match hitsMatched in content
Math for LLMsLLM Training Data PipelineLab

Theory Lab

Math for LLMs / LLM Training Data Pipeline / Theory Lab

Theory Lab > Runnable lab version for web reading. # JSONL Generation - Theory Notebook Executable companion to notes.md. ### Code cell 2 import numpy as np import matplotlib.pyplot as plt import matplotlib as mpl try: import seaborn as sns sns.set_theme(style

27 match hitsMatched in content
Math for LLMsLLM Training Data PipelineLab

Theory Lab

Math for LLMs / LLM Training Data Pipeline / Theory Lab

Theory Lab > Runnable lab version for web reading. # Quality Checks - Theory Notebook Executable companion to notes.md. ### Code cell 2 import numpy as np import matplotlib.pyplot as plt import matplotlib as mpl try: import seaborn as sns sns.set_theme(style="

27 match hitsMatched in content
Math for LLMsLLM Training Data PipelineLab

Theory Lab

Math for LLMs / LLM Training Data Pipeline / Theory Lab

Theory Lab > Runnable lab version for web reading. # Full Dataset Assembly - Theory Notebook Executable companion to notes.md. ### Code cell 2 import numpy as np import matplotlib.pyplot as plt import matplotlib as mpl try: import seaborn as sns sns.set_theme(

27 match hitsMatched in content
Math for LLMsLLM Training Data PipelineLab

Theory Lab

Math for LLMs / LLM Training Data Pipeline / Theory Lab

Theory Lab > Runnable lab version for web reading. # Contamination and Dedup Audits - Theory Notebook Executable companion to notes.md. ### Code cell 2 import numpy as np import matplotlib.pyplot as plt import matplotlib as mpl try: import seaborn as sns sns.s

27 match hitsMatched in content
Math for LLMsLLM Training Data PipelineLab

Theory Lab

Math for LLMs / LLM Training Data Pipeline / Theory Lab

Theory Lab > Runnable lab version for web reading. # Documentation and Governance - Theory Notebook Executable companion to notes.md. ### Code cell 2 import numpy as np import matplotlib.pyplot as plt import matplotlib as mpl try: import seaborn as sns sns.set

27 match hitsMatched in content
Math for LLMsLLM Training Data PipelineLab

Theory Lab

Math for LLMs / LLM Training Data Pipeline / Theory Lab

Theory Lab > Runnable lab version for web reading. # Data Mixture Optimization - Theory Notebook Executable companion to notes.md. ### Code cell 2 import numpy as np import matplotlib.pyplot as plt import matplotlib as mpl try: import seaborn as sns sns.set_th

27 match hitsMatched in content
Math for LLMsEvaluation And ReliabilityLab

Theory Lab

Math for LLMs / Evaluation And Reliability / Theory Lab

Theory Lab > Runnable lab version for web reading. # Capability Benchmarks Capability benchmarks estimate what a model can do under a stated protocol; reliability begins when the protocol, metric, and uncertainty are explicit. This notebook is the executable c

27 match hitsMatched in content
Math for LLMsEvaluation And ReliabilityLab

Theory Lab

Math for LLMs / Evaluation And Reliability / Theory Lab

Theory Lab > Runnable lab version for web reading. # Calibration and Uncertainty Calibration asks whether confidence matches correctness; uncertainty methods decide when a model should answer, abstain, or return a set of plausible answers. This notebook is the

27 match hitsMatched in content
Math for LLMsEvaluation And ReliabilityLab

Theory Lab

Math for LLMs / Evaluation And Reliability / Theory Lab

Theory Lab > Runnable lab version for web reading. # Robustness and Distribution Shift Robustness evaluation measures how model risk changes when the test distribution, prompt surface, subgroup, or adversary changes. This notebook is the executable companion t

27 match hitsMatched in content
Math for LLMsEvaluation And ReliabilityLab

Theory Lab

Math for LLMs / Evaluation And Reliability / Theory Lab

Theory Lab > Runnable lab version for web reading. # Error Analysis and Ablations Error analysis turns aggregate scores into failure structure; ablations test which component actually caused an improvement. This notebook is the executable companion to notes.md

27 match hitsMatched in content
Math for LLMsEvaluation And ReliabilityLab

Theory Lab

Math for LLMs / Evaluation And Reliability / Theory Lab

Theory Lab > Runnable lab version for web reading. # Online Experimentation and AB Testing Online experiments connect offline model evidence to causal user and system impact through randomized comparison, statistical inference, and trust checks. This notebook

27 match hitsMatched in content
Math for LLMsAlignment And SafetyLab

Theory Lab

Math for LLMs / Alignment And Safety / Theory Lab

Theory Lab > Runnable lab version for web reading. # Instruction Tuning and SFT Supervised fine-tuning aligns a pretrained next-token model with demonstrated instruction-following behavior by optimizing response tokens under a curated chat protocol. This noteb

27 match hitsMatched in content
Math for LLMsAlignment And SafetyLab

Theory Lab

Math for LLMs / Alignment And Safety / Theory Lab

Theory Lab > Runnable lab version for web reading. # Preference Optimization RLHF and DPO Preference optimization learns from comparisons, either by fitting a reward model and optimizing a KL-regularized policy or by directly optimizing policy log-ratios. This

27 match hitsMatched in content