CHAPTER 18 · Alignment and Safety This chapter contains the following subtopics: 01 · Instruction Tuning and SFT 02 · Preference Optimization RLHF and DPO 03 · Red Teaming and Safety Evaluations 04 · Policy and Guardrails 05 · Human in the Loop and Monitoring ← Previous 05 · Online Experimentation and AB Testing Next → 01 · Instruction Tuning and SFT