Skip to main content

Fine-tuning generative large language models

In this presentation, we go over three fine-tuning techniques, namely Instruction fine-tuning, Domain adaptation fine-tuning, and Reinforcement Learning with Human Feedback (RLHF).

Download PDF

In this presentation, we go over three fine-tuning techniques, namely (a) Instruction fine-tuning, (b) Domain adaptation fine-tuning, and (c) Reinforcement Learning with Human Feedback (RLHF), and explain when to use which one.