Learning Dynamics of LLM Finetuning

91��

Events

Learning Dynamics of LLM Finetuning

Lectures and seminars

LLM seminar event about the paper "Learning Dynamics of LLM Finetuning" by University of British Columbia.

Image with writing about the presenter name, title, time and place of the event. Black background with a book

When

19.6.2025 14:00 – 15:00 (UTC +3)

Where

Maarintie 8, room 1592 & Online

Event language(s)

English

Title: Learning Dynamics of LLM Finetuning

Presenter: Linli Zhang

Abstract: Learning dynamics, which describes how the learning of specific training examples influences the model’s predictions on other examples, gives us a powerful tool for understanding the behavior of deep learning systems. The authors study the learning dynamics of large language models during different types of finetuning, by analyzing the step-wise decomposition of how influence accumulates among different potential responses. Their framework allows a
uniform interpretation of many interesting observations about the training of popular algorithms for both instruction tuning and preference tuning. In particular, they propose a hypothetical explanation of why specific types of hallucination are strengthened after finetuning, e.g., the model might use phrases or facts in the response for question B to answer question A, or the model might keep repeating similar simple phrases when generating responses. They also extend their framework and highlight a unique “squeezing effect” to explain a previously observed phenomenon in off-policy direct preference optimization (DPO), where running DPO for too long makes even the desired outputs less likely. This framework also provides insights into where the benefits of on-policy DPO and other variants come from. The analysis not only provides a novel perspective of understanding LLM’s finetuning but also inspires a simple, effective method to improve alignment performance.

Paper link:

Disclaimer: The presenter is not part of the authors!

LLM seminar

Seminar on Large Language Models in the CS Department

Updated: 17.6.2025
Published: 17.6.2025

91�����

Learning Dynamics of LLM Finetuning

When

Where

Event language(s)

LLM seminar

91��