LLMs Get Lost In Multi-Turn Conversation

Name: LLMs Get Lost In Multi-Turn Conversation
Start: 2025-08-29T14:00:00Z
End: 2025-08-29T15:00:00Z
Location: COM3-B1-15

Duc Q. Nguyen

Abstract

In this talk, I will present a systematic evaluation of large language models (LLMs) in multi-turn conversational settings, focusing on the “lost-in-conversation” phenomenon. I will introduce a novel benchmarking methodology that transforms single-turn tasks into multi-turn interactions using a simulated user and classifier-based evaluation pipeline. Through large-scale simulations across diverse tasks, I will demonstrate that LLMs often fail to recover from early misinterpretations, exhibit high unreliability, and produce verbose or bloated responses. I will also discuss the limitations of current mitigation strategies such as agent-based concatenation and temperature tuning, and highlight the implications for future LLM design and evaluation.

Date

Aug 29, 2025 2:00 PM — 3:00 PM

Event

Offline Seminar

Location

COM3-B1-15

Large Language Models Multi-Turn Dialogue Benchmarking Evaluation Methodology Conversational AI Natural Language Processing

LLMs Get Lost In Multi-Turn Conversation

Abstract

Duc Q. Nguyen

CS PhD Student