← Back to UX News Feed

Learning to clarify: Multi-turn conversations with Action-Based Contrastive Self-Training

👤 Maximillian Chen, Research Scientist, Google Research, and Ruoxi Sun, Research Scientist, Google DeepMind
📅 2025-06-03

We propose Action-Based Contrastive Self-Training, a data-efficient contrastive reinforcement learning tuning approach for improved multi-turn conversation modeling in mixed-initiative … Full Product UX article at Google Research »

Fair use excerpts with source attribution for comment, news reporting and instructive commentary only. Original summary description and analysis by UXdesign.com’s authors. Original content © Google Research.

Learning to clarify: Multi-turn conversations with Action-Based Contrastive Self-Training

Access UX News