← Back to UX News Feed
Learning to clarify: Multi-turn conversations with Action-Based Contrastive Self-Training
👤 Maximillian Chen, Research Scientist, Google Research, and Ruoxi Sun, Research Scientist, Google DeepMind
📅 2025-06-03

We propose Action-Based Contrastive Self-Training, a data-efficient contrastive reinforcement learning tuning approach for improved multi-turn conversation modeling in mixed-initiative …
Full Product UX article at Google Research »
Fair use excerpts with source attribution for comment, news reporting and instructive commentary only. Original summary description and analysis by UXdesign.com’s authors. Original content © Google Research.
Google Research
Access UX News
Login or create an account to
- Save as favorite
- Upvote/downvote articles
- Share via socials
- Comment on articles
- Submit an article
Product UX News Categories
Next UX News Fine-tuning LLMs with user-level differential privacy »