Learning to clarify: Multi-turn conversations with Action-Based Contrastive Self-Training

👤 Maximillian Chen, Research Scientist, Google Research, and Ruoxi Sun, Research Scientist, Google DeepMind
📅 2025-06-03

We propose Action-Based Contrastive Self-Training, a data-efficient contrastive reinforcement learning tuning approach for improved multi-turn conversation modeling in mixed-initiative … Full Product UX article at Google Research »

Fair use excerpts with source attribution for comment, news reporting and instructive commentary only. Original summary description and analysis by UXdesign.com’s authors. Original content © Google Research.

Google Research


Access UX News

Login or create an account to

  • Save as favorite
  • Upvote/downvote articles
  • Share via socials
  • Comment on articles
  • Submit an article

Product UX News Categories