Training Conversational Agents on Noisy Data
The development of conversational agents that can interact with people in the real world presents many difficult challenges. First, natural language understanding and perception of context must be performed based on continuous data from sensors that often contain a high rate of sensor noise. Second, conversational agents must respond appropriately to natural human behavior that spans vast action spaces, thus data-efficient collection strategies are necessary in order to acquire large amounts of training data. This session will address both sides of the challenge: (1) using data-efficient strategies during the utterance collection and annotation phases to optimize the trade-off between cost and quality when collecting training data, (2) using data-driven approaches to train and generate behaviors for a conversational agent despite noisy data and lack of training labels. Finally, we will discuss field studies and past research for a conversational humanoid robot to autonomously generate socially appropriate locomotion and speech behaviors that was deployed in the real world to provide guidance and services to people.
Session ID: Presentation Type: On-Demand Session (Recorded)
Date / Time: [Content On-Demand] @ On Demand ET (US)
Presented by:Appen collects and labels images, text, speech, audio, video, and other data used to build and continuously improve the world’s most innovative artificial intelligence systems. Click for more details and additional sessions and content.
To view this session, register for the conference.