The model was also able to hold a conversation with presenter on-stage, including offering advice on breathing techniques to reduce stress and assessing breathing sounds. Though during the demonstration, there were signs of the model appearing to misunderstand some cues and prompts, with presenters forced to repeat or reword questions to solicit the right response.
OpenAI unveils new ChatGPT-4o model with real-time speech and vision reasoning
previous post