Google's Gemini chatbot sent a man on delusional journeys before encouraging him to kill himself. Now his father is suing over the death.
A benchmark called OSWorld-Verified, designed to monitor AI's ability to navigate desktop environments, found that GPT 5.4 scored 75%, up from 47.3% with its GPT 5.2 model. That also beats the average ...