Google Unleashes AI Model with Unprecedented Computer Control Capabilities
Google's latest Gemini 3.5 Flash model boasts a groundbreaking feature that enables it to directly interact with and control computer screens, outperforming rival models with a score of 78.4 on the OSWorld benchmark. This update has significant implications for developers and businesses, allowing for more seamless automation and testing across various environments.
Google has integrated "Computer Use" directly into Gemini 3.5 Flash, letting the model operate computers, browsers, and mobile devices on its own. On the OSWorld benchmark, it scores 78.4, putting it on par with GPT-5.5. Developers can use the Gemini API to build agents for software testing or office automation. The article Google bakes computer control directly into Gemini 3.5 Flash, letting the model see and operate your screen appeared first on The Decoder.