Google is working on an artificial intelligence (AI) product to automate web browsing tasks, reported The Information, citing sources.  

Commonly referred to as a computer-using agent, the AI system is designed to perform tasks such as research, shopping, and booking flights.  

This technology, code-named Project Jarvis, is expected to be previewed as early as December 2024, coinciding with the release of Google’s next flagship Gemini large language model (LLM), which will enhance the capabilities of the product. 

Google’s move mirrors efforts by other technology companies, such as Anthropic, which recently announced a similar capability in its AI models, allowing the technology to interpret screen content, select buttons, enter text, and navigate websites. 

Announcing the upgraded Claude 3.5 Sonnet, and Claude 3.5 Haiku, a new model, Anthropic said: “At this stage, it is still experimental—at times cumbersome and error-prone. We are releasing computer use early for feedback from developers, and expect the capability to improve rapidly over time.” 

Amazon-backed Anthropic added that Asana, Canva, Cognition, DoorDash, Replit, and The Browser Company have already started using the capability. 

How well do you really know your competitors?

Access the most comprehensive Company Profiles on the market, powered by GlobalData. Save hours of research. Gain competitive edge.

Company Profile – free sample

Thank you!

Your download email will arrive shortly

Not ready to buy yet? Download a free sample

We are confident about the unique quality of our Company Profiles. However, we want you to make the most beneficial decision for your business, so we offer a free sample that you can download by submitting the below form

By GlobalData
Visit our Privacy Policy for more information about our services, how we may use, process and share your personal data, including information of your rights in respect of your personal data and how you can unsubscribe from future marketing communications. Our services are intended for corporate subscribers and you warrant that the email address submitted is your corporate email address.

Microsoft-supported OpenAI is also exploring this domain, with ambitions for its AI models to autonomously conduct web-based research using a computer-using agent, code-named Strawberry.  

This technology is designed to enable OpenAI’s AI to perform deep research by autonomously navigating the internet, applying specialised processing methods to pre-trained AI models. 

While current LLMs excel at summarising texts and generating prose, they often struggle with tasks that are intuitively easy for humans, such as identifying logical fallacies or playing simple games.  

Strawberry aims to address these shortcomings by enhancing reasoning in AI models, a step considered vital for a range of applications, including scientific research and software development. 

OpenAI’s strategy involves focusing on long-horizon tasks, which require planning and executing a series of actions over time.