• CO/AI
  • Posts
  • The Browser Control Race 🟢

The Browser Control Race 🟢

OpenAI, Anthropic, and Google are all developing similar capabilities. This isn't surprising - the web browser represents the primary interface through which most knowledge workers interact with information and services.

Today in AI

  • News roundup

  • Today’s big story

  • This week on the podcast

  • AI events

News roundup

The top stories in AI today.

NEW LAUNCHES

The latest features, products & partnerships in AI

AI AGENTS

Launches, research & more from the AI Agent Report Newsletter

AI MODELS

Deployment, research, training & infrastructure

IMPLEMENTATION

Announcements, strategies, predictions & tools

HARDWARE

Computers, phones, chips & AI powered devices

What’s happening in AI right now

The AI wars enter a new phase of browser control and context

Google's ambitions to develop an AI system that autonomously controls web browsers signals the next major battleground in artificial intelligence. Code-named "Jarvis," this system represents more than just another incremental advance - it points to a future where AI agents actively navigate and manipulate the web on our behalf, rather than merely processing our queries.

The browser becomes the battlefield

The race for browser control is heating up, with OpenAI and Anthropic also developing similar capabilities. This isn't surprising - the web browser represents the primary interface through which most knowledge workers interact with information and services. An AI that can effectively navigate websites, conduct research, and complete transactions would fundamentally change how we interact with digital services.

What's particularly interesting about Google's approach is the timing - they're planning to demonstrate this technology alongside their next Gemini model update, suggesting they see browser control and large language models as complementary technologies that become more powerful when combined.

The context revolution

But browser control is only part of the story. Google is simultaneously pushing the boundaries of what's possible with context windows. Their recent Kaggle competition challenging developers to test Gemini 1.5's ability to process over 100,000 tokens represents a significant leap forward. Ethan Mollick points out that this expanded context window allows AI to maintain "near-perfect recall" across massive amounts of information - the equivalent of reading and perfectly remembering thousands of pages of text.

Implications for business

  1. Knowledge Work: The combination of browser control and expanded context windows could automate significant portions of research, analysis, and routine decision-making tasks. Companies need to start thinking about how to redesign workflows around these capabilities.

  2. Customer Service: AI agents that can actually navigate websites and systems could handle complex customer service tasks that currently require human intervention.

  3. Privacy and Security: Browser-controlling AI raises serious questions about data privacy and system security. Organizations will need robust frameworks to govern how these systems interact with sensitive information.

Looking ahead

The combination of browser control and expanded context windows suggests we're entering a new phase where AI moves from being purely reactive to actively engaging with the digital world on our behalf. This shift will likely accelerate the pace of change in how we interact with technology and conduct many knowledge work tasks.

We publish weekly AI agent research, news, and strategies. Learn More Here

This week on the podcast

Can’t get enough of our newsletter? Check out our podcast Future-Proof.

In this episode, the hosts Anthony Batt and Shane Robinson with guest Joe Veroneau from Conveyor discuss outsmarting paperwork. Conveyor is a company that helps automate security reviews and document sharing between companies. They use AI technology, specifically language models, to automate the process of filling out security questionnaires. This saves customers a significant amount of time and improves the quality of their responses.

CO/AI Future-Proof AI podcast on Spotify
CO/AI Future-Proof AI podcast on Apple
CO/AI Future-Proof AI podcast on YouTube

AI events

Best way to get AI literate? Go to some awesome events.

We’re thrilled to share that COAI is partnering with HumanX 2025—the AI conference that’s set to redefine the future of technology. Taking place on March 10-13, 2025 at The Fontainebleau Las Vegas, this forum is where the brightest minds in AI will gather to shape what’s next. And we want you to join us!

Why attend HumanX?
HumanX isn’t just another tech conference. It’s a unique opportunity to:

  • Connect with industry leaders, C-suite executives, policymakers, and innovators from around the globe.

  • Learn from top-tier speakers like Kevin Weil, Clara Shih, and Sridhar Ramaswamy about the latest AI trends and how they’re transforming cross-functional industries.

  • Explore personalized strategies and solutions to drive your business forward with AI.

Whether you’re a startup, an established business, or an AI pro looking to make meaningful connections, HumanX is the place where AI meets opportunity.

Exclusive offer for our community
As a valued member of our community, we’re excited to extend an exclusive offer to attend HumanX 2025. Register now with our code HX25p_coai and save $250 on general admission!

How'd you like today's issue?

Have any feedback to help us improve? We'd love to hear it!

Login or Subscribe to participate in polls.

Reply

or to participate.