Rabbit's web-based 'large action model' agent arrives on r1 as early as this week

Devin Coldewey

September 23, 2024 at 1:43 p.m.·7 min read

The Rabbit r1 was the must-have gadget of early 2024, but the blush fell off it pretty quick when the company's expansive promises failed to materialize. CEO Jesse Lyu admits that "on day one, we set our expectations too high" but also said that an update coming to devices this month will finally set the vaunted Large Action Model free on the web.

While skeptics may (justifiably) see this as too little, too late, or another shifting of goalposts, Rabbit's aspiration of building a platform-agnostic agent for web and mobile apps still has fundamental — if still largely theoretical — value.

Speaking to TechCrunch, Lyu said that the last six months have been a whirlwind of shipping, bug fixes, improving response times, and adding minor features. But despite 16 over-the-air updates to the r1, it remains fundamentally limited to interacting with an LLM or accessing one of seven specific services, like Uber and Spotify.

"That was the first-ever version of the LAM, trained on recordings collected from data laborers, but it isn't generic — it only connects to those services," he said. Whether or not it was what they call the LAM is pretty much academic at this point; whatever the model was, it didn't provide the capabilities Rabbit detailed at its debut.

A generalist web-based agent

But Rabbit is ready to release the first generic version, which is to say not specific to any app or interface, of the LAM, which Lyu demonstrated for me.

This version is a web-based agent that reasons out the steps to do any ordinary task, like buying tickets to a concert, registering a website, or even playing an online game. "Our goal is very clear: At the end of September, your r1 will suddenly do lots more things. It should support anything you can do on any website," Lyu said.

Given a task, it first breaks that task down into steps, then starts executing them by analyzing what it sees on screen: buttons, fields, images, regardless of position or appearance. Then it interacts with the appropriate element based on what it has learned in general about how websites work.

I asked it (through Lyu, who was operating it remotely) to register a new website for a film festival. Taking an action every few seconds, it searched for domain registries on Google, picked one (a sponsored one, I think), put film festival in the domain box, and from the resulting list of options picked "filmfestival2023.com" for $14. Technically I hadn't given it any constraints like "for 2025" or "horror festival" or anything.

Similarly, when Lyu asked it to search for and buy an r1, it quickly found its way to eBay, where dozens were on sale. Perhaps a good result for a user but not for the founder of the company presenting to the press! He laughed it off and did the prompt again with the addition that it should buy only from the official website. The agent succeeded.

Next, he had it play Dictionary.com's daily word game. It took a bit of prompt engineering (the model found an out in that it could quickly finish by hitting "end game") but it did it.

Which browser does it use, though? A fresh, clean one in the cloud, Lyu said, but they are working on local versions, like a Chrome extension, that would mean you can use existing sessions and it wouldn't have to log into your services.

To that end, as users are understandably (and rightly) wary of giving any company full access to their credentials, the agent is not equipped with those. Lyu suggested that a walled-off small language model with your credentials could be privately invoked in the future to perform logins. It seems to be an open question how this will work, which is somewhat to be expected given the newness of the space.

An example of UI analysis inside apps from the Rabbit website.

Still learning

The demo showed me a couple things. First, if we give the company and its developers the benefit of the doubt that this isn't all some elaborate hoax (as some believe), it does appear to be a working, general-purpose web agent. And that would be, if not a first in itself, certainly the first to be easily accessible to consumers.

"There are companies doing verticals, for Excel or legal documents, but I believe this is one of the first general agents for consumers," Lyu said. "The idea is you can say anything that can be achieved through a website. We'll have the generic agent for websites first, then for apps."

Second, it showed that prompt engineering is still very much needed. How you phrase a request can easily be the difference between success and failure, and that's probably not something ordinary consumers will tolerate.

Lyu cautioned that this is a "playground version," not final by any means, and that although it is a fully functioning general web agent, it still can be improved in many ways. For instance, he said, "the model is smart enough to do the planning, but isn't smart enough to skip steps." It wouldn't "learn" that a user prefers not to buy their electronics on eBay, or that it should scroll down after searching to avoid the wall of sponsored results.

User data won't be harvested to improve the model — yet. Lyu attributed this to the fact that there's basically no evaluation method for a system like this, so it is difficult to say quantitatively whether improvements have been made. A "teach mode" is also coming, though, so you can show it how to do a specific type of task.

Interestingly, the company is also working on a desktop agent that can interact with apps like word processors, music players, and of course browsers. This is still in the early stages, but it's working. "You don't even need to input a destination, it just tries to use the computer. As long as there is an interface, it can control it."

Third, there is still no "killer app," or at least no obvious one. The agent is impressive, but I personally would have little use for it, being unfortunately sitting in front of a browser for 8 hours a day anyway. There are almost certainly some great applications, but none sprang to mind that makes the utility of a browser-based automaton as obvious as that of, say, a robot vacuum.

Why not an app, again?

I raised the common objection to the entire Rabbit business model, essentially that "this could be an app."

Lyu has clearly heard this criticism many times, and he was confident of his answer.

"If you do the math, it doesn't make sense," he said. "Yes, it's technically achievable, but you're going to piss off Apple and Google from day one. They will never let this be better than Siri or Gemini. Just like there's no way Apple intelligence is going to control Google stuff better, or vice versa. And they take 30% of revenue! If at the beginning we'd just built an app, we'd never have this momentum."

The rabbit r1 in use. Hand model: Chris Velazco of The Washington Post.

The fundamental pitch Rabbit is making is that there can be a third-party AI or device that can access and operate all your other services, and from outside them, like you are. "A cross-platform, generic agent system," as Lyu called it. "We'll control every UI, and the website is a good start. Then we'll go to Windows, to MacOS, to phones."

Speaking of which: "We never said we'd never build a phone in the future." Isn't that antithetical to their original thesis of a smaller, simpler device? Maybe, maybe not.

In the meantime, they're working on starting to fulfill the promises they made early this year. The new model should be available to any r1 owner sometime this week when the OTA update goes out. Instructions on how to invoke it will arrive then as well. Lyu cautioned expectant users with his characteristic understatement.

"We're setting the expectations right. It's not perfect," he said. "It's just the best the human race has achieved so far."

Reuters
Exclusive-US to propose ban on Chinese software, hardware in connected vehicles, sources say
WASHINGTON (Reuters) -The U.S. Commerce Department is expected on Monday to propose prohibiting Chinese software and hardware in connected and autonomous vehicles on American roads due to national security concerns, two sources told Reuters. The Biden administration has raised serious concerns about the collection of data by Chinese companies on U.S. drivers and infrastructure as well as the potential foreign manipulation of vehicles connected to the internet and navigation systems.
Insider Monkey
Analyst on iPhone 16: I Do Not Think People Are ‘Running Out to Buy These Phones’
We recently published a list of Top 10 Buzzing AI Stocks Now. Since Apple Inc (NASDAQ:AAPL) ranks 3rd on the list, it deserves a deeper look. Following the aggressive rate cut by the Federal Reserve, the market roared to new highs but quickly lost enthusiasm as investors look for clues on what might be ailing the economy that […]
Business Insider
Apple Intelligence will drive sales for iPhone 16. Just wait for it, analyst says.
Apple Intelligence won't roll out until October but it'll still be a a big booster to iPhone 16 sales in the coming months, analysts say.
Barrons.com
An Intel-Qualcomm Megamerger Is a Bad Idea. Here’s Why.
Intel is now in play. Late Friday, The Wall Street Journal reported that Qualcomm had approached Intel about a takeover. The Journal story cited people familiar with the situation, noting that “a deal is far from certain.”
Engadget
See the iPhone 16’s game-changing battery removal process in new iFixit teardown
A teardown by iFixit now shows the new battery removal process in action, and it looks easier than ever.
HuffPost
'Grifter' Melania Trump Gets Blunt Reminder After Awkward New Sales Pitch
The former first lady unveiled her latest "collectibles," but critics spotted one key problem.
Fortune
Data scientist nails the Trump gaffe that started what looks today like a building Harris landslide
"That event, and not the debate that just made things worse for Trump, marked the decisive turning point in the campaign."
HuffPost
Authoritarianism Expert Spots New Donald Trump Boast That ‘Sends A Chill Down My Spine’
Ruth Ben-Ghiat also examined Donald Trump and JD Vance's latest "really disturbing" turn.
The Canadian Press
Boy abducted from California in 1951 at age 6 found alive on East Coast more than 70 years later
OAKLAND, Calif. (AP) — Luis Armando Albino was 6 years old in 1951 when he was abducted while playing at an Oakland, California park. Now, more than seven decades later, Albino has been found thanks to help from an online ancestry test, old photos and newspaper clippings.
The Daily Beast
Why Trump, 78, Can’t Rally Like He Did Before
What a difference eight years makes, especially at his age. Donald Trump is holding way fewer rallies than he did during his previous presidential runs, in part because he’s older and enjoys staying in at Mar-a-Lago, Axios reports.The former president did 72 rallies in the summer leading up to the 2016 election, barn-burning events that demonstrated Americans’ enthusiasm about his bid. This summer, he did 24, just over a third as many.According to people on Trump’s team, besides his inclination
The Daily Beast
Emily Ratajkowski Reveals Disturbing Link Between Diddy and Menendez Brothers
Emily Ratajkowski is calling attention to the connections she claims nobody wants to talk about between Sean “Diddy” Combs and the notorious Menendez brothers.“With everything that’s coming out about Diddy and the allegations and also this new Menendez brothers show called Monsters I think we need to have a conversation about male sexual assault,” the model and actress says in a video posted to TikTok Sunday.Ratajkowski notes that the reason Diddy was able to “hide in plain sight for so long” wa
BuzzFeed
This Woman's Outfit For A Dinner With Her Husband's Boss Is Going Viral Online — Here's Why
"As soon as he saw me, he said I was going to embarrass him."
HuffPost
Mary Trump Pinpoints The Family Root Of Uncle's Latest Attack On Kamala Harris
Donald Trump's niece also highlighted the former president's total inability to recognize himself as one thing.
HuffPost
Marjorie Taylor Greene’s Boyfriend Emits ‘Major Karen Vibes’ In Online Complaint
Brian Glenn of Real America’s Voice griped about an issue that united many critics in mockery.
The Wrap
Olivia Nuzzi and RFK Jr’s Affair Was Exposed Because He Bragged to Friends About Her Nudes | Report
The star NY Magazine Washington correspondent is on leave pending a third-party investigation The post Olivia Nuzzi and RFK Jr’s Affair Was Exposed Because He Bragged to Friends About Her Nudes | Report appeared first on TheWrap.
FTW Outdoors
Tom Brady's disgusted reaction to Cowboys penalty was his latest great broadcast moment
We have found a niche for Tom Brady to fill as Fox's lead analyst on NFL games, and it's just this: ripping on the Dallas Cowboys. He did it in Week 2 as the Cowboys melted down against the New Orleans Saints, and it happened again as he was ranting during Dallas's
The Daily Beast
Kamala Harris Plotted to Stop Me Getting a Job, Kimberly Guilfoyle Says
Vice President Kamala Harris tried to block Kimberly Guilfoyle—the former prosecutor turned Fox News host turned MAGA beau to Donald Trump Jr.—from getting a job in the San Francisco district attorney’s office over 20 years ago, even going so far as to falsely pose as a member of the hiring committee, according to allegations in a New York Times report.While Harris says she never suggested Guilfoyle couldn’t have a job, former District Attorney Terence Hallinan, their boss at the time, largely b
People
She Killed Her Mom and Invited Friend to See the Body. Here's What a Psychiatrist Said Carly Gregg Was Thinking
Carly Gregg was sentenced to life in prison for fatally shooting her mother when she was 14 years old
WSJ
Mexico Is Building a $7.5 Billion Trade Route to Compete With Panama Canal
The Panama Canal isn’t as reliable as it once was and Mexico is racing to build a new corridor connecting the Pacific and Atlantic Oceans that would help fill the gap. WSJ explores whether it will lead to faster or cheaper shipping.
FTW Outdoors
Raheem Morris had a perfect 4-word response to no-call on Chiefs' pass interference
This post has been updated because an earlier version included an inaccuracy. Raheem Morris wasn't going to say anything, but by saying just four words, you KNOW what he meant. The Atlanta Falcons head coach was clearly fumin

This 'versatile' Coach Outlet tote bag is 66% off right now: 'Goes with everything'

Rabbit's web-based 'large action model' agent arrives on r1 as early as this week

A generalist web-based agent

Still learning

Why not an app, again?

Latest Stories

Exclusive-US to propose ban on Chinese software, hardware in connected vehicles, sources say

Analyst on iPhone 16: I Do Not Think People Are ‘Running Out to Buy These Phones’

Apple Intelligence will drive sales for iPhone 16. Just wait for it, analyst says.

An Intel-Qualcomm Megamerger Is a Bad Idea. Here’s Why.

See the iPhone 16’s game-changing battery removal process in new iFixit teardown

'Grifter' Melania Trump Gets Blunt Reminder After Awkward New Sales Pitch

Data scientist nails the Trump gaffe that started what looks today like a building Harris landslide

Authoritarianism Expert Spots New Donald Trump Boast That ‘Sends A Chill Down My Spine’

Boy abducted from California in 1951 at age 6 found alive on East Coast more than 70 years later

Why Trump, 78, Can’t Rally Like He Did Before

Emily Ratajkowski Reveals Disturbing Link Between Diddy and Menendez Brothers

This Woman's Outfit For A Dinner With Her Husband's Boss Is Going Viral Online — Here's Why

Mary Trump Pinpoints The Family Root Of Uncle's Latest Attack On Kamala Harris

Marjorie Taylor Greene’s Boyfriend Emits ‘Major Karen Vibes’ In Online Complaint

Olivia Nuzzi and RFK Jr’s Affair Was Exposed Because He Bragged to Friends About Her Nudes | Report

Tom Brady's disgusted reaction to Cowboys penalty was his latest great broadcast moment

Kamala Harris Plotted to Stop Me Getting a Job, Kimberly Guilfoyle Says

She Killed Her Mom and Invited Friend to See the Body. Here's What a Psychiatrist Said Carly Gregg Was Thinking

Mexico Is Building a $7.5 Billion Trade Route to Compete With Panama Canal

Raheem Morris had a perfect 4-word response to no-call on Chiefs' pass interference