AI21 Labs' new AI model can handle more context than most

Kyle Wiggers

Updated July 24, 2024 at 2:14 p.m.·3 min read

Increasingly, the AI industry is moving toward generative AI models with longer contexts. But models with large context windows tend to be compute-intensive. Or Dagan, product lead at AI startup AI21 Labs, asserts that this doesn't have to be the case -- and his company is releasing a generative model to prove it.

Contexts, or context windows, refer to input data (e.g. text) that a model considers before generating output (more text). Models with small context windows tend to forget the content of even very recent conversations, while models with larger contexts avoid this pitfall -- and, as an added benefit, better grasp the flow of data they take in.

AI21 Labs' Jamba, a new text-generating and -analyzing model, can perform many of the same tasks that models like OpenAI's ChatGPT and Google's Gemini can. Trained on a mix of public and proprietary data, Jamba can write text in English, French, Spanish and Portuguese.

Jamba can handle up to 140,000 tokens while running on a single GPU with at least 80GB of memory (like a high-end Nvidia A100). That translates to around 105,000 words, or 210 pages -- a decent-sized novel.

Meta's Llama 2, by comparison, has a ~4,000-token context window -- on the smaller side by today's standards -- but only requires a GPU with ~12GB of memory in order to run. (Context windows are typically measured in tokens, which are bits of raw text and other data.)

On its face, Jamba is unremarkable. Loads of freely available, downloadable generative AI models exist, from Databricks' recently released DBRX to the aforementioned Llama 2.

But what makes Jamba unique is what's under the hood. It uses a combination of two model architectures: transformers and state space models (SSMs).

Transformers are the architecture of choice for complex reasoning tasks, powering models like GPT-4 and Google's Gemini, for example. They have several unique characteristics, but by far transformers' defining feature is their "attention mechanism." For every piece of input data (e.g. a sentence), transformers weigh the relevance of every other input (other sentences) and draw from them to generate the output (a new sentence).

SSMs, on the other hand, combine several qualities of older types of AI models, such as recurrent neural networks and convolutional neural networks, to create a more computationally efficient architecture capable of handling long sequences of data.

Now, SSMs have their limitations. But some of the early incarnations, including an open source model called Mamba from Princeton and Carnegie Mellon researchers, can handle larger inputs than their transformer-based equivalents while outperforming them on language generation tasks.

Jamba in fact uses Mamba as part of the core model -- and Dagan claims it delivers three times the throughput on long contexts compared to transformer-based models of comparable sizes.

"While there are a few initial academic examples of SSM models, this is the first commercial-grade, production-scale model," Dagan said in an interview with TechCrunch. "This architecture, in addition to being innovative and interesting for further research by the community, opens up great efficiency and throughput possibilities."

Now, while Jamba has been released under the Apache 2.0 license, an open source license with relatively few usage restrictions, Dagan stresses that it's a research release not intended to be used commercially. The model doesn't have safeguards to prevent it from generating toxic text or mitigations to address potential bias; a fine-tuned, ostensibly "safer" version will be made available in the coming weeks.

But Dagan asserts that Jamba demonstrates the promise of the SSM architecture even at this early stage.

"The added value of this model, both because of its size and its innovative architecture, is that it can be easily fitted onto a single GPU," he said. "We believe performance will further improve as Mamba gets additional tweaks."

South China Morning Post
Apple falls: iPhone maker out of China's top 5 as Huawei ascends
Apple has fallen out of the top 5 ranking of smartphone vendors in China, according to data trackers, marking the first time in years the iPhone maker has fallen so low in one of its most important markets. iPhone shipments in China in the three months ended June declined 2 per cent year on year, bumping Apple down to No 6 on Canalys' list of top vendors by shipments, putting it behind Vivo, Oppo, Honor, Huawei Technologies and Xiaomi, according to a report from the market research firm on Thurs
Reuters
Apple's China smartphone shipments drop 6.7% as Huawei surges, data shows
BEIJING (Reuters) -Apple's smartphone shipments in China fell by 6.7% in the second quarter of 2024, as the tech giant faced intensifying competition from rivals like Huawei, according to data from market research firm Canalys. Apple's total shipments for the quarter ending in June stood at 9.7 million units, down from 10.4 million units in the same quarter last year, Canalys data shows. In contrast, Huawei's smartphone shipments surged 41% year-on-year to 10.6 milion in the quarter, bolstered by the launch of its new Pura 70 series in April.
Associated Press
A neurological disorder stole her voice. Jennifer Wexton takes it back on the House floor.
When Jennifer Wexton rose Thursday to speak on the House floor, something she has done countless times before, the congresswoman used a voice she thought was gone forever. After a rare neurological disorder robbed her of her ability to speak clearly, Wexton has been given her voice back with the help of a powerful artificial intelligence program, allowing the Virginia Democrat to make a clone of her speaking voice using old recordings of speeches and appearances she made as a congresswoman.
Engadget
The Morning After: OpenAI reveals its AI-powered search engine, SearchGPT
The biggest news stories this morning: AI video startup Runway reportedly trained on ‘thousands’ of YouTube videos without permission, The best cameras for 2024, WhatsApp hits 100 million monthly active US users.
South China Morning Post
Chinese AI start-up Baichuan raises US$700 million from Alibaba, Tencent, Xiaomi
Baichuan AI, one of China's four so-called artificial intelligence (AI) tigers, raised about 5 billion yuan (US$687.6 million) in a new funding round that valued the start-up at more than 20 billion yuan, the company said on Thursday. The Beijing-based firm's latest round was backed by some of the biggest names in Chinese technology, including Alibaba Group Holding, Tencent Holdings and Xiaomi, along with some state-backed funds. Alibaba owns the South China Morning Post. China International Cap
Engadget
OpenAI unveils SearchGPT, an AI-powered search engine
The launch of SearchGPT comes amid growing competition in AI-powered search.
Sky News
£7.7 million bounty offered in hunt for members of North Korea-backed hacking group
The UK, US and South Korea have accused a North Korea-backed cyber group of carrying out an online espionage campaign to steal military and nuclear secrets. The "Andariel" group has been compromising organisations around the globe as it attempts to get hold of sensitive and classified technical information and intellectual property data, according to the UK's National Cyber Security Centre (NCSC). The centre, along with the FBI in the US and South Korea's national intelligence service, have issued a joint warning and advisory note about Andariel's actions.
Barrons.com
Apple’s AI iPhone Could Take the Stock This High
Apple stock hasn’t racked up Nvidia -like gains since ChatGPT’s launch almost two years ago—but that doesn’t prevent the iPhone maker from racking up gains from the AI fervor. For Raymond James analyst Srini Pajjuri, the stock is “a more stable AI play for volatile times.” Apple will offer Apple Intelligence AI features only on the iPhone 15 Pro and the iPhone 16, which is coming this fall.
Fortune
Apple slips from the top 5 in China, as domestic brands take all the top slots for the first quarter in history
Vivo is China's top smartphone seller by shipments, with Huawei, HONOR, Oppo and Xiaomi rounding out the top five, reports Canalys and IDC research.
USA TODAY
Get an Apple AirTag tracking device for the lowest price we've seen in months
Keep a watchful eye on your keys, wallet, luggage, and more with an Apple AirTag. Get the tracker on sale at Amazon for just $24, the lowest price we've seen in months.
Reuters
Epic Games says Fortnite returning to iOS in EU, leaving Samsung app store
Epic has been attempting to expand the distribution of its games beyond smartphone companies' official app stores, opposing steep commissions on in-app payments and users being limited to downloading applications through dedicated stores. The company also said its videogames will be leaving the Samsung Galaxy Store in protest of the phone maker's decision to block default side-loading - the installation of applications on a mobile device without using its dedicated app store - on Android devices, calling it "anticompetitive". Along the same lines, Epic said its mobile games will come to AltStore on iOS in the EU.
Bloomberg
Apple to Adopt Voluntary AI Safeguards Established by Biden
(Bloomberg) -- Apple Inc. is the latest company to agree to a set of voluntary safeguards for artificial intelligence crafted by President Joe Biden’s administration as it tries to guide the development of the emerging technology and encourage firms to protect consumers. Most Read from BloombergTrump Risks Losing Voters He Needs With Loaded Attacks on HarrisParis Sticks to Olympics Opening Event Plans After Rail SabotageFed’s Favored Price Gauge Rises at Mild Pace, Spending Holds UpHarris Just S
The Daily Beast
FBI Is Not Fully Convinced Trump Was Struck by a Bullet
FBI Director Christopher Wray revealed during a marathon testimony on Wednesday that investigators still do not know if former President Donald Trump was grazed by a bullet or a piece of shrapnel during his attempted assassination.Twice during the hours-long session, Wray told lawmakers that the FBI was still working to determine what exactly struck the former president on his right ear during a rally in Butler, Pennsylvania. “My understanding is that either it [a bullet] or some shrapnel is wha
Good Housekeeping
Céline Dion Fans Won't Believe How Much She’s Getting Paid by the Olympics
Céline Dion and Lady Gaga are performing a duet at the 2024 Paris Olympics opening ceremony. Here's how much they are reportedly being paid for one song.
The Daily Beast
Donald Trump Seen in Public Without Ear Bandage
Donald Trump ditched his ear bandage for his meeting with Israeli Prime Minister Benjamin Netanyahu on Friday. The former president’s right ear returned to public life after being injured during the assassination attempt on the former president on July 13.The former president’s large bandage became an impromptu fashion statement during the Republican National Convention with some attendees donning DIY wound dressings. Following the convention, Trump swapped out his bulky white gauze for a thin n
BuzzFeed
Kamala Harris' Press Release About Donald Trump's Fox News Appearance Is Going Viral
"Something about the question mark after 'old and quite weird' is taking me out."
Miami Herald
Ana Navarro just posted a racy throwback pic of Melania — and the Internet has opinions
The GQ spread appeared in 2000
BuzzFeed
A Bunch Of Trump Supporters' Cars Were Towed From A Dunkin' Parking Lot, And The Towing Company Name Is Unintentionally Hilarious
Yeah, this is why I'd never mess with a manager of a Dunkin'.
HuffPost
Stephen Colbert Taunts Trump With Absolutely Brutal Reminder About Melania
The "Late Show" host mocked the former president over one curious claim.
The Daily Beast
Harris Campaign Trolls ‘78-Year-Old Criminal’ Donald Trump After Fox News Appearance
Kamala Harris’ campaign trolled Donald Trump after his appearance on Fox News Thursday morning with a statement attacking his age and criminal conviction.The Republican gave his two-cents to Fox & Friends on a range of issues over the course of a roughly 30-minute interview, variously describing President Joe Biden as a “problemmed man” and slamming Harris as “real garbage.” Harris for President quickly hit back, releasing a: “Statement on a 78-Year-Old Criminal’s Fox News Appearance.”“After wat

This $50 Amazon smartwatch does 'everything' a name-brand watch can do — trust me, I tried it

AI21 Labs' new AI model can handle more context than most

Latest Stories

Apple falls: iPhone maker out of China's top 5 as Huawei ascends

Apple's China smartphone shipments drop 6.7% as Huawei surges, data shows

A neurological disorder stole her voice. Jennifer Wexton takes it back on the House floor.

The Morning After: OpenAI reveals its AI-powered search engine, SearchGPT

Chinese AI start-up Baichuan raises US$700 million from Alibaba, Tencent, Xiaomi

OpenAI unveils SearchGPT, an AI-powered search engine

£7.7 million bounty offered in hunt for members of North Korea-backed hacking group

Apple’s AI iPhone Could Take the Stock This High

Apple slips from the top 5 in China, as domestic brands take all the top slots for the first quarter in history

Get an Apple AirTag tracking device for the lowest price we've seen in months

Epic Games says Fortnite returning to iOS in EU, leaving Samsung app store

Apple to Adopt Voluntary AI Safeguards Established by Biden

FBI Is Not Fully Convinced Trump Was Struck by a Bullet

Céline Dion Fans Won't Believe How Much She’s Getting Paid by the Olympics

Donald Trump Seen in Public Without Ear Bandage

Kamala Harris' Press Release About Donald Trump's Fox News Appearance Is Going Viral

Ana Navarro just posted a racy throwback pic of Melania — and the Internet has opinions

A Bunch Of Trump Supporters' Cars Were Towed From A Dunkin' Parking Lot, And The Towing Company Name Is Unintentionally Hilarious

Stephen Colbert Taunts Trump With Absolutely Brutal Reminder About Melania

Harris Campaign Trolls ‘78-Year-Old Criminal’ Donald Trump After Fox News Appearance