Nvidia launches NIM to make it smoother to deploy AI models into production

Frederic Lardinois

Updated March 18, 2024 at 7:18 p.m.·2 min read

At its GTC conference, Nvidia today announced Nvidia NIM, a new software platform designed to streamline the deployment of custom and pre-trained AI models into production environments. NIM takes the software work Nvidia has done around inferencing and optimizing models and makes it easily accessible by combining a given model with an optimized inferencing engine and then packing this into a container, making that accessible as a microservice.

Typically, it would take developers weeks -- if not months -- to ship similar containers, Nvidia argues -- and that is if the company even has any in-house AI talent. With NIM, Nvidia clearly aims to create an ecosystem of AI-ready containers that use its hardware as the foundational layer with these curated microservices as the core software layer for companies that want to speed up their AI roadmap.

NIM currently includes support for models from NVIDIA, A121, Adept, Cohere, Getty Images, and Shutterstock as well as open models from Google, Hugging Face, Meta, Microsoft, Mistral AI and Stability AI. Nvidia is already working with Amazon, Google and Microsoft to make these NIM microservices available on SageMaker, Kubernetes Engine and Azure AI, respectively. They'll also be integrated into frameworks like Deepset, LangChain and LlamaIndex.

Image Credits: Nvidia

"We believe that the Nvidia GPU is the best place to run inference of these models on [...], and we believe that NVIDIA NIM is the best software package, the best runtime, for developers to build on top of so that they can focus on the enterprise applications -- and just let Nvidia do the work to produce these models for them in the most efficient, enterprise-grade manner, so that they can just do the rest of their work," said Manuvir Das, the head of enterprise computing at Nvidia, during a press conference ahead of today's announcements."

As for the inference engine, Nvidia will use the Triton Inference Server, TensorRT and TensorRT-LLM. Some of the Nvidia microservices available through NIM will include Riva for customizing speech and translation models, cuOpt for routing optimizations and the Earth-2 model for weather and climate simulations.

The company plans to add additional capabilities over time, including, for example, making the Nvidia RAG LLM operator available as a NIM, which promises to make building generative AI chatbots that can pull in custom data a lot easier.

This wouldn't be a developer conference without a few customer and partner announcements. Among NIM's current users are the likes of Box, Cloudera, Cohesity, Datastax, Dropbox
and NetApp.

“Established enterprise platforms are sitting on a goldmine of data that can be transformed into generative AI copilots,” said Jensen Huang, founder and CEO of NVIDIA. “Created with our partner ecosystem, these containerized AI microservices are the building blocks for enterprises in every industry to become AI companies.”

BBC
Americans and Chinese share jokes on 'alternative TikTok' as US ban looms
RedNote's Chinese users say it is the first time they have been able to speak directly to Americans online.
Reuters
TikTok says it will go dark Sunday in US without assurance from Biden
WASHINGTON (Reuters) -TikTok warned late Friday it will go dark in the United States on Sunday unless President Joe Biden's administration provides assurances to companies like Apple and Google that they will not face enforcement actions when a ban takes effect. The statement came hours after the Supreme Court upheld a law banning TikTok in the United States on national security grounds if its Chinese parent company ByteDance does not sell it, putting the popular short-video app on track to go dark in just two days. The court's 9-0 decision throws the social media platform - and its 170 million American users - into limbo, and its fate in the hands of Donald Trump, who has vowed to rescue TikTok after returning to the presidency on Monday.
Yahoo Canada Style
I bought a budget TCL TV from Amazon (it's on sale for $300!) — I'm so impressed that I'm going to buy another
I'm terrible with technology, but I found it easy to set up and was able to stream Netflix 10 minutes after unboxing it.
Engadget
The Nintendo Switch 2 has been revealed, here's everything we know so far
The Nintendo Switch 2 is coming in 2025. Here are all of the confirmed details, rumors and speculation regarding the upcoming console.
Engadget
Samsung Galaxy S25 Unpacked 2025 event: What to expect on January 22
The Galaxy S25 series will be unveiled on January 22. A Galaxy Ring 2 and Android XR devices are less likely — but could make cameos.
The Canadian Press
How TikTok grew from a fun app for teens into a potential national security threat
SAN FRANCISCO (AP) — If it feels like TikTok has been around forever, that's probably because it has, at least if you're measuring via internet time. What's now in question is whether it will be around much longer and, if so, in what form?
The Canadian Press
TikTok refugees are pouring to Xiaohongshu. Here's what you need to know about the RedNote app
WASHINGTON (AP) — As the fate of TikTok hangs in the balance, U.S. TikTok users are flocking to the Chinese social media app Xiaohongshu, also called RedNote – making it the top downloaded app in the U.S.
South China Morning Post
US 'TikTok refugees' spark global rush of sign-ups to China's RedNote platform
In yet another consequence of the impending US ban on short-video app TikTok, rival Chinese platform RedNote is seeing a spike in users based in other countries, mirroring the migration of American self-styled "TikTok refugees". The app - also known as Xiaohongshu, which means "little red book" - was ranked as the No. 1 free-to-use platform in Britain and Canada, as well as EU countries like Ireland and Italy, by the Apple Store on Thursday. The exact number of downloads is unknown, but the app
GuruFocus.com
Apple's $250 Price Target: The iPhone 17 Bet That Could Send Shares Soaring
Evercore sees Apple's next-gen iPhone, emerging market growth, and a resilient China as key catalysts for upside.
Futurism
Apple Halts Disastrous AI System That Was Making Up Fake News Stories and Pushing Them to iPhone Users
Apple has temporarily halted its disastrous "Apple Intelligence" feature which consistently bungled up its one task of summarizing breaking news alerts. An upcoming iOS 18.3 update will disable the summaries for news and entertainment apps, as the Washington Post's Geoffrey Fowler reports. For over a month, the company's feature has been consistently generating lies, pushing them to millions of users. Apple's admission that its feature has failed is rare for the iPhone maker. Earlier this week,
Simply Wall St.
AI Chips Today - Alchip Technologies Leads with Advanced 3DIC Design Services
Alchip Technologies has unveiled its advanced 3DIC design services tailored for AI and high-performance computing (HPC) applications. This innovative semiconductor technology involves stacking multiple integrated circuits vertically, using through-silicon vias and hybrid bonding to enhance data transfer speeds, reduce power consumption, and shrink the physical footprint compared to traditional designs. The company's design flow optimizes power delivery, die-to-die electrical interconnect, and...
The Canadian Press
The rise - and potential fall - of TikTok in the US
The possibility of the U.S. outlawing TikTok kept influencers and users in anxious limbo during the four-plus years that lawmakers and judges debated the fate of the video-sharing app. Now, the moment its fans dreaded is here, but uncertainty over TikTok’s future lingers.
Reuters Videos
What can U.S. TikTok users expect on Sunday?
STORY: :: A U.S. ban on TikTok is set to go into effect on Sunday,so what can American users of the app expect to happen?:: Stephanie Kelly, Reuters"It's not entirely clear..."One Biden administration official told NBC that Americans shouldn't expect that the app suddenly be banned on Sunday and that the administration is weighing options to make the app available to users beyond Sunday."But sources say that users attempting to open the app will be redirected to a website with information about the ban. And one TikTok lawyer told the Supreme Court last week that the app essentially goes dark."This all started back in April with a U.S. law that mandated that ByteDance, which is TikTok's Chinese parent company, either divest from the app or be faced with the ban that goes into effect January 19. Now, January 19 is also the day before President-elect Donald Trump is sworn into office. Trump reportedly is considering ways to delay the ban by 60 or 90 days, but it's not legally clear if he's able to do so."
Engadget
Prime members can now get $50 off the Kindle Colorsoft
The Amazon Kindle Colorsoft just got its first discount, but you'll need to be a Prime member to take advantage of the deal.
United Press International
Clock ticking on TikTok: Platform to go dark in U.S. unless Biden intervenes
TikTok plans to cease operations in the United States on Sunday unless President Joe Biden intervenes before he leaves office one day later.
Engadget
The Anker Prime battery with a charging base is 40 percent off, plus the rest of this week's best tech deals
The best discounts we found this week include deals on MacBooks, iPads, Anker chargers, Kindles and more.
PA Media: Money
Q&A: What does the future hold for TikTok?
The app’s future in the US is uncertain – what could that mean for users around the world.
HuffPost
I Went To A Nudist Swingers Resort Without My Girlfriend. Here's What Happened.
"What awaits a monogamous lesbian on vacation by herself with mostly heterosexual couples looking to play? A lot of fun."
BuzzFeed
"Barack Can’t Convince His Wife She Has To Do Her Duty?": The Internet Is Firing Back After A Fox News Host Questioned Why Michelle Obama Won't Attend Trump's Inauguration
"Michelle Obama doesn't have to do a thing except remain true to Michelle Obama."
HuffPost
Trump's New Official Portrait Tells Quite The Story. Body Language Experts Explain Why.
The president-elect's new eyebrow-raising photo spurred a lot of conversations online. Experts think his expression and pose reveal a lot.

30+ best Amazon Canada deals to score this weekend, starting at $10 — snow shovels, cleaning devices & more

Nvidia launches NIM to make it smoother to deploy AI models into production

Latest Stories

Americans and Chinese share jokes on 'alternative TikTok' as US ban looms

TikTok says it will go dark Sunday in US without assurance from Biden

I bought a budget TCL TV from Amazon (it's on sale for $300!) — I'm so impressed that I'm going to buy another

The Nintendo Switch 2 has been revealed, here's everything we know so far

Samsung Galaxy S25 Unpacked 2025 event: What to expect on January 22

How TikTok grew from a fun app for teens into a potential national security threat

TikTok refugees are pouring to Xiaohongshu. Here's what you need to know about the RedNote app

US 'TikTok refugees' spark global rush of sign-ups to China's RedNote platform

Apple's $250 Price Target: The iPhone 17 Bet That Could Send Shares Soaring

Apple Halts Disastrous AI System That Was Making Up Fake News Stories and Pushing Them to iPhone Users

AI Chips Today - Alchip Technologies Leads with Advanced 3DIC Design Services

The rise - and potential fall - of TikTok in the US

What can U.S. TikTok users expect on Sunday?

Prime members can now get $50 off the Kindle Colorsoft

Clock ticking on TikTok: Platform to go dark in U.S. unless Biden intervenes

The Anker Prime battery with a charging base is 40 percent off, plus the rest of this week's best tech deals

Q&A: What does the future hold for TikTok?

I Went To A Nudist Swingers Resort Without My Girlfriend. Here's What Happened.

"Barack Can’t Convince His Wife She Has To Do Her Duty?": The Internet Is Firing Back After A Fox News Host Questioned Why Michelle Obama Won't Attend Trump's Inauguration

Trump's New Official Portrait Tells Quite The Story. Body Language Experts Explain Why.