NVIDIA AI Foundry Builds Custom Llama 3.1 Generative AI Models

NVIDIA AI Foundry offers generative AI model service spanning curation, synthetic data generation, and evaluation to deploy custom Llama 3.1 NVIDIA NIM microservices with NVIDIA NeMo retriever microservices.

Latest Engineering Computing News

Latest Engineering Computing Resources

Cut Retrieval-Augmented Generation (RAG) Hallucinations by 50%

Most teams hit the same wall with enterprise AI: LLMs that hallucinate, pipelines that don’t scale, and infrastructure that’s harder to design than the models themselves.
What Is Intelligent BOM Management? A Guide to Smarter Product Development

Learn how intelligent Bill of Materials (BOM) management helps teams collaborate, reduce errors, and bring innovative products to market faster with cloud-based PLM tools.
More Resources

By DE Editors

July 23, 2024

NVIDIA reports a new NVIDIA AI Foundry service and NVIDIA NIM inference microservices to advance generative artificial intelligence for enterprises with the Llama 3.1 collection of openly available models.

With NVIDIA AI Foundry, enterprises and nations can now create custom “supermodels” for their domain-specific industry use cases using Llama 3.1 and NVIDIA software, computing and expertise. Enterprises can train these supermodels with proprietary data as well as synthetic data generated from Llama 3.1 405B and the NVIDIA Nemotron Reward model.

NVIDIA AI Foundry is powered by the NVIDIA DGX Cloud AI platform, which is co-engineered with the leading public clouds, to give enterprises compute resources that scale as AI demands change.

The new offerings come at a time when enterprises, as well as nations developing sovereign AI strategies, want to build custom large language models with domain-specific knowledge for generative AI applications that reflect their business or culture.

“Meta’s openly available Llama 3.1 models mark a pivotal moment for the adoption of generative AI within the world’s enterprises,” says Jensen Huang, founder and CEO of NVIDIA. “Llama 3.1 opens the floodgates for every enterprise and industry to build state-of-the-art generative AI applications. NVIDIA AI Foundry has integrated Llama 3.1 throughout and is ready to help enterprises build and deploy custom Llama supermodels.”

“The new Llama 3.1 models are a super-important step for open source AI,” says Mark Zuckerberg, founder and CEO of Meta. “With NVIDIA AI Foundry, companies can easily create and customize the state-of-the-art AI services people want and deploy them with NVIDIA NIM."

To supercharge enterprise deployments of Llama 3.1 models for production AI, NVIDIA NIM inference microservices for Llama 3.1 models are now available for download from ai.nvidia.com.

Enterprises can pair Llama 3.1 NIM microservices with new NVIDIA NeMo Retriever NIM microservices to create retrieval pipelines for AI copilots, assistants and digital human avatars.

NVIDIA AI Foundry provides an end-to-end service for quickly building custom supermodels. It combines NVIDIA software, infrastructure and expertise with open community models, technology and support from the NVIDIA AI ecosystem.

With NVIDIA AI Foundry, enterprises can create custom models using Llama 3.1 models and the NVIDIA NeMo platform—including the NVIDIA Nemotron-4 340B Reward model.

Once custom models are created, enterprises can create NVIDIA NIM inference microservices to run them in production using their preferred MLOps and AIOps platforms on their preferred cloud platforms and NVIDIA-Certified Systems from global server manufacturers.

NVIDIA AI Enterprise experts and global system integrator partners work with AI Foundry customers to accelerate the entire process, from development to deployment.

Sources: Press materials received from the company and additional information gleaned from the company’s website.

More about NVIDIA

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and…

Cut Retrieval-Augmented Generation (RAG) Hallucinations by 50%

Most teams hit the same wall with enterprise AI: LLMs that hallucinate, pipelines that don’t scale, and infrastructure that’s harder to design than the models themselves.

Latest in NVIDIA

About DE Editors

DE's editors contribute news and new product announcements to Digital Engineering. Press releases may be sent to them via [email protected].

Follow DE
on Facebook
on Linkedin

NVIDIA AI Foundry Builds Custom Llama 3.1 Generative AI Models

NVIDIA AI Foundry offers generative AI model service spanning curation, synthetic data generation, and evaluation to deploy custom Llama 3.1 NVIDIA NIM microservices with NVIDIA NeMo retriever microservices.

Latest Engineering Computing News

Latest Engineering Computing Resources

More about NVIDIA

Latest in NVIDIA

About DE Editors

Related Topics

From our Sponsors

Digital Engineering 24/7

Design

Simulate

Additive

Digital Thread

Computing

Resources

Our Partners

Design

Top Story

Latest in Design

Simulation

Top Story

Latest in Simulation

Additive Manufacturing

Top Story

Latest in Additive Manufacturing

Digital Thread

Top Story

Latest in Digital Thread

Engineering Computing

Top Story

Latest in Engineering Computing

Subscribe

Latest Magazine

Latest Special Issue

Previous Special Issue

NVIDIA AI Foundry Builds Custom Llama 3.1 Generative AI Models

NVIDIA AI Foundry offers generative AI model service spanning curation, synthetic data generation, and evaluation to deploy custom Llama 3.1 NVIDIA NIM microservices with NVIDIA NeMo retriever microservices.

Latest Engineering Computing News

Latest Engineering Computing Resources

More about NVIDIA

Latest in NVIDIA

About DE Editors

Related Topics

From our Sponsors

Digital Engineering 24/7

Design

Simulate

Additive

Digital Thread

Computing

Resources

Our Partners