Digital Engineering 24/7

Helping design and engineering professionals discover, evaluate and specify technologies and processes that shorten the design cycle and enable success.

NVIDIA Unveils AI Platform to Minimize Downtime in Supercomputing Data Centers

NVIDIA Mellanox UFM cyber-AI platform detects security threats, predicts network failures.

Latest Engineering Computing News

Latest Engineering Computing Resources

By DE Editors  

June 22, 2020

NVIDIA unveiled the NVIDIA Mellanox UFM Cyber-AI platform, which minimizes downtime in InfiniBand data centers by harnessing AI-powered analytics to detect security threats and operational issues, as well as predict network failures.

This extension of the UFM platform product portfolio applies AI to learn a data center’s operational cadence and network workload patterns, drawing on real-time and historic telemetry and workload data. Against this baseline, it tracks the system’s health and network modifications, and detects performance degradations, usage and profile changes.

The new platform provides alerts of abnormal system and application behavior, and potential system failures and threats, as well as performs corrective actions. It is also targeted to deliver security alerts in cases of attempted system hacking to host undesired applications, such as cryptocurrency mining. 

“The UFM Cyber-AI platform determines a data center’s unique vital signs and uses them to identify performance degradation, component failures and abnormal usage patterns,” says Gilad Shainer, senior vice president of marketing for Mellanox networking at NVIDIA. “It allows system administrators to quickly detect and respond to potential security threats and address upcoming failures, saving cost and ensuring consistent service to customers.”

Ecosystem Support

Organizations that have long been employing the UFM platform in their data centers have expressed strong interest in the latest offering.

“NCI plays a pivotal role in the national research landscape," says Allan Williams, associate director of services and technology at the National Computational Infrastructure (NCI Australia). "Our supercomputing infrastructure serves 5,000 researchers who use it for critical national and global activities. UFM enables us to effectively manage our supercomputers and to optimize performance.”

“We have been using the UFM platform for years in our InfiniBand data centers," says Douglas Johnson, associate director of the Ohio Supercomputer Center. "UFM and the expertise from the Mellanox networking team have been fundamental ingredients in the management of our network and the stability we’ve achieved.”

Extending UFM Platform

The UFM Cyber-AI platform complements the UFM Enterprise platform, which provides network monitoring, management, performance optimization, configuration checks and secure cable management.

NVIDIA also added today a third member of the UFM family, the UFM Telemetry platform. This tool captures real-time network telemetry data, which is streamed to an on-premises or cloud-based database to monitor network performance and validate the network configuration.

Supporting Resources

Learn more about the UFM Appliance product line.

Learn more about NVIDIA Mellanox Quantum HDR 200Gb/s InfiniBand Smart Switches.

Learn more about NVIDIA Mellanox ConnectX®-6 HDR 200Gb/s InfiniBand adapters.

Sources: Press materials received from the company and additional information gleaned from the company’s website.

 

More about NVIDIA

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and…

Cut Retrieval-Augmented Generation (RAG) Hallucinations by 50%

Most teams hit the same wall with enterprise AI: LLMs that hallucinate, pipelines that don’t scale, and infrastructure that’s harder to design than the models themselves.

Latest in NVIDIA

Latest in NVIDIA

About DE Editors

DE Editors

DE's editors contribute news and new product announcements to Digital Engineering. Press releases may be sent to them via [email protected].

Follow DE
on Facebook
on Linkedin

Related Topics

Engineering Computing   Products   Artificial Intelligence AI   Data Centers   NVIDIA   Supercomputing   All topics
 

Subscribe

Subscribe to our FREE magazine, FREE email newsletters or both!

Join over 90,000 engineering professionals who get fresh engineering news as soon as it is published.

Subscribe today

 
 

From our Sponsors

Meltio Takes Metal Additive to the Next Level
Meltio's DED technology enables industries to tailor and customize their solutions to create & repair metal parts.
Easing the Transition from ETO to CTO with Configuration Lifecycle Management
Manufacturers are discovering that the Configure-to-Order (CTO) model provides significant benefits when it comes to customization.
Siemens + Altair = The Next Chapter in Design and Simulation
With its acquisition of Altair, Siemens creates a unified simulation portfolio combining generative design with high-performance computing and AI workflows.