NVIDIA DGX-2 increases deep learning performance 10x in six months

March 27, 2018 Brandon Lewis

SAN JOSE, CA. NVIDIA has announced that its deep learning compute platform, the DGX-2, provides 10x the performance compared with the previous generation, which was released six months ago. The DGX-2 is capable of delivering two petaflops of computational power for executing deep learning workloads in a single server, supported by a 2x memory boost in NVIDIA Tesla V100 GPUs and the NVIDIA NVSwitch GPU interconnect fabric, which allows DGX-2 GPUs to communicate at up to 2.4 terabytes per second.

The DGX-2 can be outfitted with up to 16, 32 GB Tesla V100 GPUs that share a common memory space for high-performance computing (HPC) applications, while NVSwitch extends NVIDIA’s NVLink offering to deliver 5x the bandwidth of leading PCIe switches. The result is a platform with the processing capabilities of 300 servers, but 60x smaller and 18x more power efficient.

The DGX-2 is also supported by updates the NVIDIA’s deep learning and HPC software stack, including new versions of CUDA, TensorRT, NCCL, cuDNN, and a new Isaac software developer kit. These updates are available free of charge for members of the NVIDIA developer community.

For more information, including technical details, visit nvda.ws/2IRilLe.

eletter-03-27-2018

About the Author

Brandon Lewis

Brandon Lewis, Editor-in-Chief of Embedded Computing Design, is responsible for guiding the property's content strategy, editorial direction, and engineering community engagement, which includes IoT Design, Automotive Embedded Systems, the Power Page, Industrial AI & Machine Learning, and other publications. As an experienced technical journalist, editor, and reporter with an aptitude for identifying key technologies, products, and market trends in the embedded technology sector, he enjoys covering topics that range from development kits and tools to cyber security and technology business models. Brandon received a BA in English Literature from Arizona State University, where he graduated cum laude. He can be reached by email at brandon.lewis@opensysmedia.com.

Follow on Twitter Follow on Linkedin Visit Website More Content by Brandon Lewis
Previous Article
Renesas 28 nm automotive control MCU integrates 16 MB on-chip flash, up to six CPU cores
Renesas 28 nm automotive control MCU integrates 16 MB on-chip flash, up to six CPU cores

RH850/E2x MCUs incorporate 16 MB of built-in flash, up to six 400 MHz CPU cores, and deliver up to 9600 MIPS.

Next Article
Renesas Electronics unveils on-chip Flash memory MCU featuring 28nm embedded Flash technology

RH850/E2x Series is the New-Generation Automotive Control Microcontroller that Extends the Entire Renesas a...