HPC / Scientific Computing

Jul 12, 2024

Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities

First introduced in 2019, NVIDIA Megatron-LM sparked a wave of innovation in the AI community, enabling researchers and developers to use the underpinnings of...

11 MIN READ

Jul 09, 2024

Just Released: nvmath-python

nvmath-python is an open-source Python library that provides high performance access to the core mathematical operations in the NVIDIA Math Libraries. Available...

1 MIN READ

Jul 09, 2024

Building Cyber Language Models to Unlock New Cybersecurity Capabilities

General-purpose large language models (LLMs) have proven their usefulness across various fields, offering substantial benefits in applications ranging from text...

13 MIN READ

Two b&w images of a woman in a hat, one image in a higher resolution.

Jul 05, 2024

Explainer: What Is K-Means?

K-means is a clustering algorithm—one of the simplest and most popular unsupervised machine learning (ML) algorithms for data scientists.

1 MIN READ

Jul 03, 2024

Maximize GPU performance with Near-Real-Time Usage Stats on NVDashboard v0.10

At NVIDIA GTC 2024, the RAPIDS team demonstrated new features on NVDashboard v0.10 a dashboard that runs on JupyterLab, for monitoring GPU usage to help...

6 MIN READ

Jul 03, 2024

Just Released: cuDSS 0.3.0

cuDSS (Preview) is an accelerated direct sparse solver. It now supports multi-GPU multi-node platforms, and introduces a hybrid memory mode.

1 MIN READ

Jul 02, 2024

Checkpointing CUDA Applications with CRIU

Checkpoint and restore functionality for CUDA is exposed through a command-line utility called cuda-checkpoint. This utility can be used to transparently...

7 MIN READ

Jul 01, 2024

How Cutting-Edge Computer Chips are Speeding Up the AI Revolution

Featured in Nature, this post delves into how GPUs and other advanced technologies are meeting the computational challenges posed by AI.

1 MIN READ

Abstract image with three different illustrations representing HPC applications.

Jun 28, 2024

Explainer: What Is High-Performance Computing?

High-performance computing (HPC) is the art and science of using groups of cutting-edge computer systems to perform complex simulations, computations, and data...

1 MIN READ

Decorative image of light fields in green, purple, and blue.

Jun 18, 2024

Runtime Fatbin Creation Using the NVIDIA CUDA Toolkit 12.4 Compiler

CUDA Toolkit 12.4 introduced a new nvFatbin library for creating fatbins at runtime. Fatbins, otherwise known as NVIDIA device code fat binaries, are containers...

11 MIN READ

Jun 12, 2024

Introducing Grouped GEMM APIs in cuBLAS and More Performance Updates

The latest release of NVIDIA cuBLAS library, version 12.5, continues to deliver functionality and performance to deep learning (DL) and high-performance...

7 MIN READ

Jun 12, 2024

NVIDIA Sets New Generative AI Performance and Scale Records in MLPerf Training v4.0

Generative AI models have a variety of uses, such as helping write computer code, crafting stories, composing music, generating images, producing videos, and...

11 MIN READ

Decorative image of TensorRT workflow on a black background.

Jun 11, 2024

Maximum Performance and Minimum Footprint for AI Apps with NVIDIA TensorRT Weight-Stripped Engines

NVIDIA TensorRT, an established inference library for data centers, has rapidly emerged as a desirable inference backend for NVIDIA GeForce RTX and NVIDIA RTX...

8 MIN READ

Jun 10, 2024

Spotlight: Cisco Enhances Workload Security and Operational Efficiency with NVIDIA BlueField-3 DPUs

As cyberattacks become more sophisticated, organizations must constantly adapt with cutting-edge solutions to protect their critical assets. One such solution...

7 MIN READ

Jun 07, 2024

Seamlessly Deploying a Swarm of LoRA Adapters with NVIDIA NIM

The latest state-of-the-art foundation large language models (LLMs) have billions of parameters and are pretrained on trillions of tokens of input text. They...

11 MIN READ

Decorative image of green icons on a black screen behind IGX hardware.

Jun 02, 2024

Production-Ready, Enterprise-Grade Software on NVIDIA IGX Platform, Support for NVIDIA RTX 6000 ADA, and More

Real-time AI at the edge is crucial for medical, industrial, and scientific computing because these mission-critical applications require immediate data...

6 MIN READ