HPC / Scientific Computing
Jul 12, 2024
Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities
First introduced in 2019, NVIDIA Megatron-LM sparked a wave of innovation in the AI community, enabling researchers and developers to use the underpinnings of...
11 MIN READ
Jul 09, 2024
Just Released: nvmath-python
nvmath-python is an open-source Python library that provides high performance access to the core mathematical operations in the NVIDIA Math Libraries. Available...
1 MIN READ
Jul 09, 2024
Building Cyber Language Models to Unlock New Cybersecurity Capabilities
General-purpose large language models (LLMs) have proven their usefulness across various fields, offering substantial benefits in applications ranging from text...
13 MIN READ
Jul 05, 2024
Explainer: What Is K-Means?
K-means is a clustering algorithm—one of the simplest and most popular unsupervised machine learning (ML) algorithms for data scientists.
1 MIN READ
Jul 03, 2024
Maximize GPU performance with Near-Real-Time Usage Stats on NVDashboard v0.10
At NVIDIA GTC 2024, the RAPIDS team demonstrated new features on NVDashboard v0.10 a dashboard that runs on JupyterLab, for monitoring GPU usage to help...
6 MIN READ
Jul 03, 2024
Just Released: cuDSS 0.3.0
cuDSS (Preview) is an accelerated direct sparse solver. It now supports multi-GPU multi-node platforms, and introduces a hybrid memory mode.
1 MIN READ
Jul 02, 2024
Checkpointing CUDA Applications with CRIU
Checkpoint and restore functionality for CUDA is exposed through a command-line utility called cuda-checkpoint. This utility can be used to transparently...
7 MIN READ
Jul 01, 2024
How Cutting-Edge Computer Chips are Speeding Up the AI Revolution
Featured in Nature, this post delves into how GPUs and other advanced technologies are meeting the computational challenges posed by AI.
1 MIN READ
Jun 28, 2024
Explainer: What Is High-Performance Computing?
High-performance computing (HPC) is the art and science of using groups of cutting-edge computer systems to perform complex simulations, computations, and data...
1 MIN READ
Jun 18, 2024
Runtime Fatbin Creation Using the NVIDIA CUDA Toolkit 12.4 Compiler
CUDA Toolkit 12.4 introduced a new nvFatbin library for creating fatbins at runtime. Fatbins, otherwise known as NVIDIA device code fat binaries, are containers...
11 MIN READ
Jun 12, 2024
Introducing Grouped GEMM APIs in cuBLAS and More Performance Updates
The latest release of NVIDIA cuBLAS library, version 12.5, continues to deliver functionality and performance to deep learning (DL) and high-performance...
7 MIN READ
Jun 12, 2024
NVIDIA Sets New Generative AI Performance and Scale Records in MLPerf Training v4.0
Generative AI models have a variety of uses, such as helping write computer code, crafting stories, composing music, generating images, producing videos, and...
11 MIN READ
Jun 11, 2024
Maximum Performance and Minimum Footprint for AI Apps with NVIDIA TensorRT Weight-Stripped Engines
NVIDIA TensorRT, an established inference library for data centers, has rapidly emerged as a desirable inference backend for NVIDIA GeForce RTX and NVIDIA RTX...
8 MIN READ
Jun 10, 2024
Spotlight: Cisco Enhances Workload Security and Operational Efficiency with NVIDIA BlueField-3 DPUs
As cyberattacks become more sophisticated, organizations must constantly adapt with cutting-edge solutions to protect their critical assets. One such solution...
7 MIN READ
Jun 07, 2024
Seamlessly Deploying a Swarm of LoRA Adapters with NVIDIA NIM
The latest state-of-the-art foundation large language models (LLMs) have billions of parameters and are pretrained on trillions of tokens of input text. They...
11 MIN READ
Jun 02, 2024
Production-Ready, Enterprise-Grade Software on NVIDIA IGX Platform, Support for NVIDIA RTX 6000 ADA, and More
Real-time AI at the edge is crucial for medical, industrial, and scientific computing because these mission-critical applications require immediate data...
6 MIN READ