Benchmark
![Illustration showing models and NeMo.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/07/llm-megatron-core-blog-2967200-1920x1080-1-960x540.jpg)
Jul 17, 2024
NVIDIA NeMo Accelerates LLM Innovation with Hybrid State Space Model Support
Today’s large language models (LLMs) are based on the transformer model architecture introduced in 2017. Since then, rapid advances in AI compute performance...
7 MIN READ
![Decorative image of rows of GPUs.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/hpc-promo-mlperf-featured-960x540.jpg)
Jun 12, 2024
NVIDIA Sets New Generative AI Performance and Scale Records in MLPerf Training v4.0
Generative AI models have a variety of uses, such as helping write computer code, crafting stories, composing music, generating images, producing videos, and...
11 MIN READ
![An illustration representing an embedding model.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/NV-Embed-MTEB-Record-960x540.png)
Jun 10, 2024
NVIDIA Text Embedding Model Tops MTEB Leaderboard
The latest embedding model from NVIDIA—NV-Embed—set a new record for embedding accuracy with a score of 69.32 on the Massive Text Embedding Benchmark...
6 MIN READ
![Three reflective green spheres hovering above three white platforms on a neutral background.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/04/nvidia-rtx-virtual-workstation-windows-365-pcs-featured-960x540.png)
Apr 29, 2024
GPU-Powered Windows 365 Cloud PCs with NVIDIA RTX Virtual Workstation for High-End Graphics Workloads
Professional workflows have become more complex with the increased demand for graphics-intensive scenarios. From regular office applications to demanding...
7 MIN READ
![Decorative image of graphs as light web.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/03/wholegraph-storage-featured-960x540.png)
Apr 03, 2024
Optimizing Memory and Retrieval for Graph Neural Networks with WholeGraph, Part 2
Large-scale graph neural network (GNN) training presents formidable challenges, particularly concerning the scale and complexity of graph data. These challenges...
5 MIN READ
![An image of an NVIDIA H200 Tensor Core GPU.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/03/H200-Tensor-Core-GPU-e1711062873243-960x540.jpg)
Mar 27, 2024
NVIDIA H200 Tensor Core GPUs and NVIDIA TensorRT-LLM Set MLPerf LLM Inference Records
Generative AI is unlocking new computing applications that greatly augment human capability, enabled by continued model innovation. Generative AI...
11 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/03/graphic-field-1-960x540.jpg)
Mar 20, 2024
Record-Breaking NVIDIA cuOpt Algorithms Deliver Route Optimization Solutions 100x Faster
NVIDIA cuOpt is an accelerated optimization engine for solving complex routing problems. It efficiently solves problems with different aspects such as breaks,...
13 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/03/speech-ai-composite-graphic-960x540.png)
Mar 19, 2024
NVIDIA Speech and Translation AI Models Set Records for Speed and Accuracy
Speech and translation AI models developed at NVIDIA are pushing the boundaries of performance and innovation. The NVIDIA Parakeet automatic speech recognition...
8 MIN READ
![Decorative image of a computer screen against a purple background, with a dial on the side.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2023/11/rapids-ai-day-announcement-press-1920x1080-1-960x540.jpg)
Mar 18, 2024
RAPIDS cuDF Accelerates pandas Nearly 150x with Zero Code Changes
At NVIDIA GTC 2024, it was announced that RAPIDS cuDF can now bring GPU acceleration to 9.5M million pandas users without requiring them to change their code....
5 MIN READ
![Decorative image of a RAG pipeline against a black background.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/03/networking-rag-featured-960x540.png)
Mar 18, 2024
Scaling Enterprise RAG with Accelerated Ethernet Networking and Networked Storage
In the era of generative AI, where machines are not just learning from data but generating human-like text, images, video, and more, retrieval-augmented...
10 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/02/image-panels-with-multicolor-cat-960x540.png)
Feb 22, 2024
Benchmarking NVIDIA Spectrum-X for AI Network Performance, Now Available from Supermicro
NVIDIA Spectrum-X is swiftly gaining traction as the leading networking platform tailored for AI in hyperscale cloud infrastructures. Spectrum-X networking...
6 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2023/11/nvidia-grace-hopper-960x540.png)
Dec 18, 2023
Deploying Retrieval-Augmented Generation Applications on NVIDIA GH200 Delivers Accelerated Performance
Large language model (LLM) applications are essential in enhancing productivity across industries through natural language. However, their effectiveness is...
10 MIN READ
![An illustration of the NVIDIA H100.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2023/12/Top-inference-performance-H100-960x540.png)
Dec 14, 2023
Achieving Top Inference Performance with the NVIDIA H100 Tensor Core GPU and NVIDIA TensorRT-LLM
Best-in-class AI performance requires an efficient parallel computing architecture, a productive tool stack, and deeply optimized algorithms. NVIDIA released...
4 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2023/12/Oracle-NV-STAC-A2-1.png)
Dec 12, 2023
Oracle Cloud Infrastructure Sets Quantitative Financial HPC Calculations Record with NVIDIA GPUs
NVIDIA A100 Tensor Core GPUs were featured in a stack that set several records in a recent STAC-A2™ benchmark standard based on financial market risk...
1 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2023/12/bmw-cars-1-960x540.jpg)
Dec 12, 2023
Benchmarking Quantum Computing Applications with BMW Group and NVIDIA cuQuantum
Quantum computing has the potential to revolutionize various aspects of industry, ranging from numerical simulations and optimization of complex systems to...
5 MIN READ
![An illustration showing the steps "LLM" then "Optimize" then "Deploy."](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2023/12/TensorRT-LLM-Enhancements--960x540.png)
Dec 04, 2023
NVIDIA TensorRT-LLM Enhancements Deliver Massive Large Language Model Speedups on NVIDIA H200
Large language models (LLMs) have seen dramatic growth over the last year, and the challenge of delivering great user experiences depends on both high-compute...
5 MIN READ