Deep dive

Decorative image of a llama in cool sunglasses against a sunny landscape.

Jul 23, 2024

Supercharging Llama 3.1 across NVIDIA Platforms

Meta's Llama collection of large language models are the most popular foundation models in the open-source community today, supporting a variety of use cases....

8 MIN READ

An illustration representing text retrieval pipelines for RAG.

Jul 23, 2024

Develop Production-Grade Text Retrieval Pipelines for RAG with NVIDIA NeMo Retriever

Enterprises are sitting on a goldmine of data waiting to be used to improve efficiency, save money, and ultimately enable higher productivity. With generative...

6 MIN READ

An illustrations representing agnetic RAG.

Jul 23, 2024

Build an Agentic RAG Pipeline with Llama 3.1 and NVIDIA NeMo Retriever NIMs

Employing retrieval-augmented generation (RAG) is an effective strategy for ensuring large language model (LLM) responses are up-to-date and not...

7 MIN READ

Jul 23, 2024

Creating Synthetic Data Using Llama 3.1 405B

Synthetic data isn’t about creating new information. It's about transforming existing information to create different variants. For over a decade, synthetic...

15 MIN READ

Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server

Jul 23, 2024

Customize Generative AI Models for Enterprise Applications with Llama 3.1

The newly unveiled Llama 3.1 collection of 8B, 70B, and 405B large language models (LLMs) is narrowing the gap between proprietary and open-source models. Their...

10 MIN READ

Image of a city simulation with a 6G network.

Jul 19, 2024

Boosting AI-Driven Innovation in 6G with the AI-RAN Alliance, 3GPP, and O-RAN

The pace of 6G research and development is picking up as the 5G era crosses the midpoint of the decade-long cellular generation time frame. In this blog post,...

13 MIN READ

An illustration representing an AI model.

Jul 17, 2024

Develop Generative AI-Powered Visual AI Agents for the Edge

An exciting breakthrough in AI technology—Vision Language Models (VLMs)—offers a more dynamic and flexible method for video analysis. VLMs enable users to...

9 MIN READ

GIF of a factory floor with potential paths marked in green.

Jul 16, 2024

Building an AI Agent for Supply Chain Optimization with NVIDIA NIM and cuOpt

Enterprises face significant challenges in making supply chain decisions that maximize profits while adapting quickly to dynamic changes. Optimal supply chain...

8 MIN READ

Jul 15, 2024

Unlock Gene Networks Using Limited Data with AI Model Geneformer

Geneformer is a recently introduced and powerful AI model that learns gene network dynamics and interactions using transfer learning from vast single-cell...

6 MIN READ

Jul 12, 2024

Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities

First introduced in 2019, NVIDIA Megatron-LM sparked a wave of innovation in the AI community, enabling researchers and developers to use the underpinnings of...

11 MIN READ

An illustration showing a securit alert.

Jul 11, 2024

Defending AI Model Files from Unauthorized Access with Canaries

As AI models grow in capability and cost of creation, and hold more sensitive or proprietary data, securing them at rest is increasingly important....

6 MIN READ

Jul 11, 2024

Training Sim-to-Real Transferable Robotic Assembly Skills over Diverse Geometries

Most objects in home and industrial settings consist of multiple parts that must be assembled. While human workers typically perform assembly, in certain...

10 MIN READ

A GIF showing the creation of a building image with diffusion models.

Jul 10, 2024

Understanding Diffusion Models: An Essential Guide for AEC Professionals

Generative AI, the ability of algorithms to process various types of inputs—such as text, images, audio, video, and code—and generate new content, is...

13 MIN READ

Jul 03, 2024

Powering the Future of AI-Enabled Medical Devices with NVIDIA Holoscan and RTI Connext

The demand for real-time insights and autonomous decision-making is growing across industries, and healthcare and medical devices are no exception. Relying on...

8 MIN READ

Jul 03, 2024

Maximize GPU performance with Near-Real-Time Usage Stats on NVDashboard v0.10

At NVIDIA GTC 2024, the RAPIDS team demonstrated new features on NVDashboard v0.10 a dashboard that runs on JupyterLab, for monitoring GPU usage to help...

6 MIN READ

Jul 02, 2024

Achieving High Mixtral 8x7B Performance with NVIDIA H100 Tensor Core GPUs and TensorRT-LLM

As large language models (LLMs) continue to grow in size and complexity, the performance requirements for serving them quickly and cost-effectively continue to...

9 MIN READ