Deep dive
Jul 23, 2024
Supercharging Llama 3.1 across NVIDIA Platforms
Meta's Llama collection of large language models are the most popular foundation models in the open-source community today, supporting a variety of use cases....
8 MIN READ
Jul 23, 2024
Develop Production-Grade Text Retrieval Pipelines for RAG with NVIDIA NeMo Retriever
Enterprises are sitting on a goldmine of data waiting to be used to improve efficiency, save money, and ultimately enable higher productivity. With generative...
6 MIN READ
Jul 23, 2024
Build an Agentic RAG Pipeline with Llama 3.1 and NVIDIA NeMo Retriever NIMs
Employing retrieval-augmented generation (RAG) is an effective strategy for ensuring large language model (LLM) responses are up-to-date and not...
7 MIN READ
Jul 23, 2024
Creating Synthetic Data Using Llama 3.1 405B
Synthetic data isn’t about creating new information. It's about transforming existing information to create different variants. For over a decade, synthetic...
15 MIN READ
Jul 23, 2024
Customize Generative AI Models for Enterprise Applications with Llama 3.1
The newly unveiled Llama 3.1 collection of 8B, 70B, and 405B large language models (LLMs) is narrowing the gap between proprietary and open-source models. Their...
10 MIN READ
Jul 19, 2024
Boosting AI-Driven Innovation in 6G with the AI-RAN Alliance, 3GPP, and O-RAN
The pace of 6G research and development is picking up as the 5G era crosses the midpoint of the decade-long cellular generation time frame. In this blog post,...
13 MIN READ
Jul 17, 2024
Develop Generative AI-Powered Visual AI Agents for the Edge
An exciting breakthrough in AI technology—Vision Language Models (VLMs)—offers a more dynamic and flexible method for video analysis. VLMs enable users to...
9 MIN READ
Jul 16, 2024
Building an AI Agent for Supply Chain Optimization with NVIDIA NIM and cuOpt
Enterprises face significant challenges in making supply chain decisions that maximize profits while adapting quickly to dynamic changes. Optimal supply chain...
8 MIN READ
Jul 15, 2024
Unlock Gene Networks Using Limited Data with AI Model Geneformer
Geneformer is a recently introduced and powerful AI model that learns gene network dynamics and interactions using transfer learning from vast single-cell...
6 MIN READ
Jul 12, 2024
Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities
First introduced in 2019, NVIDIA Megatron-LM sparked a wave of innovation in the AI community, enabling researchers and developers to use the underpinnings of...
11 MIN READ
Jul 11, 2024
Defending AI Model Files from Unauthorized Access with Canaries
As AI models grow in capability and cost of creation, and hold more sensitive or proprietary data, securing them at rest is increasingly important....
6 MIN READ
Jul 11, 2024
Training Sim-to-Real Transferable Robotic Assembly Skills over Diverse Geometries
Most objects in home and industrial settings consist of multiple parts that must be assembled. While human workers typically perform assembly, in certain...
10 MIN READ
Jul 10, 2024
Understanding Diffusion Models: An Essential Guide for AEC Professionals
Generative AI, the ability of algorithms to process various types of inputs—such as text, images, audio, video, and code—and generate new content, is...
13 MIN READ
Jul 03, 2024
Powering the Future of AI-Enabled Medical Devices with NVIDIA Holoscan and RTI Connext
The demand for real-time insights and autonomous decision-making is growing across industries, and healthcare and medical devices are no exception. Relying on...
8 MIN READ
Jul 03, 2024
Maximize GPU performance with Near-Real-Time Usage Stats on NVDashboard v0.10
At NVIDIA GTC 2024, the RAPIDS team demonstrated new features on NVDashboard v0.10 a dashboard that runs on JupyterLab, for monitoring GPU usage to help...
6 MIN READ
Jul 02, 2024
Achieving High Mixtral 8x7B Performance with NVIDIA H100 Tensor Core GPUs and TensorRT-LLM
As large language models (LLMs) continue to grow in size and complexity, the performance requirements for serving them quickly and cost-effectively continue to...
9 MIN READ