Conversational AI

Jul 18, 2024

Spotlight: UneeQ Revolutionizes Customer Engagement with AI-Powered Digital Humans

With the rise of chatbots and virtual assistants, customer interactions have evolved to embrace the versatility of voice and text inputs. However, integrating...

4 MIN READ

Jul 18, 2024

Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 2, Performance Tuning

In the first part of the series, we presented an overview of the IVF-PQ algorithm and explained how it builds on top of the IVF-Flat algorithm, using the...

14 MIN READ

Jul 18, 2024

Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 1, Deep Dive

In this blog post, we continue the series on accelerating vector search using cuVS. Our previous post in the series introduced IVF-Flat, a fast algorithm for...

14 MIN READ

Jul 17, 2024

NVIDIA NeMo Accelerates LLM Innovation with Hybrid State Space Model Support

Today’s large language models (LLMs) are based on the transformer model architecture introduced in 2017. Since then, rapid advances in AI compute performance...

7 MIN READ

Jul 16, 2024

New Workshops: Customize LLMs, Build and Deploy Large Neural Networks

1 MIN READ

Jul 12, 2024

Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities

First introduced in 2019, NVIDIA Megatron-LM sparked a wave of innovation in the AI community, enabling researchers and developers to use the underpinnings of...

11 MIN READ

Jul 02, 2024

Addressing Hallucinations in Speech Synthesis LLMs with the NVIDIA NeMo T5-TTS Model

NVIDIA NeMo has released the T5-TTS model, a significant advancement in text-to-speech (TTS) technology. Based on large language models (LLMs), T5-TTS produces...

4 MIN READ

Jun 28, 2024

Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning

Full fine-tuning (FT) is commonly employed to tailor general pretrained models for specific downstream tasks. To reduce the training cost, parameter-efficient...

6 MIN READ

Jun 26, 2024

Generate High-Quality, Context-Aware Responses for Chatbots and Search Engines with Llama 3-ChatQA

Experience and test Llama3-ChatQA models at scale with performance optimized NVIDIA NIM inference microservice using the NVIDIA API catalog.

1 MIN READ

Jun 20, 2024

AI Brain Implant Restores Bilingual Communication for Stroke Survivor

Scientists have enabled a stroke survivor, who is unable to speak, to communicate in both Spanish and English by training a neuroprosthesis implant to decode...

3 MIN READ

Jun 12, 2024

Introducing Grouped GEMM APIs in cuBLAS and More Performance Updates

The latest release of NVIDIA cuBLAS library, version 12.5, continues to deliver functionality and performance to deep learning (DL) and high-performance...

7 MIN READ

An illustration representing an embedding model.

Jun 10, 2024

NVIDIA Text Embedding Model Tops MTEB Leaderboard

The latest embedding model from NVIDIA—NV-Embed—set a new record for embedding accuracy with a score of 69.32 on the Massive Text Embedding Benchmark...

6 MIN READ

Jun 04, 2024

Build Lifelike Digital Humans with NVIDIA ACE, Now Generally Available

NVIDIA ACE—a suite of technologies bringing digital humans to life with generative AI—is now generally available for developers. Packaged as NVIDIA NIMs,...

5 MIN READ

Jun 02, 2024

Streamline Development of AI-Powered Apps with NVIDIA RTX AI Toolkit for Windows RTX PCs

NVIDIA today launched the NVIDIA RTX AI Toolkit, a collection of tools and SDKs for Windows application developers to customize, optimize, and deploy AI models...

8 MIN READ

An illustration representing NeMo Guardrails.

May 31, 2024

Building Safer LLM Apps with LangChain Templates and NVIDIA NeMo Guardrails

An easily deployable reference architecture can help developers get to production faster with custom LLM use cases. LangChain Templates are a new way of...

7 MIN READ

Stylized image of a smartphone chat with a young woman smiling off to one side.

May 30, 2024

Personalized Learning with Gipi, NVIDIA TensortRT-LLM, and AI Foundation Models

Over 1.2B people are actively learning new languages, with over 500M learners on digital learning platforms such as Duolingo. At the same time, a significant...

6 MIN READ