Conversational AI
Jul 18, 2024
Spotlight: UneeQ Revolutionizes Customer Engagement with AI-Powered Digital Humans
With the rise of chatbots and virtual assistants, customer interactions have evolved to embrace the versatility of voice and text inputs. However, integrating...
4 MIN READ
Jul 18, 2024
Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 2, Performance Tuning
In the first part of the series, we presented an overview of the IVF-PQ algorithm and explained how it builds on top of the IVF-Flat algorithm, using the...
14 MIN READ
Jul 18, 2024
Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 1, Deep Dive
In this blog post, we continue the series on accelerating vector search using cuVS. Our previous post in the series introduced IVF-Flat, a fast algorithm for...
14 MIN READ
Jul 17, 2024
NVIDIA NeMo Accelerates LLM Innovation with Hybrid State Space Model Support
Today’s large language models (LLMs) are based on the transformer model architecture introduced in 2017. Since then, rapid advances in AI compute performance...
7 MIN READ
Jul 16, 2024
New Workshops: Customize LLMs, Build and Deploy Large Neural Networks
Register now for an instructor-led public workshop in July, August or September. Space is limited.
1 MIN READ
Jul 12, 2024
Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities
First introduced in 2019, NVIDIA Megatron-LM sparked a wave of innovation in the AI community, enabling researchers and developers to use the underpinnings of...
11 MIN READ
Jul 02, 2024
Addressing Hallucinations in Speech Synthesis LLMs with the NVIDIA NeMo T5-TTS Model
NVIDIA NeMo has released the T5-TTS model, a significant advancement in text-to-speech (TTS) technology. Based on large language models (LLMs), T5-TTS produces...
4 MIN READ
Jun 28, 2024
Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning
Full fine-tuning (FT) is commonly employed to tailor general pretrained models for specific downstream tasks. To reduce the training cost, parameter-efficient...
6 MIN READ
Jun 26, 2024
Generate High-Quality, Context-Aware Responses for Chatbots and Search Engines with Llama 3-ChatQA
Experience and test Llama3-ChatQA models at scale with performance optimized NVIDIA NIM inference microservice using the NVIDIA API catalog.
1 MIN READ
Jun 20, 2024
AI Brain Implant Restores Bilingual Communication for Stroke Survivor
Scientists have enabled a stroke survivor, who is unable to speak, to communicate in both Spanish and English by training a neuroprosthesis implant to decode...
3 MIN READ
Jun 12, 2024
Introducing Grouped GEMM APIs in cuBLAS and More Performance Updates
The latest release of NVIDIA cuBLAS library, version 12.5, continues to deliver functionality and performance to deep learning (DL) and high-performance...
7 MIN READ
Jun 10, 2024
NVIDIA Text Embedding Model Tops MTEB Leaderboard
The latest embedding model from NVIDIA—NV-Embed—set a new record for embedding accuracy with a score of 69.32 on the Massive Text Embedding Benchmark...
6 MIN READ
Jun 04, 2024
Build Lifelike Digital Humans with NVIDIA ACE, Now Generally Available
NVIDIA ACE—a suite of technologies bringing digital humans to life with generative AI—is now generally available for developers. Packaged as NVIDIA NIMs,...
5 MIN READ
Jun 02, 2024
Streamline Development of AI-Powered Apps with NVIDIA RTX AI Toolkit for Windows RTX PCs
NVIDIA today launched the NVIDIA RTX AI Toolkit, a collection of tools and SDKs for Windows application developers to customize, optimize, and deploy AI models...
8 MIN READ
May 31, 2024
Building Safer LLM Apps with LangChain Templates and NVIDIA NeMo Guardrails
An easily deployable reference architecture can help developers get to production faster with custom LLM use cases. LangChain Templates are a new way of...
7 MIN READ
May 30, 2024
Personalized Learning with Gipi, NVIDIA TensortRT-LLM, and AI Foundation Models
Over 1.2B people are actively learning new languages, with over 500M learners on digital learning platforms such as Duolingo. At the same time, a significant...
6 MIN READ