LLMs

Jul 17, 2024

NVIDIA NeMo Accelerates LLM Innovation with Hybrid State Space Model Support

Today’s large language models (LLMs) are based on the transformer model architecture introduced in 2017. Since then, rapid advances in AI compute performance...

7 MIN READ

Jul 16, 2024

New Workshops: Customize LLMs, Build and Deploy Large Neural Networks

1 MIN READ

GIF of a factory floor with potential paths marked in green.

Jul 16, 2024

Building an AI Agent for Supply Chain Optimization with NVIDIA NIM and cuOpt

Enterprises face significant challenges in making supply chain decisions that maximize profits while adapting quickly to dynamic changes. Optimal supply chain...

8 MIN READ

Jul 15, 2024

Power Your AI Projects with New NVIDIA NIMs for Mistral and Mixtral Models

Large language models (LLMs) are growing in adoption across enterprise organizations, with many building them into their AI applications. Foundation models are...

5 MIN READ

Jul 12, 2024

Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities

First introduced in 2019, NVIDIA Megatron-LM sparked a wave of innovation in the AI community, enabling researchers and developers to use the underpinnings of...

11 MIN READ

Jul 11, 2024

Next Generation of FlashAttention

NVIDIA is excited to collaborate with Colfax, Together.ai, Meta, and Princeton University on their recent achievement to exploit the Hopper GPU architecture and...

1 MIN READ

Decorative image of a computer screen with characters and symbols streaming through it.

Jul 10, 2024

Curating Non-English Datasets for LLM Training with NVIDIA NeMo Curator

Data curation plays a crucial role in the development of effective and fair large language models (LLMs). High-quality, diverse training data directly...

12 MIN READ

Jul 09, 2024

Building Cyber Language Models to Unlock New Cybersecurity Capabilities

General-purpose large language models (LLMs) have proven their usefulness across various fields, offering substantial benefits in applications ranging from text...

13 MIN READ

Jul 03, 2024

Power Advanced Coding Capabilities with Deepseek Code LLM

Deepseek Coder v2, available as an NVIDIA NIM microservice, enhances project-level coding and infilling tasks.

1 MIN READ

Jul 02, 2024

Addressing Hallucinations in Speech Synthesis LLMs with the NVIDIA NeMo T5-TTS Model

NVIDIA NeMo has released the T5-TTS model, a significant advancement in text-to-speech (TTS) technology. Based on large language models (LLMs), T5-TTS produces...

4 MIN READ

Jul 02, 2024

Achieving High Mixtral 8x7B Performance with NVIDIA H100 Tensor Core GPUs and TensorRT-LLM

As large language models (LLMs) continue to grow in size and complexity, the performance requirements for serving them quickly and cost-effectively continue to...

9 MIN READ

Jul 02, 2024

Advancing Security for Large Language Models with NVIDIA GPUs and Edgeless Systems

Edgeless Systems introduced Continuum AI, the first generative AI framework that keeps prompts encrypted at all times with confidential computing by combining...

6 MIN READ

Jul 02, 2024

Phi-3-Medium: Now Available on the NVIDIA API Catalog

Phi-3-Medium accelerates research with logic-rich features in both short (4K) and long (128K) context.

1 MIN READ

Jul 01, 2024

StarCoder2-15B: A Powerful LLM for Code Generation, Summarization, and Documentation

Trained on 600+ programming languages, StarCoder2-15B is now packaged as a NIM inference microservice available for free from the NVIDIA API catalog.

1 MIN READ

Jul 01, 2024

Google's New Gemma 2 Model Now Optimized and Available on NVIDIA API Catalog

Gemma 2, the next generation of Google Gemma models, is now optimized with TensorRT-LLM and packaged as NVIDIA NIM inference microservice.

1 MIN READ

Jul 01, 2024

Deploy GPU-Optimized AI Software with One Click Using Brev.dev and NVIDIA NGC Catalog

Brev.dev is making it easier to develop AI solutions by leveraging software libraries, frameworks, and Jupyter Notebooks on the NVIDIA NGC catalog. You can use...

7 MIN READ