LLMs
![Illustration showing models and NeMo.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/07/llm-megatron-core-blog-2967200-1920x1080-1-960x540.jpg)
Jul 17, 2024
NVIDIA NeMo Accelerates LLM Innovation with Hybrid State Space Model Support
Today’s large language models (LLMs) are based on the transformer model architecture introduced in 2017. Since then, rapid advances in AI compute performance...
7 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/07/NVIDIA-DLI-Instructor-Workshops-960x540.png)
Jul 16, 2024
New Workshops: Customize LLMs, Build and Deploy Large Neural Networks
Register now for an instructor-led public workshop in July, August or September. Space is limited.
1 MIN READ
![GIF of a factory floor with potential paths marked in green.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/07/cuopt-promo-ai-agent-1200x675-1-960x540.gif)
Jul 16, 2024
Building an AI Agent for Supply Chain Optimization with NVIDIA NIM and cuOpt
Enterprises face significant challenges in making supply chain decisions that maximize profits while adapting quickly to dynamic changes. Optimal supply chain...
8 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/07/NVIDIA-API-Catalog-Mistral-Mixtral-960x540.png)
Jul 15, 2024
Power Your AI Projects with New NVIDIA NIMs for Mistral and Mixtral Models
Large language models (LLMs) are growing in adoption across enterprise organizations, with many building them into their AI applications. Foundation models are...
5 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/07/stacked-geometric-shapes-1-960x540.jpg)
Jul 12, 2024
Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities
First introduced in 2019, NVIDIA Megatron-LM sparked a wave of innovation in the AI community, enabling researchers and developers to use the underpinnings of...
11 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/07/cutlass-featured.png)
Jul 11, 2024
Next Generation of FlashAttention
NVIDIA is excited to collaborate with Colfax, Together.ai, Meta, and Princeton University on their recent achievement to exploit the Hopper GPU architecture and...
1 MIN READ
![Decorative image of a computer screen with characters and symbols streaming through it.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/nemo-curator-featured-960x540.png)
Jul 10, 2024
Curating Non-English Datasets for LLM Training with NVIDIA NeMo Curator
Data curation plays a crucial role in the development of effective and fair large language models (LLMs). High-quality, diverse training data directly...
12 MIN READ
![An illustration showing code.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/cyber-language-960x540.jpg)
Jul 09, 2024
Building Cyber Language Models to Unlock New Cybersecurity Capabilities
General-purpose large language models (LLMs) have proven their usefulness across various fields, offering substantial benefits in applications ranging from text...
13 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/07/DeepSeek-e1720012355496-960x540.jpg)
Jul 03, 2024
Power Advanced Coding Capabilities with Deepseek Code LLM
Deepseek Coder v2, available as an NVIDIA NIM microservice, enhances project-level coding and infilling tasks.
1 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/llm-composite-960x540.png)
Jul 02, 2024
Addressing Hallucinations in Speech Synthesis LLMs with the NVIDIA NeMo T5-TTS Model
NVIDIA NeMo has released the T5-TTS model, a significant advancement in text-to-speech (TTS) technology. Based on large language models (LLMs), T5-TTS produces...
4 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/Achieving-High-Mixtral-8x7B-Performance-with-NVIDIA-H100-Tensor-Core-GPUs-and-TensorRT-LLM.png)
Jul 02, 2024
Achieving High Mixtral 8x7B Performance with NVIDIA H100 Tensor Core GPUs and TensorRT-LLM
As large language models (LLMs) continue to grow in size and complexity, the performance requirements for serving them quickly and cost-effectively continue to...
9 MIN READ
![An image representing cybersecurity.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/07/image4-1-960x540.png)
Jul 02, 2024
Advancing Security for Large Language Models with NVIDIA GPUs and Edgeless Systems
Edgeless Systems introduced Continuum AI, the first generative AI framework that keeps prompts encrypted at all times with confidential computing by combining...
6 MIN READ
![Illustration representing Phi-3-Medium.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/07/Phi-3-Medium-960x540.png)
Jul 02, 2024
Phi-3-Medium: Now Available on the NVIDIA API Catalog
Phi-3-Medium accelerates research with logic-rich features in both short (4K) and long (128K) context.
1 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/07/abstract-lines-1-960x540.jpg)
Jul 01, 2024
StarCoder2-15B: A Powerful LLM for Code Generation, Summarization, and Documentation
Trained on 600+ programming languages, StarCoder2-15B is now packaged as a NIM inference microservice available for free from the NVIDIA API catalog.
1 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/ngc-press-gemma-2-model-1920x10801-1-960x540.jpg)
Jul 01, 2024
Google's New Gemma 2 Model Now Optimized and Available on NVIDIA API Catalog
Gemma 2, the next generation of Google Gemma models, is now optimized with TensorRT-LLM and packaged as NVIDIA NIM inference microservice.
1 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/model-icons-960x540.png)
Jul 01, 2024
Deploy GPU-Optimized AI Software with One Click Using Brev.dev and NVIDIA NGC Catalog
Brev.dev is making it easier to develop AI solutions by leveraging software libraries, frameworks, and Jupyter Notebooks on the NVIDIA NGC catalog. You can use...
7 MIN READ