Mat Velloso’s Post

Vice President, Product - AI / ML Developer at Google, former MSFT

Gemma 2, a 27B parameter model, is now available to researchers and developers: https://lnkd.in/gV2gXtt2 It is designed to run inference efficiently at full precision on a single Google Cloud TPU host, NVIDIA A100 80GB Tensor Core GPU, or NVIDIA H100 Tensor Core GPU.

Gemma 2 is now available to researchers and developers

blog.google

1 Comment

To view or add a comment, sign in

More Relevant Posts

Eli Hueramo Lopez

Comercial Business Translator Manager, Analytics Hub en Coca-Cola FEMSA
2mo
Report this post
How big is the bet from #Microsoft for AI? On november 2023, the #Top500 list spoted in the 3rd place an Azure Cloud supercomputer, a Microsoft supercomputer called Eagle that delivered 561 petaflops of performance, with 1.12 million computing cores, including Nvidia's H100 GPU and Intel's Xeon Platinum 8480C CPU cores. The supercomputer provides access to the #GPT-4 and other generative models hosted on Azure cloud. Runs on top of Ubuntu 22.04. https://www.top500.org/ #GENAI #GPT4 #AzureCloud #AzureSupercomputer #Supercomputer #HPC #AI

Sponsored Article

top500.org
Like Comment
To view or add a comment, sign in
Kapil Bindal

Founder and Chief Executive Officer at Innow8 Apps 👨💻 | Accelerating business growth through technology
11mo
Report this post
Azure and NVIDIA Partner to Drive Next-Gen GPU Acceleration in AI! 🤖 Azure and NVIDIA have joined forces 🤝 to propel the AI landscape forward. By combining Azure's scalable cloud infrastructure with NVIDIA's cutting-edge GPU technology, this collaboration introduces next-gen GPU instances. 💫 🔺 These instances are poised to revolutionize AI workloads across sectors, delivering faster ⏩ training times, heightened accuracy, and accelerated insights. 📈 From healthcare 👨⚕️ to finance 💰 and beyond, this partnership paves the way for groundbreaking AI-powered solutions. #AI #CloudComputing #Innovation #NVIDIA #Azure #GPUAcceleration

Azure and NVIDIA deliver next-gen GPU acceleration for AI

https://www.artificialintelligence-news.com
Like Comment
To view or add a comment, sign in
Yogesh Jadhav

Data Enthusiast | Data Analyst | Data Science | ML/DL/AI | Analytics | Visualization | ETL | UI/UX | NFT | Power Apps | IT | Content Writer | Jobs/Recruitment | Quoran | Follow for more
7mo
Report this post
🚀 NVIDIA's latest powerhouse, the GH200 GPU, is set to revolutionize AI capabilities for select companies! Bindu Reddy of Abacus AI is thrilled to be among the first to harness this tech for open-source AI projects. With industry giants like AWS, Google Cloud, and Oracle Cloud Infrastructure gearing up to integrate GH200 into their services, the AI landscape is poised for a major leap forward. Meanwhile, competitors AMD and Intel are not far behind, with their own AI accelerators challenging NVIDIA's dominance. Get ready for a surge in AI innovation as these tech titans race to power the future! 🌐💡 #NVIDIA #GH200 #AI #MachineLearning #DeepLearning #CloudComputing #Innovation #TechNews #ArtificialIntelligence #DataScience #BigData #Analytics #OpenSource #FutureOfAI #TechTrends

🚀 NVIDIA's latest powerhouse, the GH200 GPU, is set to revolutionize AI capabilities for select companies! Bindu Reddy of Abacus AI is thrilled to be among the first to harness this tech for open-source AI projects. With industry giants like AWS, Google Cloud, and Oracle Cloud Infrastructure gearing up to integrate GH200 into their services, the AI landscape is poised for a major leap forwar...

http://analyticsindiamag.com
Like Comment
To view or add a comment, sign in
Paolo Costa

Founding Partner, Chairman of the Board and Investor Relator at Spindox - Adjunct Professor at Università di Pavia
1y
Report this post
At the end, is very much a matter of hardware and GPUs. A cluster of 3,584 H100s, manufactured by NVIDIA, at cloud service provider CoreWeave completed a massive GPT-3-based benchmark in just 11 minutes. And that's just the beginning. Nvidia and CoreWeave are now in the process of helping Inflection AI install a new cluster that will run 22,000 H100s. That's likeky the the largest GPU cluster for AI applications in the world, ahead of Meta’s 16,000 GPU cluster announced in May. #ai #cloud #hardware #gpu

NVIDIA H100 GPUs Set Standard for Generative AI in Debut MLPerf Benchmark

blogs.nvidia.com
Like Comment
To view or add a comment, sign in
Saanya Ojha Saanya Ojha is an Influencer

Partner at Bain Capital Ventures
8mo
Report this post
📣There's a hot, new chip on the block! 📣 1 week after OpenAI Dev Day, comes Nvidia's special address at Supercomputing 23 where they announced their latest and greatest AI chip yet. The H200s have 1.4x more memory bandwidth and 1.8x more memory capacity than the coveted H100s. Releasing in 2Q24, AWS, GCP, Azure, and Oracle will be among the first to offer these new chips. ...Not that this announcement does anything to ease the unparalleled supply crunch for these chips in Silicon Valley right now. Nvidia's GPUs are more precious than gold and harder to find than a snowflake in the Sahara. The lead time for new orders right now is 6 months, which might as well be a lifetime in AI. It's no wonder that the stock is up +247% this year, continuing its journey up and to the right. It's the closest thing we have to a monopoly play on the computing systems used to power GenAI.
3 Comments
Like Comment
To view or add a comment, sign in
Subhash KM

Founder & CEO at WhiteNoise Corporation | Tech Influencer | 30k Followers | 19 Million Impressions | DMs - Ads, Promotions, Consulting & Partnerships
1y
Report this post
https://lnkd.in/gCJYWNc8 MosaicML, just acquired by DataBricks for $1.3B, published some interesting benchmarks for training LLMs on the AMD MI250 GPU, and said it is ~80% as fast as an NVIDIA A100. Did the world just change?. #nvidia #amd #ai #cloud #llms #mosaicml #databricks #aitraining #gpucomputing #deeplearning #neuralnetworks #technology #innovation #business credit: forbes

Is The AMD GPU Better Than We Thought For AI?

forbes.com
Like Comment
To view or add a comment, sign in
Gilles RAYMOND

The Edge paves the way to the next phase of the internet
3mo
Report this post
If AI providers are in the spotlight, the current winners are companies supplying processing power both client-side and server-side. Notably, Microsoft saw a $1B increase in market cap, and while the OpenAI deal contributes, the primary driver is the surge in earnings from its cloud business. Similarly, Nvidia's market cap rose by $2B. This suggests that the main beneficiaries of the new AI ecosystem might not be the most visible players. https://lnkd.in/eCNcyqCT

Can anything stop Nvidia’s Jensen Huang?

economist.com
Like Comment
To view or add a comment, sign in
Brad Bond
7mo
Report this post
AWS re:Invent 2023 saw a game-changing announcement from AWS and NVIDIA. AWS will be the first cloud provider to offer NVIDIA GH200 Grace Hopper Superchips, interconnected with NVIDIA NVLink technology through NVIDIA DGX Cloud and running on Amazon Elastic Compute Cloud (Amazon EC2). The NVIDIA GH200 Grace Hopper Superchip-powered instances in AWS will feature 4.5 TB of HBM3e memory, a 7.2x increase compared to current-generation NVIDIA H100-powered EC2 P5 instances. This allows developers to run larger models, while improving training performance. This is serious levels of compute for training large models! 1. The NVIDIA GH200 NVL32 is a rack-scale reference design for NVIDIA GH200 Grace Hopper Superchips connected through NVLink targeted for hyperscale data centres. 2. The CPU to GPU memory interconnect is 900 GB/s, which is 7x faster than PCIe Gen 5. GPUs access CPU memory in a cache-coherent way, extending the total memory available for applications. 3. NVIDIA GH200 NVL32 is ideal for LLM training and inference, recommender systems, graph neural networks (GNNs), vector databases, and retrieval-augmented generation (RAG) models. #AWS #NVIDIA #CloudComputing #AI #EmergingTech
1 Comment
Like Comment
To view or add a comment, sign in
The Boss Creatives Lab

745 followers
10mo
Report this post
In the latest round of MLPerf inference benchmark scores, chip and server makers and cloud providers showed off performance of top-of-the-range hardware for the newly introduced 6B-parameter, GPT-J, large language model (LLM) inference benchmark, which is designed to indicate system performance on LLMs like Chat-GPT. #chatgpt #llms #nvidia #mlperf #eetimes #ai #chatgpt

Nvidia Shows Off Grace Hopper in MLPerf Inference - EE Times

https://www.eetimes.com
Like Comment
To view or add a comment, sign in
Mike Hoey

Sr. Director of Business Development at MemVerge
1mo
Report this post
The GenAI projects are off to the races ! But the underlying GPU resources still remain expensive and in short supply. How can you better manage your GPU cluster to optimize these precious resources and get maximum utilization of GPU, CPU, and memory resources? How can I re-prioritize AI/ML jobs and transparently migrate jobs between on-prem and cloud resources via checkpointing GPU’s? Message me if you would like an overview of our award winning platform, Memory Machine for AI ! #GenAI #AI/ML #AI #GPU #MemVerge
Like Comment
To view or add a comment, sign in

8,799 followers

View Profile Follow

Mat Velloso’s Post

Gemma 2 is now available to researchers and developers

blog.google

More from this author

Bots 101: Scenarios for bots

Explore topics