Hugging Face’s Post

View organization page for Hugging Face, graphic

676,444 followers

Transformers v4.42 includes a new Transformer-based model capable of real-time object detection. See below for more info:

Niels Rogge

Machine Learning Engineer at ML6 & Hugging Face

3w Edited

RT-DETR is now supported in Hugging Face Transformers! 🙌 RT-DETR, short for “Real-Time DEtection TRansformer”, is a computer vision model developed at Peking University and Baidu, Inc. capable of real-time object detection. The authors claim better performance than YOLO models in both speed and accuracy. The model comes with an Apache 2.0 license, meaning people can freely use it for commercial applications. 🔥 RT-DETR is a follow-up work of DETR, a model developed by AI at Meta that successfully used Transformers for the first time for object detection. The latter has been in the Transformers library since 2020. After this, lots of improvements have been made to enable faster convergence and inference speed. RT-DETR is an important example of that as it unlocks real-time inference at high accuracy! Big congrats to Daniel Choi for contributing this model! * Demo notebooks (fine-tuning + inference): https://lnkd.in/eA_WzsyE * Demo Space: https://lnkd.in/ewzWTSHA * Paper: https://lnkd.in/eR3Qg6dm #ai #artificialintelligence #objectdetection #huggingface #computervision

21 Comments

Cohorte

RT-DETR benefits real-time object detection by guaranteeing real-time performance and accuracy. It employs Vision Transformers for effective multiscale feature processing, features adaptable inference speed adjustment, and supports CUDA with TensorRT, outperforming other real-time detectors in both speed and accuracy.

1 Reaction

Houston Austin Muzamhindo

AI & Data at Investec UK | Founder at IQmates | Udemy Instructor | TEDxJHB 1830 Fellow

Interesting. Need to read the paper. YOLO is CNN-based. During the X / Twitter debate between Elon and Yann, one of the points was around computer vision without CNNs (which Elon claimed Tesla is doing now and those cars need very fast inference models which certain CNN architectures are capable of).

William William

Recently Yolov10 has been released. I checked on the paper, it doesn't do the comparaison with this one. Do we know if it even outperforms Yolov10 ? In anycase if the code is opensource, it opens a new door for real-time detection. I will have something to read before to sleep.

Jean-Baptiste DAVID

Hééé tes ex collègues font ça ! https://www.linkedin.com/feed/update/urn:li:activity:7212024869444595713/

Durga Prasad Dhulipudi

AI/ML Enterprise Architect, Expert Geospatial and Aviation

RT-DETR is faster than YOLOv8 and has better accuracy. Thanks for the object detection notebook. Do you have any similar examples for segmentation, or could you come up with one?

Christophe Devriese

Great. Well done! And yet, a bit sad. YOLO was the last great bastion of CNN models. It saddens me in a way that this works. I guess now we'll really see them off.

1 Reaction

Anshu Bhola

IoT | AI/ML | Technologist | Architectures

Wow .. faster than YOLO.

1 Reaction

Pablo Carmona Esparza

AI and Automation business solutions | LangChain developer | Zapier and Make IO ninja 🧑🏻💻🥷

Exciting development, Hugging Face! The innovation in real-time object detection keeps raising the bar. Kudos to the team!

Tushar Khete

AI Engineer | Data Scientist | Gen AI | AI | LLMOPs | DL | NLP | CV | Tech Team Lead

So impressive!

1 Reaction

Francesco Cozzolino

medium.com/@francesco.cozzolino | AI Solution Developer

Great

See more comments

To view or add a comment, sign in

More Relevant Posts

Niels Rogge

Machine Learning Engineer at ML6 & Hugging Face
3w Edited
Report this post
RT-DETR is now supported in Hugging Face Transformers! 🙌 RT-DETR, short for “Real-Time DEtection TRansformer”, is a computer vision model developed at Peking University and Baidu, Inc. capable of real-time object detection. The authors claim better performance than YOLO models in both speed and accuracy. The model comes with an Apache 2.0 license, meaning people can freely use it for commercial applications. 🔥 RT-DETR is a follow-up work of DETR, a model developed by AI at Meta that successfully used Transformers for the first time for object detection. The latter has been in the Transformers library since 2020. After this, lots of improvements have been made to enable faster convergence and inference speed. RT-DETR is an important example of that as it unlocks real-time inference at high accuracy! Big congrats to Daniel Choi for contributing this model! * Demo notebooks (fine-tuning + inference): https://lnkd.in/eA_WzsyE * Demo Space: https://lnkd.in/ewzWTSHA * Paper: https://lnkd.in/eR3Qg6dm #ai #artificialintelligence #objectdetection #huggingface #computervision

37 Comments
Like Comment
To view or add a comment, sign in
Kallisto AI

1,556 followers
3w Edited
Report this post
Real-Time DEtection TRansformer (RT-DETR) for real-time object detection. "we first analyze the computational redundancy present in the multi-scale Transformer encoder. Intuitively, 𝗵𝗶𝗴𝗵-𝗹𝗲𝘃𝗲𝗹 𝗳𝗲𝗮𝘁𝘂𝗿𝗲𝘀 𝘁𝗵𝗮𝘁 𝗰��𝗻𝘁𝗮𝗶𝗻 𝗿𝗶𝗰𝗵 𝘀𝗲𝗺𝗮𝗻𝘁𝗶𝗰 𝗶𝗻𝗳𝗼𝗿𝗺𝗮𝘁𝗶𝗼𝗻 𝗮𝗯𝗼𝘂𝘁 𝗼𝗯𝗷𝗲𝗰𝘁𝘀 𝗮𝗿𝗲 𝗲𝘅𝘁𝗿𝗮𝗰𝘁𝗲𝗱 𝗳𝗿𝗼𝗺 𝗹𝗼𝘄-𝗹𝗲𝘃𝗲𝗹 𝗳𝗲𝗮𝘁𝘂𝗿𝗲𝘀, 𝗺𝗮𝗸𝗶𝗻𝗴 𝗶𝘁 𝗿𝗲𝗱𝘂𝗻𝗱𝗮𝗻𝘁 to perform feature interaction on the concatenated multi-scale features." The key in #computerVision #masking is to minimize the number of low-level #features that can be extracted by any #AI algorithm (included #transformer based ones). Yes, in any #sensor band... call it EO, IR, Multispectral, radar... or any other sensor. Yep, that is what #kallistoShield does.

Niels Rogge

Machine Learning Engineer at ML6 & Hugging Face
3w Edited

RT-DETR is now supported in Hugging Face Transformers! 🙌 RT-DETR, short for “Real-Time DEtection TRansformer”, is a computer vision model developed at Peking University and Baidu, Inc. capable of real-time object detection. The authors claim better performance than YOLO models in both speed and accuracy. The model comes with an Apache 2.0 license, meaning people can freely use it for commercial applications. 🔥 RT-DETR is a follow-up work of DETR, a model developed by AI at Meta that successfully used Transformers for the first time for object detection. The latter has been in the Transformers library since 2020. After this, lots of improvements have been made to enable faster convergence and inference speed. RT-DETR is an important example of that as it unlocks real-time inference at high accuracy! Big congrats to Daniel Choi for contributing this model! * Demo notebooks (fine-tuning + inference): https://lnkd.in/eA_WzsyE * Demo Space: https://lnkd.in/ewzWTSHA * Paper: https://lnkd.in/eR3Qg6dm #ai #artificialintelligence #objectdetection #huggingface #computervision

1 Comment
Like Comment
To view or add a comment, sign in
JOB MBUVI

CEO and Founder, AI CrowdForce (AI and Automation)
3w
Report this post
Unlock unparalleled AI and ML precision with the Master Data Annotator. Discover the difference at fiverr.com/jobmunene #DataAnnotation #ImageAnnotation #AI #MachineLearning #DataScience #AITrainingData #MLTrainingData #AIAnnotation #MLAnnotation #ArtificialIntelligence

Niels Rogge

Machine Learning Engineer at ML6 & Hugging Face
3w Edited

RT-DETR is now supported in Hugging Face Transformers! 🙌 RT-DETR, short for “Real-Time DEtection TRansformer”, is a computer vision model developed at Peking University and Baidu, Inc. capable of real-time object detection. The authors claim better performance than YOLO models in both speed and accuracy. The model comes with an Apache 2.0 license, meaning people can freely use it for commercial applications. 🔥 RT-DETR is a follow-up work of DETR, a model developed by AI at Meta that successfully used Transformers for the first time for object detection. The latter has been in the Transformers library since 2020. After this, lots of improvements have been made to enable faster convergence and inference speed. RT-DETR is an important example of that as it unlocks real-time inference at high accuracy! Big congrats to Daniel Choi for contributing this model! * Demo notebooks (fine-tuning + inference): https://lnkd.in/eA_WzsyE * Demo Space: https://lnkd.in/ewzWTSHA * Paper: https://lnkd.in/eR3Qg6dm #ai #artificialintelligence #objectdetection #huggingface #computervision
Like Comment
To view or add a comment, sign in
Ritesh Sangani

Consultant - Data Sciences | Artificial Intelligence | Business Intelligence | Cloud | Software Development | Stragetic Management
3w
Report this post
Here is a comparision with YOLOv10-X Model Params (M) FLOPs (G) APval (%) Latency (ms) Latency (Forward) (ms) RT-DETR-R101 76.0 259.0 54.3 13.71 13.58 YOLOv10-X 29.5 160.4 54.4 10.70 10.60 https://lnkd.in/dibQVFyQ

Niels Rogge

Machine Learning Engineer at ML6 & Hugging Face
3w Edited

RT-DETR is now supported in Hugging Face Transformers! 🙌 RT-DETR, short for “Real-Time DEtection TRansformer”, is a computer vision model developed at Peking University and Baidu, Inc. capable of real-time object detection. The authors claim better performance than YOLO models in both speed and accuracy. The model comes with an Apache 2.0 license, meaning people can freely use it for commercial applications. 🔥 RT-DETR is a follow-up work of DETR, a model developed by AI at Meta that successfully used Transformers for the first time for object detection. The latter has been in the Transformers library since 2020. After this, lots of improvements have been made to enable faster convergence and inference speed. RT-DETR is an important example of that as it unlocks real-time inference at high accuracy! Big congrats to Daniel Choi for contributing this model! * Demo notebooks (fine-tuning + inference): https://lnkd.in/eA_WzsyE * Demo Space: https://lnkd.in/ewzWTSHA * Paper: https://lnkd.in/eR3Qg6dm #ai #artificialintelligence #objectdetection #huggingface #computervision
Like Comment
To view or add a comment, sign in
Ayush Nath Tiwari

Aspiring Data Scientist | Generative AI Enthusiast, Deep Learning, LLMs, Statical modeling and data analytics
2mo Edited
Report this post
Excited to share my new project on "Exploring Object Detection with YOLOv8" Explored the fascinating world of object detection using #YOLOv8, a powerful algorithm in computer vision. From pre-trained models to custom training, delved deep into enhancing detection accuracy. YOLOv8 Model: YOLOv8, also known as "You Only Look Once," revolutionizes object detection with its real-time capabilities and high accuracy. Its efficient architecture makes it a top choice for a wide range of applications. Project Steps: I. Installed YOLOv8 and experimented with pre-trained COCO models. II. Tailored training by fine-tuning the model with a specialized dataset. III. Evaluated the model's performance and accuracy through rigorous validation and inference on test images. GitHub Link: https://lnkd.in/gg7DKUvE #ObjectDetection #YOLOv8 #ComputerVision #DeepLearning #AI #MachineLearning #ImageRecognition #NeuralNetworks
Like Comment
To view or add a comment, sign in
Data Phoenix

1,032 followers
5mo
Report this post
[News] YOLOv9 promises to be the new state-of-the-art real-time object detector YOLOv9 is a real-time object detector with performance competitive enough to become the newest state-of-the-art method. YOLOv9 features two improvements: Programmable Gradient Information (PGI) and Generalized Efficient Layer Aggregation Network (GELAN) https://buff.ly/42SUjc5 ↓ Are you interested in AI? Check out Data Phoenix (https://buff.ly/48z3sI1) - the global AI and Data community of 8000+ Engineers, Executives, and Founders. #DataPhoenix #AI #ML #MachineLearning #DataScience #ArtificialIntelligence #News
Like Comment
To view or add a comment, sign in
Yael Baron (Ben Ruby)

Deep Learning Engineer
10mo
Report this post
🚀 Deci is raising the bar for LLM inference speed and accuracy. I’m proud to share that we at Deci just released DeciLM 6B -a 5.7 billion parameters LLM which is 15x the throughput of Llama 2 7B. The magic? AutoNAC + Infery-LLM. Quick facts about DeciLM 6B: ✅ Generated with AutoNAC, Deci's cutting-edge Neural Architecture Search engine ✅ It’s features a unique implementation of variable Grouped-Query Attention (GQA). ✅ Run it with Infery-LLM to get never-before-seen efficiency, cutting 90% of inference compute costs compared to running Llama 2 7B with vLLM. Bonus! We’re also releasing an instruction-tuned model - DeciLM 6B-Instruct Explore and Like our models on Hugging Face: DeciLM 6B > https://bit.ly/DeciLM-6b DeciLM 6B-Instruct > https://lnkd.in/dckpHqsH Learn more about Infery LLM > https://hubs.ly/Q02238CH0 Try the instruction-tuned model demo here: https://bit.ly/try-DeciLM #DeciLM6B #generativeAI #artificialintelligence Deci AI
Like Comment
To view or add a comment, sign in
Mayur Bhirud

Masters in Economics || Big Data || Machine Learning || Statistics || Python || SQL || R || JAVA || AWS cloud
11mo
Report this post
🧠 Delve into the core of Machine Learning with "A Gentle Introduction to Tensors" by Boaz Porat! 📚 Tensors, as multi-dimensional arrays, are the building blocks of data representation in ML. From input data like images and text to model parameters, tensors drive every aspect of learning and decision-making in algorithms. 🌐 Understanding their significance is essential to grasp the essence of modern AI. Explore the PDF to uncover the vital role tensors play in shaping the intelligence of machines. 🚀 #MachineLearning #Tensors #AIUnderstanding
Like Comment
To view or add a comment, sign in
Anandu kc

AI Developer
2mo Edited
Report this post
💥 YOLOv10 Released! The world of object detection just got a major upgrade with the release of YOLOv10 by researchers at Tsinghua University! This exciting new model boasts significant advancements in both speed and accuracy, addressing the shortcomings of previous YOLO versions with innovative design strategies. Highlights of YOLOv10: 👉 Enhanced model capabilities: Incorporates large-kernel convolutions and partial self-attention modules to improve performance without significant computational cost. 👉 Real-time object detection: Excels at maintaining real-time processing speeds. 👉 NMS-Free Training: Eliminates the need for Non-Maximum Suppression (NMS), reducing inference latency. 🎯 Real-World Comparison: YOLOv8n vs. YOLOv10n: In this video, I compare YOLOv8n and YOLOv10n through real-world inference. While YOLOv10n shows a significant leap in COCO benchmarks, the visual accuracy improvement over YOLOv8n might seem subtle. However, YOLOv10n shines with faster post-processing speeds and, as you'll see in the people detection portion of the video, potentially fewer false positives, although it might miss some predictions sometimes. Check out the comparison to see the differences in action! 🔗 Research paper : https://lnkd.in/gaFiaqQJ #YOLOv10 #ObjectDetection #MachineLearning #DeepLearning #AI #ComputerVision #RealTimeAI #TechInnovation #AIResearch #YOLOv8n #ModelComparison Ultralytics Roboflow OpenCV

8 Comments
Like Comment
To view or add a comment, sign in
John Paul Prabhu

Data Scientist at Green Rider Technology
3w
Report this post
Excited to share my latest project on GitHub: Automatic Number Plate Recognition (ANPR)! This system leverages advanced computer vision techniques, including YOLOv10 for object detection and EasyOCR for optical character recognition, to detect and recognize vehicle number plates from images and videos in real-time. With the integration of the Kalman filter for tracking, this project ensures high accuracy and efficiency. Check out the project repository for more details. https://lnkd.in/gFECACk4 #ComputerVision #AI #DeepLearning #ANPR #YOLOv10 #EasyOCR YOLOvX Ultralytics

3 Comments
Like Comment
To view or add a comment, sign in

676,444 followers

View Profile Follow

Hugging Face’s Post

More from this author

What you may have missed from the 🤗 open source community gathering in Paris 🕹️

Accompagnement renforcé de la CNIL et protection des données "by design" 🤗

Explore topics