vision-language-models

Here are 9 public repositories matching this topic...

baaivision / EVE

EVE: Encoder-Free Vision-Language Models from BAAI

clip vlm instruction-following large-language-models llm mllm multimodal-large-language-models vision-language-models encoder-free-vlm

Updated Jul 20, 2024
Python

snap-research / MyVLM

Star

Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)

personalization vision-language-models

Updated Jul 5, 2024
Python

baaivision / DenseFusion

Star

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception

vlm image-descriptions visual-perception mllm multimodal-large-language-models vision-language-models

Updated Jul 12, 2024
Python

BAAI-Agents / GPA-LM

Star

This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges".

games ai gcc planning gameplay awesome-list agents gameai vlm multimodal agent-framework large-language-models llm generative-ai vision-language-models general-computer-control

Updated Jul 18, 2024

erfanshayegani / Jailbreak-In-Pieces

Star

[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models

alignment ai-safety vlm llm vision-language-models cross-modality-safety-alignment multi-modal-models

Updated Jun 6, 2024
Python

vanillaer / CPL-ICML2024

Star

[ICML 2024] Offical code repo for ICML2024 paper "Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with Unlabeled Data"

unlabeled-data pseudolabels vision-language-models

Updated Jun 21, 2024
Python

Evan-wyl / Robot-Learning

Star

Papers, codes, datasets, applications, tutorials.

reinforcement-learning robotics imitation-learning robot-learning eai embodied-ai foundation-models vision-language-models robotic-transformer

Updated Jul 20, 2024

Ibtissam-SAADI / CLIVP-FER

Star

Facial Expression Recognition using vision language models (VLMs)

facial-expression-recognition vision-language-models driver-s-emotion contrastive-language--image-pretraining

Updated May 9, 2024
Python

chu0802 / SnD

Star

This is an official implementation of our work, Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models, accepted to ECCV'24

eccv continual-learning vision-language-models eccv2024

Updated Jul 1, 2024

Improve this page

Add a description, image, and links to the vision-language-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-language-models topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vision-language-models

Here are 9 public repositories matching this topic...

baaivision / EVE

snap-research / MyVLM

baaivision / DenseFusion

BAAI-Agents / GPA-LM

erfanshayegani / Jailbreak-In-Pieces

vanillaer / CPL-ICML2024

Evan-wyl / Robot-Learning

Ibtissam-SAADI / CLIVP-FER

chu0802 / SnD

Improve this page

Add this topic to your repo