Skip to content
View xiaoachen98's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report xiaoachen98

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
xiaoachen98/README.md

Hi there 👋

  • 🌱 I'm Lin Chen, a Ph.D. student in BIVLab, USTC.
  • 🔭 I’m working as a research intern at Shanghai AI Laboratory.
  • 💬 I'm currently looking for collaborations, feel free to contact me.

Research Projects

  • 🔥 Large-scale high-quality video-text data and superior large video-language model: ShareGPT4Video.
  • 🔥 An elite vision-indispensable multi-modal benchmark: MMStar.
  • 🔥 Large-scale high-quality image-text data and superior large multi-modal model: ShareGPT4V.
  • More Stable "Drag" Editing: FreeDrag
  • Robust & Transferable Semantic Segmentation: DDB, DTP, Rein
  • Discriminator-free Adversarial Domain Adaption: DALN

Pinned Loading

  1. InternLM/InternLM-XComposer InternLM/InternLM-XComposer Public

    InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

    Python 2.3k 138

  2. open-compass/VLMEvalKit open-compass/VLMEvalKit Public

    Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks

    Python 720 84

  3. ShareGPT4Omni/ShareGPT4Video ShareGPT4Omni/ShareGPT4Video Public

    An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

    Python 1.2k 36

  4. MMStar-Benchmark/MMStar MMStar-Benchmark/MMStar Public

    This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"

    Python 127 1

  5. Open-LLaVA-NeXT Open-LLaVA-NeXT Public

    An open-source implementation of LLaVA-NeXT.

    Python 141 4

  6. LPengYang/FreeDrag LPengYang/FreeDrag Public

    Official Implementation of FreeDrag (CVPR 2024)

    Python 402 20