Skip to content
View roma-goodok's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
  • Align Technology
  • Moscow

Highlights

  • Pro
Block or Report

Block or report roma-goodok

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,570 95 Updated Jul 6, 2024

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 2,564 186 Updated Jul 19, 2024

MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis

Python 35 7 Updated Mar 5, 2023

Implementation for MatMul-free LM.

Python 2,701 164 Updated Jun 27, 2024
Jupyter Notebook 671 120 Updated Feb 5, 2024

Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Python 262 30 Updated Jun 18, 2024

Official code for SwinFUSE to be presented in Self-supervised Modality-agnostic Pre-training Of Swin Transformers at ISBI'24

Jupyter Notebook 4 Updated Feb 20, 2024

Implementations of recent research prototypes/demonstrations using MONAI.

Python 973 322 Updated Jul 2, 2024

A visual interface for understanding and interpreting Transformers

Svelte 73 6 Updated Oct 21, 2023

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) avai…

Jupyter Notebook 1,350 122 Updated Jul 20, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 11,312 854 Updated May 23, 2024

[T-PAMI] A curated list of self-supervised multimodal learning resources.

191 7 Updated Aug 4, 2023

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Python 4,253 324 Updated Jul 19, 2024
Python 1,295 70 Updated Jul 19, 2024

Official Repository for "DrEureka: Language Model Guided Sim-To-Real Transfer" (RSS 2024)

Python 751 57 Updated Jul 9, 2024

LLM inference in C/C++

C++ 61,913 8,886 Updated Jul 20, 2024

Inference code for CodeLlama models

Python 15,480 1,792 Updated May 21, 2024

OpenUI let's you describe UI using your imagination, then see it rendered live.

TypeScript 17,310 1,554 Updated Jul 20, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 13,846 1,239 Updated Jul 17, 2024

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Python 468 23 Updated Jul 6, 2024

PyTorch implementation of Semi-supervised Vision Transformers

Python 48 8 Updated Dec 23, 2022

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 12,846 837 Updated Jul 19, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,106 1,447 Updated Jul 19, 2024

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,562 246 Updated Jul 5, 2024

VMamba: Visual State Space Models,code is based on mamba

Python 1,877 99 Updated Jul 16, 2024

[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)

Python 634 32 Updated Apr 5, 2024

Official Implementation of SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance

Python 82 7 Updated Jul 15, 2024

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

Python 896 83 Updated Sep 29, 2022

[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation

Python 266 26 Updated Apr 10, 2024
Next