Skip to content
View xieyuquanxx's full-sized avatar
  • Harbin Institute of Technology Shenzhen
  • Shenzhen

Highlights

  • Pro
Block or Report

Block or report xieyuquanxx

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

VMamba: Visual State Space Models,code is based on mamba

Python 1,877 99 Updated Jul 16, 2024

a Hassle-Free Python Experience

Rust 12,763 444 Updated Jul 20, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,241 3,301 Updated Jul 20, 2024

Robust recipes to align language models with human and AI preferences

Python 4,241 361 Updated Jul 17, 2024

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 8,611 1,353 Updated Jul 8, 2024

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Rust 28,139 1,735 Updated Jul 20, 2024

Blazingly fast LLM inference.

Rust 3,009 219 Updated Jul 20, 2024

☄🌌️ The minimal, blazing-fast, and infinitely customizable prompt for any shell!

Rust 43,226 1,859 Updated Jul 18, 2024

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 165 2 Updated Jul 15, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

891 17 Updated Jul 10, 2024
Jupyter Notebook 975 185 Updated Jul 19, 2024

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 32,162 5,482 Updated Jul 20, 2024

Some basic examples of playing with RL

Python 1,209 301 Updated Oct 11, 2023

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

C++ 6,514 2,113 Updated Jul 20, 2024

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 166 11 Updated Jul 17, 2024

Development repository for the Triton language and compiler

C++ 12,029 1,431 Updated Jul 20, 2024

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Python 468 23 Updated Jul 6, 2024

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 9,561 1,334 Updated Jun 21, 2024

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 3,432 255 Updated Jul 9, 2024

4M: Massively Multimodal Masked Modeling

Python 1,392 78 Updated Jul 17, 2024

LLM101n: Let's build a Storyteller

23,797 1,236 Updated Jul 20, 2024

A curated list of visual reinforcement learning resources

51 2 Updated Jul 16, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 19,489 2,416 Updated Apr 28, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,556 91 Updated Jul 10, 2024

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 18,940 2,879 Updated Jul 20, 2024
Python 54 Updated Dec 13, 2023

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,791 170 Updated Jul 20, 2024

Train transformer language models with reinforcement learning.

Python 8,819 1,086 Updated Jul 19, 2024

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 678 77 Updated Jul 15, 2024
Next