Block or Report
Block or report Zeqiang-Lai
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (20)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
official code of “OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding”
This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…
[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"
LayerDiffuse in pure diffusers without any GUI
PyTorch code and models for the DINOv2 self-supervised learning method.
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
CraftsMan: High-fidelity Mesh Generation with 3D Native Diffusion and Interactive Geometry Refiner
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
Benchmark for generative image models
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
Lumina-T2X is a unified framework for Text to Any Modality Generation
[ CVPR 2024 ] Implementation for "GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation"
Official implementation for "DMesh: A Differentiable Representation for General Meshes".
A free portable photo editor focused on pro-grade features, high performance, and maximum usability.