Skip to content
View hijkzzz's full-sized avatar
Block or Report

Block or report hijkzzz

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
hijkzzz/README.md

🔭 I'm a Coding Lover.

Jian Hu's GitHub stats

Pinned Loading

  1. OpenRLHF/OpenRLHF OpenRLHF/OpenRLHF Public

    An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

    Python 1.8k 170

  2. pymarl2 pymarl2 Public

    Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

    Python 583 113

  3. alpha-zero-gomoku alpha-zero-gomoku Public

    A Multi-threaded Implementation of AlphaZero

    Python 358 48

  4. cuda-neural-network cuda-neural-network Public

    Convolutional Neural Network with CUDA (MNIST 99.23%)

    C++ 166 38

  5. deep-reinforcement-learning-notes deep-reinforcement-learning-notes Public

    Deep Reinforcement Learning Notes

    117 6

  6. noisy-mappo noisy-mappo Public

    Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)

    Python 44 6