Skip to content
View jianzhnie's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report jianzhnie

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
jianzhnie/README.md

Hi there, I'm Robin 👋

jianzhnie's GitHub Streak


Hi there 👋

Hey, I am jianzhnie, Thanks for stopping by!

I work as a full-time Machine Learning engineer and write tutorials on basic and advanced topics (NLP, Computer vision and code - lots of it).

I read and think a lot. And sometimes I put them in a form of a painting or a piece of music. And when I need to catch a breath I go for a run.

I’m currently working on 🔭

  • Developing the open source ChatGPT, Alpaca, Vicuna and RLHF Pipeline. open-chatgpt
  • Developing nlp-toolkit nlp-toolkit
  • Developing MultiModalTransformers MultiModalTransformers
  • Developing AutoML tools for DeepLearning Project and MacheLearning Project AutoTimm | AutoTabular
  • Trying hard to reduce the Learning Machine Learning(LML) loss 😂
  • Coding everyday for better research engineering skill

I’m currently learning 🌱

  • Theoretical Machine Learning from the basic
  • Pytorch and Pytorch-lightning
  • Transformer models (BERT,GPT, T5, VIT, SwinTransformer)
  • Reinformnet Learning (DQN, A2C, PPO, SAC, TD3 ...)

How to reach me 📫

Have an awesome day!

Pinned Loading

  1. LLamaTuner LLamaTuner Public

    Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

    Python 547 60

  2. open-chatgpt open-chatgpt Public

    The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.

    Python 170 31

  3. microsoft/nni microsoft/nni Public

    An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

    Python 13.9k 1.8k

  4. autogluon/autogluon autogluon/autogluon Public

    Fast and Accurate ML in 3 Lines of Code

    Python 7.5k 885

  5. tatsu-lab/stanford_alpaca tatsu-lab/stanford_alpaca Public

    Code and documentation to train Stanford's Alpaca models, and generate the data.

    Python 29.2k 4k

  6. deep-marl-toolkit deep-marl-toolkit Public

    MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT...

    Python 92 11