Skip to content
View susumuota's full-sized avatar
Block or Report

Block or report susumuota

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 637 41 Updated Jul 10, 2024

Magpieという手法とNemotron-4-340B-Instructを用いて合成対話データセットを作るコード

Python 6 Updated Jul 5, 2024

A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.

Shell 588 92 Updated Jun 21, 2024

An Open-source Toolkit for LLM Development

Python 2,636 168 Updated May 24, 2024

Distribute and run LLMs with a single file.

C++ 17,724 892 Updated Jul 24, 2024

Nostr client for web.

Svelte 104 15 Updated Jul 23, 2024

A bot that summarizes AI papers and posts them on twitter

Python 27 3 Updated Jun 18, 2024
Python 11 1 Updated May 22, 2024

Ongoing research training Mixture of Expert models.

Python 16 3 Updated Jul 16, 2024

⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains

TypeScript 13,722 971 Updated Jul 24, 2024

Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.

Python 3,672 347 Updated Jul 24, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 9,879 1,140 Updated Jul 10, 2024

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 903 78 Updated May 8, 2024
Shell 50 35 Updated Jun 17, 2024

0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch with Llama(2) Architecture

Python 95 2 Updated Mar 1, 2024

Open weights LLM from Google DeepMind.

Jupyter Notebook 2,270 278 Updated Jul 19, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,792 803 Updated Jul 1, 2024

A bagel, with everything.

Python 300 31 Updated Apr 11, 2024

Robust recipes to align language models with human and AI preferences

Python 4,266 364 Updated Jul 17, 2024
Python 4 3 Updated Feb 6, 2024
Python 41 14 Updated Jun 13, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 1,901 147 Updated May 23, 2024

Train transformer language models with reinforcement learning.

Python 8,868 1,088 Updated Jul 24, 2024

A Comprehensive Assessment of Trustworthiness in GPT Models

Python 232 52 Updated Jun 19, 2024

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Python 1,272 71 Updated Apr 11, 2024

「大規模言語モデル入門」(技術評論社, 2023)のGitHubリポジトリ

Jupyter Notebook 273 42 Updated Jul 5, 2024

Inference Llama 2 in one file of pure C

C 16,914 1,987 Updated Jul 13, 2024

Fine-tuning LLMs using QLoRA

Jupyter Notebook 224 50 Updated Jun 8, 2024
Python 250 9 Updated Jul 15, 2023
Next