All repositories Change the repository type filter All Repositories list • 0• 1• 6• 0• Updated Jul 19, 2024 Jul 19, 2024 Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
• GNU Affero General Public License v3.0
• 1.2k• 21k• 172• 8• Updated Jul 19, 2024 Jul 19, 2024 Drop-in, local AI alternative to the OpenAI stack. Multi-engine (llama.cpp, TensorRT-LLM, ONNX). Powers 👋 Jan
• • 98• 1.8k• 49• 7• Updated Jul 19, 2024 Jul 19, 2024 Homebrew Website
• • 0• 0• 0• 0• Updated Jul 19, 2024 Jul 19, 2024 • 1• 2• 10• 2• Updated Jul 19, 2024 Jul 19, 2024 Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.
• • 822• 32• 13• 5• Updated Jul 19, 2024 Jul 19, 2024 cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server at runtime.
• GNU Affero General Public License v3.0
• 3• 4• 8• 4• Updated Jul 18, 2024 Jul 18, 2024 • 0• 0• 0• 0• Updated Jul 18, 2024 Jul 18, 2024 Training sound instruct llama3
0• 0• 3• 2• Updated Jul 18, 2024 Jul 18, 2024 Jan.ai Website & Documentation
• 6• 16• 52• 4• Updated Jul 15, 2024 Jul 15, 2024 The official Node.js / Typescript library for the OpenAI API
• • 771• 1• 0• 6• Updated Jul 12, 2024 Jul 12, 2024 • 0• 0• 0• 0• Updated Jul 9, 2024 Jul 9, 2024 • 0• 0• 0• 0• Updated Jul 5, 2024 Jul 5, 2024 • 0• 1• 2• 1• Updated Jul 3, 2024 Jul 3, 2024 GNU Affero General Public License v3.0
• 0• 0• 0• 0• Updated Jun 27, 2024 Jun 27, 2024 An awesome repository of local AI tools
87• 1.1k• 6• 4• Updated Jun 21, 2024 Jun 21, 2024 C++ code that run Python embedding
• GNU Affero General Public License v3.0
• 0• 4• 1• 0• Updated May 23, 2024 May 23, 2024 The official Python library for the OpenAI API
• • 2.9k• 1• 0• 1• Updated May 20, 2024 May 20, 2024 Make the py
0• 0• 0• 0• Updated Apr 9, 2024 Apr 9, 2024 The Triton TensorRT-LLM Backend
• • 86• 0• 0• 0• Updated Mar 19, 2024 Mar 19, 2024 • 0• 1• 0• 0• Updated Mar 15, 2024 Mar 15, 2024 OpenAI compatible API for TensorRT LLM triton backend
• • 22• 0• 0• 0• Updated Mar 15, 2024 Mar 15, 2024 This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows instead of cloud.
• • 11• 0• 0• 0• Updated Mar 8, 2024 Mar 8, 2024 R&D experiments
• GNU Affero General Public License v3.0
• 1• 1• 0• 0• Updated Mar 1, 2024 Mar 1, 2024 1• 1• 0• 0• Updated Feb 28, 2024 Feb 28, 2024 This repository contains helm chart for our team
• 0• 0• 0• 0• Updated Feb 20, 2024 Feb 20, 2024 Port of Facebook's LLaMA model in C/C++
• • 8.9k• 0• 0• 0• Updated Feb 19, 2024 Feb 19, 2024 The AI-native database built for LLM applications, providing incredibly fast vector and full-text search
• • 176• 0• 0• 0• Updated Feb 19, 2024 Feb 19, 2024 • • 2k• 0• 0• 0• Updated Feb 19, 2024 Feb 19, 2024 • • 1• 0• 0• 0• Updated Jan 17, 2024 Jan 17, 2024
You can’t perform that action at this time.