Skip to content
Change the repository type filter

All

    Repositories list

    39 repositories

    • research

      Public
      Python
      0160Updated Jul 19, 2024Jul 19, 2024
    • jan

      Public
      Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
      TypeScript
      GNU Affero General Public License v3.0
      1.2k21k1728Updated Jul 19, 2024Jul 19, 2024
    • cortex

      Public
      Drop-in, local AI alternative to the OpenAI stack. Multi-engine (llama.cpp, TensorRT-LLM, ONNX). Powers 👋 Jan
      C++
      Apache License 2.0
      981.8k497Updated Jul 19, 2024Jul 19, 2024
    • Homebrew Website
      MDX
      Apache License 2.0
      0000Updated Jul 19, 2024Jul 19, 2024
    • cortex.so

      Public
      TypeScript
      12102Updated Jul 19, 2024Jul 19, 2024
    • Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.
      C++
      Apache License 2.0
      82232135Updated Jul 19, 2024Jul 19, 2024
    • cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server at runtime.
      C++
      GNU Affero General Public License v3.0
      3484Updated Jul 18, 2024Jul 18, 2024
    • Python
      0000Updated Jul 18, 2024Jul 18, 2024
    • llama3-s

      Public
      Training sound instruct llama3
      0032Updated Jul 18, 2024Jul 18, 2024
    • docs

      Public
      Jan.ai Website & Documentation
      MDX
      616524Updated Jul 15, 2024Jul 15, 2024
    • The official Node.js / Typescript library for the OpenAI API
      TypeScript
      Apache License 2.0
      771106Updated Jul 12, 2024Jul 12, 2024
    • C++
      0000Updated Jul 9, 2024Jul 9, 2024
    • Ruby
      0000Updated Jul 5, 2024Jul 5, 2024
    • C++
      0121Updated Jul 3, 2024Jul 3, 2024
    • ppa

      Public
      GNU Affero General Public License v3.0
      0000Updated Jun 27, 2024Jun 27, 2024
    • An awesome repository of local AI tools
      871.1k64Updated Jun 21, 2024Jun 21, 2024
    • C++ code that run Python embedding
      C++
      GNU Affero General Public License v3.0
      0410Updated May 23, 2024May 23, 2024
    • The official Python library for the OpenAI API
      Python
      Apache License 2.0
      2.9k101Updated May 20, 2024May 20, 2024
    • pymaker

      Public
      Make the py
      0000Updated Apr 9, 2024Apr 9, 2024
    • The Triton TensorRT-LLM Backend
      Python
      Apache License 2.0
      86000Updated Mar 19, 2024Mar 19, 2024
    • Shell
      0100Updated Mar 15, 2024Mar 15, 2024
    • OpenAI compatible API for TensorRT LLM triton backend
      Rust
      MIT License
      22000Updated Mar 15, 2024Mar 15, 2024
    • This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows instead of cloud.
      Python
      Other
      11000Updated Mar 8, 2024Mar 8, 2024
    • R&D experiments
      Jupyter Notebook
      GNU Affero General Public License v3.0
      1100Updated Mar 1, 2024Mar 1, 2024
    • 1100Updated Feb 28, 2024Feb 28, 2024
    • charts

      Public
      This repository contains helm chart for our team
      Smarty
      0000Updated Feb 20, 2024Feb 20, 2024
    • Port of Facebook's LLaMA model in C/C++
      C++
      MIT License
      8.9k000Updated Feb 19, 2024Feb 19, 2024
    • infinity

      Public
      The AI-native database built for LLM applications, providing incredibly fast vector and full-text search
      C++
      Apache License 2.0
      176000Updated Feb 19, 2024Feb 19, 2024
    • TypeScript
      MIT License
      2k000Updated Feb 19, 2024Feb 19, 2024
    • JavaScript
      MIT License
      1000Updated Jan 17, 2024Jan 17, 2024