EmbeddedLLM: API server for Embedded Device Deployment. Currently support IpexLLM/DirectML./CPU
windows
cpu
llama
gemma
mistral
directx-12
npu
aipc
directml
llm
model-inference
llm-serving
llm-inference
open-source-llm
phi-3
ipexllm
-
Updated
Jul 19, 2024 - Python