Skip to content
Navigation Menu
Toggle navigation
Sign in
Product
Actions
Automate any workflow
Packages
Host and manage packages
Security
Find and fix vulnerabilities
Codespaces
Instant dev environments
GitHub Copilot
Write better code with AI
Code review
Manage code changes
Issues
Plan and track work
Discussions
Collaborate outside of code
Explore
All features
Documentation
GitHub Skills
Blog
Solutions
By size
Enterprise
Teams
Startups
By industry
Healthcare
Financial services
Manufacturing
By use case
CI/CD & Automation
DevOps
DevSecOps
Resources
Topics
AI
DevOps
Innersource
Open Source
Security
Software Development
Explore
Learning Pathways
White papers, Ebooks, Webinars
Customer Stories
Partners
Open Source
GitHub Sponsors
Fund open source developers
The ReadME Project
GitHub community articles
Repositories
Topics
Trending
Collections
Enterprise
Enterprise platform
AI-powered developer platform
Available add-ons
Advanced Security
Enterprise-grade security features
GitHub Copilot
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
Search or jump to...
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
Mozilla-Ocho
/
llamafile
Public
Notifications
You must be signed in to change notification settings
Fork
855
Star
17.1k
Code
Issues
100
Pull requests
3
Discussions
Actions
Projects
0
Security
Insights
Additional navigation options
Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights
Commits
Branch selector
main
User selector
All users
All time
Commit History
Commits on Jun 5, 2024
Add stable-diffusion.cpp
jart
committed
Jun 5, 2024
3b7b1e3
Commits on Jun 3, 2024
Add Mozilla logo to README
stlhood
committed
Jun 3, 2024
8e23c73
add Mozilla logo
stlhood
committed
Jun 3, 2024
5447f2d
Commits on Jun 1, 2024
Update sever README build/testing instructions (
#461
)
veekaybee
committed
Jun 1, 2024
9cd8d70
Upgrade to Cosmopolitan v3.3.10 (
#460
)
jeromew
committed
Jun 1, 2024
7d8dd1b
Commits on May 30, 2024
Performance improvements on Arm for legacy and k-quants (
#453
)
ikawrakow
committed
May 30, 2024
293a528
Commits on May 29, 2024
github: delete question in favor of link to discussion [no ci] (
#457
)
mofosyne
committed
May 29, 2024
73088c3
github: add ci (
#454
)
mofosyne
committed
May 29, 2024
31419d0
Commits on May 26, 2024
github: add mention of strace and ftrace (
#449
)
mofosyne
committed
May 26, 2024
397175e
actions: add labeler + editorconfig github actions (
#443
)
mofosyne
committed
May 26, 2024
92be52a
github: delete assignees and about --> description (
#448
)
mofosyne
committed
May 26, 2024
ba71930
github: add issue templates (
#442
)
mofosyne
committed
May 26, 2024
076dfb0
Commits on May 25, 2024
Release llamafile v0.8.6
jart
committed
May 25, 2024
81cfbcf
Upgrade to Cosmopolitan v3.3.8
jart
committed
May 25, 2024
866a129
Don't print special tokens for now
jart
committed
May 25, 2024
69c2dd3
Disable GPU in llava-quantize
jart
committed
May 25, 2024
ea2a96e
Release llamafile v0.8.5
jart
committed
May 25, 2024
b79ecf4
Recompute llamafile-quantize documentation
jart
committed
May 25, 2024
07e87bf
Upgrade to Cosmopolitan v3.3.7
jart
committed
May 25, 2024
261dfe7
Add missing CPUID check
jart
committed
May 25, 2024
e75caa1
Commits on May 24, 2024
Make some more benchmark tool fixes
jart
committed
May 24, 2024
e675719
Reclaim mapped memory
jart
committed
May 24, 2024
4451c6d
Make benchmark tool work more reliably
jart
committed
May 24, 2024
8b2f8d8
Avoid crashing on llava ctrl-c
jart
committed
May 24, 2024
5c40565
Fix o/depend file build
jart
committed
May 24, 2024
aa58b3a
Commits on May 23, 2024
Add missing CPUID checks
jart
committed
May 23, 2024
91dd4d3
Restore quantize_row_q8_K()
jart
committed
May 23, 2024
50dd001
Add llama-bench command (cpu mode only)
jart
committed
May 23, 2024
9206719
Introduce bf16 cuda support
jart
committed
May 23, 2024
c0aa43e
Import upstream ggml-cuda fixes
jart
committed
May 23, 2024
0b5997d
Another performance optimization for Zen4 + refactoring (
#435
)
ikawrakow
committed
May 23, 2024
7cb15c6
Commits on May 22, 2024
Sync with llama.cpp upstream
jart
committed
May 22, 2024
d228e01
Commits on May 21, 2024
Fix typo in llama.h (
#354
)
eltociear
committed
May 21, 2024
b3aa97d
Fix f16 cpuid check
jart
committed
May 21, 2024
87d4ce1
Faster AVX2 matrix multiplications for MoE models (
#428
)
ikawrakow
committed
May 21, 2024
938cf72
Pagination
Previous
Next
You can’t perform that action at this time.