Commits

Commits on Jun 5, 2024

Add stable-diffusion.cpp
jart
committedJun 5, 2024

Commits on Jun 3, 2024

Add Mozilla logo to README
stlhood
committedJun 3, 2024
add Mozilla logo
stlhood
committedJun 3, 2024

Commits on Jun 1, 2024

Update sever README build/testing instructions (#461 )
veekaybee
committedJun 1, 2024
Upgrade to Cosmopolitan v3.3.10 (#460 )
jeromew
committedJun 1, 2024

Commits on May 30, 2024

Performance improvements on Arm for legacy and k-quants (#453 )
ikawrakow
committedMay 30, 2024

Commits on May 29, 2024

github: delete question in favor of link to discussion [no ci] (#457 )
mofosyne
committedMay 29, 2024
github: add ci (#454 )
mofosyne
committedMay 29, 2024

Commits on May 26, 2024

github: add mention of strace and ftrace (#449 )
mofosyne
committedMay 26, 2024
actions: add labeler + editorconfig github actions (#443 )
mofosyne
committedMay 26, 2024
github: delete assignees and about --> description (#448 )
mofosyne
committedMay 26, 2024
github: add issue templates (#442 )
mofosyne
committedMay 26, 2024

Commits on May 25, 2024

Release llamafile v0.8.6
jart
committedMay 25, 2024
Upgrade to Cosmopolitan v3.3.8
jart
committedMay 25, 2024
Don't print special tokens for now
jart
committedMay 25, 2024
Disable GPU in llava-quantize
jart
committedMay 25, 2024
Release llamafile v0.8.5
jart
committedMay 25, 2024
Recompute llamafile-quantize documentation
jart
committedMay 25, 2024
Upgrade to Cosmopolitan v3.3.7
jart
committedMay 25, 2024
Add missing CPUID check
jart
committedMay 25, 2024

Commits on May 24, 2024

Make some more benchmark tool fixes
jart
committedMay 24, 2024
Reclaim mapped memory
jart
committedMay 24, 2024
Make benchmark tool work more reliably
jart
committedMay 24, 2024
Avoid crashing on llava ctrl-c
jart
committedMay 24, 2024
Fix o/depend file build
jart
committedMay 24, 2024

Commits on May 23, 2024

Add missing CPUID checks
jart
committedMay 23, 2024
Restore quantize_row_q8_K()
jart
committedMay 23, 2024
Add llama-bench command (cpu mode only)
jart
committedMay 23, 2024
Introduce bf16 cuda support
jart
committedMay 23, 2024
Import upstream ggml-cuda fixes
jart
committedMay 23, 2024
Another performance optimization for Zen4 + refactoring (#435 )
ikawrakow
committedMay 23, 2024

Commits on May 22, 2024

Sync with llama.cpp upstream
jart
committedMay 22, 2024

Commits on May 21, 2024

Fix typo in llama.h (#354 )
eltociear
committedMay 21, 2024
Fix f16 cpuid check
jart
committedMay 21, 2024
Faster AVX2 matrix multiplications for MoE models (#428 )
ikawrakow
committedMay 21, 2024