Skip to content

Tags: withcatai/node-llama-cpp

Tags

v3.14.2

Toggle v3.14.2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: `semantic-release` retry (#518) 

v3.14.1

Toggle v3.14.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix(Vulkan): include integrated GPU memory (#516) * fix(Vulkan): include integrated GPU memory - adapt to a change in `llama.cpp` * fix(Vulkan): deduplicate the same device coming from different drivers * fix: adapt Llama chat wrappers to breaking `llama.cpp` changes * fix: internal log level * docs(Vulkan): recommend installing LLVM on Windows

v3.14.0

Toggle v3.14.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
test: fix tests (#509) 

v3.13.0

Toggle v3.13.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat: Seed OSS support (#502) 

v3.12.4

Toggle v3.12.4's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
test: fix tests (#499) 

v3.12.3

Toggle v3.12.3's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: split prebuilt CUDA binaries into 2 npm modules (#495) 

v3.12.2

Toggle v3.12.2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: CUDA 13 support (#494) fix: prebuilt binaries CUDA 13 support

v3.12.1

Toggle v3.12.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: completion config (#490) * fix: more flexible model message prompt completion config * feat(Electron template): improve scroll

v3.12.0

Toggle v3.12.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat: `gpt-oss` support (#487) * feat: `gpt-oss` support * fix: Qwen3 memory estimation

v3.11.0

Toggle v3.11.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
build: fix CI config (#483) * build: update CUDA version in the CI * fix: add missing GGUF types