Skip to content

Updated installation instructions for BLAS backends#177

Merged
abetlen merged 1 commit into
abetlen:mainfrom
joelkurian:main
May 10, 2023
Merged

Updated installation instructions for BLAS backends#177
abetlen merged 1 commit into
abetlen:mainfrom
joelkurian:main

Conversation

@joelkurian

Copy link
Copy Markdown
Contributor

llama.cpp does not read BLAS related environment variables while configuring build with CMake.

Instead, it needs to be passed as command line argument which correctly configures BLAS backend. It can be done by setting CMAKE_ARGS environment variable.

@abetlen

abetlen commented May 9, 2023

Copy link
Copy Markdown
Owner

@joelkurian what's your OS? Happy to add these instructions but the environment variables do work for me but I'm on Linux.

@joelkurian

Copy link
Copy Markdown
Contributor Author

I am on ArchLinux. Environment variables work with make build, but have a bit weird behavior with cmake.

In order to make use of environment variable, CMake needs to read it like $ENV{LLAMA_CUBLAS} in CMakeLists.txt in llama.cpp which is not the case right now.
https://github.com/ggerganov/llama.cpp/blob/master/CMakeLists.txt

@abetlen abetlen merged commit c3ed133 into abetlen:main May 10, 2023
ThongvanAlexis added a commit to ThongvanAlexis/llama-cpp-python that referenced this pull request Apr 16, 2026
Suppress nvcc template warnings (abetlen#177, abetlen#221, abetlen#550) that flood the log
with ~2500 lines of noise from llama.cpp CUDA kernels. Drop
CMAKE_BUILD_PARALLEL_LEVEL from 2 to 1 — parallel nvcc with 5 arch
targets causes sccache daemon OOM crash (os error 10054) on the
4-vCPU runner.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants