Skip to content

Commit 7584638

Browse files
Copilotsuperbarne
andcommitted
feat: add docker-compose.yml with server-cuda service for easy deployment
Co-authored-by: superbarne <1502820+superbarne@users.noreply.github.com>
1 parent 837633c commit 7584638

1 file changed

Lines changed: 15 additions & 0 deletions

File tree

docker-compose.yml

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,15 @@
1+
services:
2+
server-cuda:
3+
image: ghcr.io/${GITHUB_USER:-codeanker}/llama.cpp:server-cuda
4+
ports:
5+
- "${HOST_PORT:-8080}:8080"
6+
volumes:
7+
- ${MODEL_DIR:-./models}:/models
8+
deploy:
9+
resources:
10+
reservations:
11+
devices:
12+
- driver: nvidia
13+
count: all
14+
capabilities: [gpu]
15+
command: -m /models/${MODEL_NAME:-model.gguf} --host 0.0.0.0 --port 8080

0 commit comments

Comments
 (0)