I found it!
The VM's CPU type needed to be set to "Host", instead of the default "x86-64-v2-AES".
Thanks to "journalctl -xeu ollama", for spitting out these error messages:
Code:
ollama[778]: time=2024-03-29T09:51:08.606+01:00 level=INFO source=gpu.go:120 msg="Nvidia GPU detected via cudart"
ollama[778]: time=2024-03-29T09:51:08.606+01:00 level=INFO source=cpu_common.go:18 msg="CPU does not have vector extensions"
ollama[778]: time=2024-03-29T09:51:08.607+01:00 level=WARN source=gpu.go:151 msg="CPU does not have AVX or AVX2, disabling GPU support."
ollama[778]: time=2024-03-29T09:51:08.607+01:00 level=INFO source=routes.go:1141 msg="no GPU detected"
...and a
Proxmox forum thread on "AVX2 and AVX flags" with hints towards the VM-CPU type.
Now it's running perfectly, and journalctl confirms this:
Code:
ollama[779]: time=2024-03-29T10:21:40.126+01:00 level=INFO source=gpu.go:115 msg="Detecting GPU type"
ollama[779]: time=2024-03-29T10:21:40.126+01:00 level=INFO source=gpu.go:265 msg="Searching for GPU management library libcudart.so*"
ollama[779]: time=2024-03-29T10:21:40.129+01:00 level=INFO source=gpu.go:311 msg="Discovered GPU libraries: [/tmp/ollama4267936894/runners/cuda_v11/libcudart.so.11.0 /usr/local/cuda/lib64/libcudart.so.12.4.99]"
ollama[779]: time=2024-03-29T10:21:40.150+01:00 level=INFO source=gpu.go:120 msg="Nvidia GPU detected via cudart"
ollama[779]: time=2024-03-29T10:21:40.150+01:00 level=INFO source=cpu_common.go:11 msg="CPU has AVX2"
ollama[779]: time=2024-03-29T10:21:40.215+01:00 level=INFO source=gpu.go:188 msg="[cudart] CUDART CUDA Compute Capability detected: 8.6"
nvtop, etc all show proper GPU usage now.
Case closed.
Thanks!