!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

211 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda42 Servers

Load older messages


SenderMessageTime
22 Jul 2024
@hexa:lossy.networkhexa (UTC+1)hah!00:07:47
@hexa:lossy.networkhexa (UTC+1)
diff --git a/nixos/modules/services/misc/ollama.nix b/nixos/modules/services/misc/ollama.nix
index 63ee6798a6dd..d7cabb9af497 100644
--- a/nixos/modules/services/misc/ollama.nix
+++ b/nixos/modules/services/misc/ollama.nix
@@ -183,16 +183,12 @@ in
         DeviceAllow = [
           # CUDA
           # https://docs.nvidia.com/dgx/pdf/dgx-os-5-user-guide.pdf
-          "/dev/nvidia?"
-          "/dev/nvidia-caps/nvidia-cap?"
-          "/dev/nvidiactl"
-          "/dev/nvidia-modeset"
-          "/dev/nvidia-uvm"
-          "/dev/nvidia-uvm-tools"
+          "char-nvidiactl"
+          "char-nvidia-caps"
+          "char-nvidia-uvm"
           # ROCm
-          "/dev/dri/card*"
-          "/dev/dri/renderD*"
-          "/dev/kfd"
+          "char-drm"
+          "char-kfd"
         ];
         DevicePolicy = "closed";
         LockPersonality = true;
00:08:01
@hexa:lossy.networkhexa (UTC+1)device node type matching works better for me00:08:27
@hexa:lossy.networkhexa (UTC+1)also more concise, less pattern matching00:08:34
@hexa:lossy.networkhexa (UTC+1)updated the PR, please retest, if it works for both of us now00:09:33
@adam:robins.wtfadamcstephens yeah i have the rocmOverrideGfx = "10.3.0"; since the 6700 xt is outside the supported cards 00:10:09
@hexa:lossy.networkhexa (UTC+1)yeah, few if any consumer cards have official rocm support00:10:32
@hexa:lossy.networkhexa (UTC+1)and the rx 5000 series is just broken with rocm00:10:56
@adam:robins.wtfadamcstephensupdated DeviceAllow still works00:12:31
@hexa:lossy.networkhexa (UTC+1)Thanks!00:15:30
@adam:robins.wtfadamcstephensyou motivated me to fix my ollama setup :)00:23:24
@adam:robins.wtfadamcstephensapparently you need to listen on all interfaces if you want to put it behind a proxy?00:24:24
@hexa:lossy.networkhexa (UTC+1)I just hope they get tool support going and then I can have local voice control with freeform conversations and stuff 😄 00:24:39
@hexa:lossy.networkhexa (UTC+1)no, that sounds wrong00:24:52
@hexa:lossy.networkhexa (UTC+1)reverse proxy implies the opposite, that the services shouldn't need to expose it self beyond localhost and to the reverse proxy00:25:27
@hexa:lossy.networkhexa (UTC+1)that's why best case is unix sockets 🙂 00:25:37
@adam:robins.wtfadamcstephensactually i didn't try this on the unstable version, but on 24.05 version listening on localhost would return a 403 but on 0.0.0.0 would succeed00:26:52
@hexa:lossy.networkhexa (UTC+1)odd, did you enable recommendedProxySettings in nginx, or something similar in whatever you're using?00:28:29
@adam:robins.wtfadamcstephensyeah, confirmed on usntable version as well00:28:30
@hexa:lossy.networkhexa (UTC+1)X-Forwarded-For is what I'm hinting at00:28:38
@hexa:lossy.networkhexa (UTC+1)Also, you would probably expose a frontend like open-webui instead, no?00:41:46
@adam:robins.wtfadamcstephensthat was what i'm working on next00:43:17
@adam:robins.wtfadamcstephensopen-webui works pretty well. only had to disable pkce to connect it to kanidm :/01:47:59
@hexa:lossy.networkhexa (UTC+1)it is always pkce02:08:28
@adam:robins.wtfadamcstephens Yep 02:13:45
23 Jul 2024
@sbc64:matrix.orgsbc64 joined the room.03:49:16
@kaya:catnip.eekaya changed their profile picture.05:30:28
@ezzobirbezziou:matrix.orgEzzobir Bezziou joined the room.08:22:21
@sellout:matrix.orgsellout joined the room.13:32:24
@ss:someonex.netSomeoneSerge (utc+3)
In reply to @ss:someonex.net
Sounds like "why aren't we doing this yet?"
Speaking of which, we need to stop copy-pasting this stuff because why aren't these default: https://github.com/NixOS/nixpkgs/pull/328713/files#diff-2d862bf9684df6fdface7aabf2af2b1767eb17ba2f9ab8b2e7db03d22c3c0660R196-R216
13:49:06

Show newer messages


Back to Room ListRoom Version: 9