!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

319 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda63 Servers

You have reached the beginning of time (for this room).


SenderMessageTime
29 Apr 2024
@vid:matrix.org@vid:matrix.org * I don't think I'm trying to do anything too exotic though, basically "cuda in docker with widely used hardware," so I don't know why it doesn't "just work." The only thing different is I want to use Intel IGP for display (which works fine), nvidia for cuda. at this point I'm willing to sacrifice the ~250mb of using the nvidia display driver if it "just works"12:58:54
@vid:matrix.org@vid:matrix.org * I don't think I'm trying to do anything too exotic though, basically "cuda in docker with widely used hardware," so I don't know why it doesn't "just work." The only thing different is I want to use Intel IGP for display (which works fine), nvidia for cuda. at this point I'm willing to sacrifice the ~250mb for using the nvidia display driver if it "just works"12:59:12
@vid:matrix.org@vid:matrix.orgone notable thing is I don't have services.xserver.videoDrivers; defined at all. "intel" didn't work for some reason (12700k).13:00:30
@vid:matrix.org@vid:matrix.org * one notable thing is I don't have services.xserver.videoDrivers; defined at all. "intel" didn't work for some reason (12700k). 13:00:41
@vid:matrix.org@vid:matrix.org * one notable thing is I don't have services.xserver.videoDrivers defined at all. "intel" didn't work for some reason (12700k). 13:01:46
@matthewcroughan:defenestrate.itmatthewcroughan changed their profile picture.13:08:05
@ss:someonex.netSomeoneSerge (matrix works sometimes)Tge option is at Hardware nvidia container toolkit13:13:49
@ss:someonex.netSomeoneSerge (matrix works sometimes)* The option is at Hardware nvidia container toolkit13:14:08
@ss:someonex.netSomeoneSerge (matrix works sometimes)
In reply to @vid:matrix.org
one notable thing is I don't have services.xserver.videoDrivers defined at all. "intel" didn't work for some reason (12700k).
You need to enable either the nvidia_x11 or the datacenter driver
13:15:09
@ss:someonex.netSomeoneSerge (matrix works sometimes)* The option is at Hardware nvidia container toolkit Edit: I see now you have set it13:15:54
@vid:matrix.org@vid:matrix.orgI don't see many references to the datacentre driver, if it'd work with a 3090 it seems like it would be good for not causing complexity with displays.13:19:16
@ss:someonex.netSomeoneSerge (matrix works sometimes)I use nvidia_x11 with a 309013:21:05
@ss:someonex.netSomeoneSerge (matrix works sometimes)(Note you don't have to enable the xserver or anything to use it; the option is just confusingly named)13:22:47
@vid:matrix.org@vid:matrix.org ok, I'm sorry to ask this, but do you see anything obviously wrong in my config? nvidia-smi does find the card, but none of the libraries/docker seem to work. /run/cdi/nvidia-container-toolkit.json exists but isn't populated 13:30:06
@vid:matrix.org@vid:matrix.orgthere seems to be a fundamental problem when nvidia-container-toolkit is installed, every docker command yields "no help topic for" <cmd>13:38:56
@connorbaker:matrix.orgconnor (he/him)@vid do you have an example container you’re trying to run? Looks close to my setup so I could give it a try14:30:37
@mjolnir:nixos.orgNixOS Moderation Botchanged room power levels.15:29:37
@vid:matrix.org@vid:matrix.orgit was just the stock llama.cpp repo, following the instructions for the docker light setup. it was probably something I was doing wrong, but after spending a weekend on this, I got it running on ubuntu without pulling out a single hair. I'm going to have to stick to that camp, but I will keep an eye on nixos 'cause I really like the ideas15:47:43
@trexd:matrix.orgtrexd

I found that doing docker run --gpus=all results in

docker: Error response from daemon: could not select device driver "" with capabilities: [[gpu]].

Whereas docker run --device nvidia.com/gpu=all will detect my GPU. vid

My minimal settings are documented in this issue. https://github.com/NixOS/nixpkgs/issues/305312

16:05:20
30 Apr 2024
@simon88812:matrix.orgPiqué joined the room.14:20:23
1 May 2024
@hacker1024:matrix.orghacker1024

Has anyone managed to get TensorFlow and PyTorch in the same Python environment on a recent nixos-unstable? This has seemed to have broken at some point in the last few months.

$ nix-shell -I nixpkgs=channel:nixos-unstable -p 'python3.withPackages (ps: with ps; [ torch tensorflow ])'

this derivation will be built:
  /nix/store/p7hnwqgxp8hm52qkw787r9i9akb1y9fd-python3-3.11.9-env.drv
building '/nix/store/p7hnwqgxp8hm52qkw787r9i9akb1y9fd-python3-3.11.9-env.drv'...
error: collision between `/nix/store/k0hpynrpwp0ihh86r1walxv0dcvij9ba-python3.11-grpcio-1.62.1/lib/python3.11/site-packages/grpc/__pycache__/__init__.cpython-311.opt-1.pyc' and `/nix/store/n67kryc5dcblnvb17h04fx7ivbbjjlk6-python3.11-grpcio-1.62.1/lib/python3.11/site-packages/grpc/__pycache__/__init__.cpython-311.opt-1.pyc'
error: builder for '/nix/store/p7hnwqgxp8hm52qkw787r9i9akb1y9fd-python3-3.11.9-env.drv' failed with exit code 255;
       last 1 log lines:
       > error: collision between `/nix/store/k0hpynrpwp0ihh86r1walxv0dcvij9ba-python3.11-grpcio-1.62.1/lib/python3.11/site-packages/grpc/__pycache__/__init__.cpython-311.opt-1.pyc' and `/nix/store/n67kryc5dcblnvb17h04fx7ivbbjjlk6-python3.11-grpcio-1.62.1/lib/python3.11/site-packages/grpc/__pycache__/__init__.cpython-311.opt-1.pyc'
       For full logs, run 'nix log /nix/store/p7hnwqgxp8hm52qkw787r9i9akb1y9fd-python3-3.11.9-env.drv'.
02:41:09
@hacker1024:matrix.orghacker1024 *

Has anyone managed to get TensorFlow and PyTorch in the same Python environment on a recent nixos-unstable?

$ nix-shell -I nixpkgs=channel:nixos-unstable -p 'python3.withPackages (ps: with ps; [ torch tensorflow ])'

this derivation will be built:
  /nix/store/p7hnwqgxp8hm52qkw787r9i9akb1y9fd-python3-3.11.9-env.drv
building '/nix/store/p7hnwqgxp8hm52qkw787r9i9akb1y9fd-python3-3.11.9-env.drv'...
error: collision between `/nix/store/k0hpynrpwp0ihh86r1walxv0dcvij9ba-python3.11-grpcio-1.62.1/lib/python3.11/site-packages/grpc/__pycache__/__init__.cpython-311.opt-1.pyc' and `/nix/store/n67kryc5dcblnvb17h04fx7ivbbjjlk6-python3.11-grpcio-1.62.1/lib/python3.11/site-packages/grpc/__pycache__/__init__.cpython-311.opt-1.pyc'
error: builder for '/nix/store/p7hnwqgxp8hm52qkw787r9i9akb1y9fd-python3-3.11.9-env.drv' failed with exit code 255;
       last 1 log lines:
       > error: collision between `/nix/store/k0hpynrpwp0ihh86r1walxv0dcvij9ba-python3.11-grpcio-1.62.1/lib/python3.11/site-packages/grpc/__pycache__/__init__.cpython-311.opt-1.pyc' and `/nix/store/n67kryc5dcblnvb17h04fx7ivbbjjlk6-python3.11-grpcio-1.62.1/lib/python3.11/site-packages/grpc/__pycache__/__init__.cpython-311.opt-1.pyc'
       For full logs, run 'nix log /nix/store/p7hnwqgxp8hm52qkw787r9i9akb1y9fd-python3-3.11.9-env.drv'.
02:43:27
@connorbaker:matrix.orgconnor (he/him)I haven't but that seems about right -- they either don't pin or pin different versions of dependencies :l04:44:10
@connorbaker:matrix.orgconnor (he/him)

I think my ISP hates me running nixpkgs-review almost as much as I do

[82/70/5750 built, 135/283/6926 copied (2415.7/87849.0 MiB), 2042.0/25876.6 MiB DL] connecting to 'ssh-ng://nix@nixos-build01'
05:24:16
@mjolnir:nixos.orgNixOS Moderation Botchanged room power levels.15:06:29
2 May 2024
@brandon:matrix.radiation.io@brandon:matrix.radiation.io joined the room.18:18:28
3 May 2024
@kaya:catnip.eekaya 𖤐 joined the room.14:01:31
@ironbound:hackerspace.pl@ironbound:hackerspace.pl changed their profile picture.17:47:48
4 May 2024
@ss:someonex.netSomeoneSerge (matrix works sometimes) changed their display name from SomeoneSerge (is taking time off and doesn't want to hear about it) to SomeoneSerge (Way down Hadestown).21:03:42

Show newer messages


Back to Room ListRoom Version: 9