!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

310 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda60 Servers

Load older messages


SenderMessageTime
24 Sep 2024
@hexa:lossy.networkhexaI think this one has been failing for me on the linear-operator package11:41:02
@connorbaker:matrix.orgconnor (he/him) As a sanity check — has anyone been able to successfully use torch.compile to speed up model training, or do they also get a python stack trace when torch tries to call into OpenAI’s triton 15:23:08
25 Sep 2024
@ss:someonex.netSomeoneSerge (matrix works sometimes)It used to work but now our t2iton is lagging 1 major version behind19:36:58
@glepage:matrix.orgGaétan LepageBecause those geniuses are not able to tag a freaking release20:20:55
@glepage:matrix.orgGaétan Lepage https://github.com/triton-lang/triton/issues/3535 20:21:18
@ss:someonex.netSomeoneSerge (matrix works sometimes)unstable-yyyy-mm-dd is ok for us; there were some minor but unresolved issues with the PR that does the bump though20:23:04
26 Sep 2024
@connorbaker:matrix.orgconnor (he/him)
In reply to @glepage:matrix.org
https://github.com/triton-lang/triton/issues/3535
Well that’s an infuriating read
16:33:18
@glepage:matrix.orgGaétan LepageIt's OK, OpenAI is just a small startup with only a few people. And deep learning is not even their main activity17:07:38
@connorbaker:matrix.orgconnor (he/him) Yeah and they're definitely not a for-profit organization 17:20:14
@adam:robins.wtf@adam:robins.wtf"open" is in their name17:24:26
@gsaurel:laas.frnim65sit's such a joke that I find it sad it was not opened one day earlier17:28:20
@glepage:matrix.orgGaétan Lepage "I propose a 200€ bounty for this PR. Please git tag the freaking commit. 21:09:04
@glepage:matrix.orgGaétan Lepage * "I propose a 200€ bounty for this PR. Please git tag the freaking commit." 21:09:07
@glepage:matrix.orgGaétan LepageThe ease of spinning up a release is a decreasing function of the project/company resources.21:09:40
@gsaurel:laas.frnim65ssame issue on a one-man project abandonned for the last year or so: https://github.com/bab2min/EigenRand/issues/5621:47:05
@gsaurel:laas.frnim65s * same issue on a one-man project abandonned for the last year or so: https://github.com/bab2min/EigenRand/issues/56 : <48h21:49:56
28 Sep 2024
@shekhinah:she.khinah.xyzshekhinah changed their profile picture.07:04:58
@kaya:catnip.eekaya 𖤐 changed their profile picture.16:55:46
1 Oct 2024
@-_o:matrix.org-_o joined the room.21:00:15
2 Oct 2024
@hexa:lossy.networkhexa Gaétan Lepage: please take care of tensordict 00:25:19
@hexa:lossy.networkhexaimage.png
Download image.png
00:25:22
@glepage:matrix.orgGaétan Lepage Sure, I will have a look right now.
I have not faced any failure on my end, weird...
06:21:33
@glepage:matrix.orgGaétan LepageIs this on staging ?06:23:26
@glepage:matrix.orgGaétan Lepage All failures that I was able to find on hydra are timeouts or upstream dependency failures.
I was able to build tensordict on all architectures...
07:05:50
@hexa:lossy.networkhexathis is on trunk11:03:39
@hexa:lossy.networkhexathen you probably need to increase meta.timeout11:04:00
@glepage:matrix.orgGaétan Lepage Now that you say it, I remember this package being stuck (indefinitly) during mass rebuilds.
I don't know if increasing the timeout will help. When everything works fine, it builds in ~1min...
Also, nothing has changed in the derivation for the past few months.
11:47:12
@justbrowsing:matrix.orgKevin Mittman (UTC-7)Back from vacation18:23:19
@justbrowsing:matrix.orgKevin Mittman (UTC-7)Redacted or Malformed Event18:32:05
@justbrowsing:matrix.orgKevin Mittman (UTC-7)
In reply to @ss:someonex.net
Kevin Mittman Hi! Do you know how dcgm uses cuda and why it has to link several versions?
See libdcgm_cublas_proxy${cudaMajor}.so
18:34:06

Show newer messages


Back to Room ListRoom Version: 9