!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

288 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda58 Servers

Load older messages


SenderMessageTime
3 Feb 2025
@ss:someonex.netSomeoneSerge (back on matrix) * Is it still broken? The attribute page shows latest eval grey. I might have interest in fixing it, I'll check tmr 22:57:50
@hexa:lossy.networkhexa
       > ERROR: noBrokenSymlinks: the symlink /nix/store/fqx2dv9vp1k0f00imgqshy6d92ykcw5d-python3.12-kaleido-0.2.1/lib/python3.12/site-packages/kaleido/executable/etc/fonts/fonts.conf points to a missing target /nix/store/2ynwbywyaxk4wgl8d3xrb9dzkdzv241x-fontconfig-2.15.0-bin/etc/fonts/fonts.conf
       > ERROR: noBrokenSymlinks: found 1 dangling symlinks and 0 reflexive symlinks
       For full logs, run 'nix log /nix/store/f7whd4p85k8b7bd8sx2bnp5jpmzycbkx-python3.12-kaleido-0.2.1.drv'.
error: 1 dependencies of derivation '/nix/store/gdg8kgy8ry0gjhpv2dws072wajkjk69l-python3.12-plotly-5.24.1.drv' failed to build
error: 1 dependencies of derivation '/nix/store/pfxw0g0npwr091cr7ks7012jl8qsg
23:00:36
@hexa:lossy.networkhexa yes, but now also due to that new hook connor (he/him) (UTC-7) introduced 23:00:45
@hexa:lossy.networkhexa * SomeoneSerge (Gand St. Pieters): yes, but now also due to that new hook connor (he/him) (UTC-7) introduced 23:00:50
@glepage:matrix.orgGaƩtan LepageRedacted or Malformed Event23:02:31
@ss:someonex.netSomeoneSerge (back on matrix) Huh? 23:10:48
@ss:someonex.netSomeoneSerge (back on matrix) * Huh? I thought it was out of tree 23:11:17
@ss:someonex.netSomeoneSerge (back on matrix) * Huh? I thought it was still out of tree 23:11:21
@hexa:lossy.networkhexait is in the current staging cycle23:58:18
@hexa:lossy.networkhexaand it is causing all sort of havoc23:58:27
@hexa:lossy.networkhexabecause no fixes were prepared in advance23:58:38
4 Feb 2025
@connorbaker:matrix.orgconnor (he/him) I apologize, I should have made sure at least stdenvs for {x86_64,aarch64}-{darwin,linux} were working
In the case of rebuild-the-world PRs, I'm not sure what is considered sufficient in terms of testing -- is there particular language (or Nix expressions) you'd want to see in the contributing guidelines?
05:07:55
@ss:someonex.netSomeoneSerge (back on matrix) Oh nice! 09:46:24
@ss:someonex.netSomeoneSerge (back on matrix)Found it09:46:27
@afdee1c:matrix.orgafdee1c joined the room.20:04:25
@zopieux:matrix.zopi.euzopieux I'm a bit confused about the nix-community cache and I wonder if my system/config is to blame, or the cache.
This cuda build succeeded and depends on nixpkgs d0bb46, which I pinned in my flake. Upon building though, nix decides to build fmpz8s6hy3yr8z6kb84h6498437d0xj1-ollama-0.5.7.drv even though per the above and per https://nix-community.cachix.org/8njyvpf8sxh8k61zvnv13cymn7szv63c.narinfo, the output should be available in the cache. nix.conf confirms the substituter/pubkey is present. Am I missing something?
21:03:07
@connorbaker:matrix.orgconnor (he/him) I haven't followed too closely how the community Hydra builds cudaPackages, but the first thing that comes to my mind is that perhaps your config.cudaCapabilities doesn't match the default set selected when it's unset? (By default, we build for a number of capabilities.) 22:28:12
5 Feb 2025
@ss:someonex.netSomeoneSerge (back on matrix) No, fmpz8s6hy3yr8z6kb84h6498437d0xj1 is the deriver of 8njyvpf8sxh8k61zvnv13cymn7szv63c.narinfo, and cachix knows about it, and if you try wget https://nix-community.cachix.org/nar/77cdeba29947cabc7c880df05f41200e0f9ee711651931ac046e64fcfd52f48b.nar.zst it actually begins to download the blob so it's not GCed 00:38:31
@ss:someonex.netSomeoneSerge (back on matrix) So Nix does behave weird here? zopieux what's it say if --builders "" -j0? 00:39:30
@ruroruro:matrix.orgruro Regarding "broken javascript", you can see eval failures on the "Evaluation Errors" tab of the nixpkgs:cuda jobset (but not on individual job pages, not sure why). 21:37:53
6 Feb 2025
@ruroruro:matrix.orgruroHere's my attempt at a PR that fixes a bunch of Eval Errors that I've outlined earlier: https://github.com/NixOS/nixpkgs/pull/37976806:38:45
@ruroruro:matrix.orgruro

Regarding the remaining 21 eval errors:

  • 13x cuda-samples depends on freeimage which is technically insecure. I am not 100% sure, if we should just filter all of the cuda-samples packages using the new filterPackagePredicates mechanism or if it might be better to do

    freeimage.overrideAttrs { meta.knownVulnerabilities = [ ]; }
    

    since cuda-samples isn't really "production-facing" anyway, so the users should only really care that the samples compile. The security risk of distributing sample code that technically has some vulnerabilities should be minimal.

  • colmap also depends on freeimage, this issue should be probably raised upstream

  • boxx and bpycv haven't been updated upstream in the last 11 months and they seem to not support any of the python versions that are currently supported in nixpkgs. So we should probably check in with the nixpkgs maintainer and remove these packages if they aren't required by something important.

  • pixinsight is (and always was) unfree, but it is explicitly listed in release-cuda.nix for some reason. Should it be removed?

  • tts because it depends on a -bin version of pytorch for some reason, which is "unfree" (bsd3 issl unfreeRedistributable). Is it possible to make it depend on a non-binary version of pytorch or should it be removed from release-cuda.nix?

  • mxnet is "actually" broken since #173463

  • truecrack-cuda is "actually" broken since #167250

  • pymc depends on pytensor is "actually" broken since #373239

06:56:17
@ruroruro:matrix.orgruro *

Regarding the remaining 21 eval errors:

  • 13x cuda-samples depends on freeimage which is technically insecure. I am not 100% sure, if we should just filter all of the cuda-samples packages using the new filterPackagePredicates mechanism or if it might be better to do
freeimage.overrideAttrs { meta.knownVulnerabilities = [ ]; }

specifically for cuda-samples. Since cuda-samples isn't really "production-facing" anyway, so the users should only really care that the samples compile. The security risk of distributing sample code that technically has some vulnerabilities should be minimal.

  • colmap also depends on freeimage, this issue should be probably raised upstream
  • boxx and bpycv haven't been updated upstream in the last 11 months and they seem to not support any of the python versions that are currently supported in nixpkgs. So we should probably check in with the nixpkgs maintainer and remove these packages if they aren't required by something important.
  • pixinsight is (and always was) unfree, but it is explicitly listed in release-cuda.nix for some reason. Should it be removed?
  • tts because it depends on a -bin version of pytorch for some reason, which is "unfree" (bsd3 issl unfreeRedistributable). Is it possible to make it depend on a non-binary version of pytorch or should it be removed from release-cuda.nix?
  • mxnet is "actually" broken since #173463
  • truecrack-cuda is "actually" broken since #167250
  • pymc depends on pytensor is "actually" broken since #373239
06:57:34
@ruroruro:matrix.orgruro *

Regarding the remaining 21 eval errors:

  • 13x cuda-samples depends on freeimage which is technically insecure. I am not 100% sure, if we should just filter all of the cuda-samples packages using the new filterPackagePredicates mechanism or if it might be better to do

    freeimage.overrideAttrs { meta.knownVulnerabilities = [ ]; }
    

    specifically for cuda-samples. Since cuda-samples isn't really "production-facing" anyway, so the users should only really care that the samples compile. The security risk of distributing sample code that technically has some vulnerabilities should be minimal.

  • colmap also depends on freeimage, this issue should be probably raised upstream

  • boxx and bpycv haven't been updated upstream in the last 11 months and they seem to not support any of the python versions that are currently supported in nixpkgs. So we should probably check in with the nixpkgs maintainer and remove these packages if they aren't required by something important.

  • pixinsight is (and always was) unfree, but it is explicitly listed in release-cuda.nix for some reason. Should it be removed?

  • tts because it depends on a -bin version of pytorch for some reason, which is "unfree" (bsd3 issl unfreeRedistributable). Is it possible to make it depend on a non-binary version of pytorch or should it be removed from release-cuda.nix?

  • mxnet is "actually" broken since #173463

  • truecrack-cuda is "actually" broken since #167250

  • pymc depends on pytensor is "actually" broken since #373239

06:57:53
@ruroruro:matrix.orgruro *

Regarding the remaining 21 eval errors:

  • 13x cuda-samples depends on freeimage which is technically insecure. I am not 100% sure, if we should just filter all of the cuda-samples packages using the new filterPackagePredicates mechanism or if it might be better to do

    freeimage.overrideAttrs { meta.knownVulnerabilities = [ ]; }
    

    specifically for cuda-samples. Since cuda-samples isn't really "production-facing" anyway, so the users should only really care that the samples compile. The security risk of distributing sample code that technically has some vulnerabilities should be minimal.

  • colmap also depends on freeimage, this issue should be probably raised upstream

  • boxx and bpycv haven't been updated upstream in the last 11 months and they seem to not support any of the python versions that are currently supported in nixpkgs. So we should probably check in with the nixpkgs maintainer and remove these packages if they aren't required by something important.

  • pixinsight is (and always was) unfree, but it is explicitly listed in release-cuda.nix for some reason. Should it be removed?

  • tts because it depends on a -bin version of pytorch for some reason, which is "unfree" (bsd3 issl unfreeRedistributable). Is it possible to make it depend on a non-binary version of pytorch or should it be removed from release-cuda.nix?

  • mxnet is "actually" broken since #173463

  • truecrack-cuda is "actually" broken since #167250

  • pymc depends on pytensor is "actually" broken since #373239

07:00:28
@ss:someonex.netSomeoneSerge (back on matrix)

Thanks for the summary!

tts because it depends on a -bin version of pytorch for some reason, which is "unfree" (bsd3 issl unfreeRedistributable). Is it possible to make it depend on a non-binary version of pytorch or should it be removed from release-cuda.nix?

Definitely shouldn't be removed, tts is a package we want maintained, and when it's broken we want to see it's broken. It was probably made to use torch-bin at some point when source build was broken? If we can move it to torch, we probably should.

colmap also depends on freeimage, this issue should be probably raised upstream

Indeed

13x cuda-samples depends on freeimage which is technically insecure. I am not 100% sure, if we should just filter all of the cuda-samples packages using ...

For the Hydra job we might as well allow the insecure freeimage? It's ok to test and cache it, we just don't want people to copy the allowInsecurePredicate configuration

09:30:54
@ss:someonex.netSomeoneSerge (back on matrix) Btw, at some point this list was used to build with allowUnfree = true instead of the more conservative allowUnfreePredicate we currently use 09:32:13
@ruroruro:matrix.orgruro Alternatively/additionally, we might want to mark torch-bin with the appropriate CUDA-specific license so that it passes the allowUnfreePredicate in release-cuda (assuming that the unfreeRedistributable part of torch-bin does indeed refer to the vendored(?) CUDA. 13:07:01
@ruroruro:matrix.orgruro I am not sure if I like the idea of adding freeimage to allowInsecurePredicate "globally" in release-cuda, as the eval failure is a useful indicator for when some package ends up depending on it. I was thinking of allowing freeimage specifically for cuda-samples. Also, it seems that cuda-samples is only present in CUDA versions <=12.3 for some reason. I wonder, why is that? 13:18:26
@stick:matrix.orgstick joined the room.14:20:42

Show newer messages


Back to Room ListRoom Version: 9