!PbtOpdWBSRFbEZRLIf:numtide.com

Nix Community Projects

604 Members
Meta discussions related to https://nix-community.org. (For project specific discussions use github issues or projects own matrix channel). Need help from an admin? Open an issue on https://github.com/nix-community/infra/issues158 Servers

Load older messages


SenderMessageTime
4 Sep 2024
@antifuchs:asf.computerantifuchs changed their display name from antifuchs to antifuchs ⚡️.18:57:28
@antifuchs:asf.computerantifuchs changed their display name from antifuchs ⚡️ to antifuchs.19:01:46
5 Sep 2024
@necoarc:transfem.dev@necoarc:transfem.dev joined the room.03:35:24
@necoarc:transfem.dev@necoarc:transfem.dev removed their display name Neco-Arc.03:49:32
@necoarc:transfem.dev@necoarc:transfem.dev removed their profile picture.03:49:32
@necoarc:transfem.dev@necoarc:transfem.dev left the room.03:49:32
@ss:someonex.netSomeoneSerge (back on matrix) joined the room.11:56:04
@ss:someonex.netSomeoneSerge (back on matrix)

Hi zowoq and Jonas Chevalier! I was wondering what has been your impression of the cuda jobset so far: whether you find it sustainable to build as is, whether it'd be sustainable to scale up? With the experience you've had so far, do you think nix-community should commit to keeping it alive and what'd it take to call it "stable" and announce it to the public?

About nix-community/infra in general, I understand you zowoq are "the official maintainer" and afaiu the infra is funded by the foundation. I wonder what is the general idea for nix-community's sustainability? Does infra require more labour that currently available? Who's taking over if you need to move to other things? Are there plans to get off the "cloud needle"?

Thanks

12:58:44
@antifuchs:asf.computerantifuchs left the room.13:58:58
@antifuchs:asf.computerantifuchs 14:25:15
@antifuchs:asf.computerantifuchs left the room.17:16:48
@aruzeta:matrix.org@aruzeta:matrix.org left the room.18:12:33
@zowoq:matrix.orgzowoq

I was wondering what has been your impression of the cuda jobset so far: whether you find it sustainable to build as is, whether it'd be sustainable to scale up?

Yes, it is sustainable. I don't understand what scale up would mean in this context?

With the experience you've had so far, do you think nix-community should commit to keeping it alive and what'd it take to call it "stable" and announce it to the public?

Yes, I think we can commit to it. It is stable enough that we added it to our docs a couple of days ago. https://nix-community.org/package-sets/

afaiu the infra is funded by the foundation.

No, we don't get any funding from the foundation. We have an open collective and some services are sponsored. https://opencollective.com/nix-community, https://nix-community.org/sponsors/

I wonder what is the general idea for nix-community's sustainability?

I'd say that basically it just depends on the open collective. @zimbatm may have a more nuanced answer for this.

Does infra require more labour that currently available?

No, not at the moment.

Who's taking over if you need to move to other things?

https://nix-community.org/administrators/ Nix community has five admins.

Are there plans to get off the "cloud needle"?

I'm assuming that this means getting our own hardware and finding somewhere to put it? Has been mentioned once or twice but nothing beyond that.

23:56:28
6 Sep 2024
@ss:someonex.netSomeoneSerge (back on matrix)

I don't understand what scale up would mean in this context?

Building more "configs": e.g. the current aarch64 jobset only builds the variant for normal plug-in pcie GPUs not for jetson boards but we could extend it (which is probably the most common aarch64 user in fact...). The jobset also only chooses the "fat" variants which include code for all available gpu architectures at once. We could spawn variants for individual architectures, which is particularly relevant for embedded systems. I'd say we don't need to enable any of these at the moment, but if/when we discover this is needed it'd mean multiplying the cost. Similarly, adding the stable branch would add to another multiplicative factor, innocuous on its own but possibly significant when coupled with other toggles

00:09:33
@zowoq:matrix.orgzowoq

the current aarch64 jobset only builds the variant for normal plug-in pcie GPUs not for jetson boards but we could extend it (which is probably the most common aarch64 user in fact...).

If jetson is the more common use case should we switch to building that instead of the current aarch64 jobset?

The jobset also only chooses the "fat" variants which include code for all available gpu architectures at once. We could spawn variants for individual architectures, which is particularly relevant for embedded systems.

If we built the individual architectures would we still need to build the fat variant? Is building an individual architecture quicker than the "fat" variant?

02:29:43
@zowoq:matrix.orgzowoq *

the current aarch64 jobset only builds the variant for normal plug-in pcie GPUs not for jetson boards but we could extend it (which is probably the most common aarch64 user in fact...).

If jetson is the more common use case should we switch to building that instead of the current aarch64 jobset?

The jobset also only chooses the "fat" variants which include code for all available gpu architectures at once. We could spawn variants for individual architectures, which is particularly relevant for embedded systems.

If we built the individual architectures would we still need to build the "fat" variant? Is building an individual architecture quicker than the "fat" variant?

02:29:58
@ss:someonex.netSomeoneSerge (back on matrix)

If jetson is the more common use case should we switch to building that instead of the current aarch64 jobset?

Imo yes

02:33:46
@zimbatm:numtide.comJonas Chevalier

I wonder what is the general idea for nix-community's sustainability?

The project is already sustainable, thanks to all of our donors <3

The main cost centre is hardware, which we scale depending on the amount of donations.

07:44:37
@zimbatm:numtide.comJonas ChevalierOne thing that could get more love is to add some reporting to the shared build boxes that people SSH into. Some MOTD that shows who else is on the box, and how many builds are currently running.07:49:43
@zimbatm:numtide.comJonas ChevalierI imagine that people step over each other sometimes07:50:02
@emilazy:matrix.orgemily yeah I frequently have to do top to see if someone else is using it 07:52:50
@zimbatm:numtide.comJonas Chevalierwould be cool if you could chat with the person if somebody is hogging the box07:56:31
@zimbatm:numtide.comJonas Chevalier you can try echo hey, who is hogging the box? | wall on the machine, but not many people know you can do that 07:57:20
@magic_rb:matrix.redalder.orgmagic_rbUnix has facilities for that07:57:21
@magic_rb:matrix.redalder.orgmagic_rbExactly07:57:28
@zimbatm:numtide.comJonas ChevalierI guess some instructions can be added to the MOTD as well07:58:07
@magic_rb:matrix.redalder.orgmagic_rbOr maybe we can symlink a markdown file into each of the users home dirs, like a manual thingie08:01:30
@emilazy:matrix.orgemilyeh, my experience is that the builders are unused a large fraction of the time08:01:43
@magic_rb:matrix.redalder.orgmagic_rb(Mark the symlink readonly or smth so it cannot be gotten rid off)08:01:46
@emilazy:matrix.orgemilyI don't mind people "hogging" it because I hog it at other times too :)08:01:50

Show newer messages


Back to Room ListRoom Version: 6