Zulip setup coordination | 80 Members | |
| Coordination to setup https://nixpkgs.zulipchat.com/, see https://github.com/NixOS/foundation/issues/143 | 22 Servers |
| Sender | Message | Time |
|---|---|---|
| 22 Feb 2024 | ||
In reply to @zimbatm:numtide.comoh yes, that is a nice list. Very pragmatic approach to making it too! | 16:43:51 | |
In reply to @delroth:delroth.netSorry just catching up. Wanted to ++ on this item | 17:32:21 | |
| Maybe we should brainstorm a decision tree process? A lot of what I'm reading throughout this thread and generally strongly agreeing with seems to end in two areas:
One can't go without the other... and each have a high reliance in all avenues on the other. | 17:34:46 | |
| Total example one without a lot of thought behind but might be a good exercise to brainstorm on: | 17:41:39 | |
Download 1000051542.jpg | 17:42:07 | |
| (sorry for lack of proper rotation or artistic capabilities) | 17:42:54 | |
| 23 Feb 2024 | ||
In reply to @delroth:delroth.net (Following a conversation with Jonas Chevalier) The CUDA team might be a candidate to come out with this proposal. I'll also note that there's an entangled issue of the less controversial ROCm/OpenCL[/Vulkan] functionality also not being tested because it requires special hardware. This concerns both cuda-maintainers and rocm-maintainers, possibly the "geospatial" team and some "unowned" subsystems ("HPC", meaning e.g. slurm and mpi, maintained by single individuals). The issues are similar in that they are about nixpkgs keeping around an amount of untested code of unknown "expiry date", and in that this happens because whether to provision (and pay for) this hardware is a decision to be made by somebody. | 13:23:01 | |
| I don't really understand what makes this different from many, many other packages in nixpkgs that require specific hardware to test and use. | 13:26:21 | |
| Do we have similarly complicated package set with specific hardware needs without any simulator? (genuine question) | 13:27:01 | |
| For example, hostapd can be tested in NixOS tests via 80211 hw simulators | 13:27:12 | |
| not sure, but you're raising the bar significantly above the usual maintenance level of a nixpkgs package | 13:27:45 | |
| that's fair | 13:27:54 | |
| a good example would be home-assistant | 13:28:02 | |
| (to be clear, I'm not saying the status quo is optimal, I'm saying that maybe it's a separate issue and not something entangled to whether we distribute CUDA / ROCm / ... stuff) | 13:30:50 | |
| (this is getting maybe a bit too detailed to be on-topic for this channel) | 13:32:43 | |
In reply to @delroth:delroth.netYes, this is an argument that can be made and I think I could try replying to it, but this is unrelated to the "decision making" and could be discussed in the Discourse thread if it happens | 13:33:07 | |
In reply to @delroth:delroth.net(By the way, the irony is not lost to me that this is, literally, a "request for comments". Except without the baggage that comes attached with the Nix* RFC process and the tacit expectation of producing a formal document proposing a design and implementation.) | 15:05:11 | |
In reply to @delroth:delroth.net* (By the way, the irony is not lost on me that this is, literally, a "request for comments". Except without the baggage that comes attached with the Nix* RFC process and the tacit expectation of producing a formal document proposing a design and implementation.) | 15:05:17 | |
| What do you think if the CUDA team wrote a Nix Improvement Proposal (just made up that name), which is like a RFC in terms of structure, but just posted to Discourse? This could be a trial run for something more liteweight than the RFC process. This would be a trial run similar to what happened for the wiki on the foundation repo, but posted more widely | 15:14:03 | |
| Just out of curiosity: who would need which hardware for testing ROCm functionality? And what is the problem with testing OpenCL/Vulkan? Is this just a general lack of the right GPUs for testing? | 15:30:14 | |
| 15:41:35 | ||
| It's best to ask rocm-maintainers (e.g. Madouura, flakeby), but very basic things like "can we actually upload an array to vram?" can be tested with consumer grade hardware (probably anything that's in the llvm's target list) and even these things do ridiculously break. The OpenCL issue isn't actually about hardware, it's just about accessing the driver. I think it's about extra "system-features", although maybe there is some sort of hardware-agnostic all-software driver. At least for vulkan there seems to be one. | 16:35:34 | |
| * It's best to ask rocm-maintainers (e.g. Madouura, flakeby), but very basic things like "can we actually upload an array to vram?" can be tested with consumer grade hardware (probably anything that's in the llvm's target list) and even these things do ridiculously break. The more realistic answer is "something that amd is going to support for a while" because the libraries (*blas, etc) only support a subset of what the compiler does. The OpenCL issue isn't actually about hardware, it's just about accessing the driver. I think it's about extra "system-features", although maybe there is some sort of hardware-agnostic all-software driver. At least for vulkan there seems to be one. | 16:42:31 | |
If a noticeable group of people who won't be dismissed as not having much of the demonstrated commitment to the project is willing to claim the change is very harmful, I find stalling a feature (possibly unstalling later if a majority of the group withdraws under a better explanation!) | 18:15:38 | |
We have «Develop» category on Discourse and we do sometimes link Discourse discussions in PRs. I agree this is a better way for things that do not try to establsh norms for everyone and don't meet strong opposition. If there is a disagreement, well, there is a reason design by committee happens again and again, it is actually best of the achievable for the goal of preserving unity | 18:22:45 | |
| I think invoking the Foundation means that Eelco Dolstra does have a veto on changing things and we know that he has consistently opposed unfree for-CPU code on Hydra, so, Foundation won't delegate because Foundation board is not in agreement there is anything that needs unblocking. | 18:27:56 | |
| You know, the cases (especially pre-RFC) when a lot of discussion happened then an absolutely unreasonable choice was made made progress, but still did not feel good | 18:34:05 | |
In reply to @zimbatm:numtide.comHaven't they literally done this, looked on the set of people who objected and their levels of influence, then started their own Hydra instance for CUDA stuff? | 18:40:38 | |
In reply to @delroth:delroth.netNo joking, for low-controversy mid-scale things I have literally made PRs to Nixpkgs and tagged them [RFC]. Some were merged eventually, some rejected. | 18:41:42 | |
| 24 Feb 2024 | ||
| In the end, someone has to take the initiative; to be the leader. Either to decide, or to drive the process, to take a risk, or to take responsibility. We often discuss developing ways to include more voices, or to distribute risk. But these situations require the opposite; to encourage a few to rise to the occasion. We've asked people why they feel blocked. A similar question would be why someone does not feel empowered to act. I'd like to ask why people don't feel empowered to decide. | 01:29:49 | |