| 30 Nov 2024 |
SomeoneSerge (back on matrix) | Ah, I was just commenting on that on github. I think open being a passthru of nvidia_x11 is a constant source of confusion and we should just move it out to nvidiaPackages. AFAIU the only way open depends on nvidia_x11 is for version numbers... | 13:00:29 |
| 2 Dec 2024 |
hexa | Gaétan Lepage: so when are we killing the tests on tensordict? | 00:53:22 |
Gaétan Lepage | :') | 07:24:29 |
Gaétan Lepage | At least the blocking one I guess | 07:24:35 |
Gaétan Lepage | Should be fixed by https://github.com/NixOS/nixpkgs/pull/361008. | 07:57:50 |
Gaétan Lepage | In reply to @glepage:matrix.org Should be fixed by https://github.com/NixOS/nixpkgs/pull/361008. I went through both tensordict and torchrl. They should be more robust now. | 14:06:35 |
Gaétan Lepage | Well, it's still not perfect hexa (UTC+1).
On the flaky Threadripper 3990X system, torchrl segfaults after the test suite has passed. It only occurs for python312Packages.torchrl but not for python311Packages.torchrl... | 14:40:15 |
Gaétan Lepage |  Download clipboard.png | 14:40:51 |
Gaétan Lepage | Of course, it builds perfectly fine on two other AMD systems (Ryzen 5 5600X and Ryzen 9 3900) | 14:41:38 |
hexa | microcode updates applied? | 14:42:05 |
Gaétan Lepage | I forgot how to check that | 14:42:35 |
Gaétan Lepage | I think you told me a while ago | 14:42:47 |
hexa | intel be like | 14:43:46 |
hexa |
[ 0.000000] microcode: updated early: 0xc6 -> 0xf8, date = 2024-02-01
| 14:43:51 |
hexa | dmesg | grep microcode | 14:44:36 |
Gaétan Lepage | [ 2.153548] microcode: microcode updated early to new patch_level=0x0830107b
| 14:45:07 |
Gaétan Lepage | Nothing matches on my Ryzen system. | 14:46:03 |
Gaétan Lepage | * TR system:
[ 2.153548] microcode: microcode updated early to new patch_level=0x0830107b
| 14:46:09 |
Gaétan Lepage | * TR system:
[ 2.153548] microcode: microcode updated early to new patch_level=0x0830107b
...
[ 2.154392] microcode: Microcode Update Driver: v2.2.
| 14:46:30 |
hexa | ❯ ./amd_ucode_info.py kernel/x86/microcode/AuthenticAMD.bin
Microcode patches in kernel/x86/microcode/AuthenticAMD.bin:
Family=0x10 Model=0x02 Stepping=0x03: Patch=0x01000083 Length=960 bytes
Family=0x10 Model=0x02 Stepping=0x02: Patch=0x01000083 Length=960 bytes
Family=0x10 Model=0x02 Stepping=0x0a: Patch=0x01000084 Length=960 bytes
Family=0x10 Model=0x06 Stepping=0x02: Patch=0x010000c7 Length=960 bytes
Family=0x10 Model=0x04 Stepping=0x03: Patch=0x010000c8 Length=960 bytes
Family=0x10 Model=0x06 Stepping=0x03: Patch=0x010000c8 Length=960 bytes
Family=0x10 Model=0x05 Stepping=0x03: Patch=0x010000c8 Length=960 bytes
Family=0x10 Model=0x08 Stepping=0x01: Patch=0x010000d9 Length=960 bytes
Family=0x10 Model=0x09 Stepping=0x01: Patch=0x010000d9 Length=960 bytes
Family=0x10 Model=0x08 Stepping=0x00: Patch=0x010000da Length=960 bytes
Family=0x10 Model=0x04 Stepping=0x02: Patch=0x010000db Length=960 bytes
Family=0x10 Model=0x05 Stepping=0x02: Patch=0x010000db Length=960 bytes
Family=0x10 Model=0x0a Stepping=0x00: Patch=0x010000dc Length=960 bytes
Family=0x11 Model=0x03 Stepping=0x01: Patch=0x02000032 Length=512 bytes
Family=0x12 Model=0x01 Stepping=0x00: Patch=0x03000027 Length=960 bytes
Family=0x14 Model=0x01 Stepping=0x00: Patch=0x05000029 Length=1568 bytes
Family=0x14 Model=0x02 Stepping=0x00: Patch=0x05000119 Length=1568 bytes
Microcode patches in kernel/x86/microcode/AuthenticAMD.bin+0x318c:
Family=0x15 Model=0x01 Stepping=0x02: Patch=0x0600063e Length=2592 bytes
Family=0x15 Model=0x02 Stepping=0x00: Patch=0x06000852 Length=2592 bytes
Family=0x15 Model=0x10 Stepping=0x01: Patch=0x06001119 Length=2592 bytes
Microcode patches in kernel/x86/microcode/AuthenticAMD.bin+0x5050:
Family=0x16 Model=0x00 Stepping=0x01: Patch=0x0700010f Length=3458 bytes
Microcode patches in kernel/x86/microcode/AuthenticAMD.bin+0x5e06:
Family=0x17 Model=0x01 Stepping=0x02: Patch=0x0800126f Length=3200 bytes
Family=0x17 Model=0x31 Stepping=0x00: Patch=0x0830107c Length=3200 bytes
Family=0x17 Model=0x08 Stepping=0x02: Patch=0x0800820d Length=3200 bytes
Family=0x17 Model=0xa0 Stepping=0x00: Patch=0x08a00008 Length=3200 bytes
Microcode patches in kernel/x86/microcode/AuthenticAMD.bin+0x9082:
Family=0x19 Model=0x01 Stepping=0x00: Patch=0x0a00107a Length=5568 bytes
Family=0x19 Model=0x11 Stepping=0x02: Patch=0x0a101248 Length=5568 bytes
Family=0x19 Model=0xa0 Stepping=0x02: Patch=0x0aa00215 Length=5568 bytes
Family=0x19 Model=0x01 Stepping=0x02: Patch=0x0a001238 Length=5568 bytes
Family=0x19 Model=0x11 Stepping=0x01: Patch=0x0a101148 Length=5568 bytes
Family=0x19 Model=0x01 Stepping=0x01: Patch=0x0a0011d5 Length=5568 bytes
Family=0x19 Model=0xa0 Stepping=0x01: Patch=0x0aa00116 Length=5568 bytes
| 14:52:33 |
hexa | this is the latest ucode for amd on nixpkgs master | 14:52:39 |
hexa | so 0x19 is family 25 | 14:53:59 |
hexa | and for the model you probably have to binary or your model with 0xa0 if it is > 17 | 14:54:27 |
Gaétan Lepage | Ok, so anyway the issues are not a problem of the python package then | 15:11:51 |
| 4 Dec 2024 |
Kevin Mittman (UTC-8) | Anyone planning to attend PlanetNix https://www.socallinuxexpo.org/scale/22x/events/planet-nix ? Looks like CFP is still open | 02:21:01 |
connor (he/him) | I probably will; I'm also planning to submit two talks | 04:38:33 |
| 6 Dec 2024 |
| vannagamma joined the room. | 00:01:17 |
connor (he/him) | Does anyone have a NixOS system they recommend using to test eval performance? | 05:49:07 |
connor (he/him) | Ideally something which takes on the order of 30s or so to eval | 05:51:25 |
| kaya 𖤐 changed their profile picture. | 21:17:32 |