Commit Graph

28 Commits

Author SHA1 Message Date
eugr
ae25d64ac0 Changed CUTLASS ref for mxfp4 build 2026-04-01 08:58:31 -07:00
Drew Botwinick
d6e76f8e2f add build metadata generation and include in Dockerfiles 2026-03-21 16:10:04 -05:00
Andrej V.
bdd2b10f54 Remove script copy and permission commands from Dockerfile
Removed script copying and permission setting for run-cluster-node.sh.
2026-03-18 21:57:56 +01:00
Eugene Rakhmatulin
ad662f9bab Changed MXFP4 CUTLASS SHA 2026-02-18 18:20:15 -08:00
Eugene Rakhmatulin
b959818536 MXFP4 fix cache bug 2026-02-18 16:53:57 -08:00
Eugene Rakhmatulin
bd3f45f920 Updated MXFP4 build to use fresh repo references 2026-02-18 13:35:09 -08:00
Eugene Rakhmatulin
4214d4fefe Caching cubins during build for reuse 2026-02-13 19:30:28 -08:00
Eugene Rakhmatulin
da4185cb12 Fixed an issue with fetching latest vLLM code 2026-02-11 22:35:49 -08:00
Eugene Rakhmatulin
3b1e49dcb0 Supporting other CUDA archs via --gpu-arch flag 2026-02-11 13:10:41 -08:00
Eugene Rakhmatulin
ace16f3a8f Applied new fastsafetensors fix to mxfp4 build; disabled wheel builds by default 2026-02-09 23:47:06 -08:00
Eugene Rakhmatulin
d845cd0401 changed arch to 12.1a again 2026-02-08 14:18:12 -08:00
Eugene Rakhmatulin
37953478f0 changed arch codes again to be in line with upcoming PR 2026-02-02 09:21:48 -08:00
Eugene Rakhmatulin
3c7f91081d changed arch flags 2026-02-01 16:37:01 -08:00
Eugene Rakhmatulin
c81edce091 bumped up MXFP4 base image version 2026-01-31 16:12:33 -08:00
Eugene Rakhmatulin
a6d6bafa69 Merge branch 'main' into pytorch-base 2026-01-30 17:06:29 -08:00
Eugene Rakhmatulin
57c890b10c Reduced MXFP4 container size 2026-01-30 15:18:42 -08:00
Eugene Rakhmatulin
008af21383 Merge branch 'main' into pytorch-base 2026-01-30 13:37:03 -08:00
Eugene Rakhmatulin
3a68e1ca46 Fixed #25 2026-01-30 11:20:29 -08:00
Eugene Rakhmatulin
34bd3ae39c Fixed fetching vllm source code in MXFP4 version. 2026-01-30 09:07:01 -08:00
Eugene Rakhmatulin
ef0f996df6 Bumped base image version; reverted Triton to 3.5.1 2026-01-29 23:14:43 -08:00
Eugene Rakhmatulin
0ac438b4dd Some optimizations 2026-01-29 22:08:05 -08:00
Eugene Rakhmatulin
9a907caffc mxfp4 dockerfile optimizations 2026-01-29 14:17:36 -08:00
Eugene Rakhmatulin
b58ba7b19a Added cubins and jit-cache 2026-01-29 11:42:04 -08:00
Eugene Rakhmatulin
36e3b7af27 Removed unnessesary dependencies 2026-01-29 09:58:44 -08:00
Eugene Rakhmatulin
e4b57633fe moved everything to uv 2026-01-29 08:34:49 -08:00
Eugene Rakhmatulin
cef3727f26 Updated SHA for repos 2026-01-28 13:20:03 -08:00
Eugene Rakhmatulin
564afc1f6b Working MXFP4 fork, updated build script 2026-01-26 22:31:46 -08:00
Eugene Rakhmatulin
aece2fad78 Initial import of MXFP4 branch 2026-01-24 22:40:36 -08:00