Eugene Rakhmatulin
|
ad662f9bab
|
Changed MXFP4 CUTLASS SHA
|
2026-02-18 18:20:15 -08:00 |
|
Eugene Rakhmatulin
|
b959818536
|
MXFP4 fix cache bug
|
2026-02-18 16:53:57 -08:00 |
|
Eugene Rakhmatulin
|
bd3f45f920
|
Updated MXFP4 build to use fresh repo references
|
2026-02-18 13:35:09 -08:00 |
|
Eugene Rakhmatulin
|
4214d4fefe
|
Caching cubins during build for reuse
|
2026-02-13 19:30:28 -08:00 |
|
Eugene Rakhmatulin
|
da4185cb12
|
Fixed an issue with fetching latest vLLM code
|
2026-02-11 22:35:49 -08:00 |
|
Eugene Rakhmatulin
|
3b1e49dcb0
|
Supporting other CUDA archs via --gpu-arch flag
|
2026-02-11 13:10:41 -08:00 |
|
Eugene Rakhmatulin
|
ace16f3a8f
|
Applied new fastsafetensors fix to mxfp4 build; disabled wheel builds by default
|
2026-02-09 23:47:06 -08:00 |
|
Eugene Rakhmatulin
|
d845cd0401
|
changed arch to 12.1a again
|
2026-02-08 14:18:12 -08:00 |
|
Eugene Rakhmatulin
|
37953478f0
|
changed arch codes again to be in line with upcoming PR
|
2026-02-02 09:21:48 -08:00 |
|
Eugene Rakhmatulin
|
3c7f91081d
|
changed arch flags
|
2026-02-01 16:37:01 -08:00 |
|
Eugene Rakhmatulin
|
c81edce091
|
bumped up MXFP4 base image version
|
2026-01-31 16:12:33 -08:00 |
|
Eugene Rakhmatulin
|
a6d6bafa69
|
Merge branch 'main' into pytorch-base
|
2026-01-30 17:06:29 -08:00 |
|
Eugene Rakhmatulin
|
57c890b10c
|
Reduced MXFP4 container size
|
2026-01-30 15:18:42 -08:00 |
|
Eugene Rakhmatulin
|
008af21383
|
Merge branch 'main' into pytorch-base
|
2026-01-30 13:37:03 -08:00 |
|
Eugene Rakhmatulin
|
3a68e1ca46
|
Fixed #25
|
2026-01-30 11:20:29 -08:00 |
|
Eugene Rakhmatulin
|
34bd3ae39c
|
Fixed fetching vllm source code in MXFP4 version.
|
2026-01-30 09:07:01 -08:00 |
|
Eugene Rakhmatulin
|
ef0f996df6
|
Bumped base image version; reverted Triton to 3.5.1
|
2026-01-29 23:14:43 -08:00 |
|
Eugene Rakhmatulin
|
0ac438b4dd
|
Some optimizations
|
2026-01-29 22:08:05 -08:00 |
|
Eugene Rakhmatulin
|
9a907caffc
|
mxfp4 dockerfile optimizations
|
2026-01-29 14:17:36 -08:00 |
|
Eugene Rakhmatulin
|
b58ba7b19a
|
Added cubins and jit-cache
|
2026-01-29 11:42:04 -08:00 |
|
Eugene Rakhmatulin
|
36e3b7af27
|
Removed unnessesary dependencies
|
2026-01-29 09:58:44 -08:00 |
|
Eugene Rakhmatulin
|
e4b57633fe
|
moved everything to uv
|
2026-01-29 08:34:49 -08:00 |
|
Eugene Rakhmatulin
|
cef3727f26
|
Updated SHA for repos
|
2026-01-28 13:20:03 -08:00 |
|
Eugene Rakhmatulin
|
564afc1f6b
|
Working MXFP4 fork, updated build script
|
2026-01-26 22:31:46 -08:00 |
|
Eugene Rakhmatulin
|
aece2fad78
|
Initial import of MXFP4 branch
|
2026-01-24 22:40:36 -08:00 |
|