Updated README

This commit is contained in:
Eugene Rakhmatulin
2026-04-13 11:27:57 -07:00
parent cf4cb35356
commit c026c92bd0

View File

@@ -135,6 +135,18 @@ For periodic maintenance, I recommend using a filter: `docker builder prune --fi
## CHANGELOG ## CHANGELOG
### 2026-04-12
#### Drop-caches mod for Qwen3.5-397B
Updated Qwen3.5-397B recipe (for dual node configuration) to use the new mod `mods/drop-caches` which clears filesystem caches every minute while the container is running, resolving fastsafetensors getting stuck during loading and a few other bugs when operating close to max memory limit.
### 2026-04-11
#### Pinned PyTorch Version
Pinned PyTorch to version 2.11.0 (previously using nightly builds) to fix incompatibility with transformers 5.x and avoid torch version mismatch in builds.
### 2026-04-02 ### 2026-04-02
A new recipe for Gemma4-26B-A4B in "on-the-fly" FP8 quantization: A new recipe for Gemma4-26B-A4B in "on-the-fly" FP8 quantization: