Highlights for 2024-06-23
Following zero-day SD3 release, a 10 days later here’s a refresh with 10+ improvements
including full prompt attention, support for compressed weights, additional text-encoder quantization modes.
But there’s more than SD3:
- support for quantized T5 text encoder FP16/FP8/FP4/INT8 in all models that use T5: SD3, PixArt-Σ, etc.
- support for PixArt-Sigma in small/medium/large variants
- support for HunyuanDiT 1.1
- additional NNCF weights compression support: SD3, PixArt, ControlNet, Lora
- integration of MS Florence VLM/VQA Base and Large models
- (finally) new release of Torch-DirectML
- additional efficiencies for users with low VRAM GPUs
- over 20 overall fixes
You must log in or register to comment.