Kohya S.
c6bc632ec6
fix: metadata dataset degradation and make it work ( #2186 )
...
* fix: support dataset with metadata
* feat: support another tagger model
* fix: improve handling of image size and caption/tag processing in FineTuningDataset
* fix: enhance metadata loading to support JSONL format in FineTuningDataset
* feat: enhance image loading and processing in ImageLoadingPrepDataset with batch support and output options
* fix: improve image path handling and memory management in dataset classes
* Update finetune/tag_images_by_wd14_tagger.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* fix: add return type annotation for process_tag_replacement function and ensure tags are returned
* feat: add artist category threshold for tagging
* doc: add comment for clarification
---------
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
2026-01-18 15:17:07 +09:00
urlesistiana
f7fc7ddda2
fix #2201 : lumina 2 timesteps handling
2025-10-13 16:08:28 +08:00
Kohya S
5462a6bb24
Merge branch 'dev' into sd3
2025-09-29 21:02:02 +09:00
Kohya S
63711390a0
Merge branch 'main' into dev
2025-09-29 20:56:07 +09:00
Kohya S
60bfa97b19
fix: disable_mmap_safetensors not defined in SDXL TI training
2025-09-29 20:52:48 +09:00
Kohya S.
e7b89826c5
Update library/custom_offloading_utils.py
...
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
2025-09-21 13:29:58 +09:00
Kohya S
806d535ef1
fix: block-wise scaling is overwritten by per-tensor scaling
2025-09-21 13:10:41 +09:00
Kohya S
3876343fad
fix: remove print statement for guidance rescale in AdaptiveProjectedGuidance
2025-09-21 13:09:38 +09:00
Kohya S
040d976597
feat: add guidance rescale options for Adaptive Projected Guidance in inference
2025-09-21 13:03:14 +09:00
Kohya S
9621d9d637
feat: add Adaptive Projected Guidance parameters and noise rescaling
2025-09-21 12:34:40 +09:00
Kohya S
f41e9e2b58
feat: add vae_chunk_size argument for memory-efficient VAE decoding and processing
2025-09-21 11:09:37 +09:00
Kohya S
b090d15f7d
feat: add multi backend attention and related update for HI2.1 models and scripts
2025-09-20 19:45:33 +09:00
Kohya S
f834b2e0d4
fix: --fp8_vl to work
2025-09-18 23:46:18 +09:00
Kohya S
f6b4bdc83f
feat: block-wise fp8 quantization
2025-09-18 21:20:54 +09:00
Kohya S
f5b004009e
fix: correct tensor indexing in HunyuanVAE2D class for blending and encoding functions
2025-09-17 21:54:25 +09:00
Kohya S
4e2a80a6ca
refactor: update imports to use safetensors_utils for memory-efficient operations
2025-09-13 21:07:11 +09:00
Kohya S
d831c88832
fix: sample generation doesn't work with block swap
2025-09-13 21:06:04 +09:00
Kohya S
bae7fa74eb
Merge branch 'sd3' into feat-hunyuan-image-2.1-inference
2025-09-13 20:13:58 +09:00
Kohya S.
e1c666e97f
Update library/safetensors_utils.py
...
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
2025-09-13 20:03:55 +09:00
Kohya S
8783f8aed3
feat: faster safetensors load and split safetensor utils
2025-09-13 19:51:38 +09:00
Kohya S
209c02dbb6
feat: HunyuanImage LoRA training
2025-09-12 21:40:42 +09:00
Kohya S
a0f0afbb46
fix: revert constructor signature update
2025-09-11 22:27:00 +09:00
Kohya S
7f983c558d
feat: block swap for inference and initial impl for HunyuanImage LoRA (not working)
2025-09-11 22:15:22 +09:00
Kohya S
5149be5a87
feat: initial commit for HunyuanImage-2.1 inference
2025-09-11 12:54:12 +09:00
Kohya S
e836b7f66d
fix: chroma LoRA training without Text Encode caching
2025-08-30 09:30:24 +09:00
Kohya S
6edbe00547
feat: update libraries, remove warnings
2025-08-16 20:07:03 +09:00
Kohya S
351bed965c
fix model type handling in analyze_state_dict_state function for SD3
2025-08-13 21:38:51 +09:00
rockerBOO
9bb50c26c4
Set sai_model_spec to must
2025-08-03 00:43:09 -04:00
rockerBOO
10bfcb9ac5
Remove text model spec
2025-08-03 00:40:10 -04:00
rockerBOO
d24d733892
Update model spec to 1.0.1. Refactor model spec
2025-08-02 21:14:27 -04:00
Kohya S
96feb61c0a
feat: implement modulation vector extraction for Chroma and update related methods
2025-07-30 21:34:49 +09:00
Kohya S
6c8973c2da
doc: add reference link for input vector gradient requirement in Chroma class
2025-07-28 22:08:02 +09:00
Kohya S
9eda938876
Merge branch 'sd3' into feature-chroma-support
2025-07-21 13:32:22 +09:00
Kohya S.
d98400b06e
Merge pull request #2138 from kohya-ss/feature-lumina-image
...
Feature lumina image
2025-07-21 13:21:26 +09:00
Kohya S
0b763ef1f1
feat: fix timestep for input_vec for Chroma
2025-07-20 20:53:06 +09:00
Kohya S
b4e862626a
feat: add LoRA training support for Chroma
2025-07-20 19:00:09 +09:00
Kohya S
c4958b5dca
feat: change img/txt order for attention and single blocks
2025-07-20 16:30:43 +09:00
Kohya S
8fd0b12d1f
feat: update DoubleStreamBlock and SingleStreamBlock to handle text sequence lengths instead of mask
2025-07-20 16:00:58 +09:00
Kohya S
404ddb060d
fix: inference for Chroma model
2025-07-20 14:08:54 +09:00
kohya-ss
24d2ea86c7
feat: support Chroma model in loading and inference processes
2025-07-20 12:56:42 +09:00
Dave Lage
3adbbb6e33
Add note about why we are moving it
2025-07-16 16:09:20 -04:00
rockerBOO
a7b33f3204
Fix alphas cumprod after add_noise for DDIMScheduler
2025-07-15 22:36:46 -04:00
Kohya S
e0fcb5152a
feat: support Neta Lumina all-in-one weights
2025-07-15 21:34:35 +09:00
Kohya S
a96d684ffa
feat: add Chroma model implementation
2025-07-15 20:44:43 +09:00
Kohya S
30295c9668
fix: update parameter names for CFG truncate and Renorm CFG in documentation and code
2025-07-13 21:00:27 +09:00
Kohya S
999df5ec15
fix: update default values for timestep_sampling and model_prediction_type in training arguments
2025-07-13 20:52:00 +09:00
Kohya S
88960e6309
doc: update lumina LoRA training guide
2025-07-13 20:49:38 +09:00
Kohya S
b4d1152293
fix: sample generation with system prompt, without TE output caching
2025-07-09 21:55:36 +09:00
Kohya S
6731d8a57f
fix: update system prompt handling
2025-06-29 22:21:48 +09:00
Kohya S
884c1f37c4
fix: update to work with cache text encoder outputs (without disk)
2025-06-29 21:58:43 +09:00