Kohya S
02a75944b3
fix: keep latents 4D except DiT call
2026-02-10 21:26:20 +09:00
Kohya S
6a4e392445
fix: update logging message for regex match in LoRANetwork
2026-02-10 21:21:29 +09:00
Kohya S
f3b6e59900
fix: improve regex matching for module selection and learning rates in LoRANetwork
2026-02-10 08:25:20 +09:00
Kohya S
3d24736238
feat: add regex-based learning rates and dimensions handling in create_network
2026-02-10 08:17:43 +09:00
Kohya S
58db77a488
fix: update to work fp8_scaled option
2026-02-09 23:31:33 +09:00
Kohya S
0f413974b7
feat: support LoRA format without net. prefix
2026-02-09 23:21:04 +09:00
kohya-ss
8f5b298906
feat: simplify Anima training script and update empty caption handling
2026-02-09 12:46:53 +09:00
kohya-ss
06dcb30016
fix: update argument names in anima_train_utils to align with other archtectures
2026-02-09 12:46:04 +09:00
kohya-ss
f320c1b964
feat: update loss calculation to support 5d tensor
2026-02-09 12:44:47 +09:00
kohya-ss
bf9806648c
feat: support 5d tensor in get_noisy_model_input_and_timesteps
2026-02-09 12:43:37 +09:00
kohya-ss
2774e7757b
feat: add dtype property and all-zero mask handling in cross-attention in LLMAdapterTransformerBlock
2026-02-09 12:43:11 +09:00
kohya-ss
a1e3d02259
fix: use strategy instead of using tokenizers directly
2026-02-09 12:42:16 +09:00
kohya-ss
35161b044c
feat: update train_llm_adapter and verbose default values to string type
2026-02-09 12:41:36 +09:00
kohya-ss
4f6511bf28
feat: kept caption dropout rate in cache and handle in training script
2026-02-08 15:35:53 +09:00
kohya-ss
c3556d455f
feat: simplify target module selection by regular expression patterns
2026-02-08 13:45:52 +09:00
kohya-ss
d992037984
format: format
2026-02-08 12:22:54 +09:00
kohya-ss
648c045cb0
feat: add anima minimal inference script (WIP)
2026-02-08 12:17:18 +09:00
kohya-ss
5c4643516c
feat: loading with dynamic fp8 optimization and LoRA support
2026-02-08 12:17:02 +09:00
kohya-ss
44b8d79577
feat: use unified attention module, add wrapper for state dict compatibility
2026-02-08 12:16:00 +09:00
kohya-ss
10445ff660
feat: simplify encode_tokens
2026-02-08 12:06:12 +09:00
kohya-ss
7b0ed3269a
feat: add Qwen-Image VAE
2026-02-08 11:13:09 +09:00
kohya-ss
9e2b4dc8a0
fix: update default value for --discrete_flow_shift in anima training guide
2026-02-08 10:56:34 +09:00
kohya-ss
3f3e2684c7
feat: add entry for 'temperal' in extend-words section of _typos.toml for Qwen-Image VAE
2026-02-08 10:41:13 +09:00
kohya-ss
65637517e8
fix: exclude anima tests from pytest
2026-02-08 10:36:33 +09:00
kohya-ss
402c468082
fix: update extend-exclude list in _typos.toml to include configs
2026-02-08 10:22:24 +09:00
duongve13112002
e21a7736f8
Support Anima model ( #2260 )
...
* Support Anima model
* Update document and fix bug
* Fix latent normlization
* Fix typo
* Fix cache embedding
* fix typo in tests/test_anima_cache.py
* Remove redundant argument apply_t5_attn_mask
* Improving caching with argument caption_dropout_rate
* Fix W&B logging bugs
* Fix discrete_flow_shift default value
2026-02-08 10:18:55 +09:00
Kohya S.
b996440c5f
Doc update sd3 branch documentation ( #2253 )
...
* doc: move sample prompt file documentation, and remove history for branch
* doc: remove outdated FLUX.1 and SD3 training information from README
* doc: update README and training documentation for clarity and structure
2026-01-19 21:38:46 +09:00
Kohya S.
a9af52692a
feat: add pyramid noise and noise offset options to generation script ( #2252 )
...
* feat: add pyramid noise and noise offset options to generation script
* fix: fix to work with SD1.5 models
* doc: update to match with latest gen_img.py
* doc: update README to clarify script capabilities and remove deprecated sections
2026-01-18 16:56:48 +09:00
Kohya S.
c6bc632ec6
fix: metadata dataset degradation and make it work ( #2186 )
...
* fix: support dataset with metadata
* feat: support another tagger model
* fix: improve handling of image size and caption/tag processing in FineTuningDataset
* fix: enhance metadata loading to support JSONL format in FineTuningDataset
* feat: enhance image loading and processing in ImageLoadingPrepDataset with batch support and output options
* fix: improve image path handling and memory management in dataset classes
* Update finetune/tag_images_by_wd14_tagger.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* fix: add return type annotation for process_tag_replacement function and ensure tags are returned
* feat: add artist category threshold for tagging
* doc: add comment for clarification
---------
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
2026-01-18 15:17:07 +09:00
Kohya S.
f7f971f50d
Merge pull request #2251 from kohya-ss/fix-pytest-for-lumina
...
fix(tests): add ip_noise_gamma args for MockArgs in pytest
2026-01-18 15:09:47 +09:00
Kohya S
c4be615f69
fix(tests): add ip_noise_gamma args for MockArgs in pytest
2026-01-18 15:05:57 +09:00
Kohya S.
e06e063970
Merge pull request #2225 from urlesistiana/sd3_lumina2_ts_fix
...
fix: lumina 2 timesteps handling
2026-01-18 14:39:04 +09:00
Kohya S.
94e3dbebea
Merge pull request #2246 from kozistr/deps/pytorch-optimizer
...
Bump `pytorch-optimizer` version to v3.9.0
2025-12-21 22:51:32 +09:00
kozistr
95a65b89a5
build(deps): bump pytorch-optimizer to v3.9.0
2025-12-21 15:53:47 +09:00
Kohya S.
a5a162044c
Merge pull request #2226 from kohya-ss/fix-hunyuan-image-batch-gen-error
...
fix: error on batch generation closes #2209
2025-10-15 21:57:45 +09:00
Kohya S
a33cad714e
fix: error on batch generation closes #2209
2025-10-15 21:57:11 +09:00
urlesistiana
f7fc7ddda2
fix #2201 : lumina 2 timesteps handling
2025-10-13 16:08:28 +08:00
Kohya S.
5e366acda4
Merge pull request #2003 from laolongboy/sd3-dev
...
Fix missing parameters in model conversion script
2025-10-01 21:03:12 +09:00
Kohya S
5462a6bb24
Merge branch 'dev' into sd3
2025-09-29 21:02:02 +09:00
Kohya S
63711390a0
Merge branch 'main' into dev
2025-09-29 20:56:07 +09:00
Kohya S.
206adb6438
Merge pull request #2216 from kohya-ss/fix-sdxl-textual-inversion-training-disable-mmap
...
fix: disable_mmap_safetensors not defined in SDXL TI training
2025-09-29 20:55:02 +09:00
Kohya S
60bfa97b19
fix: disable_mmap_safetensors not defined in SDXL TI training
2025-09-29 20:52:48 +09:00
Kohya S.
f0c767e0f2
Merge pull request #2213 from kohya-ss/doc-hunyuan-image-training-text-encoder-cpu-note
...
docs: enhance text encoder CPU usage instructions for HunyuanImage-2.…
2025-09-28 18:32:11 +09:00
kohya-ss
a0c26a0efa
docs: enhance text encoder CPU usage instructions for HunyuanImage-2.1 training
2025-09-28 18:21:25 +09:00
Kohya S.
67d0621313
Merge pull request #2212 from kohya-ss/fix-hunyuan-image-sample-generation
...
fix: HunyuanImage-2.1 sample generation fails
2025-09-28 18:12:04 +09:00
Kohya S
6a826d21b1
feat: add new parameters for sample image inference configuration
2025-09-28 18:06:17 +09:00
Kohya S.
4c197a538b
Merge pull request #2207 from kohya-ss/fix-flux-extract-lora-metadata-failed
...
fix: update metadata construction to include model_config for flux
2025-09-24 21:19:27 +09:00
Kohya S
4b79d73504
fix: update metadata construction to include model_config for flux
2025-09-24 21:15:37 +09:00
Kohya S.
121853ca2a
Merge pull request #2198 from kohya-ss/feat-hunyuan-image-2.1-inference
...
feat: support HunyuanImage-2.1
2025-09-23 19:11:50 +09:00
Kohya S
58df9dffa4
doc: update README with HunyuanImage-2.1 LoRA training details and requirements
2025-09-23 18:59:02 +09:00