Kohya-ss-sd-scripts

mirror of https://github.com/kohya-ss/sd-scripts.git synced 2026-04-15 08:36:41 +00:00

Author	SHA1	Message	Date
Kohya S	02a75944b3	fix: keep latents 4D except DiT call	2026-02-10 21:26:20 +09:00
Kohya S	6a4e392445	fix: update logging message for regex match in LoRANetwork	2026-02-10 21:21:29 +09:00
Kohya S	f3b6e59900	fix: improve regex matching for module selection and learning rates in LoRANetwork	2026-02-10 08:25:20 +09:00
Kohya S	3d24736238	feat: add regex-based learning rates and dimensions handling in create_network	2026-02-10 08:17:43 +09:00
Kohya S	58db77a488	fix: update to work fp8_scaled option	2026-02-09 23:31:33 +09:00
Kohya S	0f413974b7	feat: support LoRA format without `net.` prefix	2026-02-09 23:21:04 +09:00
kohya-ss	8f5b298906	feat: simplify Anima training script and update empty caption handling	2026-02-09 12:46:53 +09:00
kohya-ss	06dcb30016	fix: update argument names in anima_train_utils to align with other archtectures	2026-02-09 12:46:04 +09:00
kohya-ss	f320c1b964	feat: update loss calculation to support 5d tensor	2026-02-09 12:44:47 +09:00
kohya-ss	bf9806648c	feat: support 5d tensor in get_noisy_model_input_and_timesteps	2026-02-09 12:43:37 +09:00
kohya-ss	2774e7757b	feat: add dtype property and all-zero mask handling in cross-attention in LLMAdapterTransformerBlock	2026-02-09 12:43:11 +09:00
kohya-ss	a1e3d02259	fix: use strategy instead of using tokenizers directly	2026-02-09 12:42:16 +09:00
kohya-ss	35161b044c	feat: update train_llm_adapter and verbose default values to string type	2026-02-09 12:41:36 +09:00
kohya-ss	4f6511bf28	feat: kept caption dropout rate in cache and handle in training script	2026-02-08 15:35:53 +09:00
kohya-ss	c3556d455f	feat: simplify target module selection by regular expression patterns	2026-02-08 13:45:52 +09:00
kohya-ss	d992037984	format: format	2026-02-08 12:22:54 +09:00
kohya-ss	648c045cb0	feat: add anima minimal inference script (WIP)	2026-02-08 12:17:18 +09:00
kohya-ss	5c4643516c	feat: loading with dynamic fp8 optimization and LoRA support	2026-02-08 12:17:02 +09:00
kohya-ss	44b8d79577	feat: use unified attention module, add wrapper for state dict compatibility	2026-02-08 12:16:00 +09:00
kohya-ss	10445ff660	feat: simplify encode_tokens	2026-02-08 12:06:12 +09:00
kohya-ss	7b0ed3269a	feat: add Qwen-Image VAE	2026-02-08 11:13:09 +09:00
kohya-ss	9e2b4dc8a0	fix: update default value for --discrete_flow_shift in anima training guide	2026-02-08 10:56:34 +09:00
kohya-ss	3f3e2684c7	feat: add entry for 'temperal' in extend-words section of _typos.toml for Qwen-Image VAE	2026-02-08 10:41:13 +09:00
kohya-ss	65637517e8	fix: exclude anima tests from pytest	2026-02-08 10:36:33 +09:00
kohya-ss	402c468082	fix: update extend-exclude list in _typos.toml to include configs	2026-02-08 10:22:24 +09:00
duongve13112002	e21a7736f8	Support Anima model (#2260 ) * Support Anima model * Update document and fix bug * Fix latent normlization * Fix typo * Fix cache embedding * fix typo in tests/test_anima_cache.py * Remove redundant argument apply_t5_attn_mask * Improving caching with argument caption_dropout_rate * Fix W&B logging bugs * Fix discrete_flow_shift default value	2026-02-08 10:18:55 +09:00
Kohya S.	b996440c5f	Doc update sd3 branch documentation (#2253 ) * doc: move sample prompt file documentation, and remove history for branch * doc: remove outdated FLUX.1 and SD3 training information from README * doc: update README and training documentation for clarity and structure	2026-01-19 21:38:46 +09:00
Kohya S.	a9af52692a	feat: add pyramid noise and noise offset options to generation script (#2252 ) * feat: add pyramid noise and noise offset options to generation script * fix: fix to work with SD1.5 models * doc: update to match with latest gen_img.py * doc: update README to clarify script capabilities and remove deprecated sections	2026-01-18 16:56:48 +09:00
Kohya S.	c6bc632ec6	fix: metadata dataset degradation and make it work (#2186 ) * fix: support dataset with metadata * feat: support another tagger model * fix: improve handling of image size and caption/tag processing in FineTuningDataset * fix: enhance metadata loading to support JSONL format in FineTuningDataset * feat: enhance image loading and processing in ImageLoadingPrepDataset with batch support and output options * fix: improve image path handling and memory management in dataset classes * Update finetune/tag_images_by_wd14_tagger.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * fix: add return type annotation for process_tag_replacement function and ensure tags are returned * feat: add artist category threshold for tagging * doc: add comment for clarification --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-01-18 15:17:07 +09:00
Kohya S.	f7f971f50d	Merge pull request #2251 from kohya-ss/fix-pytest-for-lumina fix(tests): add ip_noise_gamma args for MockArgs in pytest	2026-01-18 15:09:47 +09:00
Kohya S	c4be615f69	fix(tests): add ip_noise_gamma args for MockArgs in pytest	2026-01-18 15:05:57 +09:00
Kohya S.	e06e063970	Merge pull request #2225 from urlesistiana/sd3_lumina2_ts_fix fix: lumina 2 timesteps handling	2026-01-18 14:39:04 +09:00
Kohya S.	94e3dbebea	Merge pull request #2246 from kozistr/deps/pytorch-optimizer Bump `pytorch-optimizer` version to v3.9.0	2025-12-21 22:51:32 +09:00
kozistr	95a65b89a5	build(deps): bump pytorch-optimizer to v3.9.0	2025-12-21 15:53:47 +09:00
Kohya S.	a5a162044c	Merge pull request #2226 from kohya-ss/fix-hunyuan-image-batch-gen-error fix: error on batch generation closes #2209	2025-10-15 21:57:45 +09:00
Kohya S	a33cad714e	fix: error on batch generation closes #2209	2025-10-15 21:57:11 +09:00
urlesistiana	f7fc7ddda2	fix #2201 : lumina 2 timesteps handling	2025-10-13 16:08:28 +08:00
Kohya S.	5e366acda4	Merge pull request #2003 from laolongboy/sd3-dev Fix missing parameters in model conversion script	2025-10-01 21:03:12 +09:00
Kohya S	5462a6bb24	Merge branch 'dev' into sd3	2025-09-29 21:02:02 +09:00
Kohya S	63711390a0	Merge branch 'main' into dev	2025-09-29 20:56:07 +09:00
Kohya S.	206adb6438	Merge pull request #2216 from kohya-ss/fix-sdxl-textual-inversion-training-disable-mmap fix: disable_mmap_safetensors not defined in SDXL TI training	2025-09-29 20:55:02 +09:00
Kohya S	60bfa97b19	fix: disable_mmap_safetensors not defined in SDXL TI training	2025-09-29 20:52:48 +09:00
Kohya S.	f0c767e0f2	Merge pull request #2213 from kohya-ss/doc-hunyuan-image-training-text-encoder-cpu-note docs: enhance text encoder CPU usage instructions for HunyuanImage-2.…	2025-09-28 18:32:11 +09:00
kohya-ss	a0c26a0efa	docs: enhance text encoder CPU usage instructions for HunyuanImage-2.1 training	2025-09-28 18:21:25 +09:00
Kohya S.	67d0621313	Merge pull request #2212 from kohya-ss/fix-hunyuan-image-sample-generation fix: HunyuanImage-2.1 sample generation fails	2025-09-28 18:12:04 +09:00
Kohya S	6a826d21b1	feat: add new parameters for sample image inference configuration	2025-09-28 18:06:17 +09:00
Kohya S.	4c197a538b	Merge pull request #2207 from kohya-ss/fix-flux-extract-lora-metadata-failed fix: update metadata construction to include model_config for flux	2025-09-24 21:19:27 +09:00
Kohya S	4b79d73504	fix: update metadata construction to include model_config for flux	2025-09-24 21:15:37 +09:00
Kohya S.	121853ca2a	Merge pull request #2198 from kohya-ss/feat-hunyuan-image-2.1-inference feat: support HunyuanImage-2.1	2025-09-23 19:11:50 +09:00
Kohya S	58df9dffa4	doc: update README with HunyuanImage-2.1 LoRA training details and requirements	2025-09-23 18:59:02 +09:00

1 2 3 4 5 ...

2499 Commits