Kohya-ss-sd-scripts

mirror of https://github.com/kohya-ss/sd-scripts.git synced 2026-04-10 15:00:23 +00:00

Author	SHA1	Message	Date
Kohya S.	34e7138b6a	Add/modify some implementation for anima (#2261 ) * fix: update extend-exclude list in _typos.toml to include configs * fix: exclude anima tests from pytest * feat: add entry for 'temperal' in extend-words section of _typos.toml for Qwen-Image VAE * fix: update default value for --discrete_flow_shift in anima training guide * feat: add Qwen-Image VAE * feat: simplify encode_tokens * feat: use unified attention module, add wrapper for state dict compatibility * feat: loading with dynamic fp8 optimization and LoRA support * feat: add anima minimal inference script (WIP) * format: format * feat: simplify target module selection by regular expression patterns * feat: kept caption dropout rate in cache and handle in training script * feat: update train_llm_adapter and verbose default values to string type * fix: use strategy instead of using tokenizers directly * feat: add dtype property and all-zero mask handling in cross-attention in LLMAdapterTransformerBlock * feat: support 5d tensor in get_noisy_model_input_and_timesteps * feat: update loss calculation to support 5d tensor * fix: update argument names in anima_train_utils to align with other archtectures * feat: simplify Anima training script and update empty caption handling * feat: support LoRA format without `net.` prefix * fix: update to work fp8_scaled option * feat: add regex-based learning rates and dimensions handling in create_network * fix: improve regex matching for module selection and learning rates in LoRANetwork * fix: update logging message for regex match in LoRANetwork * fix: keep latents 4D except DiT call * feat: enhance block swap functionality for inference and training in Anima model * feat: refactor Anima training script * feat: optimize VAE processing by adjusting tensor dimensions and data types * fix: wait all block trasfer before siwtching offloader mode * feat: update Anima training guide with new argument specifications and regex-based module selection. Thank you Claude! * feat: support LORA for Qwen3 * feat: update Anima SAI model spec metadata handling * fix: remove unused code * feat: split CFG processing in do_sample function to reduce memory usage * feat: add VAE chunking and caching options to reduce memory usage * feat: optimize RMSNorm forward method and remove unused torch_attention_op * Update library/strategy_anima.py Use torch.all instead of all. Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update library/safetensors_utils.py Fix duplicated new_key for concat_hook. Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update anima_minimal_inference.py Remove unused code. Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update anima_train.py Remove unused import. Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update library/anima_train_utils.py Remove unused import. Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * fix: review with Copilot * feat: add script to convert LoRA format to ComfyUI compatible format (WIP, not tested yet) * feat: add process_escape function to handle escape sequences in prompts * feat: enhance LoRA weight handling in model loading and add text encoder loading function * feat: improve ComfyUI conversion script with prefix constants and module name adjustments * feat: update caption dropout documentation to clarify cache regeneration requirement * feat: add clarification on learning rate adjustments * feat: add note on PyTorch version requirement to prevent NaN loss --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-02-13 08:15:06 +09:00
duongve13112002	e21a7736f8	Support Anima model (#2260 ) * Support Anima model * Update document and fix bug * Fix latent normlization * Fix typo * Fix cache embedding * fix typo in tests/test_anima_cache.py * Remove redundant argument apply_t5_attn_mask * Improving caching with argument caption_dropout_rate * Fix W&B logging bugs * Fix discrete_flow_shift default value	2026-02-08 10:18:55 +09:00
Kohya S.	b996440c5f	Doc update sd3 branch documentation (#2253 ) * doc: move sample prompt file documentation, and remove history for branch * doc: remove outdated FLUX.1 and SD3 training information from README * doc: update README and training documentation for clarity and structure	2026-01-19 21:38:46 +09:00
Kohya S.	a9af52692a	feat: add pyramid noise and noise offset options to generation script (#2252 ) * feat: add pyramid noise and noise offset options to generation script * fix: fix to work with SD1.5 models * doc: update to match with latest gen_img.py * doc: update README to clarify script capabilities and remove deprecated sections	2026-01-18 16:56:48 +09:00
Kohya S.	c6bc632ec6	fix: metadata dataset degradation and make it work (#2186 ) * fix: support dataset with metadata * feat: support another tagger model * fix: improve handling of image size and caption/tag processing in FineTuningDataset * fix: enhance metadata loading to support JSONL format in FineTuningDataset * feat: enhance image loading and processing in ImageLoadingPrepDataset with batch support and output options * fix: improve image path handling and memory management in dataset classes * Update finetune/tag_images_by_wd14_tagger.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * fix: add return type annotation for process_tag_replacement function and ensure tags are returned * feat: add artist category threshold for tagging * doc: add comment for clarification --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-01-18 15:17:07 +09:00
kohya-ss	a0c26a0efa	docs: enhance text encoder CPU usage instructions for HunyuanImage-2.1 training	2025-09-28 18:21:25 +09:00
Kohya S	31f7df3b3a	doc: add --network_train_unet_only option for HunyuanImage-2.1 training	2025-09-23 18:53:36 +09:00
Kohya S	040d976597	feat: add guidance rescale options for Adaptive Projected Guidance in inference	2025-09-21 13:03:14 +09:00
Kohya S	e7b8e9a778	doc: add --vae_chunk_size option for training and inference	2025-09-21 11:13:26 +09:00
Kohya S	8f20c37949	feat: add --text_encoder_cpu option to reduce VRAM usage by running text encoders on CPU for training	2025-09-20 20:26:20 +09:00
Kohya S	b090d15f7d	feat: add multi backend attention and related update for HI2.1 models and scripts	2025-09-20 19:45:33 +09:00
Kohya S	cbe2a9da45	feat: add conversion script for LoRA models to ComfyUI format with reverse option	2025-09-16 21:48:47 +09:00
kohya-ss	f318ddaeea	docs: update HunyuanImage-2.1 training guide with model download instructions and VRAM optimization settings (by Claude)	2025-09-16 21:18:01 +09:00
kohya-ss	e04b9f0497	docs: add LoRA training guide for HunyuanImage-2.1 model (by Gemini CLI)	2025-09-13 22:06:10 +09:00
kohya-ss	ee8e670765	Merge branch 'sd3' into doc-update-for-latest-features	2025-09-09 12:42:09 +09:00
rockerBOO	fe4c18934c	blocks_to_swap is supported for validation loss now	2025-09-08 14:28:55 -04:00
rockerBOO	78685b9c5f	Move general settings to top to make more clear the validation bits	2025-09-08 14:18:50 -04:00
rockerBOO	ef4397963b	Fix validation dataset documentation to not use subsets	2025-09-08 14:16:33 -04:00
kohya-ss	0bb0d91615	doc: update introduction and clarify command line option priorities in config README	2025-09-06 19:52:54 +09:00
Kohya S.	952f9ce7be	Update docs/train_textual_inversion.md Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-09-04 19:46:04 +09:00
kohya-ss	ddfb38e501	doc: add documentation for Textual Inversion training scripts	2025-09-04 18:39:52 +09:00
kohya-ss	6c82327dc8	doc: remove Japanese section on Gradual Latent options from gen_img README	2025-09-01 21:32:50 +09:00
kohya-ss	9984868154	doc: update README to include support for SDXL models and additional command-line options for gen_img.py	2025-09-01 21:32:24 +09:00
kohya-ss	142d0be180	doc: add comprehensive fine-tuning guide for various model architectures	2025-09-01 12:36:51 +09:00
kohya-ss	c38b07d0da	doc: add validation loss documentation for model training	2025-08-31 21:39:47 +09:00
kohya-ss	80710134d5	doc: add Sage Attention and sample batch size options to Lumina training guide	2025-08-31 21:19:28 +09:00
kohya-ss	fe81d40202	doc: refactor structure for improved readability and maintainability	2025-08-31 21:14:45 +09:00
kohya-ss	989448afdd	doc: enhance SD3/SDXL LoRA training guide	2025-08-31 19:19:10 +09:00
Dave Lage	0ad2cb854d	Update flux_train_network.md	2025-08-02 17:27:55 -04:00
Dave Lage	24c605ee3b	Update flux_train_network.md	2025-08-02 17:21:25 -04:00
Dave Lage	b9c091eafc	Fix validation documentation	2025-08-02 17:19:26 -04:00
Kohya S	250f0eb9b0	doc: update README and training guide with breaking changes for CFG scale and model download instructions	2025-07-30 22:08:51 +09:00
Kohya S	af14eab6d7	doc: update section number for regex-based rank and learning rate configuration in FLUX.1 LoRA guide	2025-07-26 19:37:15 +09:00
Kohya S	c28e7a47c3	feat: add regex-based rank and learning rate configuration for FLUX.1 LoRA	2025-07-26 19:35:42 +09:00
kohya-ss	32f06012a7	doc: update flux train document and add about breaking changes in sample generation prompts	2025-07-21 21:48:06 +09:00
Kohya S	c84a163b32	docs: update README for documentation	2025-07-21 13:40:03 +09:00
Kohya S	7de68c1eb1	Merge branch 'sd3' into update-docs	2025-07-21 13:32:43 +09:00
Kohya S	d300f19045	docs: update Lumina training guide to include inference script and options	2025-07-21 13:15:09 +09:00
Kohya S	30295c9668	fix: update parameter names for CFG truncate and Renorm CFG in documentation and code	2025-07-13 21:00:27 +09:00
Kohya S	999df5ec15	fix: update default values for timestep_sampling and model_prediction_type in training arguments	2025-07-13 20:52:00 +09:00
Kohya S	88960e6309	doc: update lumina LoRA training guide	2025-07-13 20:49:38 +09:00
Kohya S	8a72f56c9f	fix: clarify Flash Attention usage in lumina training guide	2025-07-11 22:14:16 +09:00
kohya-ss	d0b335d8cf	feat: add LoRA training guide for Lumina Image 2.0 (WIP)	2025-07-10 20:15:45 +09:00
Kohya S	a376fec79c	doc: add comprehensive README for image generation script with usage examples and options	2025-05-24 18:48:54 +09:00
Kohya S	e7e371c9ce	doc: update English translation for advanced SDXL LoRA training	2025-05-17 15:06:00 +09:00
Kohya S	08aed008eb	doc: update FLUX.1 for newer features from README.md	2025-05-17 14:42:19 +09:00
Kohya S.	19a180ff90	Add English versions with Japanese in details	2025-05-17 14:28:26 +09:00
Kohya S	176baa6b95	doc: update sd3 and sdxl training guides	2025-04-16 12:32:43 +09:00
Kohya S	b1bbd4576c	doc: update sd3 LoRA, sdxl LoRA advanced	2025-04-14 21:53:21 +09:00
Kohya S	ceb19bebf8	update docs. sdxl is transltaed, flux.1 is corrected	2025-04-13 22:06:58 +09:00

1 2 3

105 Commits