Kohya-ss-sd-scripts

mirror of https://github.com/kohya-ss/sd-scripts.git synced 2026-04-16 17:02:45 +00:00

Author	SHA1	Message	Date
rockerBOO	c8a4e99074	Add --cdc_debug flag and tqdm progress for CDC preprocessing - Add --cdc_debug flag to enable verbose bucket-by-bucket output - When debug=False (default): Show tqdm progress bar, concise logging - When debug=True: Show detailed bucket information, no progress bar - Improves user experience during CDC cache generation	2025-10-09 18:28:51 -04:00
rockerBOO	7a7110cdc6	Use logger instead of print for CDC loading messages	2025-10-09 18:28:51 -04:00
rockerBOO	1d4c4d4cb2	Fix: Replace CDC integer index lookup with image_key strings Fixes shape mismatch bug in multi-subset training where CDC preprocessing and training used different index calculations, causing wrong CDC data to be loaded for samples. Changes: - CDC cache now stores/loads data using image_key strings instead of integer indices - Training passes image_key list instead of computed integer indices - All CDC lookups use stable image_key identifiers - Improved device compatibility check (handles "cuda" vs "cuda:0") - Updated all 30 CDC tests to use image_key-based access Root cause: Preprocessing used cumulative dataset indices while training used sorted keys, resulting in mismatched lookups during shuffled multi-subset training.	2025-10-09 18:28:51 -04:00
rockerBOO	4bea582601	Fix: Prevent false device mismatch warnings for cuda vs cuda:0 - Treat cuda and cuda:0 as compatible devices - Only warn on actual device mismatches (cuda vs cpu) - Eliminates warning spam during multi-subset training	2025-10-09 18:28:51 -04:00
rockerBOO	ee8ceee178	Add device consistency validation for CDC transformation - Check that noise and CDC matrices are on same device - Automatically transfer noise if device mismatch detected - Warn user when device transfer occurs - Add tests to verify device handling	2025-10-09 18:28:51 -04:00
rockerBOO	ce17007e1a	Add warning throttling for CDC shape mismatches - Track warned samples in global set to prevent log spam - Each sample only warned once per training session - Prevents thousands of duplicate warnings during training - Add tests to verify throttling behavior	2025-10-09 18:28:50 -04:00
rockerBOO	88af20881d	Fix: Enable gradient flow through CDC noise transformation - Remove @torch.no_grad() decorator from compute_sigma_t_x() - Gradients now properly flow through CDC transformation during training - Add comprehensive gradient flow tests for fast/slow paths and fallback - All 25 CDC tests passing	2025-10-09 18:28:50 -04:00
rockerBOO	0d822b2f74	Refactor: Extract CDC noise transformation to separate function - Create apply_cdc_noise_transformation() for better modularity - Implement fast path for batch processing when all shapes match - Implement slow path for per-sample processing on shape mismatch - Clone noise tensors in fallback path for gradient consistency	2025-10-09 18:28:50 -04:00
rockerBOO	e03200bdba	Optimize: Cache CDC shapes in memory to eliminate I/O bottleneck - Cache all shapes during GammaBDataset initialization - Eliminates file I/O on every training step (9.5M accesses/sec) - Reduces get_shape() from file operation to dict lookup - Memory overhead: ~126 bytes/sample (~12.6 MB per 100k images)	2025-10-09 18:28:50 -04:00
rockerBOO	f552f9a3bd	Add CDC-FM (Carré du Champ Flow Matching) support Implements geometry-aware noise generation for FLUX training based on arXiv:2510.05930v1.	2025-10-09 18:28:47 -04:00
Kohya S.	5e366acda4	Merge pull request #2003 from laolongboy/sd3-dev Fix missing parameters in model conversion script	2025-10-01 21:03:12 +09:00
Kohya S	5462a6bb24	Merge branch 'dev' into sd3	2025-09-29 21:02:02 +09:00
Kohya S	63711390a0	Merge branch 'main' into dev	2025-09-29 20:56:07 +09:00
Kohya S.	206adb6438	Merge pull request #2216 from kohya-ss/fix-sdxl-textual-inversion-training-disable-mmap fix: disable_mmap_safetensors not defined in SDXL TI training	2025-09-29 20:55:02 +09:00
Kohya S	60bfa97b19	fix: disable_mmap_safetensors not defined in SDXL TI training	2025-09-29 20:52:48 +09:00
Kohya S.	f0c767e0f2	Merge pull request #2213 from kohya-ss/doc-hunyuan-image-training-text-encoder-cpu-note docs: enhance text encoder CPU usage instructions for HunyuanImage-2.…	2025-09-28 18:32:11 +09:00
kohya-ss	a0c26a0efa	docs: enhance text encoder CPU usage instructions for HunyuanImage-2.1 training	2025-09-28 18:21:25 +09:00
Kohya S.	67d0621313	Merge pull request #2212 from kohya-ss/fix-hunyuan-image-sample-generation fix: HunyuanImage-2.1 sample generation fails	2025-09-28 18:12:04 +09:00
Kohya S	6a826d21b1	feat: add new parameters for sample image inference configuration	2025-09-28 18:06:17 +09:00
Kohya S.	4c197a538b	Merge pull request #2207 from kohya-ss/fix-flux-extract-lora-metadata-failed fix: update metadata construction to include model_config for flux	2025-09-24 21:19:27 +09:00
Kohya S	4b79d73504	fix: update metadata construction to include model_config for flux	2025-09-24 21:15:37 +09:00
Kohya S.	121853ca2a	Merge pull request #2198 from kohya-ss/feat-hunyuan-image-2.1-inference feat: support HunyuanImage-2.1	2025-09-23 19:11:50 +09:00
Kohya S	58df9dffa4	doc: update README with HunyuanImage-2.1 LoRA training details and requirements	2025-09-23 18:59:02 +09:00
Kohya S	31f7df3b3a	doc: add --network_train_unet_only option for HunyuanImage-2.1 training	2025-09-23 18:53:36 +09:00
Kohya S.	753c794549	Update hunyuan_image_train_network.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-09-21 13:30:22 +09:00
Kohya S.	e7b89826c5	Update library/custom_offloading_utils.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-09-21 13:29:58 +09:00
Kohya S	806d535ef1	fix: block-wise scaling is overwritten by per-tensor scaling	2025-09-21 13:10:41 +09:00
Kohya S	3876343fad	fix: remove print statement for guidance rescale in AdaptiveProjectedGuidance	2025-09-21 13:09:38 +09:00
Kohya S	040d976597	feat: add guidance rescale options for Adaptive Projected Guidance in inference	2025-09-21 13:03:14 +09:00
Kohya S	9621d9d637	feat: add Adaptive Projected Guidance parameters and noise rescaling	2025-09-21 12:34:40 +09:00
Kohya S	e7b8e9a778	doc: add --vae_chunk_size option for training and inference	2025-09-21 11:13:26 +09:00
Kohya S	f41e9e2b58	feat: add vae_chunk_size argument for memory-efficient VAE decoding and processing	2025-09-21 11:09:37 +09:00
Kohya S	8f20c37949	feat: add --text_encoder_cpu option to reduce VRAM usage by running text encoders on CPU for training	2025-09-20 20:26:20 +09:00
Kohya S	b090d15f7d	feat: add multi backend attention and related update for HI2.1 models and scripts	2025-09-20 19:45:33 +09:00
Kohya S	f834b2e0d4	fix: --fp8_vl to work	2025-09-18 23:46:18 +09:00
Kohya S	f6b4bdc83f	feat: block-wise fp8 quantization	2025-09-18 21:20:54 +09:00
Kohya S	2ce506e187	fix: fp8 casting not working	2025-09-18 21:20:08 +09:00
Kohya S	f5b004009e	fix: correct tensor indexing in HunyuanVAE2D class for blending and encoding functions	2025-09-17 21:54:25 +09:00
Kohya S	cbe2a9da45	feat: add conversion script for LoRA models to ComfyUI format with reverse option	2025-09-16 21:48:47 +09:00
kohya-ss	f318ddaeea	docs: update HunyuanImage-2.1 training guide with model download instructions and VRAM optimization settings (by Claude)	2025-09-16 21:18:01 +09:00
kohya-ss	39458ec0e3	fix: update default values for guidance_scale, image_size, infer_steps, and flow_shift in argument parser	2025-09-16 21:17:21 +09:00
Kohya S	2732be0b29	Merge branch 'feat-hunyuan-image-2.1-inference' of https://github.com/kohya-ss/sd-scripts into feat-hunyuan-image-2.1-inference	2025-09-14 20:49:24 +09:00
Kohya S	1a73b5e8a5	feat: add script to convert LoRA format to ComfyUI format	2025-09-14 20:49:20 +09:00
kohya-ss	e04b9f0497	docs: add LoRA training guide for HunyuanImage-2.1 model (by Gemini CLI)	2025-09-13 22:06:10 +09:00
Kohya S	29b0500e70	fix: restore files section in _typos.toml for exclusion configuration	2025-09-13 21:18:50 +09:00
Kohya S	4e2a80a6ca	refactor: update imports to use safetensors_utils for memory-efficient operations	2025-09-13 21:07:11 +09:00
Kohya S	d831c88832	fix: sample generation doesn't work with block swap	2025-09-13 21:06:04 +09:00
Kohya S	bae7fa74eb	Merge branch 'sd3' into feat-hunyuan-image-2.1-inference	2025-09-13 20:13:58 +09:00
Kohya S.	f5d44fd487	Merge pull request #2200 from kohya-ss/feat-faster-safetensors-load feat: Speeding up loading .safetensors files	2025-09-13 20:09:03 +09:00
Kohya S	4568631b43	docs: update README to reflect improved loading speed of .safetensors files	2025-09-13 20:05:39 +09:00

1 2 3 4 5 ...

2472 Commits