Kohya-ss-sd-scripts

mirror of https://github.com/kohya-ss/sd-scripts.git synced 2026-04-16 17:02:45 +00:00

Author	SHA1	Message	Date
rockerBOO	8458a5696e	Add graceful fallback when FAISS is not installed - Make FAISS import optional with try/except - CDCPreprocessor raises helpful ImportError if FAISS unavailable - train_util.py catches ImportError and returns None - train_network.py checks for None and warns user - Training continues without CDC-FM if FAISS not installed - Remove benchmark file (not needed in repo) This allows users to run training without FAISS dependency. CDC-FM will be automatically disabled with a warning if FAISS is missing.	2025-10-09 23:50:07 -04:00
rockerBOO	7ca799ca26	Add adaptive k_neighbors support for CDC-FM - Add --cdc_adaptive_k flag to enable adaptive k based on bucket size - Add --cdc_min_bucket_size to set minimum bucket threshold (default: 16) - Fixed mode (default): Skip buckets with < k_neighbors samples - Adaptive mode: Use k=min(k_neighbors, bucket_size-1) for buckets >= min_bucket_size - Update CDCPreprocessor to support adaptive k per bucket - Add metadata tracking for adaptive_k and min_bucket_size - Add comprehensive pytest tests for adaptive k behavior This allows CDC-FM to work effectively with multi-resolution bucketing where bucket sizes may vary widely. Users can choose between strict paper methodology (fixed k) or pragmatic approach (adaptive k).	2025-10-09 23:16:44 -04:00
rockerBOO	c8a4e99074	Add --cdc_debug flag and tqdm progress for CDC preprocessing - Add --cdc_debug flag to enable verbose bucket-by-bucket output - When debug=False (default): Show tqdm progress bar, concise logging - When debug=True: Show detailed bucket information, no progress bar - Improves user experience during CDC cache generation	2025-10-09 18:28:51 -04:00
rockerBOO	1d4c4d4cb2	Fix: Replace CDC integer index lookup with image_key strings Fixes shape mismatch bug in multi-subset training where CDC preprocessing and training used different index calculations, causing wrong CDC data to be loaded for samples. Changes: - CDC cache now stores/loads data using image_key strings instead of integer indices - Training passes image_key list instead of computed integer indices - All CDC lookups use stable image_key identifiers - Improved device compatibility check (handles "cuda" vs "cuda:0") - Updated all 30 CDC tests to use image_key-based access Root cause: Preprocessing used cumulative dataset indices while training used sorted keys, resulting in mismatched lookups during shuffled multi-subset training.	2025-10-09 18:28:51 -04:00
rockerBOO	f552f9a3bd	Add CDC-FM (Carré du Champ Flow Matching) support Implements geometry-aware noise generation for FLUX training based on arXiv:2510.05930v1.	2025-10-09 18:28:47 -04:00
Kohya S	209c02dbb6	feat: HunyuanImage LoRA training	2025-09-12 21:40:42 +09:00
Kohya S	7f983c558d	feat: block swap for inference and initial impl for HunyuanImage LoRA (not working)	2025-09-11 22:15:22 +09:00
Kohya S	6edbe00547	feat: update libraries, remove warnings	2025-08-16 20:07:03 +09:00
rockerBOO	d24d733892	Update model spec to 1.0.1. Refactor model spec	2025-08-02 21:14:27 -04:00
Kohya S	9eda938876	Merge branch 'sd3' into feature-chroma-support	2025-07-21 13:32:22 +09:00
Kohya S.	d98400b06e	Merge pull request #2138 from kohya-ss/feature-lumina-image Feature lumina image	2025-07-21 13:21:26 +09:00
Kohya S	b4e862626a	feat: add LoRA training support for Chroma	2025-07-20 19:00:09 +09:00
Dave Lage	3adbbb6e33	Add note about why we are moving it	2025-07-16 16:09:20 -04:00
rockerBOO	a7b33f3204	Fix alphas cumprod after add_noise for DDIMScheduler	2025-07-15 22:36:46 -04:00
Kohya S	30295c9668	fix: update parameter names for CFG truncate and Renorm CFG in documentation and code	2025-07-13 21:00:27 +09:00
rockerBOO	0e929f97b9	Revert system_prompt for dataset config	2025-06-16 16:50:18 -04:00
rockerBOO	0145efc2f2	Merge branch 'sd3' into lumina	2025-06-09 18:13:06 -04:00
Kohya S.	7c075a9c8d	Merge pull request #2060 from saibit-tech/sd3 Fix: try aligning dtype of matrixes when training with deepspeed and mixed-precision is set to bf16 or fp16	2025-05-01 23:20:17 +09:00
Kohya S	64430eb9b2	Merge branch 'dev' into sd3	2025-04-29 21:30:57 +09:00
Kohya S	d8717a3d1c	Merge branch 'main' into dev	2025-04-29 21:30:33 +09:00
Kohya S	4625b34f4e	Fix mean image aspect ratio error calculation to avoid NaN values	2025-04-29 21:27:04 +09:00
sdbds	4fc917821a	fix bugs	2025-04-23 16:16:36 +08:00
sdbds	899f3454b6	update for init problem	2025-04-23 15:47:12 +08:00
saibit	7c61c0dfe0	Add autocast warpper for forward functions in deepspeed_utils.py to try aligning precision when using mixed precision in training process	2025-04-22 16:06:55 +08:00
Kohya S	629073cd9d	Add guidance scale for prompt param and flux sampling	2025-04-16 21:50:36 +09:00
sdbds	7f93e21f30	fix typo	2025-04-06 16:21:48 +08:00
青龍聖者@bdsqlsz	9f1892cc8e	Merge branch 'sd3' into lumina	2025-04-06 16:13:43 +08:00
Kohya S	f1423a7229	fix: add resize_interpolation parameter to FineTuningDataset constructor	2025-04-03 21:48:51 +09:00
Kohya S	b3c56b22bd	Merge branch 'dev' into sd3	2025-03-31 22:05:40 +09:00
Kohya S	1f432e2c0e	use PIL for lanczos and box	2025-03-30 20:40:29 +09:00
Kohya S.	93a4efabb5	Merge branch 'sd3' into resize-interpolation	2025-03-30 19:30:56 +09:00
Disty0	620a06f517	Check for uppercase file extension too	2025-03-17 17:44:29 +03:00
rockerBOO	1f22a94cfe	Update embedder_dims, add more flexible caption extension	2025-03-04 02:25:50 -05:00
rockerBOO	ce2610d29b	Change system prompt to inject Prompt Start special token	2025-02-27 02:47:04 -05:00
rockerBOO	7b83d50dc0	Merge branch 'sd3' into lumina	2025-02-26 22:13:56 -05:00
Disty0	9a415ba965	JPEG XL support	2025-02-27 00:21:57 +03:00
sdbds	fc772affbe	1、Implement cfg_trunc calculation directly using timesteps, without intermediate steps. 2、Deprecate and remove the guidance_scale parameter because it used in inference not train 3、Add inference command-line arguments --ct for cfg_trunc_ratio and --rc for renorm_cfg to control CFG truncation and renormalization during inference.	2025-02-24 14:10:24 +08:00
rockerBOO	42a801514c	Fix system prompt in datasets	2025-02-23 13:48:37 -05:00
rockerBOO	025cca699b	Fix samples, LoRA training. Add system prompt, use_flash_attn	2025-02-23 01:29:18 -05:00
Kohya S	efb2a128cd	fix wandb val logging	2025-02-21 22:07:35 +09:00
rockerBOO	7f2747176b	Use resize_image where resizing is required	2025-02-19 14:20:40 -05:00
rockerBOO	545425c13e	Typo	2025-02-19 14:20:40 -05:00
rockerBOO	d0128d18be	Add resize interpolation CLI option	2025-02-19 14:20:40 -05:00
rockerBOO	58e9e146a3	Add resize interpolation configuration	2025-02-19 14:20:40 -05:00
Kohya S	dc7d5fb459	Merge branch 'sd3' into val-loss-improvement	2025-02-18 21:34:30 +09:00
rockerBOO	9436b41061	Fix validation split and add test	2025-02-17 14:28:41 -05:00
rockerBOO	3ed7606f88	Clear sizes for validation reg images to be consistent	2025-02-17 12:07:23 -05:00
rockerBOO	3365cfadd7	Fix sizes for validation split	2025-02-17 12:07:23 -05:00
rockerBOO	f3a010978c	Clear sizes for validation reg images to be consistent	2025-02-16 22:28:34 -05:00
rockerBOO	3c7496ae3f	Fix sizes for validation split	2025-02-16 22:18:14 -05:00

1 2 3 4 5 ...

561 Commits