Kohya-ss-sd-scripts

mirror of https://github.com/kohya-ss/sd-scripts.git synced 2026-04-17 01:12:41 +00:00

Author	SHA1	Message	Date
rockerBOO	4f27c6a0c9	Add BPO, CPO, DDO, SDPO, SimPO Refactor Preference Optimization Refactor preference dataset Add iterator support for ImageInfo and ImageSetInfo - Supporting iterating through either ImageInfo or ImageSetInfo to clean up preference dataset implementation and support 2 or more images more cleanly without needing to duplicate code Add tests for all PO functions Add metrics for process_batch Add losses for gradient manipulation of loss parts Add normalizing gradient for stabilizing gradients Args added: mapo_beta = 0.05 cpo_beta = 0.1 bpo_beta = 0.1 bpo_lambda = 0.2 sdpo_beta = 0.02 simpo_gamma_beta_ratio = 0.25 simpo_beta = 2.0 simpo_smoothing = 0.0 simpo_loss_type = "sigmoid" ddo_alpha = 4.0 ddo_beta = 0.05	2025-06-03 15:09:48 -04:00
Lex Song	b822b7e60b	Fix the interpolation logic error in resize_image() The original code had a mistake. It used 'lanczos' when the image got smaller (width > resized_width and height > resized_height) and 'area' when it stayed the same or got bigger. This was the wrong way. 'area' is better for big shrinking.	2025-04-02 22:04:37 +08:00
Lex Song	ede3470260	Ensure all size parameters are integers to prevent type errors	2025-04-02 03:50:33 +08:00
Kohya S	1f432e2c0e	use PIL for lanczos and box	2025-03-30 20:40:29 +09:00
Kohya S.	93a4efabb5	Merge branch 'sd3' into resize-interpolation	2025-03-30 19:30:56 +09:00
Kohya S	f4a0047865	feat: support metadata loading in MemoryEfficientSafeOpen	2025-02-26 20:50:44 +09:00
rockerBOO	7f2747176b	Use resize_image where resizing is required	2025-02-19 14:20:40 -05:00
Kohya S	aab943cea3	remove unused weight swapping functions from utils.py	2024-11-05 23:27:41 +09:00
Kohya S	81c0c965a2	faster block swap	2024-11-05 21:22:42 +09:00
Kohya S	623017f716	refactor SD3 CLIP to transformers etc.	2024-10-24 19:49:28 +09:00
Ed McManus	de4bb657b0	Update utils.py Cleanup	2024-09-19 14:38:32 -07:00
Ed McManus	3957372ded	Retain alpha in `pil_resize` Currently the alpha channel is dropped by `pil_resize()` when `--alpha_mask` is supplied and the image width does not exceed the bucket. This codepath is entered on the last line, here: ``` def trim_and_resize_if_required( random_crop: bool, image: np.ndarray, reso, resized_size: Tuple[int, int] ) -> Tuple[np.ndarray, Tuple[int, int], Tuple[int, int, int, int]]: image_height, image_width = image.shape[0:2] original_size = (image_width, image_height) # size before resize if image_width != resized_size[0] or image_height != resized_size[1]: # リサイズする if image_width > resized_size[0] and image_height > resized_size[1]: image = cv2.resize(image, resized_size, interpolation=cv2.INTER_AREA) # INTER_AREAでやりたいのでcv2でリサイズ else: image = pil_resize(image, resized_size) ```	2024-09-19 14:30:03 -07:00
Kohya S	ce144476cf	Merge branch 'dev' into sd3	2024-09-07 10:59:22 +09:00
Kohya S	0005867ba5	update README, format code	2024-09-07 10:45:18 +09:00
Kohya S	3be712e3e0	feat: Update direct loading fp8 ckpt for LoRA training	2024-08-27 21:40:02 +09:00
kohya-ss	98c91a7625	Fix bug in FLUX multi GPU training	2024-08-22 12:37:41 +09:00
Kohya S	486fe8f70a	feat: reduce memory usage and add memory efficient option for model saving	2024-08-19 22:30:24 +09:00
sdbds	9ca7a5b6cc	instead cv2 LANCZOS4 resize to pil resize	2024-07-20 21:59:11 +08:00
Kohya S	93bed60762	fix to work `--console_log_xxx` options	2024-02-12 14:49:29 +09:00
Kohya S	98f42d3a0b	Merge branch 'dev' into gradual_latent_hires_fix	2024-02-12 12:59:25 +09:00
Kohya S	5d9e2873f6	make rich to output to stderr instead of stdout	2024-02-08 21:38:02 +09:00
Kohya S	9b8ea12d34	update log initialization without rich	2024-02-08 21:06:39 +09:00
Kohya S	efd3b58973	Add logging arguments and update logging setup	2024-02-04 20:44:10 +09:00
Kohya S	6279b33736	fallback to basic logging if rich is not installed	2024-02-04 18:28:54 +09:00
Yuta Hayashibe	5f6bf29e52	Replace print with logger if they are logs (#905 ) * Add get_my_logger() * Use logger instead of print * Fix log level * Removed line-breaks for readability * Use setup_logging() * Add rich to requirements.txt * Make simple * Use logger instead of print --------- Co-authored-by: Kohya S <52813779+kohya-ss@users.noreply.github.com>	2024-02-04 18:14:34 +09:00
Kohya S	7a4e50705c	add target_x flag (not sure this impl is correct)	2023-12-03 17:59:41 +09:00
Kohya S	29b6fa6212	add unsharp mask	2023-11-28 22:33:22 +09:00
ddPn08	3f339cda6f	small fix	2023-04-02 23:21:17 +09:00
ddPn08	b5ff4e816f	resume from huggingface repository	2023-04-02 17:39:21 +09:00
ddPn08	45381b188c	small fix	2023-04-02 17:39:20 +09:00
ddPn08	054fb3308c	use access token	2023-04-02 17:39:19 +09:00
ddPn08	d42431d73a	Added feature to upload to huggingface	2023-04-02 17:39:10 +09:00

32 Commits