Commit Graph

883 Commits

Author SHA1 Message Date
Dave Lage
336dd6a730 Merge 8b0a467bc0 into 2e0fcc50cb 2025-07-11 09:58:40 +08:00
Kohya S.
3e6935a07e Merge pull request #2115 from kohya-ss/fix-flux-sampling-accelerate-error
Fix unwrap_model handling for None text_encoders in sample_images
2025-06-15 21:14:09 +09:00
Kohya S
fc40a279fa Merge branch 'dev' into sd3 2025-06-15 21:05:57 +09:00
Disty0
bcd3a5a60a Update IPEX libs 2025-06-13 16:25:16 +03:00
rockerBOO
7c83ac4369 Add avg non-zero ratio metric 2025-06-10 13:17:04 -04:00
Kohya S
bb47f1ea89 Fix unwrap_model handling for None text_encoders in sample_images function 2025-06-08 18:00:24 +09:00
rockerBOO
9629853d15 Fix wavelet loss not separating levels. Refactor loss to be spatial 2025-06-05 22:03:52 -04:00
rockerBOO
0af0302c38 Metrics, energy, loss 2025-05-19 19:15:23 -04:00
rockerBOO
346790a996 Merge branch 'sd3' into network-wavelet-loss 2025-05-19 19:10:55 -04:00
rockerBOO
984472ca09 Fix metrics 2025-05-04 18:58:54 -04:00
rockerBOO
7be3c5dce1 Refactor SWT to work properly and faster. Add SWT tests 2025-05-04 18:57:53 -04:00
rockerBOO
964bfcb576 Fix padding for small latents. Add DWT tests 2025-05-04 18:57:42 -04:00
rockerBOO
d5f8f7de1f Add wavelet loss fn 2025-05-04 18:57:10 -04:00
rockerBOO
56dfdae7c5 Add QuaternionWaveletTransform. Update WaveletLoss 2025-05-04 18:55:50 -04:00
rockerBOO
19ce0ae61f Add wavelet_loss_band_level_weights 2025-05-04 18:39:51 -04:00
rockerBOO
7b9e92a8cc Fix band weights via toml. Add more logging 2025-05-04 18:39:48 -04:00
rockerBOO
20a99771bf Add back in padding 2025-05-04 18:39:45 -04:00
rockerBOO
f553b7bf31 Add wavelet loss recording 2025-05-04 18:39:36 -04:00
rockerBOO
6d42b95e2b Refactor transforms, fix loss calculations
- add full conditional_loss functionality to wavelet loss
- Transforms are separate and abstracted
- Loss now doesn't include LL except the lowest level
  - ll_level_threshold allows you to control the level the ll is
    used in the loss
- band weights can now be passed in
- rectified flow calculations can be bypassed for experimentation
- Fixed alpha to 1.0 with new weighted bands producing lower loss
2025-05-04 18:39:32 -04:00
rockerBOO
64422ff4a0 Suggest the right module 2025-05-04 18:39:25 -04:00
rockerBOO
837231a5c7 Add wavelet loss 2025-05-04 18:39:17 -04:00
rockerBOO
813942a967 Add wavelet loss 2025-05-04 18:38:47 -04:00
Kohya S.
7c075a9c8d Merge pull request #2060 from saibit-tech/sd3
Fix: try aligning dtype of matrixes when training with deepspeed and mixed-precision is set to bf16 or fp16
2025-05-01 23:20:17 +09:00
Kohya S
64430eb9b2 Merge branch 'dev' into sd3 2025-04-29 21:30:57 +09:00
Kohya S
d8717a3d1c Merge branch 'main' into dev 2025-04-29 21:30:33 +09:00
Kohya S
4625b34f4e Fix mean image aspect ratio error calculation to avoid NaN values 2025-04-29 21:27:04 +09:00
Kohya S
fd3a445769 fix: revert default emb guidance scale and CFG scale for FLUX.1 sampling 2025-04-27 22:50:27 +09:00
saibit
46ad3be059 update deepspeed wrapper 2025-04-24 11:26:36 +08:00
sharlynxy
abf2c44bc5 Dynamically set device in deepspeed wrapper (#2)
* get device type from model

* add logger warning

* format

* format

* format
2025-04-23 18:57:19 +08:00
Robert
f501209c37 Merge branch 'dev/xy/align_dtype_using_mixed_precision' of github.com:saibit-tech/sd-scripts into dev/xy/align_dtype_using_mixed_precision 2025-04-22 16:19:52 +08:00
Robert
c8af252a44 refactor 2025-04-22 16:19:14 +08:00
saibit
7f984f4775 # 2025-04-22 16:15:12 +08:00
saibit
d33d5eccd1 # 2025-04-22 16:12:06 +08:00
saibit
7c61c0dfe0 Add autocast warpper for forward functions in deepspeed_utils.py to try aligning precision when using mixed precision in training process 2025-04-22 16:06:55 +08:00
Kohya S
629073cd9d Add guidance scale for prompt param and flux sampling 2025-04-16 21:50:36 +09:00
Kohya S
06df0377f9 Merge branch 'sd3' into flux-sample-cfg 2025-04-16 21:27:08 +09:00
Kohya S.
c56dc90b26 Merge pull request #1992 from rockerBOO/flux-ip-noise-gamma
Add IP noise gamma for Flux
2025-04-06 21:29:26 +09:00
Kohya S.
606e6875d2 Merge pull request #2022 from LexSong/fix-resize-issue
Fix size parameter types and improve resize_image interpolation
2025-04-05 19:28:25 +09:00
Kohya S
f1423a7229 fix: add resize_interpolation parameter to FineTuningDataset constructor 2025-04-03 21:48:51 +09:00
Lex Song
b822b7e60b Fix the interpolation logic error in resize_image()
The original code had a mistake. It used 'lanczos' when the image got smaller (width > resized_width and height > resized_height) and 'area' when it stayed the same or got bigger. This was the wrong way. 'area' is better for big shrinking.
2025-04-02 22:04:37 +08:00
Lex Song
ede3470260 Ensure all size parameters are integers to prevent type errors 2025-04-02 03:50:33 +08:00
Kohya S
b3c56b22bd Merge branch 'dev' into sd3 2025-03-31 22:05:40 +09:00
Kohya S
583ab27b3c doc: update license information in jpeg_xl_util.py 2025-03-31 22:02:25 +09:00
Kohya S
1f432e2c0e use PIL for lanczos and box 2025-03-30 20:40:29 +09:00
Kohya S.
93a4efabb5 Merge branch 'sd3' into resize-interpolation 2025-03-30 19:30:56 +09:00
rockerBOO
e8b3254858 Add flux_train_utils tests for get get_noisy_model_input_and_timesteps 2025-03-20 15:01:15 -04:00
rockerBOO
16cef81aea Refactor sigmas and timesteps 2025-03-20 14:32:56 -04:00
rockerBOO
f974c6b257 change order to match upstream 2025-03-19 14:27:43 -04:00
rockerBOO
5d5a7d2acf Fix IP noise calculation 2025-03-19 13:50:04 -04:00
rockerBOO
1eddac26b0 Separate random to a variable, and make sure on device 2025-03-19 00:49:42 -04:00