Kohya S.
7c075a9c8d
Merge pull request #2060 from saibit-tech/sd3
...
Fix: try aligning dtype of matrixes when training with deepspeed and mixed-precision is set to bf16 or fp16
2025-05-01 23:20:17 +09:00
Kohya S
64430eb9b2
Merge branch 'dev' into sd3
2025-04-29 21:30:57 +09:00
Kohya S
d8717a3d1c
Merge branch 'main' into dev
2025-04-29 21:30:33 +09:00
Kohya S
4625b34f4e
Fix mean image aspect ratio error calculation to avoid NaN values
2025-04-29 21:27:04 +09:00
Kohya S
fd3a445769
fix: revert default emb guidance scale and CFG scale for FLUX.1 sampling
2025-04-27 22:50:27 +09:00
saibit
46ad3be059
update deepspeed wrapper
2025-04-24 11:26:36 +08:00
sharlynxy
abf2c44bc5
Dynamically set device in deepspeed wrapper ( #2 )
...
* get device type from model
* add logger warning
* format
* format
* format
2025-04-23 18:57:19 +08:00
Robert
f501209c37
Merge branch 'dev/xy/align_dtype_using_mixed_precision' of github.com:saibit-tech/sd-scripts into dev/xy/align_dtype_using_mixed_precision
2025-04-22 16:19:52 +08:00
Robert
c8af252a44
refactor
2025-04-22 16:19:14 +08:00
saibit
7f984f4775
#
2025-04-22 16:15:12 +08:00
saibit
d33d5eccd1
#
2025-04-22 16:12:06 +08:00
saibit
7c61c0dfe0
Add autocast warpper for forward functions in deepspeed_utils.py to try aligning precision when using mixed precision in training process
2025-04-22 16:06:55 +08:00
Kohya S
629073cd9d
Add guidance scale for prompt param and flux sampling
2025-04-16 21:50:36 +09:00
Kohya S
06df0377f9
Merge branch 'sd3' into flux-sample-cfg
2025-04-16 21:27:08 +09:00
Kohya S.
c56dc90b26
Merge pull request #1992 from rockerBOO/flux-ip-noise-gamma
...
Add IP noise gamma for Flux
2025-04-06 21:29:26 +09:00
Kohya S.
606e6875d2
Merge pull request #2022 from LexSong/fix-resize-issue
...
Fix size parameter types and improve resize_image interpolation
2025-04-05 19:28:25 +09:00
Kohya S
f1423a7229
fix: add resize_interpolation parameter to FineTuningDataset constructor
2025-04-03 21:48:51 +09:00
Lex Song
b822b7e60b
Fix the interpolation logic error in resize_image()
...
The original code had a mistake. It used 'lanczos' when the image got smaller (width > resized_width and height > resized_height) and 'area' when it stayed the same or got bigger. This was the wrong way. 'area' is better for big shrinking.
2025-04-02 22:04:37 +08:00
Lex Song
ede3470260
Ensure all size parameters are integers to prevent type errors
2025-04-02 03:50:33 +08:00
Kohya S
b3c56b22bd
Merge branch 'dev' into sd3
2025-03-31 22:05:40 +09:00
Kohya S
583ab27b3c
doc: update license information in jpeg_xl_util.py
2025-03-31 22:02:25 +09:00
Kohya S
1f432e2c0e
use PIL for lanczos and box
2025-03-30 20:40:29 +09:00
Kohya S.
93a4efabb5
Merge branch 'sd3' into resize-interpolation
2025-03-30 19:30:56 +09:00
rockerBOO
e8b3254858
Add flux_train_utils tests for get get_noisy_model_input_and_timesteps
2025-03-20 15:01:15 -04:00
rockerBOO
16cef81aea
Refactor sigmas and timesteps
2025-03-20 14:32:56 -04:00
rockerBOO
f974c6b257
change order to match upstream
2025-03-19 14:27:43 -04:00
rockerBOO
5d5a7d2acf
Fix IP noise calculation
2025-03-19 13:50:04 -04:00
rockerBOO
1eddac26b0
Separate random to a variable, and make sure on device
2025-03-19 00:49:42 -04:00
rockerBOO
8e6817b0c2
Remove double noise
2025-03-19 00:45:13 -04:00
rockerBOO
d93ad90a71
Add perturbation on noisy_model_input if needed
2025-03-19 00:37:27 -04:00
rockerBOO
7197266703
Perturbed noise should be separate of input noise
2025-03-19 00:25:51 -04:00
rockerBOO
b81bcd0b01
Move IP noise gamma to noise creation to remove complexity and align noise for target loss
2025-03-18 21:36:55 -04:00
rockerBOO
6f4d365775
zeros_like because we are adding
2025-03-18 18:53:34 -04:00
rockerBOO
a4f3a9fc1a
Use ones_like
2025-03-18 18:44:21 -04:00
rockerBOO
b425466e7b
Fix IP noise gamma to use random values
2025-03-18 18:42:35 -04:00
rockerBOO
c8be141ae0
Apply IP gamma to noise fix
2025-03-18 15:42:18 -04:00
rockerBOO
0b25a05e3c
Add IP noise gamma for Flux
2025-03-18 15:40:40 -04:00
Disty0
620a06f517
Check for uppercase file extension too
2025-03-17 17:44:29 +03:00
Disty0
564ec5fb7f
use extend instead of +=
2025-03-17 17:41:03 +03:00
Disty0
7e90cdd47a
use bytearray and add typing hints
2025-03-17 17:26:08 +03:00
Kohya S
272f4c3775
Merge branch 'sd3' into sd3_safetensors_merge
2025-02-28 23:52:36 +09:00
Disty0
2f69f4dbdb
fix typo
2025-02-27 00:30:19 +03:00
Disty0
9a415ba965
JPEG XL support
2025-02-27 00:21:57 +03:00
Kohya S
ec350c83eb
Merge branch 'dev' into sd3
2025-02-26 21:17:29 +09:00
Kohya S
1fcac98280
Merge branch 'sd3' into val-loss-improvement
2025-02-26 21:09:10 +09:00
Kohya S
f4a0047865
feat: support metadata loading in MemoryEfficientSafeOpen
2025-02-26 20:50:44 +09:00
Disty0
f68702f71c
Update IPEX libs
2025-02-25 21:27:41 +03:00
Kohya S
67fde015f7
Merge branch 'dev' into sd3
2025-02-24 18:56:15 +09:00
Kohya S.
386b7332c6
Merge pull request #1918 from tsukimiya/fix_vperd_warning
...
Remove v-pred warning.
2025-02-24 18:55:25 +09:00
Kohya S
905f081798
Merge branch 'dev' into sd3
2025-02-24 18:54:28 +09:00