Kohya S
1286e00bb0
fix to call train/eval in schedulefree #1605
2024-09-18 21:31:54 +09:00
Kohya S
e74502117b
update README
2024-09-18 08:04:32 +09:00
Kohya S.
bbd160b4ca
sd3 schedule free opt ( #1605 )
...
* New ScheduleFree support for Flux (#1600 )
* init
* use no schedule
* fix typo
* update for eval()
* fix typo
* update
* Update train_util.py
* Update requirements.txt
* update sfwrapper WIP
* no need to check schedulefree optimizer
* remove debug print
* comment out schedulefree wrapper
* update readme
---------
Co-authored-by: 青龍聖者@bdsqlsz <865105819@qq.com >
2024-09-18 07:55:04 +09:00
Kohya S
a2ad7e5644
blocks_to_swap=0 means no swap
2024-09-17 21:42:14 +09:00
Kohya S
0cbe95bcc7
fix text_encoder_lr to work with int closes #1608
2024-09-17 21:21:28 +09:00
Kohya S
d8d15f1a7e
add support for specifying blocks in FLUX.1 LoRA training
2024-09-16 23:14:09 +09:00
Kohya S
96c677b459
fix to work lienar/cosine lr scheduler closes #1602 ref #1393
2024-09-16 10:42:09 +09:00
Kohya S
be078bdaca
fix typo
2024-09-15 13:59:17 +09:00
Kohya S
9f44ef1330
add diffusers to FLUX.1 conversion script
2024-09-15 13:52:23 +09:00
Kohya S
6445bb2bc9
update README
2024-09-14 22:37:26 +09:00
Kohya S
c9ff4de905
Add support for specifying rank for each layer in FLUX.1
2024-09-14 22:17:52 +09:00
Kohya S
2d8ee3c280
OFT for FLUX.1
2024-09-14 15:48:16 +09:00
Kohya S
0485f236a0
Merge branch 'dev' into sd3
2024-09-13 22:39:24 +09:00
Kohya S
93d9fbf607
improve OFT implementation closes #944
2024-09-13 22:37:11 +09:00
Kohya S
c15a3a1a65
Merge branch 'dev' into sd3
2024-09-13 21:30:49 +09:00
Kohya S
43ad73860d
Merge branch 'main' into dev
2024-09-13 21:29:51 +09:00
Kohya S
b755ebd0a4
add LBW support for SDXL merge LoRA
2024-09-13 21:29:31 +09:00
Kohya S
f4a0bea6dc
format by black
2024-09-13 21:26:06 +09:00
terracottahaniwa
734d2e5b2b
Support Lora Block Weight (LBW) to svd_merge_lora.py ( #1575 )
...
* support lora block weight
* solve license incompatibility
* Fix issue: lbw index calculation
2024-09-13 20:45:35 +09:00
Kohya S
f3ce80ef8f
Merge branch 'dev' into sd3
2024-09-13 19:49:16 +09:00
Kohya S
9d2860760d
Merge branch 'main' into dev
2024-09-13 19:48:53 +09:00
Kohya S
3387dc7306
formatting, update README
2024-09-13 19:45:42 +09:00
Kohya S
57ae44eb61
refactor to make safer
2024-09-13 19:45:00 +09:00
Maru-mee
1d7118a622
Support : OFT merge to base model ( #1580 )
...
* Support : OFT merge to base model
* Fix typo
* Fix typo_2
* Delete unused parameter 'eye'
2024-09-13 19:01:36 +09:00
Kohya S
cefe52629e
fix to work old notation for TE LR in .toml
2024-09-12 12:36:07 +09:00
Kohya S
237317fffd
update README
2024-09-11 22:23:43 +09:00
Plat
a823fd9fb8
Improve wandb logging ( #1576 )
...
* fix: wrong training steps were recorded to wandb, and no log was sent when logging_dir was not specified
* fix: checking of whether wandb is enabled
* feat: log images to wandb with their positive prompt as captions
* feat: logging sample images' caption for sd3 and flux
* fix: import wandb before use
2024-09-11 22:21:16 +09:00
Kohya S
c7c666b182
fix typo
2024-09-11 22:12:31 +09:00
Kohya S.
d83f2e92da
Merge pull request #1592 from cocktailpeanut/sd3
...
Critical typo fix
2024-09-11 22:10:37 +09:00
cocktailpeanut
8311e88225
typo fix
2024-09-11 09:02:29 -04:00
Kohya S
eaafa5c9da
Merge branch 'dev' into sd3
2024-09-11 21:46:21 +09:00
Kohya S
6dbfd47a59
Fix to work PIECEWISE_CONSTANT, update requirement.txt and README #1393
2024-09-11 21:44:36 +09:00
青龍聖者@bdsqlsz
fd68703f37
Add New lr scheduler ( #1393 )
...
* add new lr scheduler
* fix bugs and use num_cycles / 2
* Update requirements.txt
* add num_cycles for min lr
* keep PIECEWISE_CONSTANT
* allow use float with warmup or decay ratio.
* Update train_util.py
2024-09-11 21:25:45 +09:00
Kohya S
65b8a064f6
update README
2024-09-10 21:20:38 +09:00
Kohya S
d10ff62a78
support individual LR for CLIP-L/T5XXL
2024-09-10 20:32:09 +09:00
Kohya S
d29af146b8
add negative prompt for flux inference script
2024-09-09 23:01:15 +09:00
Kohya S
ce144476cf
Merge branch 'dev' into sd3
2024-09-07 10:59:22 +09:00
Kohya S
62ec3e6424
Merge branch 'main' into dev
2024-09-07 10:52:49 +09:00
Kohya S.
de25945a93
Merge pull request #1550 from kohya-ss/dependabot/github_actions/crate-ci/typos-1.24.3
...
Bump crate-ci/typos from 1.19.0 to 1.24.3
2024-09-07 10:50:46 +09:00
Kohya S
0005867ba5
update README, format code
2024-09-07 10:45:18 +09:00
Kohya S.
16bb5699ac
Merge pull request #1426 from sdbds/resize
...
Replacing CV2 resize to Pil resize
2024-09-07 10:22:52 +09:00
Kohya S.
319e4d9831
Merge pull request #1433 from millie-v/sample-image-without-cuda
...
Generate sample images without having CUDA (such as on Macs)
2024-09-07 10:19:55 +09:00
Kohya S
2889108d85
feat: Add --cpu_offload_checkpointing option to LoRA training
2024-09-05 20:58:33 +09:00
Kohya S
d9129522a6
set dtype before calling ae closes #1562
2024-09-05 12:20:07 +09:00
Kohya S
90ed2dfb52
feat: Add support for merging CLIP-L and T5XXL LoRA models
2024-09-05 08:39:29 +09:00
Kohya S
56cb2fc885
support T5XXL LoRA, reduce peak memory usage #1560
2024-09-04 23:15:27 +09:00
Kohya S
b7cff0a754
update README
2024-09-04 21:35:47 +09:00
Kohya S
b65ae9b439
T5XXL LoRA training, fp8 T5XXL support
2024-09-04 21:33:17 +09:00
Kohya S
6abacf04da
update README
2024-09-02 13:05:26 +09:00
Kohya S
4f6d915d15
update help and README
2024-09-01 19:12:29 +09:00