Kohya-ss-sd-scripts

mirror of https://github.com/kohya-ss/sd-scripts.git synced 2026-04-09 06:45:09 +00:00

Author	SHA1	Message	Date
Kohaku-Blueleaf	9cfa68c92f	[Experimental Feature] FP8 weight dtype for base model when running train_network (or sdxl_train_network) (#1057 ) * Add fp8 support * remove some debug prints * Better implementation for te * Fix some misunderstanding * as same as unet, add explicit convert * better impl for convert TE to fp8 * fp8 for not only unet * Better cache TE and TE lr * match arg name * Fix with list * Add timeout settings * Fix arg style * Add custom seperator * Fix typo * Fix typo again * Fix dtype error * Fix gradient problem * Fix req grad * fix merge * Fix merge * Resolve merge * arrangement and document * Resolve merge error * Add assert for mixed precision	2024-01-20 09:46:53 +09:00
Kohya S	09ef3ffa8b	Merge branch 'main' into dev	2024-01-14 21:49:25 +09:00
Nir Weingarten	ab716302e4	Added cli argument for wandb session name	2024-01-03 11:52:38 +02:00
Plat	62e7516537	feat: support torch.compile	2023-12-27 02:17:24 +09:00
Kohya S	3efd90b2ad	fix sampling in training with mutiple gpus ref #989	2023-12-15 22:35:54 +09:00
Kohya S	d309a27a51	change option names, add ddp kwargs if needed ref #1000	2023-12-13 21:02:26 +09:00
Kohya S	471d274803	Merge pull request #1000 from Isotr0py/dev Fix multi-gpu SDXL training	2023-12-13 20:52:11 +09:00
Kohya S	35f4c9b5c7	fix an error when keep_tokens_separator is not set ref #975	2023-12-12 21:43:21 +09:00
Kohya S	034a49c69d	Merge pull request #975 from Linaqruf/dev Add keep_tokens_separator as alternative for keep_tokens	2023-12-12 21:28:32 +09:00
Isotr0py	bb5ae389f7	fix DDP SDXL training	2023-12-12 19:58:44 +08:00
Kohya S	4a2cef887c	fix lllite training not working ref #913	2023-12-10 09:23:37 +09:00
Kohya S	42750f7846	fix error on pool_workaround in sdxl TE training ref #994	2023-12-10 09:18:33 +09:00
Isotr0py	db84530074	Fix gradients synchronization for multi-GPUs training (#989 ) * delete DDP wrapper * fix train_db vae and train_network * fix train_db vae and train_network unwrap * network grad sync --------- Co-authored-by: Kohya S <52813779+kohya-ss@users.noreply.github.com>	2023-12-07 22:01:42 +09:00
Kohya S	81a440c8e8	Merge pull request #955 from xzuyn/paged_adamw Add PagedAdamW	2023-12-03 21:22:38 +09:00
Kohya S	f24a3b5282	show seed in generating samples	2023-12-03 21:15:30 +09:00
Kohya S	383b4a2c3e	Merge pull request #907 from shirayu/add_option_sample_at_first Add option --sample_at_first	2023-12-03 21:00:32 +09:00
Kohya S	df59822a27	Merge pull request #906 from shirayu/accept_scheduler_designation_in_training Accept sampler designation in sampling of training	2023-12-03 20:46:16 +09:00
Furqanil Taqwa	1bdd83a85f	remove unnecessary debug print	2023-11-28 17:26:27 +07:00
Furqanil Taqwa	1624c239c2	added keep_tokens_separator to dynamically keep token for being shuffled	2023-11-28 17:23:55 +07:00
Kohya S	f312522cef	Merge pull request #913 from KohakuBlueleaf/custom-seperator Add custom seperator for shuffle caption	2023-11-19 21:32:01 +09:00
xzuyn	da5a144589	Add PagedAdamW	2023-11-18 07:47:27 -05:00
Kohya S	6231aa91e2	common lr logging, set default None to ddp_timeout	2023-11-05 19:09:17 +09:00
Kohaku-Blueleaf	583e2b2d01	Fix typo	2023-10-30 20:02:04 +08:00
Kohaku-Blueleaf	5dc2a0d3fd	Add custom seperator	2023-10-30 19:55:30 +08:00
Yuta Hayashibe	fea810b437	Added --sample_at_first to generate sample images before training	2023-10-29 21:44:57 +09:00
Yuta Hayashibe	40d917b0fe	Removed incorrect comments	2023-10-29 21:02:44 +09:00
Yuta Hayashibe	cf876fcdb4	Accept --ss to set sample_sampler dynamically	2023-10-29 20:15:04 +09:00
Yuta Hayashibe	291c29caaf	Added a function line_to_prompt_dict() and removed duplicated initializations	2023-10-29 19:57:25 +09:00
Yuta Hayashibe	01e00ac1b0	Make a function get_my_scheduler()	2023-10-29 19:46:02 +09:00
Kohya S	a9ed4ed8a8	Merge pull request #900 from xzuyn/paged_adamw_32bit Add PagedAdamW32bit	2023-10-29 15:01:55 +09:00
Kohya S	9d6a5a0c79	Merge pull request #899 from shirayu/use_moving_average Show moving average loss in the progress bar	2023-10-29 14:37:58 +09:00
Kohya S	fb97a7aab1	Merge pull request #898 from shirayu/update_repare_buckets_latents Fix a typo and add assertions in making buckets	2023-10-29 14:29:53 +09:00
Kohaku-Blueleaf	1cefb2a753	Better implementation for te autocast (#895 ) * Better implementation for te * Fix some misunderstanding * as same as unet, add explicit convert * Better cache TE and TE lr * Fix with list * Add timeout settings * Fix arg style	2023-10-28 15:49:59 +09:00
xzuyn	d8f68674fb	Update train_util.py	2023-10-27 07:05:53 -04:00
Yuta Hayashibe	0d21925bdf	Use @property	2023-10-27 18:14:27 +09:00
Yuta Hayashibe	3d2bb1a8f1	Add LossRecorder and use moving average in all places	2023-10-27 17:49:49 +09:00
Yuta Hayashibe	837a4dddb8	Added assertions	2023-10-26 13:34:36 +09:00
Isotr0py	592014923f	Support JPEG-XL on windows	2023-10-04 21:48:25 +08:00
Yuta Hayashibe	27f9b6ffeb	updated typos to v1.16.15 and fix typos	2023-10-01 21:51:24 +09:00
Kohya S	360af27749	fix ControlNetDataset not working	2023-09-03 12:27:58 +09:00
Kohya S	0ee75fd75d	fix typos, add comments etc.	2023-09-03 12:24:15 +09:00
Kohya S	2eae9b66d0	Merge pull request #798 from vvern999/vvern999-patch-1 add input perturbation noise	2023-09-03 10:51:23 +09:00
Kohya S	948cf17499	add caption_prefix/suffix to dataset	2023-09-02 16:17:12 +09:00
Kohya S	497051c14b	Merge pull request #786 from Isotr0py/jxl Support JPEG XL	2023-09-02 15:30:07 +09:00
vvern999	e0beb6a999	add input perturbation noise from https://arxiv.org/abs/2301.11706	2023-09-02 07:33:27 +03:00
Kohya S	7e850f3b7e	Merge branch 'main' into sdxl	2023-09-01 07:59:26 +09:00
Isotr0py	5d88351bb5	support jpeg xl	2023-08-25 11:07:02 +08:00
Kohya S	1161a5c6da	fix debug_dataset for controlnet dataset	2023-08-20 17:39:48 +09:00
Kohya S	e191892824	fix bucketing doesn't work in controlnet training	2023-08-20 12:24:40 +09:00
Kohya S	3f7235c36f	add lora controlnet train/gen temporarily	2023-08-17 10:08:02 +09:00

1 2 3 4 5 ...

312 Commits