Kohya S
d1d7d432e9
print dataset index in making buckets
2023-03-01 21:30:12 +09:00
Kohya S
089a63c573
shuffle at debug_dataset
2023-03-01 21:12:33 +09:00
Kohya S
ed19a92bbe
fix typos
2023-03-01 21:01:10 +09:00
fur0ut0
8abb8645ae
add detail dataset config feature by extra config file ( #227 )
...
* add config file schema
* change config file specification
* refactor config utility
* unify batch_size to train_batch_size
* fix indent size
* use batch_size instead of train_batch_size
* make cache_latents configurable on subset
* rename options
* bucket_repo_range
* shuffle_keep_tokens
* update readme
* revert to min_bucket_reso & max_bucket_reso
* use subset structure in dataset
* format import lines
* split mode specific options
* use only valid subset
* change valid subsets name
* manage multiple datasets by dataset group
* update config file sanitizer
* prune redundant validation
* add comments
* update type annotation
* rename json_file_name to metadata_file
* ignore when image dir is invalid
* fix tag shuffle and dropout
* ignore duplicated subset
* add method to check latent cachability
* fix format
* fix bug
* update caption dropout default values
* update annotation
* fix bug
* add option to enable bucket shuffle across dataset
* update blueprint generate function
* use blueprint generator for dataset initialization
* delete duplicated function
* update config readme
* delete debug print
* print dataset and subset info as info
* enable bucket_shuffle_across_dataset option
* update config readme for clarification
* compensate quotes for string option example
* fix bug of bad usage of join
* conserve trained metadata backward compatibility
* enable shuffle in data loader by default
* delete resolved TODO
* add comment for image data handling
* fix reference bug
* fix undefined variable bug
* prevent raise overwriting
* assert image_dir and metadata_file validity
* add debug message for ignoring subset
* fix inconsistent import statement
* loosen too strict validation on float value
* sanitize argument parser separately
* make image_dir optional for fine tuning dataset
* fix import
* fix trailing characters in print
* parse flexible dataset config deterministically
* use relative import
* print supplementary message for parsing error
* add note about different methods
* add note of benefit of separate dataset
* add error example
* add note for english readme plan
---------
Co-authored-by: Kohya S <52813779+kohya-ss@users.noreply.github.com >
2023-03-01 20:58:08 +09:00
Kohya S
82707654ad
support sample generation in TI training
2023-02-28 22:05:31 +09:00
Kohya S
dd523c94ff
sample images in training (not fully tested)
2023-02-27 17:48:32 +09:00
Kohya S
a28f9ae7a3
support tokenizer caching for offline training/gen
2023-02-25 18:46:59 +09:00
Kohya S
9b13444b9c
raise error if options conflict
2023-02-23 21:35:47 +09:00
Kohya S
9ab964d0b8
Add Adafactor optimzier
2023-02-22 21:09:47 +09:00
Kohya S
663aad2b0d
refactor get_scheduler etc.
2023-02-20 22:47:43 +09:00
Kohya S
107fa754e5
Merge branch 'dev' into optimizer-expand-and-refactor
2023-02-20 20:12:42 +09:00
mgz-dev
b29c5a750c
expand optimizer options and refactor
...
Refactor code to make it easier to add new optimizers, and support alternate optimizer parameters
-move redundant code to train_util for initializing optimizers
- add SGD Nesterov optimizers as option (since they are already available)
- add new parameters which may be helpful for tuning existing and new optimizers
2023-02-19 17:45:09 -06:00
unknown
045a3dbe48
apply dadaptation
2023-02-19 18:37:07 +09:00
Kohya S
048e7cd428
add lion optimizer support
2023-02-19 15:26:14 +09:00
Kohya S
9d0f9736bf
Merge pull request #202 from vladmandic/main
...
fix git path
2023-02-19 15:01:21 +09:00
Vladimir Mandic
dac2bd163a
fix git path
2023-02-17 14:19:08 -05:00
Isotr0py
78d1fb5ce6
Add '--lowram' argument
2023-02-17 12:08:54 +08:00
Kohya S
43c0a69843
Add noise_offset
2023-02-14 21:15:48 +09:00
Kohya S
8f1e930bf4
Merge pull request #187 from space-nuko/add-commit-hash
...
Add commit hash to metadata
2023-02-14 19:52:30 +09:00
space-nuko
5471b0deb0
Add commit hash to metadata
2023-02-13 02:58:06 -08:00
Isotr0py
92a1af8024
Merge branch 'kohya-ss:main' into support-multi-gpu
2023-02-12 15:06:46 +08:00
Kohya S
4c561411aa
revert batch size limiting for bucket
2023-02-11 16:02:56 +09:00
Kohya S
2c5f5c324a
Fix crash TI train close #172 , tag drop wo shuffle
2023-02-11 14:41:44 +09:00
Kohya S
b03721b4d9
Add todo comment
2023-02-10 17:36:38 +09:00
Kohya S
c2e1d4b71b
fix typo
2023-02-09 21:38:01 +09:00
Kohya S
3a72e6f003
add tag dropout
2023-02-09 21:35:27 +09:00
Isotr0py
5e96e1369d
fix get_hidden_states expected scalar Error
2023-02-08 20:14:13 +08:00
Isotr0py
c0be52a773
ignore get_hidden_states expected scalar Error
2023-02-08 20:13:09 +08:00
Kohya S
e42b2f7aa9
conditional caption dropout (in progress)
2023-02-07 22:28:56 +09:00
Kohya S
f9478f0d47
Merge pull request #159 from forestsource/main
...
Add Conditional Dropout options
2023-02-07 21:50:26 +09:00
Kohya S
4fc9f1f8c5
Merge pull request #157 from shirayu/improve_tag_shuffle
...
Always join with ", "
2023-02-07 21:47:05 +09:00
forestsource
7db98baa86
Add dropout options
2023-02-07 00:01:30 +09:00
Kohya S
2aa27b7a4b
Update downsampling for larger image in no_upscale
2023-02-06 20:52:24 +09:00
Yuta Hayashibe
5ea5fefcd2
Always join with ", "
2023-02-06 12:29:41 +09:00
Kohya S
ea2dfd09ef
update bucketing features
2023-02-05 21:37:46 +09:00
Kohya S
b1635f4bf6
Merge pull request #144 from tsukimiya/debug_dataset_linux_support
...
Fixed --debug_dataset option to work in non-Windows environments
2023-02-04 18:19:04 +09:00
Kohya S
9fd7fb813d
Merge branch 'dev' into main
2023-02-04 18:16:03 +09:00
Kohya S
93134cdd15
Add tag freq for FinetuneDataset
2023-02-03 21:03:42 +09:00
Kohya S
57d8483eaf
add GIT captioning, refactoring, DataLoader
2023-02-03 08:45:33 +09:00
tsukimiya
949ee6fcc9
Fixed --debug_dataset option to work in non-Windows environments
2023-02-03 00:37:27 +09:00
hitomi
26a81d075c
add --persistent_data_loader_workers option
2023-02-01 16:02:15 +08:00
Kohya S
ed2e431950
Merge branch 'main' into caption-frequency-metadata
2023-01-29 17:50:23 +09:00
Kohya S
3fb12e41b7
Merge branch 'main' into textual_inversion
2023-01-26 17:50:20 +09:00
Kohya S
91a50ea637
Change img_ar_errors to mean because too many imgs
2023-01-24 20:17:15 +09:00
Kohya S
36dc97c841
Merge pull request #103 from space-nuko/bucketing-metadata
...
Add bucketing metadata
2023-01-24 19:06:21 +09:00
Kohya S
e6bad080cb
Merge pull request #102 from space-nuko/precalculate-hashes
...
Precalculate .safetensors model hashes after training
2023-01-24 19:03:45 +09:00
Kohya S
7f17237ada
Merge pull request #92 from forestsource/add_save_n_epoch_ratio
...
Add save_n_epoch_ratio
2023-01-24 18:59:47 +09:00
space-nuko
2e8a3d20dd
Add tag frequency metadata
2023-01-23 17:43:03 -08:00
space-nuko
66051883fb
Add bucketing metadata
2023-01-23 17:26:58 -08:00
space-nuko
f7fbdc4b2a
Precalculate .safetensors model hashes after training
2023-01-23 17:21:04 -08:00