Kohya S
b4d1152293
fix: sample generation with system prompt, without TE output caching
2025-07-09 21:55:36 +09:00
rockerBOO
2ba1cc7791
Fix max norms not applying to noise
2025-03-21 20:17:22 -04:00
青龍聖者@bdsqlsz
09c4710d1e
Merge pull request #22 from rockerBOO/sage_attn
...
Add Sage Attention for Lumina
2025-03-03 10:26:02 +08:00
青龍聖者@bdsqlsz
b6e4194ea5
Merge pull request #20 from rockerBOO/lumina-system-prompt-special-token
...
Lumina system prompt special token
2025-03-02 18:30:49 +08:00
青龍聖者@bdsqlsz
b5d1f1caea
Merge pull request #19 from rockerBOO/lumina-block-swap
...
Lumina block swap
2025-03-02 18:30:37 +08:00
rockerBOO
a69884a209
Add Sage Attention for Lumina
2025-03-01 20:37:45 -05:00
rockerBOO
a2daa87007
Add block swap for uncond (neg) for sample images
2025-02-28 14:22:47 -05:00
rockerBOO
1bba7acd9a
Add block swap in sample image timestep loop
2025-02-28 14:12:13 -05:00
rockerBOO
d6f7e2e20c
Fix block swap for sample images
2025-02-28 14:08:27 -05:00
rockerBOO
ce2610d29b
Change system prompt to inject Prompt Start special token
2025-02-27 02:47:04 -05:00
rockerBOO
542f980443
Fix sample norms in batches
2025-02-27 00:00:20 -05:00
sdbds
a1a5627b13
fix shift
2025-02-26 11:35:38 +08:00
sdbds
ce37c08b9a
clean code and add finetune code
2025-02-26 11:20:03 +08:00
sdbds
5f9047c8cf
add truncation when > max_length
2025-02-26 01:00:35 +08:00
sdbds
fc772affbe
1、Implement cfg_trunc calculation directly using timesteps, without intermediate steps.
...
2、Deprecate and remove the guidance_scale parameter because it used in inference not train
3、Add inference command-line arguments --ct for cfg_trunc_ratio and --rc for renorm_cfg to control CFG truncation and renormalization during inference.
2025-02-24 14:10:24 +08:00
rockerBOO
2c94d17f05
Fix typo
2025-02-23 20:21:06 -05:00
rockerBOO
48e7da2d4a
Add sample batch size for Lumina
2025-02-23 20:19:24 -05:00
rockerBOO
ba725a84e9
Set default discrete_flow_shift to 6.0. Remove default system prompt.
2025-02-23 18:01:09 -05:00
rockerBOO
42a801514c
Fix system prompt in datasets
2025-02-23 13:48:37 -05:00
rockerBOO
6d7bec8a37
Remove non-used code
2025-02-23 01:46:47 -05:00
rockerBOO
025cca699b
Fix samples, LoRA training. Add system prompt, use_flash_attn
2025-02-23 01:29:18 -05:00
rockerBOO
98efbc3bb7
Add documentation to model, use SDPA attention, sample images
2025-02-18 00:58:53 -05:00
sdbds
aa36c48685
update for always use gemma2 mask
2025-02-17 19:00:18 +08:00
sdbds
d154e76c45
init
2025-02-12 16:30:05 +08:00