cyber-meow commited on
Commit
ef28c28
1 Parent(s): 4e83663

update readme

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -53,6 +53,7 @@ However, I estimate a model of similar quality can be trained with fewer than 20
53
 
54
  - The model was first trained for 18000 steps, at batch size 8, lr 1e-6, resolution 640, and conditional dropping rate of 15%.
55
  - After this, I modified a little the captions and trained the model for another 22000 steps, at batch size 8, lr 1e-6, reslution 704, and conditional dropping rate of 15%.
 
56
 
57
  Note that as a consequence of the weighting scheme which translates into a number of different multiply for each image,
58
  the count of repeat and epoch has a quite different meaning here.
 
53
 
54
  - The model was first trained for 18000 steps, at batch size 8, lr 1e-6, resolution 640, and conditional dropping rate of 15%.
55
  - After this, I modified a little the captions and trained the model for another 22000 steps, at batch size 8, lr 1e-6, reslution 704, and conditional dropping rate of 15%.
56
+ (Intermediate checkpoints can be found in the branch `all`)
57
 
58
  Note that as a consequence of the weighting scheme which translates into a number of different multiply for each image,
59
  the count of repeat and epoch has a quite different meaning here.