Nekochu commited on
Commit
9418c03
1 Parent(s): dc0510a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -4
README.md CHANGED
@@ -6,7 +6,30 @@ license: cc-by-nc-4.0
6
  - base model [zeroscope_v2_576w](https://huggingface.co/cerspense/zeroscope_v2_576w).
7
  - Train on [potat1](https://huggingface.co/camenduru/potat1#dataset--config) [dataset](https://huggingface.co/camenduru/potat1_dataset) with 50000 steps for 10 hours on single GPU of 24GB VRAM.
8
 
9
- ---
10
- widget:
11
- - text: "0"
12
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6
  - base model [zeroscope_v2_576w](https://huggingface.co/cerspense/zeroscope_v2_576w).
7
  - Train on [potat1](https://huggingface.co/camenduru/potat1#dataset--config) [dataset](https://huggingface.co/camenduru/potat1_dataset) with 50000 steps for 10 hours on single GPU of 24GB VRAM.
8
 
9
+ # Install Windows
10
+ ```batch
11
+ git clone https://github.com/ExponentialML/Text-To-Video-Finetuning.git
12
+ cd Text-To-Video-Finetuning
13
+ git lfs install
14
+ git clone https://huggingface.co/damo-vilab/text-to-video-ms-1.7b ./models/model_scope_diffusers/
15
+ py -m venv --clear venv && venv\Scripts\activate
16
+ pip install -r requirements.txt
17
+ pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118 --no-cache --force-reinstall --version --isolated --ignore-installed
18
+
19
+ git clone https://github.com/ExponentialML/Video-BLIP2-Preprocessor.git
20
+ cd Video-BLIP2-Preprocessor
21
+ pip install -r requirements.txt
22
+
23
+ :: fix1
24
+ echo accelerate^>=0.20.3>>requirements-dev.txt
25
+ pip install -r requirements-dev.txt
26
+
27
+ :: Captation
28
+ python preprocess.py --video_directory C:\Video-BLIP2-Preprocessor\videos --config_name "My Videos" --config_save_name "my_videos"
29
+
30
+ :: Training
31
+ venv\Scripts\activate && python train.py --config ./configs/v2/train_config.yaml
32
+
33
+ :: inference 1024x576
34
+ python inference.py --model zeroscope_v2_576w_potat1\zeroscope_v2_576w-checkpoint-50000 --prompt "a fast moving fancy sports car" --fps 24 --num-frames 30 --window-size 12 --width 1024 --height 576 --sdp
35
+ ```