How to finetune the models on GPU

#44
by Jonathanlin - opened
This comment has been hidden
Jonathanlin changed discussion status to closed
Jonathanlin changed discussion title from How to finetune the model's perplexity. to How to finetune the models on GPU

Hello,

  1. I have some questions that I hope can be solved. I have set up an environment for CUDA, but no matter how I try to type commands in Anaconda Prompt, it still runs on the CPU. How can I run it on the GPU?
    python.exe protGPT2finetune.py --model_name_or_path nferruz/ProtGPT2 --train_file modelTrain.txt --validation_file modelTest.txt --tokenizer_name nferruz/ProtGPT2 --do_train True --do_eval True --output_dir model --num_train_epochs 4 --local_rank -1 --per_device_train_batch_size=1 --no_cuda False --learning_rate 1e-06.
    image.png

  2. The accuracy I got after finetune is quite low. Is there any way to improve it or the recommended learning rate or epoch parameters?
    image.png
    Thanks a lot~

Sign up or log in to comment