How to speed up model generation

#13

by LiPengtao12138 - opened Sep 26

Discussion

LiPengtao12138

Sep 26

How to speed up model generation

MehdiL

Sep 30

•

edited Sep 30

No description provided.

Sanyam

Meta Llama org Sep 30

@LiPengtao12138 please make sure you are using a GPU during inference

ernestyalumni

Oct 3

What I like to do is forcefully set the device to "cuda", for example device = torch.device("cuda") and then model.to(device) if model is an instance of, say, AutoModelForCausalLM(..) (I'm using huggingface's transformers library)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment