zero-gpu-explorers/README · use authentication in huggingface Gradio API!!!(hosting on ZeroGPU)

27 days ago

Guys.

I have already hosted my code on ZeroGPU(for that i subscribe the PRO)

When I visited him on the webpage (logged in as my PRO user), I did receive 5x usage quota compared to free users.

But when I use it in Python code, I use the gradio_client, and I can indeed post requests to the Gradio API that I host on HF Space using ZeroGPU.I found that my quota is when I am not logged in.

By the way, why i know the quota is when i am not logged in?

I do some test, finally i get some information:

NOT LOGIN: the quota is about 180s
LOGIN: the quota is 300s
PRO USER: the quota is 1500s.....

So i just want find some way to solve this problem, i want use my PRO user in my code!!!

I have tried carrying HF tokens or headers (including cookies), but they have not worked and I am still logged in.

The error just like:
gradio_client.exceptions.AppError: The upstream Gradio app has raised an exception: You have exceeded your GPU quota (150s requested vs. 149s left). Create a free account to get more usage quota.

John6666

26 days ago

It's not the crux of this issue, but if the durarion designation exceeds 120 seconds, it's basically buggy.

Nerva1228

13 days ago

It's not the crux of this issue, but if the durarion designation exceeds 120 seconds, it's basically buggy.

NO, I just do a test to find the quota.
Actually, the duration maybe 5~10s

John6666

13 days ago

Actually, the duration maybe 5~10s

Oh. If so, it's definitely a bug. 😅
I'm also using Zero GPU space, and sometimes the login quota doesn't work properly.
However, in my case, for some reason it's a different pattern from yours, and the login status doesn't work properly in the web browser. If I sign in explicitly using the OAuth sign-in button on the space, it works.
I've never used the Gradio Client...

Moibe

8 days ago

I have exactly the same need as Nerva1228, and thanks John666 but it seems that, sound like a meme but, it's not a bug it's a feature. Not being able to use my PRO user quota programatically is not good, I want to test my own spaces with different combinations from within my code and I'm as limited as a normal user, is this intended to be that way or there is a solution, or maybe a feature for the future.

frostbyte07

5 days ago

I got it to work!!!!

I duplicated the black-forest-labs/FLUX.1-dev (!!!Important!!!! Go to their Model Card Page and get access granted to their gated model first. Then you can duplicate the model)
Set the environment variables:
HF_TOKEN =Use your HF_token (I used read and write credentials for this)
ZEROGPU_V2=true
ZERO_GPU_PATCH_TORCH_DEVICE=1

Then in your own (duplicated) space:

Navigate to the Files
Click on app.py
Change

@spaces.GPU(duration=75)  # The duration max value can be 120 but this wasn't enough and still didn't work for me

to

@spaces.GPU()

Make sure in your python code that uses the gradio_client python library your HF_TOKEN is set in the environment or you set the parameter hf_token when creating the client
example:

from gradio_client import Client 
 client = CLIENT("your_duplicated_space/FLUX.1-dev", hf_token=os.getenv("HF_TOKEN"))

Moibe

5 days ago

I got it to work!!!!

I duplicated the black-forest-labs/FLUX.1-dev (!!!Important!!!! Go to their Model Card Page and get access granted to their gated model first. Then you can duplicate the model)

Set the environment variables:
HF_TOKEN =Use your HF_token (I used read and write credentials for this)
ZEROGPU_V2=true
ZERO_GPU_PATCH_TORCH_DEVICE=1

Then in your own (duplicated) space:

Navigate to the Files

Click on app.py

Change
@spaces.GPU(duration=75)  # The duration max value can be 120 but this wasn't enough and still didn't work for me
to
@spaces.GPU()
Make sure in your python code that uses the gradio_client python library your HF_TOKEN is set in the environment or you set the parameter hf_token when creating the client
example:
from gradio_client import Client 
 client = CLIENT("your_duplicated_space/FLUX.1-dev", hf_token=os.getenv("HF_TOKEN"))

And what about the usage of the quota, you are able to use your PRO User quota??

frostbyte07

5 days ago

I got it to work!!!!

I duplicated the black-forest-labs/FLUX.1-dev (!!!Important!!!! Go to their Model Card Page and get access granted to their gated model first. Then you can duplicate the model)

Set the environment variables:
HF_TOKEN =Use your HF_token (I used read and write credentials for this)
ZEROGPU_V2=true
ZERO_GPU_PATCH_TORCH_DEVICE=1

Then in your own (duplicated) space:

Navigate to the Files

Click on app.py

Change
@spaces.GPU(duration=75)  # The duration max value can be 120 but this wasn't enough and still didn't work for me
to
@spaces.GPU()
Make sure in your python code that uses the gradio_client python library your HF_TOKEN is set in the environment or you set the parameter hf_token when creating the client
example:
from gradio_client import Client 
 client = CLIENT("your_duplicated_space/FLUX.1-dev", hf_token=os.getenv("HF_TOKEN"))
And what about the usage of the quota, you are able to use your PRO User quota??

Yes. Try it out and see if it works for you.