meta-llama
/

Llama-3.1-405B-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (14)

Access request FAQ

#10 opened 4 months ago by

cannot get 405B-model to run

#30 opened 3 days ago by

Llama 3.1 models continuously unavailable

#28 opened 3 months ago by

potential of 405b model

#27 opened 3 months ago by

Update tokenizer_config.json

#26 opened 3 months ago by

Model inference giving 503 error

#25 opened 3 months ago by

Num KV heads changed from 16 to 8?

#21 opened 4 months ago by

This repo is huge!

#19 opened 4 months ago by

Please reply, why am I not allowed to apply for approval? Aren't you open-source?

#18 opened 4 months ago by

Inference Endpoint (dedicated) not available

#16 opened 4 months ago by

why "num_key_value_heads": 16,

#14 opened 4 months ago by

GGUF version request

#13 opened 4 months ago by

🚀 LMDeploy support Llama3.1 and its Tool Calling. An example of calling "Wolfram Alpha" to perform complex mathematical calculations can be found from here!

#11 opened 4 months ago by

TGI available only for pro subscriptions?

#7 opened 4 months ago by

Max output tokens for Llama 3.1

#6 opened 4 months ago by

abhirup-sainapse

Please move PTH/original into new model/repo.

#5 opened 4 months ago by