meta-llama/Llama-3.1-8B-Instruct

#96 opened 3 months ago by

clearcash

llama3.1 gguf format

#95 opened 3 months ago by

davidomars

Crashes

#94 opened 3 months ago by

wing1x

how can i use git clone Meta-Llama-3.1-8B-Instruct

#93 opened 3 months ago by

xiangsuyu

Asking for Pro subscription

#92 opened 3 months ago by

Mayo133

update rope_scaling

#91 opened 3 months ago by

Arunjith

Update for correct tool use system prompt

#90 opened 3 months ago by

ricklamers

What call() function parameters besides "query" can be used by the model when doing brave_search and wolfram_alpha tool calls?

#89 opened 3 months ago by

sszymczyk

What form of the built-in brave_search and wolfram_alpha tool call output is expected by the model?

#88 opened 3 months ago by

sszymczyk

ValueError

#87 opened 3 months ago by

Bmurug3

Request: DOI

#86 opened 3 months ago by

sanjeev929

Request: DOI

#85 opened 3 months ago by

moh996

The model repeatedly outputs a large amount of text and does not comply with the instructs.

9

#84 opened 3 months ago by

baremetal

Llama repo access not aproved yet

#83 opened 3 months ago by

APaul1

Throwing Error for AutoModelForSequence Classification

#82 opened 3 months ago by

deshwalmahesh

GSM8K Evaluation Result: 84.5 vs. 76.95

17

#81 opened 3 months ago by

tanliboy

Deploying Llama3.1 to Nvidia T4 instance (sagemaker endpoints)

#80 opened 3 months ago by

mleiter

Variable answer is getting predicted for same prompt

#79 opened 3 months ago by

sjainlucky

Efficiency low after adding the adapter_model.safetensors with base model

#78 opened 3 months ago by

antony-pk

Minimum gpu ram capacity

10

#77 opened 3 months ago by

bob-sj

Tokenizer padding token

#76 opened 3 months ago by

Rish1

new tokenizer contains the cutoff date and today date by default

#74 opened 3 months ago by

yuchenlin

New bee questions

#73 opened 3 months ago by

rkapuaala

Add `base_model` metadata

#72 opened 3 months ago by

sbrandeis

Full SFT training caused lose its foundational capabilities

10

#71 opened 3 months ago by

sinlew

Wrong number of tensors; expected 292, got 291

#69 opened 3 months ago by

KingBadger

Fine tuned Meta-Llama-3.1-8B-Instruct deployment on AWS Sagemaker fails

#68 opened 3 months ago by

byamasuwhatnowis

Quick Fix: Rope Scaling or Rope Type Error

#67 opened 3 months ago by

deepaksiloka

Can't reproduce MATH performance

#66 opened 3 months ago by

jpiabrantes

Banned for Iranian People

13

#65 opened 3 months ago by

MustafaLotfi

Inference endpoint deployment for 'meta-llama/Meta-Llama-3.1-8B-Instruct' fails

#62 opened 3 months ago by

Keertiraj

Meta-Llama-3.1-8B-Instruct deployment on AWS Sagemaker fails

#61 opened 3 months ago by

Keertiraj

Error Loading the original model file consolidated.00.pth from local

#60 opened 3 months ago by

chanduvkp

vdl

#59 opened 3 months ago by

danakin1

Unable to deploy Meta-Llama-3.1-8B-Instruct model on Sagemaker

#58 opened 3 months ago by

axs531622

CUDA out of memory on RTX A5000 inference.

#57 opened 3 months ago by

RoberyanL

Update README.md to reflect correct transformers version

#56 opened 3 months ago by

priyakhandelwal

Update README.md to reflect correct transformers version

#55 opened 3 months ago by

priyakhandelwal

NotImplementedError: Could not run 'aten::_local_scalar_dense' with arguments from the 'Meta' backend.

#54 opened 4 months ago by

duccio84

Some of you might be interested in my 'silly' experiment.

#52 opened 4 months ago by

ZeroWw

Updated config.json

#51 opened 4 months ago by

WestM

🚀 LMDeploy support Llama3.1 and its Tool Calling. An example of calling "Wolfram Alpha" to perform complex mathematical calculations can be found from here!

#50 opened 4 months ago by

vansin

HF pro subscription for llama 3.1-8b

#49 opened 4 months ago by

ostoslista

Significant bias

#48 opened 4 months ago by

stutteringp0et

`rope_scaling` must be a dictionary with two fields

#46 opened 4 months ago by

thunderdagger

Unable to load Llama 3.1 to Text-Genration WebUI

#45 opened 4 months ago by

keeeeesz

BUG Chat template doesn't respect `add_generation_prompt`flag from transformers tokenizer

#44 opened 4 months ago by

ilu000

How to use the ASR on LLama3.1