Falcon 40B Inference at 4bit in Google Colab
pinned
27
#38 opened over 1 year ago
by
serin32
Custom 4-bit Finetuning 5-7 times faster inference than QLora
pinned
6
#25 opened over 1 year ago
by
rmihaylov
remove-extra-parentheses
#115 opened 4 months ago
by
ZennyKenny
Could not locate the configuration_RW.py inside tiiuae/falcon-40b-instruct.
#114 opened 7 months ago
by
cosmino
[AUTOMATED] Model Memory Requirements
#113 opened 7 months ago
by
model-sizer-bot
Adding Evaluation Results
#111 opened 8 months ago
by
leaderboard-pr-bot
Could someone upload a tokenizer.model file? to allow for making ggufs
#110 opened about 1 year ago
by
RonanMcGovern
Add chat_template so that it can be used for chat out-of-box
#109 opened about 1 year ago
by
chujiezheng
pb when testing the model
#108 opened about 1 year ago
by
louvivien
Update generation_config.json
1
#106 opened about 1 year ago
by
nkasmanoff
Gradio interface
#105 opened about 1 year ago
by
sequentialsystems
Optimizing Inference Time for Chat Conversations on Falcon
2
#104 opened about 1 year ago
by
humza-sami
Finetuned Falcon40 is not working with pipeline (text-generation)
#103 opened about 1 year ago
by
chelouche9
Advice on inference over a large-ish dataset in Databricks?
#102 opened about 1 year ago
by
archonlith
Use input attention mask instead of casual mask in attention
#101 opened about 1 year ago
by
CyberZHG
Inference
4
#99 opened over 1 year ago
by
davidhung
Best Practice for Handling Variable-Length Sequences in Training an LLM Model on a Chatbot Dataset
#98 opened over 1 year ago
by
humza-sami
Request: DOI
#97 opened over 1 year ago
by
waelTalan
Getting HTTP Error Code: 422 when using Inference API
2
#96 opened over 1 year ago
by
reetkat
Run falcon on Mac
2
#95 opened over 1 year ago
by
corin9122
Unable to use all cores.
2
#94 opened over 1 year ago
by
armx40
Bug: the model's head dimensionality is hardcoded
#93 opened over 1 year ago
by
danieldk-explosion
Fine-tune on model response only?
1
#92 opened over 1 year ago
by
mkserge
Finetuning Base Falcon on Unseen Language/New data (non instruct/RLHF)
2
#91 opened over 1 year ago
by
AshBam
Slow response time for 7b and 40b
6
#89 opened over 1 year ago
by
kartik99
configuration_RW.py Missing in the latest commit
#88 opened over 1 year ago
by
ravikiran3690
Update README.md
2
#87 opened over 1 year ago
by
FelixMildon
Falcon breaks after the second prompt of code.
#86 opened over 1 year ago
by
thecowmilk
Changes in modelling_RW.py to be able to handle past_key_values for faster model generations
8
#85 opened over 1 year ago
by
puru22
@TII Falcon is stunning but will you continue or is the majestic bird destined to starve ?
#84 opened over 1 year ago
by
cmp-nct
Finetune Error using the notebook referred on the model page
#83 opened over 1 year ago
by
hamad
Nvidia H100 Finetuning Error on BitsandBytes
2
#82 opened over 1 year ago
by
ashmitbhattarai
new here, confused which .bin file to download?
#80 opened over 1 year ago
by
kingofdelphi
Update generation_config.json
#77 opened over 1 year ago
by
psinger
Request: DOI
#76 opened over 1 year ago
by
winter6below618
Seeking insights on integrating RAG with Falcon for Domain Specific requirements
#75 opened over 1 year ago
by
rahul2008d
Prevent Hallucinations
1
#74 opened over 1 year ago
by
Zhaoqiong
Deployment on Azure ML
1
#73 opened over 1 year ago
by
Eliahu551818
Access To Hidden States
#72 opened over 1 year ago
by
DJT777
Were special tokens trained?
#71 opened over 1 year ago
by
Tron2060
Example code from README output is nonsense
1
#70 opened over 1 year ago
by
amitgurintecom
New language
2
#69 opened over 1 year ago
by
mindplay
GPU requirements
7
#68 opened over 1 year ago
by
GuySerk
Cuda out of memory error.
2
#67 opened over 1 year ago
by
ibrim
ValueError: The following model_kwargs are not used by the model: ['token_type_ids'] (note: typos in the generate arguments will also show up in this list)
1
#66 opened over 1 year ago
by
yiz4869
How to fine tune falcon for summarization on xsum?
1
#65 opened over 1 year ago
by
uzumakiusa
Need claritiy about the adjustable model hyperparameters
#64 opened over 1 year ago
by
Someshfengde
Update README.md
#63 opened over 1 year ago
by
Gage888
Borken docs link Use in transformers
1
#62 opened over 1 year ago
by
natika1
Hello, may I know where can I get the embeddings for falcon-40b?
3
#61 opened over 1 year ago
by
kurtgan