Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
25
1
Fahadh
fahadh4ilyas
Follow
jmjzz's profile picture
SinclairSchneider's profile picture
Qubitium's profile picture
4 followers
·
1 following
fahadh4ilyas
AI & ML interests
None yet
Recent Activity
New activity
about 2 months ago
google/datagemma-rig-27b-it:
Why the example prompt doesn't include prompt format?
View all activity
Organizations
None yet
fahadh4ilyas
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
google/datagemma-rig-27b-it
about 2 months ago
Why the example prompt doesn't include prompt format?
3
#8 opened about 2 months ago by
fahadh4ilyas
New activity in
defog/llama-3-sqlcoder-8b
5 months ago
Example prompt?
2
#17 opened 5 months ago by
fahadh4ilyas
New activity in
LargeWorldModel/ultrachat_qa_mix_128K
6 months ago
What does it mean to pre-pack UltraChat data?
#1 opened 6 months ago by
fahadh4ilyas
New activity in
gradientai/Llama-3-8B-Instruct-Gradient-1048k
6 months ago
Rope Theta Value Difference?
#24 opened 6 months ago by
fahadh4ilyas
New activity in
CohereForAI/aya-23-8B
6 months ago
What Does `elif false == true` means in chat template?
#4 opened 6 months ago by
fahadh4ilyas
upvoted
a
collection
8 months ago
Hermes 2
Collection
Nous' Flagship LLM Series
•
23 items
•
Updated
Aug 15
•
101
New activity in
mistralai/Mistral-7B-Instruct-v0.2
8 months ago
What is the max. content length of Mistral-7B-Instruct-v0.2?
17
#43 opened 10 months ago by
hanshupe
New activity in
databricks/dbrx-instruct
8 months ago
The fused expert parameters means load_in_4bit doesn't work properly, nor does LoRA
31
#10 opened 8 months ago by
tdrussell
New activity in
LnL-AI/dbrx-base-converted-v2
8 months ago
Ready for Testing...
9
#1 opened 8 months ago by
Qubitium
Fix import typo
#2 opened 8 months ago by
fahadh4ilyas
New activity in
databricks/dbrx-instruct
8 months ago
Failing to 4-bit quantize with BitsAndBytes
1
#16 opened 8 months ago by
simsim314
New activity in
microsoft/phi-2
9 months ago
Target modules {'out_proj', 'Wqkv'} is not found in the phi-2 model how can I fix this error?
2
#115 opened 9 months ago by
roy1109
New activity in
liuhaotian/llava-v1.6-mistral-7b
9 months ago
Some value in config is not used?
#7 opened 9 months ago by
fahadh4ilyas
New activity in
sshh12/Mistral-7B-LoRA-AudioWhisper
9 months ago
Where is the adapter_model.bin?
1
#1 opened 9 months ago by
fahadh4ilyas
New activity in
microsoft/phi-2
10 months ago
Model token size is bigger than tokenizer size?
2
#97 opened 10 months ago by
fahadh4ilyas
Why inside `modeling_phi.py`, the output from Self Attention is not becoming the input of MLP?
1
#94 opened 10 months ago by
fahadh4ilyas
New activity in
openchat/openchat_sharegpt_v3
about 1 year ago
-100 vs 0 in label?
#2 opened about 1 year ago by
fahadh4ilyas
New activity in
Yukang/Llama-2-13b-chat-longlora-32k-sft
about 1 year ago
Why this model kept generating \n when loaded with text generation web ui?
4
#2 opened about 1 year ago by
fahadh4ilyas
New activity in
TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
Offloading to cpu not working?
1
#21 opened over 1 year ago by
fahadh4ilyas
New activity in
TheBloke/falcon-40b-sft-mix-1226-GGML
over 1 year ago
Can it be loaded using text generation web ui?
1
#2 opened over 1 year ago by
fahadh4ilyas
Load more