Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
69
8
470
wing lian
PRO
winglian
Follow
Sharmachaitanya945's profile picture
deter3's profile picture
BBph3's profile picture
2324 followers
·
14 following
winglian
winglian
AI & ML interests
None yet
Recent Activity
liked
a model
10 days ago
nvidia/Hymba-1.5B-Base
updated
a model
17 days ago
axolotl-ai-co/SmolLM2-135M-bnb-nf4-bf16
liked
a dataset
17 days ago
microsoft/orca-agentinstruct-1M-v1
View all activity
Organizations
winglian
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
axolotl-ai-co/romulus-mistral-nemo-12b-simpo
2 months ago
Update README.md
#2 opened 2 months ago by
CombinHorizon
New activity in
deepseek-ai/DeepSeek-Prover-V1.5-Base
3 months ago
the config class and config.json uses DeepseekConfig, not v2
1
#5 opened 3 months ago by
winglian
Match the config class name to what the modeling code expects
1
#4 opened 3 months ago by
winglian
New activity in
microsoft/Phi-3.5-mini-instruct
3 months ago
trust_remote_code=True
1
#9 opened 3 months ago by
winglian
New activity in
NousResearch/Hermes-2-Pro-Llama-3-8B
7 months ago
add axolotl tag
#1 opened 7 months ago by
winglian
New activity in
mattshumer/Llama-3-8B-16K
7 months ago
add axolotl tag
#3 opened 7 months ago by
winglian
New activity in
cognitivecomputations/dolphin-2.9-llama3-8b
8 months ago
add axolotl tag
#12 opened 8 months ago by
winglian
New activity in
openbmb/Eurus-RM-7b
8 months ago
Enable flash_attention_2 support since the underlying Mistral model supports it
#3 opened 8 months ago by
winglian
New activity in
meta-llama/Meta-Llama-3-8B
8 months ago
Rename original/tokenizer.model to tokenizer.model
3
#6 opened 8 months ago by
winglian
commented
a paper
8 months ago
Octopus v2: On-device language model for super agent
Paper
•
2404.01744
•
Published
Apr 2
•
57
•
8
New activity in
PrunaAI/dbrx-base-bnb-4bit
8 months ago
invalid weights doesn't match modeling code
1
#3 opened 8 months ago by
winglian
New activity in
SinclairSchneider/dbrx-base-quantization-fixed
8 months ago
reduce verbosity of logging
#1 opened 8 months ago by
winglian
New activity in
databricks/dbrx-instruct
8 months ago
The fused expert parameters means load_in_4bit doesn't work properly, nor does LoRA
31
#10 opened 8 months ago by
tdrussell
New activity in
LnL-AI/dbrx-base-converted-v2
8 months ago
reduce logging verbosity
1
#3 opened 8 months ago by
winglian
New activity in
SinclairSchneider/dbrx-instruct-quantization-fixed
8 months ago
dbrx-base
1
#2 opened 8 months ago by
winglian
New activity in
ai21labs/Jamba-v0.1
8 months ago
finetuning issues
2
#9 opened 8 months ago by
winglian
Fix bias logic to enable QLoRA finetuning
3
#5 opened 8 months ago by
winglian
New activity in
cerebras/SlimPajama-627B
12 months ago
Trouble with streaming
7
#5 opened over 1 year ago by
andersonbcdefg
New activity in
open-llm-leaderboard/open_llm_leaderboard
about 1 year ago
latest commit breaks ability to submit mistral finetunes
4
#410 opened about 1 year ago by
winglian
New activity in
Open-Orca/Mistral-7B-OpenOrca
about 1 year ago
Can you share the training configuration of Axolotl?
3
#24 opened about 1 year ago by
timlim123
Load more