Fine-tuning foundation Llama-3.2-3B-Instruct on medical Q&A using differential attention (In progress). Paper: https://arxiv.org/pdf/2410.05258
Ali Janati
Na0s
AI & ML interests
NLP, Speech Recognition, Computer Vision, Time Series Forecasting.
Organizations
models
17
Na0s/Mixtral-8x7B-Instruct-v0.1-exhaustive-LoRA
Text Generation
•
Updated
•
13
Na0s/Mixtral-8x7B-v0.1-instruct-pruned-random-3-experts
Text Generation
•
Updated
•
10
Na0s/Mixtral-8x7B-v0.1-instruct-pruned-random-4-experts
Text Generation
•
Updated
•
16
Na0s/Mixtral-8x7B-v0.1-instruct-pruned-random-2-experts
Text Generation
•
Updated
•
10
Na0s/Mixtral-8x7B-v0.1-instruct-pruned-random-1-experts
Text Generation
•
Updated
•
11
Na0s/Mixtral-8x7B-v0.1-instruct-l2-norm-post-Gates-SFT-pruned-1-experts
Text Generation
•
Updated
•
7
Na0s/Mixtral-8x7B-v0.1-instruct-l2-norm-post-Gates-SFT-pruned-3-experts
Text Generation
•
Updated
•
11
Na0s/Mixtral-8x7B-v0.1-instruct-l2-norm-post-Gates-SFT-pruned-4-experts
Text Generation
•
Updated
•
13
Na0s/Medical-Whisper-Large-v3
Automatic Speech Recognition
•
Updated
•
221
•
1
Na0s/Mixtral-8x7B-v0.1-instruct-l2-norm-post-Gates-SFT-pruned-2-experts
Text Generation
•
Updated
•
12
datasets
37
Na0s/sft-ready-garage-bAInd-Open-Platypus
Viewer
•
Updated
•
24.9k
•
44
Na0s/Next_Token_Prediction_dataset
Viewer
•
Updated
•
5.5M
•
107
Na0s/sft-ready-neulab-conala
Viewer
•
Updated
•
2.38k
•
42
Na0s/sft-ready-HuggingFaceH4-ultrachat-200k
Viewer
•
Updated
•
658k
•
50
Na0s/sft-ready-Text-Generation-Augmented-Data
Viewer
•
Updated
•
7.67M
•
70
Na0s/sft-ready-Teknium-OpenHermes
Viewer
•
Updated
•
243k
•
39
Na0s/sft-ready-google-boolq
Viewer
•
Updated
•
9.43k
•
39
Na0s/sft-ready-allenai-WildChat-1M
Viewer
•
Updated
•
1.96M
•
44
Na0s/sft-ready-toughdata-quora-question-answer-dataset
Viewer
•
Updated
•
56.4k
•
40
Na0s/sft-ready-nvidia-HelpSteer2
Viewer
•
Updated
•
10.2k
•
43