Martial Terran
MartialTerran
AI & ML interests
I, Martial Terran am leading a Group to build solar-powered TimeCapsuleTeacher(TM} GPT-powered laptop computers, to provide Language, Math and Science Education to Non-English-Speaking people of the future in a Post-Apophis World.
Recent Activity
liked
a model
3 days ago
MartialTerran/Toy_GPTs_LLMs_for_CPU_Educational
New activity
5 days ago
HuggingFaceTB/SmolLM2-135M:Size Mismatch in safetensors file
New activity
5 days ago
mradermacher/DragonAI-Python-SmolLM2-1.7B-Instruct-GGUF:DragonAI-Python-SmolLM2_model.py???
Organizations
MartialTerran's activity
Size Mismatch in safetensors file
6
#3 opened 11 days ago
by
MartialTerran
DragonAI-Python-SmolLM2_model.py???
3
#1 opened 11 days ago
by
MartialTerran
Under-100M Parameter for detecting 20 Marathi numbers?
3
#1 opened 12 days ago
by
MartialTerran
Error. Crash. "The attention mask is not set and cannot be inferred from input
1
#8 opened 10 days ago
by
MartialTerran
Qwen2 sample model.py does not work.
7
#7 opened 10 days ago
by
MartialTerran
B/c Size Mismatch, Cant use from transformers import LlamaForCausalLM as workaround.
1
#5 opened 10 days ago
by
MartialTerran
GPT2_model.py
#1 opened 11 days ago
by
MartialTerran
Where is SmolLM2_model.py???
#1 opened 11 days ago
by
MartialTerran
Where is SmolLM2_model.py????
#1 opened 11 days ago
by
MartialTerran
Safetensors size mismatch.
5
#4 opened 11 days ago
by
MartialTerran
Sample Model Script for bfloat16 downloads safetensors parameters files then declares mismatch in their dimensions.
1
#3 opened 11 days ago
by
MartialTerran
Need Help to build a SmolLM2_360M_model.py
1
#2 opened 11 days ago
by
MartialTerran
Distinguishing between speech and non speech
3
#74 opened over 1 year ago
by
CarelessWhisperer
Phoneme recognition
5
#86 opened over 1 year ago
by
dg96
Whisper Finetuning - Validation loss is increasing but WER is Decreasing
2
#107 opened 11 months ago
by
anahar
Storing Spelling information in LLMs
6
#2 opened about 1 month ago
by
MartialTerran
Pad Token not uniquely defined?
#3 opened 19 days ago
by
MartialTerran
Optimizing Qwen Coder Models (1.5B & 3B) for Python and Edge Deployment
#6 opened 21 days ago
by
MartialTerran
Duplicates in Train set
1
#12 opened about 1 year ago
by
Qilex