Martial Terran

MartialTerran

AI & ML interests

I, Martial Terran am leading a Group to build solar-powered TimeCapsuleTeacher(TM} GPT-powered laptop computers, to provide Language, Math and Science Education to Non-English-Speaking people of the future in a Post-Apophis World.

Recent Activity

liked a model 3 days ago

MartialTerran/Toy_GPTs_LLMs_for_CPU_Educational

New activity 5 days ago

HuggingFaceTB/SmolLM2-135M:Size Mismatch in safetensors file

New activity 5 days ago

mradermacher/DragonAI-Python-SmolLM2-1.7B-Instruct-GGUF:DragonAI-Python-SmolLM2_model.py???

View all activity

Organizations

MartialTerran's activity

New activity in HuggingFaceTB/SmolLM2-135M 5 days ago

Size Mismatch in safetensors file

#3 opened 11 days ago by

MartialTerran

New activity in mradermacher/DragonAI-Python-SmolLM2-1.7B-Instruct-GGUF 5 days ago

DragonAI-Python-SmolLM2_model.py???

#1 opened 11 days ago by

MartialTerran

New activity in shripadbhat/whisper-tiny-mr 5 days ago

Under-100M Parameter for detecting 20 Marathi numbers?

#1 opened 12 days ago by

MartialTerran

New activity in Qwen/Qwen2-1.5B-Instruct 10 days ago

Error. Crash. "The attention mask is not set and cannot be inferred from input

#8 opened 10 days ago by

MartialTerran

Qwen2 sample model.py does not work.

#7 opened 10 days ago by

MartialTerran

New activity in HuggingFaceTB/SmolLM2-360M 10 days ago

B/c Size Mismatch, Cant use from transformers import LlamaForCausalLM as workaround.

#5 opened 10 days ago by

MartialTerran

New activity in HuggingFaceTB/SmolLM2-135M 10 days ago

Also cant use from transformers import LlamaForCausalLM as a workaround, because of size mismatch.

#4 opened 10 days ago by

MartialTerran

New activity in karpathy/gpt2_1558M_final4_hf 11 days ago

GPT2_model.py

#1 opened 11 days ago by

MartialTerran

New activity in vonjack/SmolLM2-1.7B-Merged 11 days ago

Where is SmolLM2_model.py???

#1 opened 11 days ago by

MartialTerran

New activity in bunnycore/SmolLM2-1.7B-SmallBig 11 days ago

Where is SmolLM2_model.py????

#1 opened 11 days ago by

MartialTerran

New activity in HuggingFaceTB/SmolLM2-360M 11 days ago

Safetensors size mismatch.

#4 opened 11 days ago by

MartialTerran

Sample Model Script for bfloat16 downloads safetensors parameters files then declares mismatch in their dimensions.

#3 opened 11 days ago by

MartialTerran

Need Help to build a SmolLM2_360M_model.py

#2 opened 11 days ago by

MartialTerran

New activity in openai/whisper 12 days ago

Distinguishing between speech and non speech

#74 opened over 1 year ago by

CarelessWhisperer

New activity in openai/whisper 14 days ago

Phoneme recognition

#86 opened over 1 year ago by

dg96

Whisper Finetuning - Validation loss is increasing but WER is Decreasing

#107 opened 11 months ago by

anahar

New activity in Corianas/llama-tiny-reactor 19 days ago

Storing Spelling information in LLMs

#2 opened about 1 month ago by

MartialTerran

Pad Token not uniquely defined?

#3 opened 19 days ago by

MartialTerran

New activity in Qwen/Qwen2.5-Coder-1.5B 21 days ago

Optimizing Qwen Coder Models (1.5B & 3B) for Python and Edge Deployment

#6 opened 21 days ago by

MartialTerran

New activity in roneneldan/TinyStories 27 days ago

Duplicates in Train set

#12 opened about 1 year ago by

Qilex