dame rajee's picture

dame rajee

damerajee

·

AI & ML interests

None yet

Recent Activity

liked a model 6 days ago

facebook/layerskip-llama2-7B

upvoted a paper 6 days ago

Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages

liked a model 9 days ago

unsloth/Qwen2.5-7B

View all activity

Organizations

Posts 2

Post

427

On the 2nd of October a really cool paper was released called "Were RNNs all we need" https://arxiv.org/abs/2410.01201

This paper introduces the MinGRU model, a simplified version of the traditional Gated Recurrent Unit (GRU) designed to enhance efficiency by removing hidden state dependencies from its gates. This allows for parallel training, making it significantly faster than conventional GRUs. Additionally, MinGRU eliminates non-linear activations like tanh, streamlining computations.

So I read the paper and I tried training this model and it seems to be doing quite well , you could check out the pre-trained model on the huggingface spaces

- damerajee/mingru-stories

Post

1856

Just released ViLaH - a compact 3B parameter vision language model! which generates responses in Hindi only hindi for now 😔

BhashaAI/ViLaH

Collections 5

spaces 1

Mingru Stories

models 90

damerajee/llama-tinystories

Updated 1 day ago

damerajee/mingru

damerajee/Barlowtwins-50

Updated Oct 5 • 2

damerajee/barlow-twins-pt

damerajee/gpt-small

damerajee/MAE

damerajee/paligemma-hindi-part-2

damerajee/paligemma-hindi-part-3

damerajee/smallgpt

damerajee/GPTVision-1-ft

Text Generation • Updated Sep 1 • 22

datasets 72

damerajee/clean_vqa_prt2

Viewer • Updated Jul 16 • 273k • 66 • 1

damerajee/clean_data_vqa

Viewer • Updated Jul 13 • 300k • 49

damerajee/Llava-pretrain-small

Viewer • Updated Jun 28 • 250k • 140

damerajee/audio_pre-training-v1.3

Viewer • Updated Jun 23 • 500 • 48

damerajee/pre-train_audio-hin

Viewer • Updated Jun 22 • 53 • 40

damerajee/short_text_audio-3

Viewer • Updated Jun 22 • 2.19k • 34

damerajee/short_text_audio-2

Viewer • Updated Jun 21 • 1.14k • 33

damerajee/short_text_audio

Viewer • Updated Jun 21 • 129 • 34

damerajee/long_text_audio

Viewer • Updated Jun 21 • 129 • 29

damerajee/dataset_for_audio

Viewer • Updated Jun 21 • 73.1k • 41