Hemanth-thunder
/

Tamil-Mistral-7B-Instruct-v0.1

Text Generation

function calling

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Hemanth-thunder commited on Apr 14

Commit

398e635

•

1 Parent(s): 39ce5b8

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -36,7 +36,7 @@ The Tamil-Mistral-7B-Instruct-v0.2 Large Language Model (LLM) is an improved ins
 Tamil LLM: A Breakthrough in Tamil Language Understanding In the realm of language models, the fine-tuned Tamil Mistral model represents a significant advancement. Unlike its English counterpart, the Tamil Mistral model is specifically tailored to comprehend and generate text in the Tamil language. This innovation addresses a critical gap, as the English Mistral model fails to effectively engage with Tamil, a language rich in culture and heritage. Through extensive fine-tuning with a base Tamil Mistral model, this iteration has been meticulously enhanced to grasp the nuances and intricacies of the Tamil language. As a result, we are delighted to present a revolutionary model that enables seamless interaction through text. Welcome to the future of conversational Tamil language processing with our instructive model.
 # Dataset
-alpaca dataset (400k) instruction google translated
 # Training time
 18 hrs to train on NVIDIA RTX A6000 48GB with batch size of 30

 Tamil LLM: A Breakthrough in Tamil Language Understanding In the realm of language models, the fine-tuned Tamil Mistral model represents a significant advancement. Unlike its English counterpart, the Tamil Mistral model is specifically tailored to comprehend and generate text in the Tamil language. This innovation addresses a critical gap, as the English Mistral model fails to effectively engage with Tamil, a language rich in culture and heritage. Through extensive fine-tuning with a base Tamil Mistral model, this iteration has been meticulously enhanced to grasp the nuances and intricacies of the Tamil language. As a result, we are delighted to present a revolutionary model that enables seamless interaction through text. Welcome to the future of conversational Tamil language processing with our instructive model.
 # Dataset
+Tamil open instruct dataset (400k) instruction google translated
 # Training time
 18 hrs to train on NVIDIA RTX A6000 48GB with batch size of 30