LeroyDyer
/

SpydazWeb_AI_HumanAI_008_ChatQA

Text Generation

text-generation-inference

Question-Answer

Token-Classification

Sequence-Classification

LCARS_AI_StarTrek_Computer

chain-of-thought

tree-of-knowledge

forest-of-thoughts

visual-spacial-sketchpad

knowledge-graph

entity-detection

mega-transformers

Mulit-Mega-Merge

Inference Endpoints

Model card Files Files and versions Community

LeroyDyer commited on Nov 18, 2024

Commit

e245501

•

1 Parent(s): 56db493

Update README.md

Files changed (1) hide show

README.md +16 -0

README.md CHANGED Viewed

@@ -17,6 +17,22 @@ language:
 - **License:** apache-2.0
 - **Finetuned from model :** LeroyDyer/SpydazWeb_AI_HumanAI_007
 To create a pipeline for encoding and decoding files (sound or images) to and from Base64, we need to account for the following:
 Generalized File Handling:

 - **License:** apache-2.0
 - **Finetuned from model :** LeroyDyer/SpydazWeb_AI_HumanAI_007
+## The textvision model Works ! the sound/Vision Text model Works !
+In the creation of models for multimodality is it suggested to use a different architecture ?
+Is it for thier pretraining ?
+So is it for just cutting the corner of the expensive training that the people are using a Vision Transformer ?
+Well In fact a simple transformer model can do ALL modalitys ! It is Neural network after all !
+the problem did not change , its only how to frame the question into a text based format : Here with the spydazweb models we use BASE64 Encoding !
+enabling for encoding and decoding of an image ! .. So a model CAN generate a Image using base64 as a representation ! ( yes Its large context! )
+Lets GO !
 To create a pipeline for encoding and decoding files (sound or images) to and from Base64, we need to account for the following:
 Generalized File Handling: