ai-forever
/

RUDOLPH-350M

Model card Files Files and versions Community

sberbank-ai commited on Oct 9, 2022

Commit

db5dfd1

•

1 Parent(s): 4c4e756

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -45,7 +45,7 @@ RUDOLPH 350M is a Transformer-based decoder model with the following parameters:
 The primary proposed method is to modify the sparse transformer's attention mask to better control modalities. It allows us to calculate the transitions of modalities in both directions, unlike another similar work DALL-E Transformer, which used only one direction, "text to image". The proposed "image to right text" direction is achieved by extension sparse attention mask to the right for auto-repressively text generation with both image and left text condition.
-<img src="https://raw.githubusercontent.com/sberbank-ai/ru-dolph/master/pics/attention_masks.png" height="40" border="2"/>
 # Authors

 The primary proposed method is to modify the sparse transformer's attention mask to better control modalities. It allows us to calculate the transitions of modalities in both directions, unlike another similar work DALL-E Transformer, which used only one direction, "text to image". The proposed "image to right text" direction is achieved by extension sparse attention mask to the right for auto-repressively text generation with both image and left text condition.
+<img src="https://raw.githubusercontent.com/sberbank-ai/ru-dolph/master/pics/attention_masks_350m.png" height="40" border="2"/>
 # Authors