Spaces:

hsienchen
/

gemini-mm-cot

Sleeping

hsienchen commited on Jan 16

Commit

c6232d6

•

1 Parent(s): 5786d40

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -72,6 +72,14 @@ with gr.Blocks(theme='snehilsanyal/scikit-learn') as app:
                                 [chatbot,text_box,image_box],
                                 chatbot
                                 )
-    gr.Markdown("## Examples")
 app.queue()
 app.launch()

                                 [chatbot,text_box,image_box],
                                 chatbot
                                 )
+    gr.Markdown("""
+    # Multimodal Chain-of-Thought Reasoning in Language Models
+    <h5 align="center"><i>"Imagine learning a textbook without figures or tables."</i></h5>
+    Multimodal-CoT incorporates vision features in a decoupled training framework. The framework consists of two training stages: (i) rationale generation and (ii) answer inference. Both stages share the same model architecture but differ in the input and output.
+    """)
 app.queue()
 app.launch()