HuggingFaceM4
/

idefics2-8b

Image-Text-to-Text

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Update README.md

#1

by HugoLaurencon - opened Apr 15

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -38,6 +38,7 @@ Idefics2 is an open multimodal model that accepts arbitrary sequences of image a
 We release under the Apache 2.0 license 2 checkpoints:
 - [idefics2-8b-base](https://huggingface.co/HuggingFaceM4/idefics2-8b-base): the base model
 - [idefics2-8b](https://huggingface.co/HuggingFaceM4/idefics2-8b): the base model fine-tuned on a mixture of supervised and instruction datasets (text-only and multimodal datasets)
 # Model Summary
@@ -59,6 +60,8 @@ We release under the Apache 2.0 license 2 checkpoints:
 For optimal results, we recommend fine-tuning `idefics2-8b` on one's specific use-case and data. In fact, the instruction-fine-tuned model (`idefics2-8b`) is significantly better at following instructions from users and thus should be preferred when using the models out-of-the-box or as a starting point for fine-tuning.
 As a starting point, we provide fine-tuning codes that can be adapted for one's particular scenario:
 - With the [TRL library](https://github.com/huggingface/trl): [Script](https://gist.github.com/edbeeching/228652fc6c2b29a1641be5a5778223cb)
 - With the [Hugging Face Trainer](https://huggingface.co/docs/transformers/main/en/main_classes/trainer#api-reference%20][%20transformers.Trainer): [Tutorial notebook](https://colab.research.google.com/drive/1NtcTgRbSBKN7pYD3Vdx1j9m8pt3fhFDB?usp=sharing)

 We release under the Apache 2.0 license 2 checkpoints:
 - [idefics2-8b-base](https://huggingface.co/HuggingFaceM4/idefics2-8b-base): the base model
 - [idefics2-8b](https://huggingface.co/HuggingFaceM4/idefics2-8b): the base model fine-tuned on a mixture of supervised and instruction datasets (text-only and multimodal datasets)
+- idefics2-8b-chatty (coming soon): `idefics2-8b` further fine-tuned on long conservations
 # Model Summary
 For optimal results, we recommend fine-tuning `idefics2-8b` on one's specific use-case and data. In fact, the instruction-fine-tuned model (`idefics2-8b`) is significantly better at following instructions from users and thus should be preferred when using the models out-of-the-box or as a starting point for fine-tuning.
+`idefics2-8b` usually generates very short answers. For long generations, use `idefics2-8b-chatty`, which was further fine-tuned on long conversations.
 As a starting point, we provide fine-tuning codes that can be adapted for one's particular scenario:
 - With the [TRL library](https://github.com/huggingface/trl): [Script](https://gist.github.com/edbeeching/228652fc6c2b29a1641be5a5778223cb)
 - With the [Hugging Face Trainer](https://huggingface.co/docs/transformers/main/en/main_classes/trainer#api-reference%20][%20transformers.Trainer): [Tutorial notebook](https://colab.research.google.com/drive/1NtcTgRbSBKN7pYD3Vdx1j9m8pt3fhFDB?usp=sharing)