24 40 176

Théo Gigant

gigant

https://giganttheo.github.io/

AI & ML interests

multimodal summarization, generative models

Recent Activity

updated a dataset about 8 hours ago

gigant/cnn_dailymail_oreo_jinacolbertv2_10k

updated a dataset about 17 hours ago

gigant/cnn_dailymail_oreo_10k

updated a dataset 2 days ago

gigant/reddit_tifu_oreo

View all activity

Articles

Design choices for Vision Language Models in 2024

Apr 16

• 25

Organizations

gigant's activity

upvoted a paper 2 months ago

EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published Sep 24 • 24

upvoted 2 papers 3 months ago

Contextual Position Encoding: Learning to Count What's Important

Paper • 2405.18719 • Published May 29 • 5

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22 • 122

upvoted 3 papers 4 months ago

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3 • 78

Harvesting Textual and Structured Data from the HAL Publication Repository

Paper • 2407.20595 • Published Jul 30 • 21

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

Paper • 2406.11271 • Published Jun 17 • 20

upvoted an article 4 months ago

Article

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18

• 68

upvoted 2 articles 5 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 279

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5

• 169

upvoted 5 papers 5 months ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10 • 68

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3 • 92

upvoted 6 articles 6 months ago

Article

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

•

Jun 11

• 48

Article

Vision Language Models Explained

Apr 11

• 218

Article

Uncensor any LLM with abliteration

•

Jun 13

• 375

Article

Explaining the SDXL latent space

•

May 20

• 33

Article

AI has a problem with objectifying women

•

May 24

• 55

Article

MobileNet-V4 (now in timm)

•

Jun 17

• 39