Resources

View closed (16)

RuntimeError: cu_seqlens_q must have dtype int32

#59 opened 3 days ago by

ginnyyk

Update README.md

#58 opened 4 days ago by

mobi

testing for inference endpoints

#57 opened 9 days ago by

nbroad

transformers requirement

#53 opened 9 days ago by

nbroad

如果利用VL模型获取视觉层的Embedding

#52 opened 10 days ago by

weiminw

Updated README for GPU configuration.

#51 opened 11 days ago by

aliasgerovs

Anyone can prompt input to show the exactly of image size?

#50 opened 14 days ago by

xJohn

Stable transformer version

#49 opened 17 days ago by

Jkppp

Is visual grounding possible on multiple images?

#48 opened 23 days ago by

echooooooooo

How many tokens is one image?

#47 opened about 1 month ago by

MoritzLaurer

RuntimeError: CUDA error: operation not permitted when stream is capturing

#46 opened about 1 month ago by

yuyanggo

Adding Evaluation Results

#45 opened about 1 month ago by

leaderboard-pr-bot

CUDA error: CUBLAS_STATUS_EXECUTION_FAILED

#44 opened about 1 month ago by

yuyanggo

KeyError: 'qwen2_vl' loading from Transformers

#42 opened about 2 months ago by

KevalRx

Batch inference on many images

#41 opened about 2 months ago by

yadavsaakash

Handling multiple images in a pdf to preserve context during processing.

#40 opened about 2 months ago by

ananthv

Questions about Naive Dynamic Resolution and the vision mask

#39 opened about 2 months ago by

YaYaGeGe

it run on cpu

#38 opened about 2 months ago by

sdyy

Request for Help: Passing an Image in cURL with vLLM

#36 opened 2 months ago by

ananthv

Ollama api setup for Qwen2

#35 opened 3 months ago by

RagulMahendran

Neto discussion

#34 opened 3 months ago by

Neto1780

An error occurred: shape mismatch

#33 opened 3 months ago by

VeeP

Finetuning script using HuggingFace (No llama-factory)

#32 opened 3 months ago by

2U1

Able to successfully deploy as Inference Endpoint?

#31 opened 3 months ago by

philglazer

GGUF models

#30 opened 3 months ago by

mariahelenass

可以用来做多模态检索吗

#29 opened 3 months ago by

Lecheal

OCR on image

#28 opened 3 months ago by

glitchyordis

Update chat_template.json to incorporate `generation` tag

#27 opened 3 months ago by

linyueqian

Request: DOI

#26 opened 3 months ago by

samzong

Value of fps for video inference

#25 opened 3 months ago by

shivanis14

Are GGUF models available?

#24 opened 3 months ago by

smcleod

support in ollama

#21 opened 3 months ago by

Goekdeniz-Guelmez

when i use torch.float16，i face this problem probability tensor contains either `inf`, `nan` or element < 0

#20 opened 3 months ago by

als-991011

Can it be run on a 3090 with 24gb VRAM?

#18 opened 3 months ago by

mnemic

Nerfed with people

#17 opened 3 months ago by

spawn99

ValueError: Unrecognized configuration class <class 'transformers.models.qwen2_vl.configuration_qwen2_vl.Qwen2VLConfig'> for this kind of AutoModel: AutoModelForSeq2SeqLM.

#16 opened 3 months ago by

vinz1396

Arabic

#15 opened 3 months ago by

MubashshirMohammad

When extracting text from an image, some text is missing.

#14 opened 3 months ago by

wol2001

Support for multi-round question answering in Qwen2-VL-7B-Instruct

#12 opened 3 months ago by

zhanchao019

Working sample for mac

#11 opened 3 months ago by

spawn99

RuntimeError: MPS backend out of memory.

#8 opened 3 months ago by

TahaZk

LoRA Finetuning Tool for Qwen2-VL-7B in Web UI (DPO updated)

#2 opened 3 months ago by

hiyouga

🍭 Fine-tuning support for Qwen2-VL-7B-Instruct

#1 opened 3 months ago by

study-hjt