Confusion with the parameter fixed in visual ViT merger in source code.

#52
by Bytes-Lin - opened

The context_dim parameter seems wrong.

image.png

See details in: https://github.com/huggingface/transformers/issues/38889

Sign up or log in to comment