LeroyDyer
/

SpydazWeb_VisonEncoderDecoder_Project

Image-Text-to-Text

vision-encoder-decoder

text-generation

image-text-to-image-text

Model card Files Files and versions

LeroyDyer commited on Apr 8, 2024

Commit

15519b0

·

verified ·

1 Parent(s): df24a60

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -82,6 +82,13 @@ Encoder:
 ## How to Get Started with the Model
 ```python
 from transformers import AutoProcessor, VisionEncoderDecoderModel
 import requests
@@ -110,6 +117,9 @@ loss = outputs.loss
 generated_ids = model.generate(pixel_values)
 generated_text = processor.batch_decode(generated_ids, skip_special_tokens=True)[0]
 ```
 [More Information Needed]

 ## How to Get Started with the Model
+### VisionEncoderDecoderModel
+#### As a vision encoder model :
+the tensors are combined into the original mistral model so it can be accessed by intaciating the correct model which is the VisionEncoderDecoderModel
 ```python
 from transformers import AutoProcessor, VisionEncoderDecoderModel
 import requests
 generated_ids = model.generate(pixel_values)
 generated_text = processor.batch_decode(generated_ids, skip_special_tokens=True)[0]
 ```
+### As a standard LLM:
+it can still also be used as a normal AutoModelForCausalLM or MistralModelForCausalLM !
 [More Information Needed]