Model Card for Model ID
FineTuned version of qwen2.5vl
Model Details
Model Description
This model is a fine-tuned version of the Qwen2.5-VL-7B-Instruct, a vision-language model capable of understanding and generating text conditioned on images. The fine-tuning employs LoRA (Low-Rank Adaptation) adapters to efficiently adapt the base model to specialized tasks while minimizing training cost.
- Base Model: Qwen2.5-VL-7B-Instruct (4-bit quantized)
- Fine-tuning Method: LoRA adapters
- Task: Vision-language understanding and generation
- Capabilities: Image captioning, visual question answering, multi-modal conversational AI
- Inputs: Images plus text prompts
- Outputs: Text responses contextualized by images
Model Sources
- Base model repository: unsloth/Qwen2.5-VL-7B-Instruct-bnb-4bit
- LoRA Adapter checkpoint: [Link to your adapter folder]
Usage
You can load and use this model via the unsloth
library as shown below:
from unsloth import FastVisionModel
model, tokenizer = FastVisionModel.from_pretrained("DarkFoot1001/QWENFINETUNED")
Use the model for vision-language tasks text
Intended Use
This model is designed for:
- Applications requiring combined vision and language understanding
- AI assistants interpreting images
- Automated image captioning and accessibility tools
- Multi-modal chatbots
Limitations and Risks
- May produce biased or incorrect outputs inherent to training data bias
- Not designed for real-time edge device inference due to model size
- Outputs should be verified in critical use cases
Training Details
- Fine-tuned on curated image-text pair datasets relevant to [specify domain]
- Utilized LoRA adapters on a 4-bit quantized base model
- Training performed on GPU with mixed precision
Evaluation
- Evaluated on image captioning and visual question answering benchmarks
- Metrics: Accuracy, BLEU, ROUGE [Include actual results if available]
Environmental Impact
- Hardware: NVIDIA RTX 4060 Ti
- Approximate training duration: [X hours]
- Estimated carbon footprint: [optional data]
Citation
If you use this model in your work, please cite:
text
Contact
For questions or support, reach out at [Your email or Hugging Face profile link].
- Downloads last month
- 25
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for DarkFoot1001/QWENFINETUNED
Base model
Qwen/Qwen2.5-VL-7B-Instruct