Model Card for Model ID

FineTuned version of qwen2.5vl

Model Details

Model Description

This model is a fine-tuned version of the Qwen2.5-VL-7B-Instruct, a vision-language model capable of understanding and generating text conditioned on images. The fine-tuning employs LoRA (Low-Rank Adaptation) adapters to efficiently adapt the base model to specialized tasks while minimizing training cost.

  • Base Model: Qwen2.5-VL-7B-Instruct (4-bit quantized)
  • Fine-tuning Method: LoRA adapters
  • Task: Vision-language understanding and generation
  • Capabilities: Image captioning, visual question answering, multi-modal conversational AI
  • Inputs: Images plus text prompts
  • Outputs: Text responses contextualized by images

Model Sources

Usage

You can load and use this model via the unsloth library as shown below:

from unsloth import FastVisionModel

model, tokenizer = FastVisionModel.from_pretrained("DarkFoot1001/QWENFINETUNED")

Use the model for vision-language tasks text

Intended Use

This model is designed for:

  • Applications requiring combined vision and language understanding
  • AI assistants interpreting images
  • Automated image captioning and accessibility tools
  • Multi-modal chatbots

Limitations and Risks

  • May produce biased or incorrect outputs inherent to training data bias
  • Not designed for real-time edge device inference due to model size
  • Outputs should be verified in critical use cases

Training Details

  • Fine-tuned on curated image-text pair datasets relevant to [specify domain]
  • Utilized LoRA adapters on a 4-bit quantized base model
  • Training performed on GPU with mixed precision

Evaluation

  • Evaluated on image captioning and visual question answering benchmarks
  • Metrics: Accuracy, BLEU, ROUGE [Include actual results if available]

Environmental Impact

  • Hardware: NVIDIA RTX 4060 Ti
  • Approximate training duration: [X hours]
  • Estimated carbon footprint: [optional data]

Citation

If you use this model in your work, please cite:

text

Contact

For questions or support, reach out at [Your email or Hugging Face profile link].

Downloads last month
25
Safetensors
Model size
8.29B params
Tensor type
BF16
·
F32
·
U8
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for DarkFoot1001/QWENFINETUNED

Quantized
(91)
this model