Running Object Detection And Understanding With VLMs 📉 Object Detection & Understanding with VLMs ft. Qwen vs. Gemm