YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Dynamic Tool Orchestration for Iterative Visual Reasoning

🔔 Important Note on Model Status

The models released on this page belong to the AdaReasoner-TC series and are not the final RL-fine-tuned models. They are trained using Tool Cold Start (TC) supervised fine-tuning only, and are intended for analysis, ablation, and reproducibility purposes.

For RL fine-tuned version, please refer to Data & models

📋 Model Description

AdaReasoner-7B is a vision-language model trained with dynamic tool orchestration capabilities for iterative visual reasoning.

AdaReasoner-TC series are trained through TC (Tool Cold Start) supervised fine-tuning only, without subsequent RL fine-tuning.

We provide three variants of AdaReasoner-TC-7B, each optimized for different use cases:

Model	Description	Hugging Face
AdaReasoner-TC-7B-Randomized	Trained with the adaptive learning method, enabling strong generalization to unseen tools and tasks. Designed for open-ended and evolving tool environments where adaptability is required.	🤗 Link
AdaReasoner-TC-7B-Non-Randomized	Trained without adaptive learning, providing more stable and reliable performance on known tools and tasks, but limited generalization to unseen tools or task settings.	🤗 Link

Key Differences:

Randomized: Trained with adaptive learning method, enabling zero-shot generalization to novel tools and task configurations
Non-Randomized: Trained without adaptive learning, offering more predictable behavior on familiar tools but lacking generalization

📊 Performance

Please refer to our paper for detailed benchmark results across multiple visual reasoning tasks.

📚 Citation

If you use this model in your research, please cite:

@article{adareasoner2024,
  title={Dynamic Tool Orchestration for Iterative Visual Reasoning},
  author={AdaReasoner Team},
  journal={arXiv preprint arXiv:XXXX.XXXXX},
  year={2024}
}

📄 License

Apache 2.0

🤝 Acknowledgments

This model is part of the AdaReasoner project. For more information, visit our GitHub repository.

📧 Contact

For questions and feedback, please open an issue in our GitHub repository.

Downloads last month: 7

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for AdaReasoner/AdaReasoner-TC-7B-Randomized

Quantizations

2 models