YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Logo

Dynamic Tool Orchestration for Iterative Visual Reasoning

Paper Docs Data & Model Homepage Demo Video

🔔 Important Note on Model Status

The models released on this page belong to the AdaReasoner-TC series and are not the final RL-fine-tuned models. They are trained using Tool Cold Start (TC) supervised fine-tuning only, and are intended for analysis, ablation, and reproducibility purposes.

For RL fine-tuned version, please refer to Data & models

📋 Model Description

AdaReasoner-7B is a vision-language model trained with dynamic tool orchestration capabilities for iterative visual reasoning.

AdaReasoner-TC series are trained through TC (Tool Cold Start) supervised fine-tuning only, without subsequent RL fine-tuning.

We provide three variants of AdaReasoner-TC-7B, each optimized for different use cases:

Model Description Hugging Face
AdaReasoner-TC-7B-Randomized Trained with the adaptive learning method, enabling strong generalization to unseen tools and tasks. Designed for open-ended and evolving tool environments where adaptability is required. 🤗 Link
AdaReasoner-TC-7B-Non-Randomized Trained without adaptive learning, providing more stable and reliable performance on known tools and tasks, but limited generalization to unseen tools or task settings. 🤗 Link

Key Differences:

  • Randomized: Trained with adaptive learning method, enabling zero-shot generalization to novel tools and task configurations
  • Non-Randomized: Trained without adaptive learning, offering more predictable behavior on familiar tools but lacking generalization

📊 Performance

Please refer to our paper for detailed benchmark results across multiple visual reasoning tasks.

📚 Citation

If you use this model in your research, please cite:

@article{adareasoner2024,
  title={Dynamic Tool Orchestration for Iterative Visual Reasoning},
  author={AdaReasoner Team},
  journal={arXiv preprint arXiv:XXXX.XXXXX},
  year={2024}
}

📄 License

Apache 2.0

🤝 Acknowledgments

This model is part of the AdaReasoner project. For more information, visit our GitHub repository.

📧 Contact

For questions and feedback, please open an issue in our GitHub repository.

Downloads last month
7
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for AdaReasoner/AdaReasoner-TC-7B-Randomized

Quantizations
2 models