Data-Juicer-Sandbox#

💡 ICML 2025 Spotlight (Top 2.6% of all submissions)

A Feedback-Driven Suite for Multimodal Data-Model Co-development.

Documentation#

Detailed documentation can be found here.

Reference#

@inproceedings{chendata,
  title={Data-Juicer Sandbox: A Feedback-Driven Suite for Multimodal Data-Model Co-development},
  author={Chen, Daoyuan and Wang, Haibin and Huang, Yilun and Ge, Ce and Li, Yaliang and Ding, Bolin and Zhou, Jingren},
  booktitle={Forty-second International Conference on Machine Learning},
  year={2025}
}