Data-Juicer-Sandbox#
💡 ICML 2025 Spotlight (Top 2.6% of all submissions)
A Feedback-Driven Suite for Multimodal Data-Model Co-development.
Documentation#
Detailed documentation can be found here.
Reference#
@inproceedings{chendata,
title={Data-Juicer Sandbox: A Feedback-Driven Suite for Multimodal Data-Model Co-development},
author={Chen, Daoyuan and Wang, Haibin and Huang, Yilun and Ge, Ce and Li, Yaliang and Ding, Bolin and Zhou, Jingren},
booktitle={Forty-second International Conference on Machine Learning},
year={2025}
}
- API
- data_juicer_sandbox
- data_juicer_sandbox.context_infos module
- data_juicer_sandbox.data_pool_manipulators module
- data_juicer_sandbox.env_manager module
- data_juicer_sandbox.evaluators module
- data_juicer_sandbox.factories module
- data_juicer_sandbox.helper_funcs module
- data_juicer_sandbox.hooks module
- data_juicer_sandbox.model_executors module
- data_juicer_sandbox.pipelines module
- data_juicer_sandbox.utils module
- data_juicer_sandbox