DOCS#
- Data Recipe Gallery
- 1. Data-Juicer Minimal Example Recipe
- 2. Reproduce Open Source Text Datasets
- 3. Improved Open Source Pre-training Text Datasets
- 4. Improved Open Source Post-tuning Text Dataset
- 5. Synthetic Contrastive Learning Image-text datasets
- 6. Improved Open Source Image-text datasets
- 7. Basic Example Recipes for Video Data
- 8. Synthesize Human-centric Video Benchmarks
- 9. Improve Existing Open Source Video Datasets
- Refine Alpaca-CoT Config Files
- Notification System
- BLOOM Config Files
- Redpajama Config Files