DOCS#
Tutorial
docs
- Operator Schemas 算子提要
- Dataset Configuration Guide
- “Bad” Data Exhibition
- Cache Management
- DJ-SORA
- DJ_service
- How-to Guide for Developers
- Distributed Data Processing in Data-Juicer
- Dataset Export
- Job Management
- Partitioned Processing with Checkpointing
- Data Tracing
- Awesome Data-Model Co-Development of MLLMs
tools
- Distributed Fuzzy Deduplication Tools
- Auto Evaluation Toolkit
- GPT EVAL: Evaluate your model with OpenAI API
- Evaluation Results Recorder
- Format Conversion Tools
- Multimodal Tools
- Post Tuning Tools
- Label Studio Service Utility
- Metrics for video generation
- VBench metrics
- Postprocess tools
- Preprocess Tools