Skip to main content
Back to top
Ctrl
+
K
Data Juicer
DOCS
API
Sandbox
Hub
Agents
GitHub
🌐
en
English
简体中文
📦
v1.4.3
main
v1.4.4
v1.4.3
v1.4.2
v1.4.1
v1.4.0
DOCS
API
Sandbox
Hub
Agents
GitHub
🌐
en
English
简体中文
📦
v1.4.3
main
v1.4.4
v1.4.3
v1.4.2
v1.4.1
v1.4.0
Section Navigation
Tutorial
DJ-Cookbook
Installation Guide
Quick Start
docs
Operator Schemas 算子提要
Dataset Configuration Guide
“Bad” Data Exhibition
DJ-SORA
DataJuicer-Agent
DJ_service
How-to Guide for Developers
Distributed Data Processing in Data-Juicer
Data Recipe Gallery
Sandbox
Awesome Data-Model Co-Development of MLLMs
operators
Aggregator
Deduplicator
Filter
Mapper
Formatter
Grouper
key_value_grouper
naive_grouper
naive_reverse_grouper
Selector
Op
demos
Demos
Note for dataset path
tools
Distributed Fuzzy Deduplication Tools
Auto Evaluation Toolkit
GPT EVAL: Evaluate your model with OpenAI API
Evaluation Results Recorder
Format Conversion Tools
Multimodal Tools
Post Tuning Tools
Hyper-parameter Optimization for Data Recipe
Label Studio Service Utility
Metrics for video generation
VBench metrics
Postprocess tools
Preprocess Tools
Data Scoring
thirdparty
LLM Ecosystems
Third-party Model Library
DOCS
Grouper
Grouper
#
key_value_grouper
naive_grouper
naive_reverse_grouper
This Page
Show Source