Skip to main content
Back to top
Ctrl
+
K
Data Juicer
DOCS
API
Sandbox
Hub
Agents
GitHub
🌐
en
English
简体中文
📦
v1.4.4
main
v1.4.4
v1.4.3
v1.4.2
v1.4.1
v1.4.0
DOCS
API
Sandbox
Hub
Agents
GitHub
🌐
en
English
简体中文
📦
v1.4.4
main
v1.4.4
v1.4.3
v1.4.2
v1.4.1
v1.4.0
Section Navigation
Tutorial
DJ-Cookbook
Installation Guide
Quick Start
docs
Operator Schemas 算子提要
Dataset Configuration Guide
“Bad” Data Exhibition
DJ-SORA
DataJuicer Agents
DJ_service
How-to Guide for Developers
Distributed Data Processing in Data-Juicer
Sandbox
Awesome Data-Model Co-Development of MLLMs
operators
Aggregator
Deduplicator
Filter
Mapper
Formatter
Grouper
key_value_grouper
naive_grouper
naive_reverse_grouper
Selector
Op
demos
Demos
Note for dataset path
tools
Distributed Fuzzy Deduplication Tools
Auto Evaluation Toolkit
GPT EVAL: Evaluate your model with OpenAI API
Evaluation Results Recorder
Format Conversion Tools
Multimodal Tools
Post Tuning Tools
Label Studio Service Utility
Metrics for video generation
VBench metrics
Postprocess tools
Preprocess Tools
thirdparty
LLM Ecosystems
Third-party Model Library
DOCS
Grouper
Grouper
#
key_value_grouper
naive_grouper
naive_reverse_grouper
This Page
Show Source