data_juicer.utils.job.common module#
DataJuicer Job Utilities - Common Functions
Shared utilities for job stopping and monitoring operations.
- class data_juicer.utils.job.common.JobUtils(job_id: str, work_dir: str = None, base_dir: str = None)[source]#
Bases:
objectCommon utilities for DataJuicer job operations.
- __init__(job_id: str, work_dir: str = None, base_dir: str = None)[source]#
Initialize job utilities.
- Parameters:
job_id – The job ID to work with
work_dir – Work directory that already includes job_id (preferred)
base_dir – Base directory containing job outputs (deprecated, use work_dir instead)
- extract_process_thread_ids() Dict[str, Set[int]][source]#
Extract process and thread IDs from event logs. Returns a dict with ‘process_ids’ and ‘thread_ids’ sets.
- find_processes_by_ids(process_ids: Set[int]) List[Process][source]#
Find running processes by their PIDs.