parq_tools.utils.progress.get_batch_progress_bar

parq_tools.utils.progress.get_batch_progress_bar(datasets, batch_size, desc=None, disable=False)[source]

Utility to create a tqdm progress bar for batch processing of pyarrow datasets.

Parameters:
  • datasets (Sequence) – Sequence of datasets (must have .count_rows()).

  • batch_size (int) – Number of rows per batch.

  • desc (str, optional) – Description for the progress bar.

  • disable (bool, optional) – If True, disables the progress bar.

Returns:

A tqdm progress bar instance (or dummy if tqdm is not installed).

Return type:

tqdm