ImageDataModule¶

Lightning data module for image classification datasets.

Overview¶

ImageDataModule is a PyTorch Lightning data module that supports:

Built-in torchvision datasets (CIFAR10, CIFAR100, MNIST, FashionMNIST)
Custom ImageFolder datasets
Torchvision and albumentations transform backends
Automatic validation splits
Balanced sampling for imbalanced datasets

API Reference¶

autotimm.ImageDataModule ¶

Bases: LightningDataModule

Lightning data module for image classification.

Supports three modes:

Folder mode -- point data_dir at a directory with train/, val/, and optionally test/ subdirectories, each containing one sub-folder per class (ImageFolder layout).
Built-in dataset mode -- set dataset_name to a torchvision dataset ("CIFAR10", "CIFAR100", "FashionMNIST", "MNIST") and data_dir to the download root.
CSV mode -- provide train_csv pointing to a CSV file with image_path,label columns. Optionally provide val_csv and test_csv for separate splits.

Parameters:

Name	Type	Description	Default
`data_dir`	`str \| Path`	Root directory for image data or download root.	`'./data'`
`dataset_name`	`str \| None`	Optional name of a torchvision dataset class.	`None`
`train_csv`	`str \| Path \| None`	Path to training CSV file (CSV mode).	`None`
`val_csv`	`str \| Path \| None`	Path to validation CSV file (CSV mode).	`None`
`test_csv`	`str \| Path \| None`	Path to test CSV file (CSV mode).	`None`
`image_dir`	`str \| Path \| None`	Root directory for resolving image paths in CSV mode. Defaults to the parent directory of `train_csv`.	`None`
`image_column`	`str \| None`	Name of the CSV column containing image paths.	`None`
`label_column`	`str \| None`	Name of the CSV column containing class labels.	`None`
`image_size`	`int`	Target image size (square).	`224`
`batch_size`	`int`	Batch size for all dataloaders.	`32`
`num_workers`	`int`	Number of data-loading workers. Defaults to `os.cpu_count()`.	`min(cpu_count() or 4, 4)`
`val_split`	`float`	Fraction of training data used for validation when no explicit val set exists.	`0.1`
`train_transforms`	`Callable \| None`	Custom training transforms; defaults used if None. Mutually exclusive with `augmentation_preset`.	`None`
`eval_transforms`	`Callable \| None`	Custom eval transforms; defaults used if None.	`None`
`augmentation_preset`	`str \| None`	Name of a built-in augmentation preset. For `torchvision`: `"default"`, `"autoaugment"`, `"randaugment"`, `"trivialaugment"`. For `albumentations`: `"default"`, `"strong"`. Ignored when `train_transforms` is provided.	`None`
`transform_backend`	`str`	`"torchvision"` (PIL-based) or `"albumentations"` (OpenCV-based). Defaults to `"torchvision"`. When `"albumentations"` is selected, folder-mode datasets load images with OpenCV and built-in datasets convert PIL images to numpy for the augmentation pipeline.	`'torchvision'`
`transform_config`	`TransformConfig \| None`	Optional :class:`TransformConfig` for unified transform configuration. When provided along with `backbone`, uses model-specific normalization from timm. Takes precedence over individual transform args.	`None`
`backbone`	`str \| Module \| None`	Optional backbone name or module. Used with `transform_config` to resolve model-specific normalization (mean, std, input_size).	`None`
`pin_memory`	`bool`	Pin memory for GPU transfer.	`True`
`persistent_workers`	`bool`	Keep worker processes alive between epochs. Reduces overhead when `num_workers > 0`.	`False`
`prefetch_factor`	`int \| None`	Number of batches prefetched per worker.	`None`
`balanced_sampling`	`bool`	Use `WeightedRandomSampler` to counter class imbalance in the training set.	`False`