DetectionDataModule¶

Lightning data module for object detection datasets in COCO format.

Overview¶

DetectionDataModule is a PyTorch Lightning data module for object detection that supports:

COCO format datasets with automatic annotation loading
Torchvision and albumentations transform backends
Built-in augmentation presets optimized for detection
Efficient collation for variable-sized objects per image
Multi-worker data loading with prefetching

API Reference¶

autotimm.DetectionDataModule ¶

Bases: LightningDataModule

Lightning data module for object detection.

Supports two modes:

COCO mode (default) -- expects COCO-style directory structure::

data_dir/ train2017/ # Training images val2017/ # Validation images annotations/ instances_train2017.json instances_val2017.json
CSV mode -- provide train_csv pointing to a CSV file with columns image_path,x_min,y_min,x_max,y_max,label.

Parameters:

Name	Type	Description	Default
`data_dir`	`str \| Path`	Root directory containing images and annotations.	`'./coco'`
`train_images_dir`	`str \| Path \| None`	Path to training images. Defaults to data_dir/train2017.	`None`
`val_images_dir`	`str \| Path \| None`	Path to validation images. Defaults to data_dir/val2017.	`None`
`train_ann_file`	`str \| Path \| None`	Path to train annotations. Defaults to data_dir/annotations/instances_train2017.json.	`None`
`val_ann_file`	`str \| Path \| None`	Path to val annotations. Defaults to data_dir/annotations/instances_val2017.json.	`None`
`test_images_dir`	`str \| Path \| None`	Optional path to test images.	`None`
`test_ann_file`	`str \| Path \| None`	Optional path to test annotations.	`None`
`train_csv`	`str \| Path \| None`	Path to training CSV file (CSV mode).	`None`
`val_csv`	`str \| Path \| None`	Path to validation CSV file (CSV mode).	`None`
`test_csv`	`str \| Path \| None`	Path to test CSV file (CSV mode).	`None`
`image_dir`	`str \| Path \| None`	Root directory for resolving image paths in CSV mode.	`None`
`image_column`	`str`	CSV column name for image paths.	`'image_path'`
`bbox_columns`	`list[str] \| None`	CSV column names for bbox coordinates.	`None`
`label_column`	`str`	CSV column name for class labels.	`'label'`
`image_size`	`int`	Target image size (square).	`640`
`batch_size`	`int`	Batch size for all dataloaders.	`16`
`num_workers`	`int`	Number of data-loading workers. Defaults to `os.cpu_count()`.	`min(cpu_count() or 4, 4)`
`train_transforms`	`Callable \| None`	Custom training transforms. Must include bbox_params.	`None`
`eval_transforms`	`Callable \| None`	Custom eval transforms. Must include bbox_params.	`None`
`augmentation_preset`	`str`	Preset name (`"default"`, `"strong"`). Ignored when train_transforms is provided.	`'default'`
`transform_config`	`TransformConfig \| None`	Optional :class:`TransformConfig` for unified transform configuration. When provided along with `backbone`, uses model-specific normalization from timm. Takes precedence over individual transform args.	`None`
`backbone`	`str \| Module \| None`	Optional backbone name or module. Used with `transform_config` to resolve model-specific normalization (mean, std, input_size).	`None`
`pin_memory`	`bool`	Pin memory for GPU transfer.	`True`
`persistent_workers`	`bool`	Keep worker processes alive between epochs.	`False`
`prefetch_factor`	`int \| None`	Number of batches prefetched per worker.	`None`
`min_bbox_area`	`float`	Minimum bbox area to include in training.	`0.0`
`class_ids`	`list[int] \| None`	Optional list of class IDs to filter.	`None`