User Guide¶

Comprehensive guides for using AutoTimm across all computer vision tasks — from data loading to deployment.

Guide Organization¶

graph LR
    A[<b>User Guide</b>] --> B[<b>Data Loading</b><br/>Datasets, transforms,<br/>presets]
    A --> C[<b>Models</b><br/>Classifier, detector,<br/>segmentor]
    A --> D[<b>Training</b><br/>AutoTrainer, losses,<br/>customization]
    A --> E[<b>Evaluation</b><br/>Metrics, benchmarks,<br/>selection]
    A --> F[<b>Interpretation</b><br/>GradCAM, attention,<br/>visualization]
    A --> G[<b>Integration</b><br/>HuggingFace Hub,<br/>Transformers]
    A --> H[<b>Deployment</b><br/>TorchScript, C++,<br/>mobile]
    A --> I[<b>Inference</b><br/>Classification,<br/>detection, segmentation]

    style A fill:#1565C0,stroke:#0D47A1
    style B fill:#1976D2,stroke:#1565C0
    style C fill:#1565C0,stroke:#0D47A1
    style D fill:#1976D2,stroke:#1565C0
    style E fill:#1565C0,stroke:#0D47A1
    style F fill:#1976D2,stroke:#1565C0
    style G fill:#1565C0,stroke:#0D47A1
    style H fill:#1976D2,stroke:#1565C0
    style I fill:#1565C0,stroke:#0D47A1

Browse by Category¶

Data Loading¶

Prepare and load data for any computer vision task.

Overview - Data module overview and architecture
Image Classification Data - ImageDataModule for classification tasks
Multi-Label Classification Data - Multi-label image datasets
Object Detection Data - Detection dataset formats (COCO, VOC)
Segmentation Data - Semantic segmentation masks
CSV Data Loading - Load datasets from CSV files
Transforms - Augmentation and preprocessing pipelines
Preset Manager - Pre-configured transform presets

Models¶

Task-specific model architectures built on timm.

Overview - Model architecture overview
ImageClassifier - Image classification with any timm backbone
ObjectDetector - Object detection with FCOS
SemanticSegmentor - Semantic segmentation with DeepLabV3+/FCN
InstanceSegmentor - Instance segmentation with Mask R-CNN
YOLOXDetector - Real-time detection with YOLOX

Training¶

Configure and run training with AutoTrainer.

Overview - AutoTrainer setup and configuration
Dataset Preparation - Preparing data for training
Loss Comparison - Choosing the right loss function
Advanced Customization - Custom callbacks, schedulers, and more

Evaluation¶

Measure and compare model performance.

Metrics - MetricConfig and MetricManager
Metric Selection - Choosing metrics per task
Benchmarks - Performance benchmarks and comparisons

Interpretation¶

Understand what your models learn.

Overview - Interpretation tools overview
Interpretation Methods - GradCAM, attention, and more
Quality Metrics - Quantitative interpretation metrics
Interactive Visualizations - Plotly-based interactive views
Performance Optimization - Efficient interpretation at scale
Feature Visualization - Visualize learned features
Training Callbacks - Log interpretations during training
Task-Specific Adapters - Adapters for detection and segmentation

Integration¶

Connect with HuggingFace ecosystem.

Comparison Overview - Compare integration approaches
HF Hub Integration - Push/pull models from Hub
HF Transformers Integration - Use Transformers models

Deployment¶

Deploy models to production.

Production Deployment - Full production deployment guide
TorchScript Export - Export models to TorchScript
C++ Deployment - Deploy with LibTorch in C++
Mobile Deployment - Deploy on iOS and Android

Inference¶

Run predictions with trained models.

Overview - Inference workflow overview
Classification Inference - Classify images
Object Detection Inference - Detect objects
Segmentation Inference - Segment images
Model Export - Export for optimized inference

Guides¶

Best practices and reference guides.

Reproducibility - Achieve reproducible results
Logging - Configure experiment logging
Migration Guide - Migrate from other frameworks
YOLOX Quick Reference - YOLOX cheat sheet