Export and Inference Issues¶

Problems with model export and inference.

ONNX Export Failures¶

import autotimm as at  # recommended alias
from autotimm import ImageClassifier, export_to_onnx
import torch

# 1. Use AutoTimm's export function (handles Lightning modules automatically)
model = ImageClassifier.load_from_checkpoint("checkpoint.ckpt", compile_model=False)
model.eval()

example_input = torch.randn(1, 3, 224, 224)
export_to_onnx(model, "model.onnx", example_input)

# 2. Or use the convenience method
model.to_onnx("model.onnx")

# 3. If export fails, try a lower opset version
export_to_onnx(model, "model.onnx", example_input, opset_version=14)

# 4. Validate the export
from autotimm.export import validate_onnx_export
is_valid = validate_onnx_export(model, "model.onnx", example_input)

# 5. If ONNX export fails, try TorchScript instead
model.to_torchscript("model.pt")

Missing ONNX Dependencies¶

# Install required packages
pip install onnx onnxruntime onnxscript

# For GPU inference
pip install onnxruntime-gpu

TorchScript Issues¶

# Some operations don't support TorchScript
# Try tracing instead of scripting
traced_model = torch.jit.trace(model, dummy_input)
traced_model.save("model_traced.pt")

# Or freeze the model
frozen_model = torch.jit.freeze(traced_model)
frozen_model.save("model_frozen.pt")

Inference Optimization¶

# 1. Compile model for faster inference (PyTorch 2.0+)
import torch
model = torch.compile(model, mode="reduce-overhead")

# 2. Use half precision for inference
model = model.half()
input_tensor = input_tensor.half()

# 3. Disable gradient computation
with torch.inference_mode():
    with torch.cuda.amp.autocast():  # Automatic mixed precision
        outputs = model(inputs)

Classification Inference Issues¶

Out of Memory During Inference¶

Solutions:

# 1. Reduce batch size
batch_size = 16

# 2. Use smaller image size
transform = transforms.Resize(224)

# 3. Clear cache between batches
torch.cuda.empty_cache()

# 4. Use CPU for very large images
device = torch.device("cpu")

Slow Inference¶

Solutions:

# 1. Use GPU
device = torch.device("cuda")

# 2. Increase batch size
batch_size = 64

# 3. Use half precision
model = model.half()

# 4. Keep model in memory (don't reload)

Detection Inference Issues¶

No Detections¶

Solutions:

# 1. Lower score threshold
from autotimm.inference import DetectionPipeline
pipeline = DetectionPipeline(score_threshold=0.1)

# 2. Check image is RGB
image = Image.open("img.jpg").convert("RGB")

# 3. Verify model classes
print(f"Model num_classes: {model.num_classes}")

Too Many Duplicate Detections¶

Solutions:

# 1. Lower NMS threshold (stricter)
model = ObjectDetector(nms_thresh=0.3)

# 2. Increase score threshold
pipeline = DetectionPipeline(score_threshold=0.5)

Missing Small Objects¶

Solutions:

# 1. Use larger image size
pipeline = DetectionPipeline(image_size=800)

# 2. Use multi-scale inference

Segmentation Inference Issues¶

Mask Size Mismatch¶

Explanation: AutoTimm automatically resizes masks back to original image size using NEAREST interpolation.

If still having issues:

# Manually resize
import cv2
mask_resized = cv2.resize(
    mask,
    (original_width, original_height),
    interpolation=cv2.INTER_NEAREST
)

Production Deployment - Production deployment issues
OOM Errors - Memory problems
Slow Training - Performance optimization

Export and Inference Issues¶

ONNX Export Failures¶

Missing ONNX Dependencies¶

TorchScript Issues¶

Inference Optimization¶

Classification Inference Issues¶

Out of Memory During Inference¶

Slow Inference¶

Detection Inference Issues¶

No Detections¶

Too Many Duplicate Detections¶

Missing Small Objects¶

Segmentation Inference Issues¶

Mask Size Mismatch¶

Related Issues¶