Detectron2’s image augmentation system
Detectron2’s image augmentation system has three main groups of classes: Transformation
, Augmentation
, and AugInput
. These components help augment images and their related annotations (for example, bounding boxes, segment masks, and key points). Additionally, this system allows you to apply a sequence of declarative augmentation statements and enables augmenting custom data types and custom operations. Figure 8.4 shows a simplified class diagram of Detectron2’s augmentation system:
Figure 8.4: Simplified class diagram of Detectron2’s augmentation system
The Transform
and Augmentation
classes are the bases for all the classes in their respective groups. Notably, the data format for boxes is in XYXY_ABS
mode, which dictates the boxes to be in (x_min, y_min, x_max, y_max
), specified in absolute pixels. Generally, subclasses of the Transform
base class perform the deterministic changes of the...