|
1 |
| -| Argument | Type | Default | Range | Description | |
2 |
| -| ----------------- | ------- | --------------- | ------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | |
3 |
| -| `hsv_h` | `float` | `0.015` | `0.0 - 1.0` | Adjusts the hue of the image by a fraction of the color wheel, introducing color variability. Helps the model generalize across different lighting conditions. | |
4 |
| -| `hsv_s` | `float` | `0.7` | `0.0 - 1.0` | Alters the saturation of the image by a fraction, affecting the intensity of colors. Useful for simulating different environmental conditions. | |
5 |
| -| `hsv_v` | `float` | `0.4` | `0.0 - 1.0` | Modifies the value (brightness) of the image by a fraction, helping the model to perform well under various lighting conditions. | |
6 |
| -| `degrees` | `float` | `0.0` | `-180 - +180` | Rotates the image randomly within the specified degree range, improving the model's ability to recognize objects at various orientations. | |
7 |
| -| `translate` | `float` | `0.1` | `0.0 - 1.0` | Translates the image horizontally and vertically by a fraction of the image size, aiding in learning to detect partially visible objects. | |
8 |
| -| `scale` | `float` | `0.5` | `>=0.0` | Scales the image by a gain factor, simulating objects at different distances from the camera. | |
9 |
| -| `shear` | `float` | `0.0` | `-180 - +180` | Shears the image by a specified degree, mimicking the effect of objects being viewed from different angles. | |
10 |
| -| `perspective` | `float` | `0.0` | `0.0 - 0.001` | Applies a random perspective transformation to the image, enhancing the model's ability to understand objects in 3D space. | |
11 |
| -| `flipud` | `float` | `0.0` | `0.0 - 1.0` | Flips the image upside down with the specified probability, increasing the data variability without affecting the object's characteristics. | |
12 |
| -| `fliplr` | `float` | `0.5` | `0.0 - 1.0` | Flips the image left to right with the specified probability, useful for learning symmetrical objects and increasing dataset diversity. | |
13 |
| -| `bgr` | `float` | `0.0` | `0.0 - 1.0` | Flips the image channels from RGB to BGR with the specified probability, useful for increasing robustness to incorrect channel ordering. | |
14 |
| -| `mosaic` | `float` | `1.0` | `0.0 - 1.0` | Combines four training images into one, simulating different scene compositions and object interactions. Highly effective for complex scene understanding. | |
15 |
| -| `mixup` | `float` | `0.0` | `0.0 - 1.0` | Blends two images and their labels, creating a composite image. Enhances the model's ability to generalize by introducing label noise and visual variability. | |
16 |
| -| `copy_paste` | `float` | `0.0` | `0.0 - 1.0` | Copies and pastes objects across images, useful for increasing object instances and learning object occlusion. Requires segmentation labels. | |
17 |
| -| `copy_paste_mode` | `str` | `'flip'` | - | Copy-Paste augmentation method selection among the options of (`"flip"`, `"mixup"`). | |
18 |
| -| `auto_augment` | `str` | `'randaugment'` | - | Automatically applies a predefined augmentation policy (`randaugment`, `autoaugment`, `augmix`), optimizing for classification tasks by diversifying the visual features. | |
19 |
| -| `erasing` | `float` | `0.4` | `0.0 - 0.9` | Randomly erases a portion of the image during classification training, encouraging the model to focus on less obvious features for recognition. | |
20 |
| -| `crop_fraction` | `float` | `1.0` | `0.1 - 1.0` | Crops the classification image to a fraction of its size to emphasize central features and adapt to object scales, reducing background distractions. | |
| 1 | +| Argument | Type | Default | Range | Description | |
| 2 | +| ----------------------------------------------------------------------------------------- | ------- | --------------- | ------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | |
| 3 | +| [`hsv_h`](../guides/yolo-data-augmentation.md/#hue-adjustment-hsv_h) | `float` | `0.015` | `0.0 - 1.0` | Adjusts the hue of the image by a fraction of the color wheel, introducing color variability. Helps the model generalize across different lighting conditions. | |
| 4 | +| [`hsv_s`](../guides/yolo-data-augmentation.md/#saturation-adjustment-hsv_s) | `float` | `0.7` | `0.0 - 1.0` | Alters the saturation of the image by a fraction, affecting the intensity of colors. Useful for simulating different environmental conditions. | |
| 5 | +| [`hsv_v`](../guides/yolo-data-augmentation.md/#brightness-adjustment-hsv_v) | `float` | `0.4` | `0.0 - 1.0` | Modifies the value (brightness) of the image by a fraction, helping the model to perform well under various lighting conditions. | |
| 6 | +| [`degrees`](../guides/yolo-data-augmentation.md//#rotation-degrees) | `float` | `0.0` | `0.0 - 180` | Rotates the image randomly within the specified degree range, improving the model's ability to recognize objects at various orientations. | |
| 7 | +| [`translate`](../guides/yolo-data-augmentation.md/#translation-translate) | `float` | `0.1` | `0.0 - 1.0` | Translates the image horizontally and vertically by a fraction of the image size, aiding in learning to detect partially visible objects. | |
| 8 | +| [`scale`](../guides/yolo-data-augmentation.md/#scale-scale) | `float` | `0.5` | `>=0.0` | Scales the image by a gain factor, simulating objects at different distances from the camera. | |
| 9 | +| [`shear`](../guides/yolo-data-augmentation.md/#shear-shear) | `float` | `0.0` | `-180 - +180` | Shears the image by a specified degree, mimicking the effect of objects being viewed from different angles. | |
| 10 | +| [`perspective`](../guides/yolo-data-augmentation.md/#perspective-perspective) | `float` | `0.0` | `0.0 - 0.001` | Applies a random perspective transformation to the image, enhancing the model's ability to understand objects in 3D space. | |
| 11 | +| [`flipud`](../guides/yolo-data-augmentation.md/#flip-up-down-flipud) | `float` | `0.0` | `0.0 - 1.0` | Flips the image upside down with the specified probability, increasing the data variability without affecting the object's characteristics. | |
| 12 | +| [`fliplr`](../guides/yolo-data-augmentation.md/#flip-left-right-fliplr) | `float` | `0.5` | `0.0 - 1.0` | Flips the image left to right with the specified probability, useful for learning symmetrical objects and increasing dataset diversity. | |
| 13 | +| [`bgr`](../guides/yolo-data-augmentation.md/#bgr-channel-swap-bgr) | `float` | `0.0` | `0.0 - 1.0` | Flips the image channels from RGB to BGR with the specified probability, useful for increasing robustness to incorrect channel ordering. | |
| 14 | +| [`mosaic`](../guides/yolo-data-augmentation.md/#mosaic-mosaic) | `float` | `1.0` | `0.0 - 1.0` | Combines four training images into one, simulating different scene compositions and object interactions. Highly effective for complex scene understanding. | |
| 15 | +| [`mixup`](../guides/yolo-data-augmentation.md/#mixup-mixup) | `float` | `0.0` | `0.0 - 1.0` | Blends two images and their labels, creating a composite image. Enhances the model's ability to generalize by introducing label noise and visual variability. | |
| 16 | +| [`copy_paste`](../guides/yolo-data-augmentation.md/#copy-paste-copy_paste) | `float` | `0.0` | `0.0 - 1.0` | _Segmentation only_. Copies and pastes objects across images to increase object instances. | |
| 17 | +| [`copy_paste_mode`](../guides/yolo-data-augmentation.md/#copy-paste-mode-copy_paste_mode) | `str` | `'flip'` | - | _Segmentation only_. Specifies the `copy-paste` strategy to use. Options include `'flip'` and `'mixup'`. | |
| 18 | +| [`auto_augment`](../guides/yolo-data-augmentation.md/#auto-augment-auto_augment) | `str` | `'randaugment'` | - | _Classification only_. Applies a predefined augmentation policy (`'randaugment'`, `'autoaugment'`, or `'augmix'`) to enhance model performance through visual diversity. | |
| 19 | +| [`erasing`](../guides/yolo-data-augmentation.md/#random-erasing-erasing) | `float` | `0.4` | `0.0 - 0.9` | _Classification only_. Randomly erases regions of the image during training to encourage the model to focus on less obvious features. | |
0 commit comments