Custom model training #21244
-
Hi! Thank you for the great library. It is much easier to work with compared to others. The built in yolo models and configs work great, however, i wanted to try with some different and custom models including swin, deformable detr and more niche ones. I tried to go through the docs and code, howeer was not fully able to understand the strcuture expected of new backbones or necks for optimal integration. Would greatly appreciate if you could help me with:
Sorry if these are repeats. I tried my best to read through the discussions an only include parts that are not explained. |
Beta Was this translation helpful? Give feedback.
Replies: 5 comments 17 replies
-
👋 Hello @someuser2026, thank you for your kind words and interest in Ultralytics 🚀! We’re glad to hear you’ve had a positive experience so far. If this is a ❓ custom model integration or training question, please provide as much information as possible about your use case, including any config files, code snippets, or error messages. If you encounter any issues or bugs, please include a minimum reproducible example to help us assist you effectively. For your questions about integrating custom backbones or modules, we recommend reviewing the Docs—especially the sections on Python and CLI usage, as well as our Tips for Best Training Results. You can also check out existing model definitions in the repository to see how backbones and necks are structured. Join the Ultralytics community wherever you prefer! For real-time chat, visit Discord 🎧. For more in-depth discussions, check out Discourse. Or connect with others on our Subreddit. UpgradePlease make sure you are using the latest pip install -U ultralytics EnvironmentsRun YOLO in any of these up-to-date verified environments (with dependencies including CUDA/CUDNN, Python, and PyTorch preinstalled):
StatusIf this badge is green, all Ultralytics CI tests are currently passing. CI tests verify correct operation of all YOLO Modes and Tasks on macOS, Windows, and Ubuntu every 24 hours and on every commit. This is an automated response 🤖. An Ultralytics engineer will review your questions and provide more detailed guidance soon! |
Beta Was this translation helpful? Give feedback.
-
@someuser2026 , Some of the above functionalities are supported in mmyolo or mmdetection in case if you want to explore. |
Beta Was this translation helpful? Give feedback.
-
You can check the explanations here to understand the YAML configs: |
Beta Was this translation helpful? Give feedback.
-
Hello, I have a question. When my training folders are named images and labels with s The yaml files are all correct What will be the impact of this? |
Beta Was this translation helpful? Give feedback.
-
Hi, I tried using a swin_t backbone with yolo11x neack and obb head. However, it appears that the tensors returned by the swin bakcbone have channels in the last dimension wheras the yolo layers accpet it in 2nd dimension. Is there a layer to help reshape the input? Attached is the current yaml file:
|
Beta Was this translation helpful? Give feedback.
You can check the explanations here to understand the YAML configs:
#19615
#18564 (comment)