-
Notifications
You must be signed in to change notification settings - Fork 267
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question regarding ch. 14 Semantic SLAM #28
Comments
I may have answered a part of my question with a google search on "deep learning for local features". But I would still be very glad if you can provide some comments on my questions based on your experience |
OFFTOP |
Hi, @aboarya here is just my opinions on your questions:
About the donation, well, I think it is just fine like this. I'm glad to know you love this book. I have some economic income from the Chinese version in recent years (about 30,000 copies I think). Springer will publish the English version someday, but I'm not sure how much time it will take. Maybe you can buy it on amazon someday in the future, but for now, I only have the Chinese version in the paper form. Also if you find some interesting books in SLAM/robotics, let me know and I'm willing to do some translation works. Thanks for your support! |
Firstly, I would like to extend my gratitude for your excellent work, your dedication and efforts. This book is truly a great resource for many people of different backgrounds.
I have just finished the first reading of the VO chapters, after a deep-dive on the Lie Algebra and Linear Optimization chapters.
My first instincts regarding ORB (Oriented FAST) corners is why not use object detection similar to YOLO v4 or v5.
I skimmed ahead to Chapter 14 and found the section on Semantic SLAM, so my instincts are not very far off. Now my questions are as follows:
Since YOLO object detection is quite fast even on mobile devices now, while also being accurate, do you think that the center of a bounding box detection can serve as a replacement to an Oriented FAST feature point? Maybe the descriptor can be even smaller than 128 bit values binary since there's some semantic meaning?
I was looking at ORB_SLAM2 paper and Github and noticed that the latest code commit was 4 years ago, and the more recent OpenVSLAM was terminated yesterday (Feb 25th 2021). This leads me to believe that the robotics industry (the vacuum robots, drones) are not necessarily using these techniques? Can you please offer some insight on this? Do you believe the trend in industry has shifted towards deep learning already? Or are they possibly using more in-house systems?
Your book was written in 2016, and since then there have been several papers published on Unsupervised Learning of Depth and Ego-Motion, like SfMLearner and Monodepth2. Do you know if there are any SLAM systems that are based on such techniques for the VO step? In your view, is this something the industry has adopted or is it not yet mature enough?
Finally, please allow me to contribute to you work in some way, either financially or with time and effort. I don't see a link for financial contribution, and since I can't read the Chinese version I would not be able to purchase it. Can you please advise on how I can support your work by contributing either my time or some minor funding.
Thanks and keep up the good work!
The text was updated successfully, but these errors were encountered: