Open
Description
The components of this system is
- Speech recognition
- Find moving object in pixel-level
- Track object in pixel-level or roi-level
For 1. @furushchev Do you know good ros node for speech recognition?
For 3. @iKrishneel Do you know good ros node for tracking in pixel-wise? I expect it uses point cloud, because roi-level tracking is common for 2D image: ex ConsensusTracking
tl;dr
I and @inabajsk talked about the importance of human-friendly data collection interface for object segmentation.
For example:
- Human says to Pepper "Now I teach you this object".
- Pepper says "Ok, what is that object name?"
- Human says "wallet"
- Pepper says "Ok, please show me"
- Human moves object in front of Pepper.
- Pepper do
- Find moving object
- Track it
- Record it
Metadata
Metadata
Assignees
Labels
No labels