Thank you for open-sourcing this work and the dataset! I noticed that the mask annotation files in VPData contain segmentation results for multiple masks, distinguished by the mask_id field. How was this field generated? Does each numerical value correspond to a real-world category, similar to how detection models assign category IDs?