Abstract:
In this paper, we introduce a mask-based grasping method that discerns multiple objects within the scene regard-less of transparency or specularity and finds the optimal ...Show MoreMetadata
Abstract:
In this paper, we introduce a mask-based grasping method that discerns multiple objects within the scene regard-less of transparency or specularity and finds the optimal grasp position avoiding clutter. Conventional vision-based robotic grasping approaches often fail to extend to the scenes containing transparent objects due to their different visual appearance. To handle the different visual characteristics, we first segment both transparent and opaque objects into instance masks, which serve as the domain-agnostic intermediate representation of both object types, using a neural network. While there exists no labelled training dataset that strongly represents both object types, we overcome the limitation by augmenting transparent objects on an existing large-scale dataset. Then, given the object instance masks, our method selects the top K discrete masks and robustly estimates grasp poses avoiding clutter. Through experiments, we verify that the instance masks are light-weight yet provide sufficient information for vision-based grasping agnostic of various appearances. On an unseen real-world test environment with complex objects, our method substantially outperforms previous methods without fine-tuning.
Date of Conference: 23-27 October 2022
Date Added to IEEE Xplore: 26 December 2022
ISBN Information:
ISSN Information:
Funding Agency:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Multiple Objects ,
- Real-world Objects ,
- Training Dataset ,
- Large-scale Datasets ,
- Real-world Environments ,
- Complex Objects ,
- Intermediate Representation ,
- Object Instances ,
- Convolutional Neural Network ,
- Data Augmentation ,
- Image Object ,
- RGB Images ,
- Number Of Objects ,
- Depth Map ,
- Pose Estimation ,
- Domain Adaptation ,
- Depth Measurements ,
- Instance Segmentation ,
- Autonomous Agents ,
- Image Coordinates ,
- Unseen Objects ,
- MS COCO Dataset ,
- RGB-D Images ,
- Input RGB Image ,
- Mask R-CNN ,
- Appearance Variations ,
- Human Pose Estimation ,
- Large-scale Image Datasets ,
- Depth Images ,
- 3D Pose
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Multiple Objects ,
- Real-world Objects ,
- Training Dataset ,
- Large-scale Datasets ,
- Real-world Environments ,
- Complex Objects ,
- Intermediate Representation ,
- Object Instances ,
- Convolutional Neural Network ,
- Data Augmentation ,
- Image Object ,
- RGB Images ,
- Number Of Objects ,
- Depth Map ,
- Pose Estimation ,
- Domain Adaptation ,
- Depth Measurements ,
- Instance Segmentation ,
- Autonomous Agents ,
- Image Coordinates ,
- Unseen Objects ,
- MS COCO Dataset ,
- RGB-D Images ,
- Input RGB Image ,
- Mask R-CNN ,
- Appearance Variations ,
- Human Pose Estimation ,
- Large-scale Image Datasets ,
- Depth Images ,
- 3D Pose