I. Introduction
Multi-person pose estimation [1], [2], [3], [65], [66] aims to simultaneously locate persons and their corresponding body keypoints in images. It is a fundamental task in computer vision, and has been widely applied in action recognition [54], human computer interaction [55], virtual fitting [58], etc.