1. Introduction
Recently, extensive research efforts have been devoted to clothes classification [11], [1], [29], attribute prediction [3], [13], [4], [24], and clothing item retrieval [17], [6], [10], [27], [15], because of their potential values to the industry. However, clothes recognition algorithms are often confronted with three fundamental challenges when adopted in realworld applications [12]. First, clothes often have large variations in style, texture, and cutting, which confuse existing systems. Second, clothing items are frequently subject to deformation and occlusion. Third, clothing images often exhibit serious variations when they are taken under different scenarios, such as selfies vs. online shopping photos.