I. Introduction
In recent years, deep learning has demonstrated excellent inference capabilities in many applications such as smart livestock farming [1], [2], [3]. The existing deep learning models usually have millions of parameters and rely on powerful computational capability. Edge computing can place the inference close to end devices to meet the high computation requirements of deep learning and reduce data transmission delay. By combining deep learning and edge computing, it is possible to achieve an accurate and real-time inference on specific tasks in an IoT framework, e.g., monitoring the health of dairy cows.