Facial expression recognition system using visible, IR, and MSX Images shows that early fusion deep learning models shows better performance than late fusion ones and the...
Abstract:
Facial expression recognition (FER) is one of the best non-intrusive methods for understanding and tracking mood and mental states. In this study, we propose early and la...Show MoreMetadata
Abstract:
Facial expression recognition (FER) is one of the best non-intrusive methods for understanding and tracking mood and mental states. In this study, we propose early and late fusion methods to recognize five facial expressions (angry, happy, neutral, sad, and surprised) using different combinations from a publicly available database (VIRI) with visible, infrared, and multispectral dynamic imaging (MSX) images and the (NVIE) database. A distinctive feature is the use of concatenation and combining techniques to combine ResNet-18 with transfer learning (TL) to create a model that is significantly more accurate than individual models. In the early fusion, we concatenated features from the modalities and classified facial expressions (FEs). In the late fusion, we combined the outputs of the modalities using weighted sums. For this purpose, we used different weighting factors depending on the accuracy of the individual models. The experimental results demonstrated that the proposed model outperformed the previous works by providing an accuracy of 83.33% when we trained the model (1-step training). Through further fine-tuning (3-step training), we obtained an improved performance of 84.44%. We conducted additional experiments by combining them with another modality (MSX) available in the database. By performing experiments with an additional modality (MSX), we obtained improved performance, which confirms that the additional modality combined with existing modalities can help improve the performance of fusion models for facial expression recognition. We also experimented by changing the backbones (Vgg-16, ShuffleNetv2, MobileNetv2, and GhostNet) in addition to ResNet-18 for visible and MSX data. ResNet-18 outperformed the other backbones in facial expression recognition for visible and MSX data.
Facial expression recognition system using visible, IR, and MSX Images shows that early fusion deep learning models shows better performance than late fusion ones and the...
Published in: IEEE Access ( Volume: 12)
Funding Agency:

Research Institute of Human Ecology, Yeungnam University, Gyeongsan-si, Republic of Korea
Muhammad Tahir Naseem (Member, IEEE) was born in Sargodha, Pakistan, in 1983. He received the B.S. degree (Hons.) in computer science from the University of the Punjab, Lahore, Pakistan, in 2005, the M.S. degree in electronic engineering from International Islamic University, Islamabad, Pakistan, in 2008, and the Ph.D. degree in electronic engineering from Isra University, Hyderabad, Pakistan, in 2015. From 2017 to 2021, ...Show More
Muhammad Tahir Naseem (Member, IEEE) was born in Sargodha, Pakistan, in 1983. He received the B.S. degree (Hons.) in computer science from the University of the Punjab, Lahore, Pakistan, in 2005, the M.S. degree in electronic engineering from International Islamic University, Islamabad, Pakistan, in 2008, and the Ph.D. degree in electronic engineering from Isra University, Hyderabad, Pakistan, in 2015. From 2017 to 2021, ...View more

Department of Electronic Engineering, Yeungnam University, Gyeongsan-si, Republic of Korea
Chan-Su Lee (Member, IEEE) received the B.S. degree in electronics engineering from Yonsei University, in 1995, the M.S. degree in electrical engineering from the Korea Advanced Institute of Science and Technology, in 1997, and the Ph.D. degree in computer science from Rutgers, The State University of New Jersey, in May 2007. From 1997 to 2001, he was a Member Research Engineer with the Electronics and Telecommunications ...Show More
Chan-Su Lee (Member, IEEE) received the B.S. degree in electronics engineering from Yonsei University, in 1995, the M.S. degree in electrical engineering from the Korea Advanced Institute of Science and Technology, in 1997, and the Ph.D. degree in computer science from Rutgers, The State University of New Jersey, in May 2007. From 1997 to 2001, he was a Member Research Engineer with the Electronics and Telecommunications ...View more

Department of Electronic Engineering, Yeungnam University, Gyeongsan-si, Republic of Korea
Na-Hyun Kim (Student Member, IEEE) received the B.S. degree in electronics engineering from Yeungnam University, in 2022, where she is currently pursuing the master’s degree with the Department of Electronics Engineering. Her research interests include human action recognition, facial expression recognition, and sequence modeling of dynamic human motions.
Na-Hyun Kim (Student Member, IEEE) received the B.S. degree in electronics engineering from Yeungnam University, in 2022, where she is currently pursuing the master’s degree with the Department of Electronics Engineering. Her research interests include human action recognition, facial expression recognition, and sequence modeling of dynamic human motions.View more

Research Institute of Human Ecology, Yeungnam University, Gyeongsan-si, Republic of Korea
Muhammad Tahir Naseem (Member, IEEE) was born in Sargodha, Pakistan, in 1983. He received the B.S. degree (Hons.) in computer science from the University of the Punjab, Lahore, Pakistan, in 2005, the M.S. degree in electronic engineering from International Islamic University, Islamabad, Pakistan, in 2008, and the Ph.D. degree in electronic engineering from Isra University, Hyderabad, Pakistan, in 2015. From 2017 to 2021, he was a Faculty Member of the Riphah School of Computing and Innovation (RSCI), Riphah International University, Lahore. Since 2022, he has been a Research Professor with the Research Institute of Human Ecology, Yeungnam University, South Korea. His research interests include computer vision, facial expressions, infrared thermography, sensor fusion, gaits, and machine and deep learning.
Muhammad Tahir Naseem (Member, IEEE) was born in Sargodha, Pakistan, in 1983. He received the B.S. degree (Hons.) in computer science from the University of the Punjab, Lahore, Pakistan, in 2005, the M.S. degree in electronic engineering from International Islamic University, Islamabad, Pakistan, in 2008, and the Ph.D. degree in electronic engineering from Isra University, Hyderabad, Pakistan, in 2015. From 2017 to 2021, he was a Faculty Member of the Riphah School of Computing and Innovation (RSCI), Riphah International University, Lahore. Since 2022, he has been a Research Professor with the Research Institute of Human Ecology, Yeungnam University, South Korea. His research interests include computer vision, facial expressions, infrared thermography, sensor fusion, gaits, and machine and deep learning.View more

Department of Electronic Engineering, Yeungnam University, Gyeongsan-si, Republic of Korea
Chan-Su Lee (Member, IEEE) received the B.S. degree in electronics engineering from Yonsei University, in 1995, the M.S. degree in electrical engineering from the Korea Advanced Institute of Science and Technology, in 1997, and the Ph.D. degree in computer science from Rutgers, The State University of New Jersey, in May 2007. From 1997 to 2001, he was a Member Research Engineer with the Electronics and Telecommunications Research Institute (ETRI). He is currently a Professor with the Department of Electronic Engineering, Yeungnam University, South Korea. His research interests include computer vision, pattern recognition, machine learning, biometrics, gesture and facial expression analysis, smart lighting control, and human visual perception. He is a member of the IEEE Computer Society.
Chan-Su Lee (Member, IEEE) received the B.S. degree in electronics engineering from Yonsei University, in 1995, the M.S. degree in electrical engineering from the Korea Advanced Institute of Science and Technology, in 1997, and the Ph.D. degree in computer science from Rutgers, The State University of New Jersey, in May 2007. From 1997 to 2001, he was a Member Research Engineer with the Electronics and Telecommunications Research Institute (ETRI). He is currently a Professor with the Department of Electronic Engineering, Yeungnam University, South Korea. His research interests include computer vision, pattern recognition, machine learning, biometrics, gesture and facial expression analysis, smart lighting control, and human visual perception. He is a member of the IEEE Computer Society.View more

Department of Electronic Engineering, Yeungnam University, Gyeongsan-si, Republic of Korea
Na-Hyun Kim (Student Member, IEEE) received the B.S. degree in electronics engineering from Yeungnam University, in 2022, where she is currently pursuing the master’s degree with the Department of Electronics Engineering. Her research interests include human action recognition, facial expression recognition, and sequence modeling of dynamic human motions.
Na-Hyun Kim (Student Member, IEEE) received the B.S. degree in electronics engineering from Yeungnam University, in 2022, where she is currently pursuing the master’s degree with the Department of Electronics Engineering. Her research interests include human action recognition, facial expression recognition, and sequence modeling of dynamic human motions.View more