Caching Policy Optimization for D2D Communications by Learning User Preference | IEEE Conference Publication | IEEE Xplore