How scenes imply actions in realistic videos? | IEEE Conference Publication | IEEE Xplore