Vision Transformers with Cross-Attention Pyramids for Class-Agnostic Counting | IEEE Conference Publication | IEEE Xplore