Find Details in Long Videos: Tower-of-Thoughts and Self-Retrieval Augmented Generation for Video Understanding | IEEE Conference Publication | IEEE Xplore