DTHV-PPO: A UAV Control Method with Dynamic Task Goal Adaptation and High- Value Experience-Guided Replay Based on PPO | IEEE Conference Publication | IEEE Xplore