Automatically exploiting implicit Pipeline Parallelism from multiple dependent kernels for GPUs | IEEE Conference Publication | IEEE Xplore