Local scheduling techniques for memory coherence in a clustered VLIW processor with a distributed data cache | IEEE Conference Publication | IEEE Xplore