A software engineering methodology to optimize caching in multi-processor DSP architectures: TMS320C80 results towards the real-time execution of low level image processing | IEEE Conference Publication | IEEE Xplore