Skip to Main Content
Heterogeneous multi-core architectures have become an integral component of high performance systems and high performance scientific computing (HPC). The use of these systems has been vital for research applications but until recently have not been a factor in the consumer level experience. However, with new technologies such as AMD's Accelerated Processing Unit (APU) which combines the Central Processing Unit and Graphics Processing Unit onto a single die, consumers now have an affordable high performance system at their disposal. AMD's APUs are aimed at providing good performance and low power consumption for all markets. Financial applications can benefit from this heterogeneous architecture for real time processing. However, to obtain good performance, algorithms must be coded to efficiently utilize the APU architecture. In this paper, we have optimized two option pricing algorithms on the APU making use of vectorization and loop unrolling for improved performance. Our algorithms are tested on both an ATI Mobility Radeon 5870 and an AMD E-350 APU which use the VLIW5 architecture. We also study the power consumption of these architectures to determine how they compare to traditional CPU- and GPU- based systems.