As multi-core CPUs are widely used in SMP clusters, parallel programming should pay more attention on shared memory parallelization inside single node. Hybrid MPI+OpenMP programming is naturally a good model that combines the distributed memory parallelization between nodes in clusters and the shared memory parallelization on each node. In this paper, we propose a parallel MLMFA (multilevel fast multipole algorithm) approach based on hybrid MPI+OpenMP model. Performance of hybrid implementation is studied compared with our previous pure MPI version. Time cost for computation and communication and memory consumption are analyzed in detail. As most modern HPC systems are clusters of SMP, the implementation is relevant.