Most Software Transactional Memory (STM) research has focused on multi-core processors and small SMP machines; limited research has been aimed at the clusters, leaving the area of big SMP machines unexplored. Big SMP machine usually use Non-Uniform Memory Access (NUMA) to unburden the overloading between CPUs and the memory. In this paper, we evaluate several STM implementations on big SMP machine with cache coherent NUMA (ccNUMA). We found the remote memory access latency is the key factor influencing the STM performance. We also analyze the different design choices of RSTM. Finally, we conclude a specific design to achieve high performance in this domain.