High locality and increased intra-node parallelism for solving finite element models on GPUs by novel element-by-element implementation | IEEE Conference Publication | IEEE Xplore