AiDE: Attention-FFN Disaggregated Execution for Cost-Effective LLM Decoding on CXL-PNM | IEEE Journals & Magazine | IEEE Xplore