Benchmarking AI: Toward Inclusive Evaluation of Language Models | IEEE Journals & Magazine | IEEE Xplore