By Topic

Cooperation in wireless networks: a game-theoretic framework with reinforcement learning

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $31
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

1 Author(s)
Baidas, M.W. ; Electr. Eng. Dept., Kuwait Univ., Safat, Kuwait

A game-theoretic framework based on the iterated prisoner's dilemma (IPD) is proposed to model the repeated dynamic interactions of multiple source nodes when communicating with multiple destinations in an ad hoc wireless network. In such networks where nodes are autonomous, selfish and not familiar with other nodes' strategies, fully cooperative behaviours cannot be assumed. Therefore reinforcement learning is studied to relate the utility function of each source node to actions previously taken in order to learn a strategy that maximises their expected future reward. Particularly, a Q-learning algorithm is proposed to allow network nodes to adapt to and play the IPD game against opponents with a variety of known and unknown strategies. Simulation results illustrate that the proposed Q-learning algorithm allows network nodes to play optimally and achieve their maximum expected return values.

Published in:

Communications, IET  (Volume:8 ,  Issue: 5 )