Conferences >2008 IEEE/WIC/ACM Internation...

Learning-Rate Adjusting Q-Learning for Prisoner's Dilemma Games

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Many multiagent Q-learning algorithms have been proposed to date, and most of them aim to converge to a Nash equilibrium, which is not desirable in games like the Prisone...Show More

Metadata

Abstract:

Many multiagent Q-learning algorithms have been proposed to date, and most of them aim to converge to a Nash equilibrium, which is not desirable in games like the Prisoner's Dilemma (PD). In the previous paper, the author proposed the utility-based Q-learning for PD, which used utilities as rewards in order to maintain mutual cooperation once it had occurred. However, since the agent's action depends on the relation of Q-values the agent has, the mutual cooperation can be maintained by adjusting the learning rate of Q-learning. Thus, in this paper, we deal with the learning rate directly and introduce a new Q-learning method called the learning-rate adjusting Q-learning, or LRA-Q.

Published in: 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology

Date of Conference: 09-12 December 2008

Date Added to IEEE Xplore: 06 January 2009

Print ISBN:978-0-7695-3496-1

DOI: 10.1109/WIIAT.2008.170

Conference Location: Sydney, NSW, Australia

Contents

References is not available for this document.

Learning-Rate Adjusting Q-Learning for Prisoner's Dilemma Games

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Learning-Rate Adjusting Q-Learning for Prisoner's Dilemma Games

Alerts

Abstract:

Metadata

Abstract:

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?