By Topic

Maximize user rewards in distributed generation environments using reinforcement learning

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Bei Li ; Telecommunication Engineering Program in the School of Electrical and Computer Engineering, The University of Oklahoma, Tulsa, 74135, USA ; Siddharth Gangadhar ; Samuel Cheng ; Pramode K. Verma

In Smart Grid environments, with distributed generation, homes are encouraged to generate power and sell it back to utilities. Time of Use pricing techniques and the introduction of storage devices would greatly influence a user in deciding when to sell back power and how much to sell. Therefore, a study of sequential decision making algorithms that can optimize the total pay off for the user is necessary. In this paper, Reinforcement Learning is used to solve this optimization problem. The problem of determining when to sell back power is formulated as a Markov decision process and a near optimal strategy is chosen using policy iteration. The results show a significant increase of total rewards from selling back power with the proposed approach.

Published in:

Energytech, 2011 IEEE

Date of Conference:

25-26 May 2011