Dynamic routing and wavelength assignment using first policy iteration | IEEE Conference Publication | IEEE Xplore