Combination of actor/critic algorithm with the goal-directed reasoning | IEEE Conference Publication | IEEE Xplore