<?xml version="1.0" ?>
<rss version="2.0">
	<channel>
		<title><![CDATA[ Systems, Man, and Cybernetics, Part B, IEEE Transactions on - new TOC ]]></title>
		<link>http://ieeexplore.ieee.org</link>
		<description>TOC Alert for Publication# 3477 </description>
		<year>2008</year>
		<month>July     </month>
		<day>24</day>
		<item>
			<title><![CDATA[Ensemble Algorithms in Reinforcement Learning]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4509588]]></link>
			<description><![CDATA[<para> This paper describes several ensemble methods that combine multiple different reinforcement learning (RL) algorithms in a single agent. The aim is to enhance learning speed and final performance by combining the chosen actions or action probabilities of different RL algorithms. We designed and implemented four different ensemble methods combining the following five different RL algorithms: <formula formulatype="inline"><tex>$Q$</tex></formula> -learning, Sarsa, actor&#x2013;critic (AC), <formula formulatype="inline"><tex>$QV$</tex></formula>-learning, and AC learning automaton. The intuitively designed ensemble methods, namely, majority voting (MV), rank voting, Boltzmann multiplication (BM), and Boltzmann addition, combine the policies derived from the value functions of the different RL algorithms, in contrast to previous work where ensemble methods have been used in RL for representing and learning a single value function. We show experiments on five maze problems of varying complexity; the first problem is simple, but the other four maze tasks are of a dynamic or partially observable nature. The results indicate that the BM and MV ensembles significantly outperform the single RL algorithms. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4509588]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>930</startPage>
			<endPage>936</endPage>
			<fileSize>252</fileSize>
			<authors><![CDATA[Wiering, M. A.;van Hasselt, H.;]]></authors>
		</item>
		<item>
			<title><![CDATA[An Evolutionary Approach Toward Dynamic Self-Generated Fuzzy Inference Systems]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4509589]]></link>
			<description><![CDATA[<para> An evolutionary approach toward automatic generation of fuzzy inference systems (FISs), termed evolutionary dynamic self-generated fuzzy inference systems (EDSGFISs), is proposed in this paper. The structure and parameters of an FIS are generated through reinforcement learning, whereas an action set for training the consequents of the FIS is evolved via genetic algorithms (GAs). The proposed EDSGFIS algorithm can automatically create, delete, and adjust fuzzy rules according to the performance of the entire system, as well as evaluation of individual fuzzy rules. Simulation studies on a wall-following task by a mobile robot show that the proposed EDSGFIS approach is superior to other related methods. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4509589]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>963</startPage>
			<endPage>969</endPage>
			<fileSize>473</fileSize>
			<authors><![CDATA[Zhou, Y.;Er, M. J.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Improving Iris Recognition Performance Using Segmentation, Quality Enhancement, Match Score Fusion, and Indexing]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4510759]]></link>
			<description><![CDATA[<para> This paper proposes algorithms for iris segmentation, quality enhancement, match score fusion, and indexing to improve both the accuracy and the speed of iris recognition. A curve evolution approach is proposed to effectively segment a nonideal iris image using the modified Mumford&#x2013;Shah functional. Different enhancement algorithms are concurrently applied on the segmented iris image to produce multiple enhanced versions of the iris image. A support-vector-machine-based learning algorithm selects locally enhanced regions from each globally enhanced image and combines these good-quality regions to create a single high-quality iris image. Two distinct features are extracted from the high-quality iris image. The global textural feature is extracted using the 1-D log polar Gabor transform, and the local topological feature is extracted using Euler numbers. An intelligent fusion algorithm combines the textural and topological matching scores to further improve the iris recognition performance and reduce the false rejection rate, whereas an indexing algorithm enables fast and accurate iris identification. The verification and identification performance of the proposed algorithms is validated and compared with other algorithms using the CASIA Version 3, ICE 2005, and UBIRIS iris databases. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4510759]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1021</startPage>
			<endPage>1035</endPage>
			<fileSize>1283</fileSize>
			<authors><![CDATA[Vatsa, M.;Singh, R.;Noore, A.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Exponential Stability Analysis for Neural Networks With Time-Varying Delay]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4510760]]></link>
			<description><![CDATA[<para> This correspondence paper focuses on the problem of exponential stability for neural networks with a time-varying delay. The relationship among the time-varying delay, its upper bound, and their difference is taken into account. As a result, an improved linear-matrix-inequality-based delay-dependent exponential stability criterion is obtained without ignoring any terms in the derivative of Lyapunov&#x2013;Krasovskii functional. Two numerical examples are given to demonstrate its effectiveness. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4510760]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1152</startPage>
			<endPage>1156</endPage>
			<fileSize>208</fileSize>
			<authors><![CDATA[Wu, M.;Liu, F.;Shi, P.;He, Y.;Yokoyama, R.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Instruction-Matrix-Based Genetic Programming]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4510842]]></link>
			<description><![CDATA[<para> In genetic programming (GP), evolving tree nodes separately would reduce the huge solution space. However, tree nodes are highly interdependent with respect to their fitness. In this paper, we propose a new GP framework, namely, instruction-matrix (IM)-based GP (IMGP), to handle their interactions. IMGP maintains an IM to evolve tree nodes and subtrees separately. IMGP extracts program trees from an IM and updates the IM with the information of the extracted program trees. As the IM actually keeps most of the information of the schemata of GP and evolves the schemata directly, IMGP is effective and efficient. Our experimental results on benchmark problems have verified that IMGP is not only better than those of canonical GP in terms of the qualities of the solutions and the number of program evaluations, but they are also better than some of the related GP algorithms. IMGP can also be used to evolve programs for classification problems. The classifiers obtained have higher classification accuracies than four other GP classification algorithms on four benchmark classification problems. The testing errors are also comparable to or better than those obtained with well-known classifiers. Furthermore, an extended version, called condition matrix for rule learning, has been used successfully to handle multiclass classification problems. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4510842]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1036</startPage>
			<endPage>1049</endPage>
			<fileSize>530</fileSize>
			<authors><![CDATA[Li, G.;Wang, J. F.;Lee, K. H.;Leung, K.-S.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Decentralized Learning in Markov Games]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4539483]]></link>
			<description><![CDATA[<para> Learning automata (LA) were recently shown to be valuable tools for designing multiagent reinforcement learning algorithms. One of the principal contributions of the LA theory is that a set of decentralized independent LA is able to control a finite Markov chain with unknown transition probabilities and rewards. In this paper, we propose to extend this algorithm to Markov games&#x2014;a straightforward extension of single-agent Markov decision problems to distributed multiagent decision problems. We show that under the same ergodic assumptions of the original theorem, the extended algorithm will converge to a pure equilibrium point between agent policies. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4539483]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>976</startPage>
			<endPage>981</endPage>
			<fileSize>267</fileSize>
			<authors><![CDATA[Vrancx, P.;Verbeeck, K.;Nowe, A.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Comparison of Adaptive Critic-Based and Classical Wide-Area Controllers for Power Systems]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4539644]]></link>
			<description><![CDATA[<para> An adaptive critic design (ACD)-based damping controller is developed for a thyristor-controlled series capacitor (TCSC) installed in a power system with multiple poorly damped interarea modes. The performance of this ACD computational intelligence-based method is compared with two classical techniques, which are observer-based state-feedback (SF) control and linear matrix inequality <formula formulatype="inline"><tex>$hbox{LMI-H}^{infty}$ </tex></formula> robust control. Remote measurements are used as feedback signals to the wide-area damping controller for modulating the compensation of the TCSC. The classical methods use a linearized model of the system whereas the ACD method is purely measurement-based, leading to a nonlinear controller with fixed parameters. A comparative analysis of the controllers' performances is carried out under different disturbance scenarios. The ACD-based design has shown promising performance with very little knowledge of the system compared to classical model-based controllers. This paper also discusses the advantages and disadvantages of ACDs, SF, and <formula formulatype="inline"><tex>$hbox{LMI-H}^{infty}$</tex></formula>. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4539644]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1002</startPage>
			<endPage>1007</endPage>
			<fileSize>689</fileSize>
			<authors><![CDATA[Ray, S.;Venayagamoorthy, G. K.;Chaudhuri, B.;Majumder, R.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Dynamic Output Feedback Control Synthesis for Continuous-Time T&#x2013;S Fuzzy Systems via a Switched Fuzzy Control Scheme]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554030]]></link>
			<description><![CDATA[<para> This correspondence paper is concerned with the problem of designing switched dynamic output feedback <formula formulatype="inline"><tex>$H_{infty}$</tex></formula> controllers for continuous-time Takagi&#x2013;Sugeno (T&#x2013;S) fuzzy systems. A new type of dynamic output feedback controllers, namely, switched dynamic parallel distributed compensation (SDPDC) controllers, is proposed, which are switched by basing on the values of membership functions. A new method for designing SDPDC controllers for guaranteeing stabilities and <formula formulatype="inline"><tex>$H_{infty}$</tex></formula> performances of closed-loop nonlinear systems is presented, where the design conditions are given in terms of the solvability of a set of linear matrix inequalities. It is shown that the new method provides better or at least the same results of the existing design methods via a pure DPDC scheme. A numerical example is given to illustrate the effectiveness of the proposed method. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554030]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1166</startPage>
			<endPage>1175</endPage>
			<fileSize>692</fileSize>
			<authors><![CDATA[Dong, J.;Yang, G.-H.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Incoherent Control of Quantum Systems With Wavefunction-Controllable Subspaces via Quantum Reinforcement Learning]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554031]]></link>
			<description><![CDATA[<para> In this paper, an incoherent control scheme for accomplishing the state control of a class of quantum systems which have wavefunction-controllable subspaces is proposed. This scheme includes the following two steps: projective measurement on the initial state and learning control in the wavefunction-controllable subspace. The first step probabilistically projects the initial state into the wavefunction-controllable subspace. The probability of success is sensitive to the initial state; however, it can be greatly improved through multiple experiments on several identical initial states even in the case with a small probability of success for an individual measurement. The second step finds a local optimal control sequence via quantum reinforcement learning and drives the controlled system to the objective state through a set of suitable controls. In this strategy, the initial states can be unknown identical states, the quantum measurement is used as an effective control, and the controlled system is not necessarily unitarily controllable. This incoherent control scheme provides an alternative quantum engineering strategy for locally controllable quantum systems. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554031]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>957</startPage>
			<endPage>962</endPage>
			<fileSize>572</fileSize>
			<authors><![CDATA[Dong, D.Y.;Chen, C.;Tarn, T.-J.;Pechen, A.;Rabitz, H.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Higher Level Application of ADP: A Next Phase for the Control Field?]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554032]]></link>
			<description><![CDATA[<para> Two distinguishing features of humanlike control vis-&#x00C0;-vis current technological control are the ability to make use of experience while selecting a control policy for distinct situations and the ability to do so faster and faster as more experience is gained (in contrast to current technological implementations that slow down as more knowledge is stored). The notions of context and context discernment are important to understanding this human ability. Whereas methods known as adaptive control and learning control focus on modifying the design of a controller as changes in context occur, experience-based (EB) control entails <emphasis emphasistype="boldital"> selecting</emphasis> a previously designed controller that is appropriate to the current situation. Developing the EB approach entails a shift of the technologist's focus &#x201C;up a level&#x201D; away from designing individual (optimal) controllers to that of developing online algorithms that efficiently and effectively select designs from a repository of existing controller solutions. A key component of the notions presented here is that of higher level learning algorithm. This is a new application of reinforcement learning and, in particular, approximate dynamic programming, with its focus shifted to the posited higher level, and is employed, with very promising results. The author's hope for this paper is to inspire and guide future work in this promising area. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554032]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>901</startPage>
			<endPage>912</endPage>
			<fileSize>604</fileSize>
			<authors><![CDATA[Lendaris, G. G.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Adaptive Critic Learning Techniques for Engine Torque and Air&#x2013;Fuel Ratio Control]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554033]]></link>
			<description><![CDATA[<para> A new approach for engine calibration and control is proposed. In this paper, we present our research results on the implementation of adaptive critic designs for self-learning control of automotive engines. A class of adaptive critic designs that can be classified as (model-free) action-dependent heuristic dynamic programming is used in this research project. The goals of the present learning control design for automotive engines include improved performance, reduced emissions, and maintained optimum performance under various operating conditions. Using the data from a test vehicle with a V8 engine, we developed a neural network model of the engine and neural network controllers based on the idea of approximate dynamic programming to achieve optimal control. We have developed and simulated self-learning neural network controllers for both engine torque (TRQ) and exhaust air&#x2013;fuel ratio (AFR) control. The goal of TRQ control and AFR control is to track the commanded values. For both control problems, excellent neural network controller transient performance has been achieved. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554033]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>988</startPage>
			<endPage>993</endPage>
			<fileSize>212</fileSize>
			<authors><![CDATA[Liu, D.;Javaherian, H.;Kovalenko, O.;Huang, T.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Direct Heuristic Dynamic Programming for Damping Oscillations in a Large Power System]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554034]]></link>
			<description><![CDATA[<para> This paper applies a neural-network-based approximate dynamic programming method, namely, the direct heuristic dynamic programming (direct HDP), to a large power system stability control problem. The direct HDP is a learning- and approximation-based approach to addressing nonlinear coordinated control under uncertainty. One of the major design parameters, the controller learning objective function, is formulated to directly account for network-wide low-frequency oscillation with the presence of nonlinearity, uncertainty, and coupling effect among system components. Results include a novel learning control structure based on the direct HDP with applications to two power system problems. The first case involves static var compensator supplementary damping control, which is used to provide a comprehensive evaluation of the learning control performance. The second case aims at addressing a difficult complex system challenge by providing a new solution to a large interconnected power network oscillation damping control problem that frequently occurs in the China Southern Power Grid. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554034]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1008</startPage>
			<endPage>1013</endPage>
			<fileSize>271</fileSize>
			<authors><![CDATA[Lu, C.;Si, J.;Xie, X.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Binary Two-Dimensional PCA]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554035]]></link>
			<description><![CDATA[<para> Fast training and testing procedures are crucial in biometrics recognition research. Conventional algorithms, e.g., <emphasis emphasistype="boldital">principal component analysis</emphasis> (PCA), fail to efficiently work on large-scale and high-resolution image data sets. By incorporating merits from both <emphasis emphasistype="boldital"> two-dimensional PCA</emphasis> (2DPCA)-based image decomposition and fast numerical calculations based on Haarlike bases, this technical correspondence first proposes <emphasis emphasistype="boldital">binary 2DPCA</emphasis> (B-2DPCA). Empirical studies demonstrated the advantages of B-2DPCA compared with 2DPCA and binary PCA. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554035]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1176</startPage>
			<endPage>1180</endPage>
			<fileSize>363</fileSize>
			<authors><![CDATA[Pang, Y.;Tao, D.;Yuan, Y.;Li, X.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554208]]></link>
			<description><![CDATA[<para> Convergence of the value-iteration-based heuristic dynamic programming (HDP) algorithm is proven in the case of general nonlinear systems. That is, it is shown that HDP converges to the optimal control and the optimal value function that solves the Hamilton&#x2013;Jacobi&#x2013;Bellman equation appearing in infinite-horizon discrete-time (DT) nonlinear optimal control. It is assumed that, at each iteration, the value and action update equations can be exactly solved. The following two standard neural networks (NN) are used: a critic NN is used to approximate the value function, whereas an action network is used to approximate the optimal control policy. It is stressed that this approach allows the implementation of HDP without knowing the internal dynamics of the system. The exact solution assumption holds for some classes of nonlinear systems and, specifically, in the specific case of the DT linear quadratic regulator (LQR), where the action is linear and the value quadratic in the states and NNs have zero approximation error. It is stressed that, for the LQR, HDP may be implemented without knowing the system <formula formulatype="inline"><tex>$A$</tex></formula> matrix by using two NNs. This fact is not generally appreciated in the folklore of HDP for the DT LQR, where only one critic NN is generally used. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554208]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>943</startPage>
			<endPage>949</endPage>
			<fileSize>233</fileSize>
			<authors><![CDATA[Al-Tamimi, A.;Lewis, F. L.;Abu-Khalaf, M.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Issues on Stability of ADP Feedback Controllers for Dynamical Systems]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554210]]></link>
			<description><![CDATA[<para> This paper traces the development of neural-network (NN)-based feedback controllers that are derived from the principle of adaptive/approximate dynamic programming (ADP) and discusses their closed-loop stability. Different versions of NN structures in the literature, which embed mathematical mappings related to solutions of the ADP-formulated problems called &#x201C;adaptive critics&#x201D; or &#x201C;action-critic&#x201D; networks, are discussed. Distinction between the two classes of ADP applications is pointed out. Furthermore, papers in &#x201C;model-free&#x201D; development and model-based neurocontrollers are reviewed in terms of their contributions to stability issues. Recent literature suggests that work in ADP-based feedback controllers with assured stability is growing in diverse forms. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554210]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>913</startPage>
			<endPage>917</endPage>
			<fileSize>145</fileSize>
			<authors><![CDATA[Balakrishnan, S. N.;Ding, J.;Lewis, F. L.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Fast Diagnosis With Sensors of Uncertain Quality]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554211]]></link>
			<description><![CDATA[<para> This correspondence presents an approach to the detection and isolation of component failures in large-scale systems. In the case of sensors that report at rates of 1 Hz or less, the algorithm can be considered real time. The input is a set of observed test results from multiple sensors, and the algorithm's main task is to deal with sensor errors. The sensors are assumed to be of threshold test (pass/fail) type, but to be vulnerable to noise, in that occasionally true failures are missed, and likewise, there can be false alarms. These errors are further assumed to be independent conditioned on the system's diagnostic state. Their probabilities, of missed detection and of false alarm, are not known <emphasis emphasistype="boldital">a priori</emphasis> and must be estimated (ideally along with the accuracies of these estimates) online, within the inference engine. Further, recognizing a practical concern in most real systems, a sparsely instantiated observation vector must not be a problem. The key ingredients to our solution include the multiple-hypothesis tracking philosophy to complexity management, a Beta prior distribution on the sensor errors, and a quickest detection overlay to detect changes in these error rates when the prior is violated. We provide results illustrating performance in terms of both computational needs and error rate, and show its application both as a filter (i.e., used to &#x201C;clean&#x201D; sensor reports) and as a standalone state estimator. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554211]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1157</startPage>
			<endPage>1165</endPage>
			<fileSize>817</fileSize>
			<authors><![CDATA[Erdinc, O.;Brideau, C.;Willett, P.;Kirubarajan, T.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Hamilton&#x2013;Jacobi&#x2013;Bellman Equations and Approximate Dynamic Programming on Time Scales]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554212]]></link>
			<description><![CDATA[<para> The time scales calculus is a key emerging area of mathematics due to its potential use in a wide variety of multidisciplinary applications. We extend this calculus to approximate dynamic programming (ADP). The core backward induction algorithm of dynamic programming is extended from its traditional discrete case to all isolated time scales. Hamilton&#x2013;Jacobi&#x2013;Bellman equations, the solution of which is the fundamental problem in the field of dynamic programming, are motivated and proven on time scales. By drawing together the calculus of time scales and the applied area of stochastic control via ADP, we have connected two major fields of research. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554212]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>918</startPage>
			<endPage>923</endPage>
			<fileSize>236</fileSize>
			<authors><![CDATA[Seiffertt, J.;Sanyal, S.;Wunsch, D. C.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Improved Adaptive&#x2013;Reinforcement Learning Control for Morphing Unmanned Air Vehicles]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554213]]></link>
			<description><![CDATA[<para> This paper presents an improved Adaptive&#x2013;Reinforcement Learning Control methodology for the problem of unmanned air vehicle morphing control. The reinforcement learning morphing control function that learns the optimal shape change policy is integrated with an adaptive dynamic inversion control trajectory tracking function. An episodic unsupervised learning simulation using the Q-learning method is developed to replace an earlier and less accurate Actor-Critic algorithm. Sequential Function Approximation, a Galerkin-based scattered data approximation scheme, replaces a K-Nearest Neighbors (KNN) method and is used to generalize the learning from previously experienced quantized states and actions to the continuous state-action space, all of which may not have been experienced before. The improved method showed smaller errors and improved learning of the optimal shape compared to the KNN. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4554213]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1014</startPage>
			<endPage>1020</endPage>
			<fileSize>497</fileSize>
			<authors><![CDATA[Valasek, J.;Doebbler, J.;Tandale, M. D.;Meade, A. J.;]]></authors>
		</item>
		<item>
			<title><![CDATA[A Novel Infinite-Time Optimal Tracking Control Scheme for a Class of Discrete-Time Nonlinear Systems via the Greedy HDP Iteration Algorithm]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4555646]]></link>
			<description><![CDATA[<para> In this paper, we aim to solve the infinite-time optimal tracking control problem for a class of discrete-time nonlinear systems using the greedy heuristic dynamic programming (HDP) iteration algorithm. A new type of performance index is defined because the existing performance indexes are very difficult in solving this kind of tracking problem, if not impossible. Via system transformation, the optimal tracking problem is transformed into an optimal regulation problem, and then, the greedy HDP iteration algorithm is introduced to deal with the regulation problem with rigorous convergence analysis. Three neural networks are used to approximate the performance index, compute the optimal control policy, and model the nonlinear system for facilitating the implementation of the greedy HDP iteration algorithm. An example is given to demonstrate the validity of the proposed optimal tracking control scheme. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4555646]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>937</startPage>
			<endPage>942</endPage>
			<fileSize>385</fileSize>
			<authors><![CDATA[Zhang, H.;Wei, Q.;Luo, Y.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Adaptive Lyapunov-Based Control of a Robot and Mass&#x2013;Spring System Undergoing an Impact Collision]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4555743]]></link>
			<description><![CDATA[<para> The control of dynamic systems that undergo an impact collision is both theoretically challenging and of practical importance. An appeal of studying systems that undergo an impact is that short-duration effects such as high stresses, rapid dissipation of energy, and fast acceleration and deceleration may be achieved from low-energy sources. However, colliding systems present a difficult control challenge because the equations of motion are different when the system suddenly transitions from a noncontact state to a contact state. In this paper, an adaptive nonlinear controller is designed to regulate the states of two dynamic systems that collide. The academic example of a planar robot colliding with an unactuated mass&#x2013;spring system is used to represent a broader class of such systems. The control objective is defined as the desire to command a robot to collide with an unactuated system and regulate the mass to a desired compressed state while compensating for the unknown constant system parameters. Lyapunov-based methods are used to develop a continuous adaptive controller that yields asymptotic regulation of the mass and robot links. It is interesting to note that one controller is responsible for achieving the control objective when the robot is in free motion (i.e., decoupled from the mass&#x2013;spring system), when the systems collide, and when the system dynamics are coupled. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4555743]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1050</startPage>
			<endPage>1061</endPage>
			<fileSize>532</fileSize>
			<authors><![CDATA[Dupree, K.;Liang, C.-H.;Hu, G.;Dixon, W. E.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Adaptive Feedback Control by Constrained Approximate Dynamic Programming]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4556643]]></link>
			<description><![CDATA[<para> A constrained approximate dynamic programming (ADP) approach is presented for designing adaptive neural network (NN) controllers with closed-loop stability and performance guarantees. Prior knowledge of the linearized equations of motion is used to guarantee that the closed-loop system meets performance and stability objectives when the plant operates in a linear parameter-varying (LPV) regime. In the presence of unmodeled dynamics or failures, the NN controller adapts to optimize its performance online, whereas constrained ADP guarantees that the LPV baseline performance is preserved at all times. The effectiveness of an adaptive NN flight controller is demonstrated for simulated control failures, parameter variations, and near-stall dynamics. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4556643]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>982</startPage>
			<endPage>987</endPage>
			<fileSize>274</fileSize>
			<authors><![CDATA[Ferrari, S.;Steck, J. E.;Chandramohan, R.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Global Synchronization Control of General Delayed Discrete-Time Networks With Stochastic Coupling and Disturbances]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4558034]]></link>
			<description><![CDATA[<para> In this paper, the synchronization control problem is considered for two coupled discrete-time complex networks with time delays. The network under investigation is quite general to reflect the reality, where the state delays are allowed to be time varying with given lower and upper bounds, and the stochastic disturbances are assumed to be Brownian motions that affect not only the network coupling but also the overall networks. By utilizing the Lyapunov functional method combined with linear matrix inequality (LMI) techniques, we obtain several sufficient delay-dependent conditions that ensure the coupled networks to be globally exponentially synchronized in the mean square. A control law is designed to synchronize the addressed coupled complex networks in terms of certain LMIs that can be readily solved using the Matlab LMI toolbox. Two numerical examples are presented to show the validity of our theoretical analysis results. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4558034]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1073</startPage>
			<endPage>1083</endPage>
			<fileSize>377</fileSize>
			<authors><![CDATA[Liang, J.;Wang, Z.;Liu, Y.;Liu, X.;]]></authors>
		</item>
		<item>
			<title><![CDATA[A Multifaceted Perspective at Data Analysis: A Study in Collaborative Intelligent Agents]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4558035]]></link>
			<description><![CDATA[<para> Multiagent systems are inherently associated with their distributivity, which enforces a great deal of communication mechanisms. To effectively arrive at meaningful solutions in a vast array of problem-solving tasks, it becomes imperative to establish a sound machinery of reconciling findings which might form partial solutions to an overall problem. In this paper, we focus on a broad category of problems of collaborative data analysis realized by a collection of agents having access to their individual data and exchanging findings through their collaboration activities. Such problems of data analysis arise in the context of building a global view at a certain phenomenon (process) by viewing it from different perspectives (and thus engaging various collections of attributes by various agents). Our goal is to develop some interaction between the agents so that they could form an overall perspective, where the knowledge available locally is shared and reconciled. The underlying format of knowledge built by the agents is that of information granules and fuzzy sets in particular. We develop a comprehensive optimization scheme and discuss its two-phase nature in which the communication phase of the granular findings intertwines with the local optimization being realized by the agents at the level of the individual datasite and exploits the evidence collected from other sites. We show how the mechanism of fuzzy granulation realized in the form of a well-known fuzzy c-means (FCM) clustering can be augmented to support collaborative activities required by the agents. For this purpose, we introduce augmented versions of the original objective function used in the FCM and derive algorithmic details. We also discuss an issue of optimizing the strength of collaborative linkages, so that the reconciled findings attain the highest level of consistency (agreement). The presented experimental studies include some synthetic data and selected data sets coming from the Machine L-
earning repository. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4558035]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1062</startPage>
			<endPage>1072</endPage>
			<fileSize>883</fileSize>
			<authors><![CDATA[Pedrycz, W.;Rai, P.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Random Sampling of States in Dynamic Programming]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4559368]]></link>
			<description><![CDATA[<para> We combine three threads of research on approximate dynamic programming: sparse random sampling of states, value function and policy approximation using local models, and using local trajectory optimizers to globally optimize a policy and associated value function. Our focus is on finding steady-state policies for deterministic time-invariant discrete time control problems with continuous states and actions often found in robotics. In this paper, we describe our approach and provide initial results on several simulated robotics problems. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4559368]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>924</startPage>
			<endPage>929</endPage>
			<fileSize>410</fileSize>
			<authors><![CDATA[Atkeson, C. G.;Stephens, B.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Reinforcement Learning in Continuous Time and Space: Interference and Not Ill Conditioning Is the Main Problem When Using Distributed Function Approximators]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567536]]></link>
			<description><![CDATA[<para> Many interesting problems in reinforcement learning (RL) are continuous and/or high dimensional, and in this instance, RL techniques require the use of function approximators for learning value functions and policies. Often, local linear models have been preferred over distributed nonlinear models for function approximation in RL. We suggest that one reason for the difficulties encountered when using distributed architectures in RL is the problem of negative interference, whereby learning of new data disrupts previously learned mappings. The continuous temporal difference (TD) learning algorithm <formula formulatype="inline"><tex>$TD(lambda)$</tex></formula> was used to learn a value function in a limited-torque pendulum swing-up task using a multilayer perceptron (MLP) network. Three different approaches were examined for learning in the MLP networks; 1) simple gradient descent; 2) vario-eta; and 3) a <emphasis emphasistype="boldital">pseudopattern</emphasis> rehearsal strategy that attempts to reduce the effects of interference. Our results show that MLP networks can be used for value function approximation in this task but require long training times. We also found that vario-eta destabilized learning and resulted in a failure of the learning process to converge. Finally, we showed that the <emphasis emphasistype="boldital">pseudopattern </emphasis> rehearsal strategy drastically improved the speed of learning. The results indicate that interference is a greater problem than ill conditioning for this task. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567536]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>950</startPage>
			<endPage>956</endPage>
			<fileSize>645</fileSize>
			<authors><![CDATA[Baddeley, B.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Uncertainty Modeling of Improved Fuzzy Functions With Evolutionary Systems]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567537]]></link>
			<description><![CDATA[<para> This paper introduce a type-2 fuzzy function system for uncertainty modeling using evolutionary algorithms (ET2FF). The type-1 fuzzy inference systems (FISs) with fuzzy functions, which do not entail <emphasis emphasistype="boldital">if</emphasis> <formula formulatype="inline"><tex>$ldots$</tex></formula> <emphasis emphasistype="boldital">then</emphasis> rule bases, have demonstrated better performance compared to traditional FIS. Nonetheless, the performance of these approaches is usually affected by their uncertain parameters. The proposed method implements a three-phase learning strategy to capture the uncertainties in fuzzy function systems induced by learning parameters, as well as fuzzy function structures. The improved fuzzy clustering initially finds hidden structures, and the genetic learning algorithm optimizes interval type-2 fuzzy sets to capture their optimum uncertainty interval. The proposed ET2FF architecture is evaluated using an extensive suite of real-life applications such as manufacturing process and financial market modeling. The results show that the proposed ET2FF method is comparable&#x2014;if not superior&#x2014;to earlier FIS in terms of generalization performance and robustness. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567537]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1098</startPage>
			<endPage>1110</endPage>
			<fileSize>1284</fileSize>
			<authors><![CDATA[Celikyilmaz, A.;Turksen, I. B.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Table of Contents]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567538]]></link>
			<description><![CDATA[ ]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567538]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>C1</startPage>
			<endPage>895</endPage>
			<fileSize>164</fileSize>
			<authors><![CDATA[]]></authors>
		</item>
		<item>
			<title><![CDATA[IEEE Transactions on Systems, Man, and Cybernetics&mdash;Part B: Cybernetics publication information]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567539]]></link>
			<description><![CDATA[ ]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567539]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>C2</startPage>
			<endPage>C2</endPage>
			<fileSize>38</fileSize>
			<authors><![CDATA[]]></authors>
		</item>
		<item>
			<title><![CDATA[IEEE Systems, Man, and Cybernetics Society Information]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567540]]></link>
			<description><![CDATA[ ]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567540]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>C3</startPage>
			<endPage>C3</endPage>
			<fileSize>28</fileSize>
			<authors><![CDATA[]]></authors>
		</item>
		<item>
			<title><![CDATA[IEEE Transactions on Systems, Man, and Cybernetics&mdash;Part B: Cybernetics information for authors]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567541]]></link>
			<description><![CDATA[ ]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567541]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>C4</startPage>
			<endPage>C4</endPage>
			<fileSize>33</fileSize>
			<authors><![CDATA[]]></authors>
		</item>
		<item>
			<title><![CDATA[Foreword<newline/>ADP: The Key Direction for Future Research in Intelligent Control and Understanding Brain Intelligence]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567542]]></link>
			<description><![CDATA[ ]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567542]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>898</startPage>
			<endPage>900</endPage>
			<fileSize>199</fileSize>
			<authors><![CDATA[Werbos, P. J.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Guest Editorial<newline/>Special Issue on Adaptive Dynamic Programming and Reinforcement Learning in Feedback Control]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567543]]></link>
			<description><![CDATA[ ]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567543]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>896</startPage>
			<endPage>897</endPage>
			<fileSize>129</fileSize>
			<authors><![CDATA[Lewis, F. L.;Liu, D.;Lendaris, G. G.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Stability Analysis of Swarms With General Topology]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567544]]></link>
			<description><![CDATA[<para> This paper investigates the stability and boundary of a swarm with a general directed and weighted topology. The stability of a swarm is generally considered as cohesiveness. We construct a symmetric eigenmatrix and define an orthogonal eigenparameter which reflects the degree of orthogonality between the left eigenvector of the coupling matrix corresponding to its zero eigenvalue and the eigenvectors of the eigenmatrix corresponding to its nonzero eigenvalues. We prove that, if the topology of the underlying swarm is strongly connected, the swarm is then stable in the sense that all agents will globally and exponentially converge to a hyperellipsoid in finite time, both in open space and profiles, whether the center of the hyperellipsoid is moving or not. The swarm boundary and convergence rate are characterized by the eigenparameters of the swarm, which reveals the quantitative relationship between the swarming behavior and characteristics of the coupling topology. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567544]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1084</startPage>
			<endPage>1097</endPage>
			<fileSize>1196</fileSize>
			<authors><![CDATA[Li, W.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Nonlinear Dimensionality Reduction of Data Lying on the Multicluster Manifold]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567545]]></link>
			<description><![CDATA[<para> A new method, which is called decomposition&#x2013;composition (D&#x2013;C) method, is proposed for the nonlinear dimensionality reduction (NLDR) of data lying on the multicluster manifold. The main idea is first to decompose a given data set into clusters and independently calculate the low-dimensional embeddings of each cluster by the decomposition procedure. Based on the intercluster connections, the embeddings of all clusters are then composed into their proper positions and orientations by the composition procedure. Different from other NLDR methods for multicluster data, which consider associatively the intracluster and intercluster information, the D&#x2013;C method capitalizes on the separate employment of the intracluster neighborhood structures and the intercluster topologies for effective dimensionality reduction. This, on one hand, isometrically preserves the rigid-body shapes of the clusters in the embedding process and, on the other hand, guarantees the proper locations and orientations of all clusters. The theoretical arguments are supported by a series of experiments performed on the synthetic and real-life data sets. In addition, the computational complexity of the proposed method is analyzed, and its efficiency is theoretically analyzed and experimentally demonstrated. Related strategies for automatic parameter selection are also examined. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567545]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1111</startPage>
			<endPage>1122</endPage>
			<fileSize>1442</fileSize>
			<authors><![CDATA[Meng, D.;Leung, Y.;Fung, T.;Xu, Z.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Introducing ieee.tv]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567546]]></link>
			<description><![CDATA[ ]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567546]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1181</startPage>
			<endPage>1181</endPage>
			<fileSize>203</fileSize>
			<authors><![CDATA[]]></authors>
		</item>
		<item>
			<title><![CDATA[Order Form for Reprints]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567547]]></link>
			<description><![CDATA[ ]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567547]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1182</startPage>
			<endPage>1182</endPage>
			<fileSize>354</fileSize>
			<authors><![CDATA[]]></authors>
		</item>
		<item>
			<title><![CDATA[Estimating Object Proper Motion Using Optical Flow, Kinematics, and Depth Information]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567548]]></link>
			<description><![CDATA[<para> For the interaction of a mobile robot with a dynamic environment, the estimation of object motion is desired while the robot is walking and/or turning its head. In this paper, we describe a system which manages this task by combining depth from a stereo camera and computation of the camera movement from robot kinematics in order to stabilize the camera images. Moving objects are detected by applying optical flow to the stabilized images followed by a filtering method, which incorporates both prior knowledge about the accuracy of the measurement and the uncertainties of the measurement process itself. The efficiency of this system is demonstrated in a dynamic real-world scenario with a walking humanoid robot. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567548]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1139</startPage>
			<endPage>1151</endPage>
			<fileSize>1248</fileSize>
			<authors><![CDATA[Schmudderich, J.;Willert, V.;Eggert, J.;Rebhan, S.;Goerick, C.;Sagerer, G.;Korner, E.;]]></authors>
		</item>
		<item>
			<title><![CDATA[A Novel Gaze Estimation System With One Calibration Point]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567549]]></link>
			<description><![CDATA[<para> The design of robust and high-performance gaze-tracking systems is one of the most important objectives of the eye-tracking community. In general, a subject calibration procedure is needed to learn system parameters and be able to estimate the gaze direction accurately. In this paper, we attempt to determine if subject calibration can be eliminated. A geometric analysis of a gaze-tracking system is conducted to determine user calibration requirements. The eye model used considers the offset between optical and visual axes, the refraction of the cornea, and Donder's law. This paper demonstrates the minimal number of cameras, light sources, and user calibration points needed to solve for gaze estimation. The underlying geometric model is based on glint positions and pupil ellipse in the image, and the minimal hardware needed for this model is one camera and multiple light-emitting diodes. This paper proves that subject calibration is compulsory for correct gaze estimation and proposes a model based on a single point for subject calibration. The experiments carried out show that, although two glints and one calibration point are sufficient to perform gaze estimation (error <formula formulatype="inline"><tex>$sim!!1^{circ}$</tex> </formula>), using more light sources and calibration points can result in lower average errors. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567549]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>1123</startPage>
			<endPage>1138</endPage>
			<fileSize>1386</fileSize>
			<authors><![CDATA[Villanueva, A.;Cabeza, R.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural Networks]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567550]]></link>
			<description><![CDATA[<para> A nonaffine discrete-time system represented by the nonlinear autoregressive moving average with eXogenous input (NARMAX) representation with unknown nonlinear system dynamics is considered. An equivalent affinelike representation in terms of the tracking error dynamics is first obtained from the original nonaffine nonlinear discrete-time system so that reinforcement-learning-based near-optimal neural network (NN) controller can be developed. The control scheme consists of two linearly parameterized NNs. One NN is designated as the critic NN, which approximates a predefined long-term cost function, and an action NN is employed to derive a near-optimal control signal for the system to track a desired trajectory while minimizing the cost function simultaneously. The NN weights are tuned online. By using the standard Lyapunov approach, the stability of the closed-loop system is shown. The net result is a supervised actor-critic NN controller scheme which can be applied to a general nonaffine nonlinear discrete-time system without needing the affinelike representation. Simulation results demonstrate satisfactory performance of the controller. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567550]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>994</startPage>
			<endPage>1001</endPage>
			<fileSize>336</fileSize>
			<authors><![CDATA[Yang, Q.;Vance, J. B.;Jagannathan, S.;]]></authors>
		</item>
		<item>
			<title><![CDATA[Decentralized Bayesian Search Using Approximate Dynamic Programming Methods]]></title>
			<link><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567551]]></link>
			<description><![CDATA[<para> We consider decentralized Bayesian search problems that involve a team of multiple autonomous agents searching for targets on a network of search points operating under the following constraints: 1) interagent communication is limited; 2) the agents do not have the opportunity to agree in advance on how to resolve equivalent but incompatible strategies; and 3) each agent lacks the ability to control or predict with certainty the actions of the other agents. We formulate the multiagent search-path-planning problem as a decentralized optimal control problem and introduce approximate dynamic heuristics that can be implemented in a decentralized fashion. After establishing some analytical properties of the heuristics, we present computational results for a search problem involving two agents on a 5 <formula formulatype="inline"> <tex>$times$</tex></formula> 5 grid. </para>]]></description>
			<pubDate><![CDATA[Aug.  2008]]></pubDate>
			<guid><![CDATA[http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4567535&arnumber=4567551]]></guid>
			<volume>38</volume>
			<issue>4</issue>
			<startPage>970</startPage>
			<endPage>975</endPage>
			<fileSize>310</fileSize>
			<authors><![CDATA[Zhao, Y.;Patek, S. D.;Beling, P. A.;]]></authors>
		</item>
	</channel>
</rss>