• Abstract

# A New VLSI 2-D Fourfold-Rotational-Symmetry Filter Architecture Design

In this paper, we propose two new two-dimensional (2-D) IIR and FIR filter architectures for 2-D transfer function using fourfold rotational symmetry. The presented type-I structure with fourfold rotational symmetry has the lowest number of multipliers, and zero latency. Importantly, the proposed type-II IIR filter possesses high speed, local broadcast, and the same number of multipliers and latency as the type I shows at expense of a slight increment of number of delay elements.

SECTION I

## Introduction

Recently, 2-D digital filters are widely applied in a variety of digital signal processing (DSP) systems such as image restoration [1] obtained through a 2-D low-pass intraframe filter, image enhancement [2], [3] performed by a 2-D high-pass filter and bio-medical signal processing [4]. Several 2-D filter VLSI architectures have been existed in [7], [8], [9], [10], [11], and an existing ASIC approach has been applied to design 2-D diagonal symmetry transfer function [12]. However, the VLSI architecture and hardware evaluation metrics using other frequency symmetry have not been discussed. Thus, we are motivated to propose one new 2-D IIR and FIR filter with less number of multipliers using fourfold rotational symmetry property [5], [6]. The comparison results among the structures of [8], [10], [11] and the proposed one in terms of hardware evaluation metrics are also debated in this paper. Note that when the frequency response has no symmetry, there is a technique to decompose that frequency response into components each of which has the desired symmetry. As a consequence, there is a motivation to derive the 2-D fourfold-rotational-symmetry IIR and FIR filter architecture designs.

The structure of this paper is organized as follows. The brief review of fourfold rotational symmetries is presented in Section II. The proposed 2-D fourfold-rotational-symmetry IIR and FIR filter architectures are discussed in Section III. In Section IV, the comparison results are tabulated in terms of the number of multipliers, critical period, latency, and whether the local broadcast exists. In the last section, concise statements conclude this presentation.

SECTION II

## Reviw of Fourfold Rotational Symmetry

2-D frequency responses possess many types of symmetries and the presence of these symmetries can be used to reduce the design complexity of these filters. Symmetry in a frequency response induces certain constraints on the coefficients of filter transfer function which in turn reduces the filter design complexity. In the following, the fourfold rotational symmetry constraints on polynomials are reviewed.

For a 2-D z-domain polynomial Q(z1, z2) we can represent the magnitude squared function of the frequency response in the form TeX Source \eqalignno{F(\theta_1, \theta_2) &= Q(e^{j\theta_1}, e^{j\theta_2}) \cdot Q \ast (e^{-j\theta_1}, e^{-j\theta_2}) \cr&= Q(z_1, z_2) \cdot Q^{\ast} (z_1^{-1}, z_2^{-1}) \vert_{z_1 = z^{j\theta_i}}, for i = 1,2&\hbox{(1)}}Where Q* is obtained by complex conjugating the coefficients of Q. A magnitude squared function of the frequency response with fourfold rotational symmetry can be defined as TeX Source \eqalignno{F(\theta_1,\theta_2) &= F(-\theta_2,\theta_1) \cr&= F(-\theta_1,-\theta_2) = F(\theta_2,-\theta_1), \quad \forall (\theta_1,\theta_2)&\hbox{(2)}}

Since real polynomials always satisfy F(θ1, θ2) = F(−θ1, −θ2), F(θ1, θ2) = F(−θ2, θ1) is enough to ensure the fourfold rotational symmetry. Using the magnitude squared function in (1) and fourfold rotational symmetry definition in (2), we can conclude the following factors [5] with fourfold rotational symmetry in their magnitude responses: TeX Source \eqalignno{Factor1&= Q_1(x_1, x_2) + y_1y_2 \cdot (x_1 - x_2) \cdot Q_2 (x_1, x_2) &\hbox{(3a)}\cr Factor2 &= (x_1 - x_2)Q_1(x_1, x_2) + y_1y_2Q_2 (x_1, x_2) &\hbox{(3b)}\cr Factor3 &= Q (z_1, z_2)\cdot Q\left(z_2^{-1}, z_1\right) &\hbox{(3c)} \cr Factor4 &= Q (z_1, z_2)\cdot Q\left(z_2, z_1^{-1}\right) &\hbox{(3d)}\cr Factor5 &= Q (z_1, z_2)\cdot Q\left(z_2^{-1}, z_1\right), Q\left(z_1^{-1}, z_2^{-1}\right)\cdot Q\left(z_2, z_1^{-1}\right)&\hbox{(3e)}}Where Qk (x1, x2) = Qk (x2, x1) for k = 1, 2, and xi = zi + zi−1, yi = zizi−1 for i = 1, 2.

Using the fourfold rotational symmetry constraints on polynomial factors, a procedure to design a 2-D IIR and FIR digital filter architectures is presented in the next section.

SECTION III

## 2-D Fourfold-Rotational Symmetry Digital Filter Architecture

In this section, we propose two new 2-D fourfold rotational-symmetry IIR and FIR digital filter architectures. The general transfer function of a 2-D IIR digital filter can be represented as TeX Source $$H(z_1, z_2) = {\sum\limits_{i=0}^{N_1} \sum\limits_{j=0}^{N_2} a_{i,j} z_1^{-i} z_2^{-j} \over \sum\limits_{i=0}^{N_1} \sum\limits_{j=0}^{N_2} b_{i,j} z_1^{-i} z_2^{-j}} = {N(z_1,z_2) \over D(z_1,z_2)}\eqno{\hbox{(4)}}$$where b0,0 = 1, ai,j as well as bi,j and N1 × N2 are coefficients and the order of the IIR digital filter, respectively. Throughout this paper, a square image M × M is fed to the following structures in raster-scan mode, and thus the delay z2−1 = z−1 and z1−1 = zM where z−1 and M denote a unit delay element and the width of an image, respectively. Since each N(z1,z2) possesses fourfold rotational symmetry property discussed above for Q, we select the numerator to be N(z1, z2) = N(z2−1, z1)z2N2 which is the fourfold rotational symmetry condition in (3a). Here z2N2 is included to ensure the polynomial nature of N(z1, z2). Consequently, the coefficients have to satisfy the condition of ai,j = aj, N2−1 [5]. Besides the numerator, the denominator is chosen to be D(z1, z2) = D1(z1)D1(z2), which is separable such that the stability can be easily assured. At the same time, the denominator also possesses fourfold rotational symmetry property. As a representative example, we perform and deduce the order of structures under an assumption N1 = N2 = 2. The following forms for the numerator and denominator are expressed in (5a) and (5b). In (5a), it can be seen that only the boxed coefficients are independent, and the other coefficients can be obtained by rotating the boxed coefficients in steps of 90° around the mid-point of the whole matrix. Owing to the separability of the denominator, we can obtain bi,j = bj,i as shown in (5b). TeX Source Initially, we can express (4) for N1 = N2 = 2 as follows. TeX Source $$Y = \sum_{i=0}^{2}\sum_{j=0}^{2} a_{i,j} z_1^{-i} z_2^{-j} (X) - \sum_{i=0}^{2} \sum_{j=0 \atop i+j\neq 0}^{2}b_{i,j} z_1^{-i} z_2^{-j} (Y)\eqno{\hbox{(6)}}$$where X = X (z1, z2) and Y = Y(z1, z2) are defined as input and output of the digital filter, respectively. Due to the fourfold rotational symmetry property in (5a) and (5b), we recast (6) using register reordering in (7) TeX Source \eqalignno{Y &= ka_{l,l} \left[z_1^{-l} z_2^{l\times P}\left(z_2^{-l\times P} z_2^{-l}\right)\right] X\cr&\quad - \sum_{i=1}^2 b_{i,i} z_1^{-i} z_2^{i\times P} \left(z_2^{-i\times P} z_2^{-i}\right) Y \cr&\quad + \sum_{i=0}^{l-k} \sum_{j=0}^l a_{i,j} \left[z_1^{-i} z_2^{i\times P} \left(z_2^{-i\times P} z_2^{-j}\right) \right.\cr&\quad + z_1^{-j} z_2^{j\times P} \left(z_2^{-j\times P} z_2^{-(2-i)}\right) \cr&\quad + z_1^{-(2-i)} z_2^{(2-i)\times P} \left(z_2^{-(2-i)\times P} z_2^{-(2-j)}\right) \cr&\quad + z_1^{-(2-j)} z_2^{(2-j)\times P} \left(z_2^{-(2-j)\times P} z_2^{-i}\right)X \cr&\quad - \sum_{i=0}^{2} \sum_{j=i+1}^2 b_{i,j} \left[z_1^{-i} z_2^{i\times P} \left(z_2^{-i\times P} z_2^{-j}\right) \right.\cr&\quad +\left. z_1^{-j} z_2^{j\times P} \left(z_2^{-j\times P} z_2^{-i}\right)\right]Y&\hbox{(7)}}Where for l and al,l, and integer variable P is restricted in range of 0 and M. Since the image is in the raster scan of this paper, the delay z1−1 = zM are realized by a shift-register (SR) with size of M. Thus, the terms of can be implemented by the number of (MP) shift registers. Next, we deduce two VLSI architectures with respect to P = 0 and P = 1.

♦ Type I with P = 0:

Equation (7) can be expressed in (8). TeX Source \eqalignno{Y &= ka_{l,l} \left(z_1^{-l}, z_2^{-l}\right)X - \sum_{i=1}^2 b_{i,i} z_1^{-i} z_2^{-i} Y \cr&\quad + \sum_{i=0}^{l-k} \sum_{j=0}^{l} a_{i,j} \left[z_1^{-i} z_2^{-j} + z_1^{-j} z_2^{-(2-i)} \right.\cr&\quad + \left. z_1^{-(2-i)} z_2^{-(2-j)} + z_1^{-(2-j)} z_2^{-i}\right]X \cr&\quad -\sum_{i=0}^{2} \sum_{j=i+1}^{2} b_{i,j} \left[z_1^{-i} z_2^{-j} + z_1^{-j} z_2^{-i}\right]y&\hbox{(8)}}Equation (8) can be mapped onto the fourfold-rotational-symmetry IIR filter architecture as depicted in Fig. 1. In addition, the users merely set bi,j to zero except b0,0 = 1 such that a new fourfold-rotational-symmetry FIR filter architecture as shown in Fig. 2 can be obtained. In this case, although the proposed type-I IIR filter has the lowest number of multipliers, the larger critical period and global broadcast are incurred. Thus, we are motivated to derive another type filter with less critical period and local broadcast.

Fig. 1. Proposed 2-D fourfold rotational symmetry IIR filter architecture with P = 0.
Fig. 2. Proposed 2-D fourfold-rotational-symmetry FIR filter architecture with P = 0.

♦ Type II with P = 1:

Observing the architectures in Figs. 1 and 2, we find that the critical period is larger and the input and output signals are globally broadcast. It is harmful to the limited fanout VLSI design. In order to enhance this architecture with higher speed and local broadcast, we set P = 1 in (9). TeX Source \eqalignno{Y &= ka_{l,l} \left[z_1^{-l}, z_2^{l}\left(z_2^{-l}, z_2^{-l}\right)\right]X \cr&\quad - \sum_{i=1}^2 b_{i,i} z_1^{-i} z_2^{i} \left(z_2^{-i} z_2^{-i}\right) Y \cr&\quad + \sum_{i=0}^{l-k} \sum_{j=0}^{l} a_{i,j} \left[z_1^{-i} z_2^{i} \left(z_2^{-i} z_2^{-j}\right) \right.\cr&\quad + z_1^{-j} z_2^{j} \left(z_2^{-j} z_2^{-(2-i)}\right)\cr&\quad + z_1^{-(2-i)} z_2^{(2-j)} \left(z_2^{-(2-i)} z_2^{-(2-j)}\right) \cr&\quad +\left. z_1^{-(2-j)} z_2^{2-j}\left(z_2^{-(2-j)} z_2^{-i}\right)\right]X \cr&\quad -\sum_{i=0}^{2} \sum_{j=i+1}^{2} b_{i,j} \left[z_1^{-i} z_2^{i} \left(z_2^{-i} z_2^{-j}\right) \right.\cr&\quad +\left. z_1^{-j} z_2^{j} \left(z_2^{-j} z_2^{-i}\right)\right]y&\hbox{(9)}}In similar behaviors, the resulting fourfold-rotational-symmetry IIR architecture corresponding to (9) is shown in Fig. 3. In addition, we can also obtain a new fourfold-rotational-symmetry FIR filter architecture in Fig. 4 by setting bi,j to zero except b0,0 = 1

Fig. 3. Proposed 2-D fourfold-rotational-symmetry IIR filter architecture with P = 1.
Fig. 4. Proposed 2-D fourfold-rotational-symmetry FIR filter architecture with P = 1.
SECTION IV

## Comparison Results of IIR and FIR Digital Filters

In this section, the comprehensive comparison results of IIR and FIR filter architectures are tabulated in Tables I and II, respectively. The evaluation metrics are in terms of number of multipliers, critical period, latency, and local broadcast, where each critical period is calculated by tree method. Here k is the same as the definition in (7). In Table I, it is shown that the proposed type I and type II architectures lead to the lowest number of multipliers than that of [8], [10], [11]. Among the proposed two type IIR filter architectures, the former has less number of delay elements and the latter has higher throughput and local broadcast. Analogously, we compare the performance of these FIR filters as listed in Table II after setting bi,j to zero except b0,0 = 1. For the proposed second-order IIR and FIR filters, the reduction of number of multipliers can be achieved 52.9% and 66.7%, respectively. From Tables I and II, we guarantee that the presented type-I structure with fourfold rotational symmetry has the lowest number of multipliers, and zero latency. Importantly, the proposed type-II IIR filter possesses high speed, local broadcast, and the same number of multipliers and latency as the type I shows at expense of a slight increment of number of delay elements.

TABLE I Comparison Results Among the IIR Digital Filter Architecture
TABLE II Comparison Results Among the FIR Digital Filter Architecture
SECTION V

## Conclusion

Two new implementation of 2-D fourfold-rotational-symmetry IIR and FIR digital filter architectures have been presented. The proposed structures have the lowest number of multipliers, high throughput, and zero latency. In addition, the percentage of the reduction of number of multipliers for the proposed IIR and FIR filters are nearly 62.5% and 75%, respectively, as N is large enough.

## Footnotes

Manuscript received ; revised ; accepted . Date of publication ; date of current version . This work was supported in part by the National Science Council (NSC) Grant NSC-96-2220-E-009-038, NSC-97-2220-E-009-055, and MOEA-96-EC-17-A-01-S1-048.

Pei-Yu Chen, Lan-Da Van are with the Department of Computer Science, National Chiao Tung University, Hsinchu, Taiwan e-mail: ldvan@cs.nctu.edu.tw

Hari C. Reddy is with the Department of Elecrical Engineering, California State University, Long Beach, USA and National Chiao Tung University, Hsinchu, Taiwan e-mail: hreddy@csulb.edu

Chin-Teng Lin is with the Department of Computer Science, National Chiao Tung University, Hsinchu, Taiwan e-mail: ctlin@mail.nctu.edu.tw

## References

1. Digital Video Processing

A. M. Tekalp

Englewood Cliffs, NJ
Digital Video Processing, Prentice-Hall, 1995, 14

2. Image Processing: Theory, Algorithms, and Architectures

M. A. Sid-Ahmed

NY
Image Processing: Theory, Algorithms, and Architectures, McGraw-Hill, 1995

3. Multidimensional Digital Signal Processing

D. E. Dudgeon, R. M. Mersereau

Englewood Cliffs, NJ
Multidimensional Digital Signal Processing, Prentice-Hall, 1984, 4

4. A simultaneous div-curl 2D clifford Fourier transform filter for enhancing vortices, sinks and sources in sampled 2D vector field images

Proc. IEEE International Symposium on Circuits and Systems, 2007-05, 821–824

5. 2-D symmetry: Theory and filter design applications

H. C. Reddy, I. H. Khoo, P. K. Rajan

IEEE Circuits and Systems Magazine, vol. 3, p. 4–33, 2003

6. Symmetry in 2-D filters and its application

M. N. S. Swamy, P. K. Rajan

New York
Multidimensional Systems: Techniques and Applications, S. G., Tzafestas, Marcel Dekkar, 1986, 9

7. Realization of two-demensinal recursive digital filters

S. K. Mitra, A. D. Sagar, N. A. Pendergrass

IEEE Trans. Circuits Syst., vol. 22, p. 171–184, 1975-03

8. Decision methods and realization of 2-D digital filters using minimum number of delay elements

S. Chakrabarti, S. K. Mitra

IEEE Trans. Circuits Syst., vol. 27, p. 657–666, 1980-08

9. A systolic realization for 2-D digital filters

M. A. Sid-Ahmed

IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-37, p. 560–565, 1989-04

10. Systolic implementations of two-dimensional recursive digital filters

S. Sunder, F. El-Guibaly, A. Antoniou

Proc. IEEE Int. Symp. Circuits Syst., 1990-05, 1034–1037

11. A new 2-D systolic digital filter architecture without global broadcast

L. D. Van

IEEE Trans. VLSI Syst., vol. 10, issue (4), p. 477–486, 2002-08

12. A new VLSI 2-D diagonal-symmetry filter architecture design

P. Y. Chen, L. D. Van, H. C. Reddy, C. T. Lin

Proc. IEEE APCCAS, Macao, China, 2008-12, accepted

## Cited By

No Citations Available

## Keywords

### INSPEC: Controlled Indexing

VLSI, FIR filters, IIR filters, transfer functions

### INSPEC: Non-Controlled Indexing

No Keywords Available

### Authors Keywords

No Keywords Available

### More Keywords

No Keywords Available

No Corrections

## Media

No Content Available
This paper appears in:
International Symposium on Circuits and Systems
Issue Date:
2009
On page(s):
93 - 96
ISBN:
N/A
Print ISBN:
978-1-4244-3827-3
INSPEC Accession Number:
10760314
Digital Object Identifier:
10.1109/ISCAS.2009.5117693
Date of Current Version:
26 Jun, 2009