Modern Rounding Based Approximate (MROBA) Multiplier for the Design of FIR Filter

Karanam.Anusha, Ponnampalli Sudhakar Rao

Abstract— For better power utilization, area, speed which are important criteria in the design of DSP processors multiplier design is main issue. The work of multiplier is rounding the numbers both signed and unsigned. In few applications speed is more important than accuracy so to improve the performance approximate circuits are introduced with reduce energy consumption and increase speed. In this paper, we propose an FIR filter based on Modern Rounding Based Approximate (MROBA). In this modified rounding based approximate multiplier the numbers are rounded to the nearest exponent irrespective of $2^n$. The proposed MROBA multiplier based FIR filter was compared with Rounding Based Approximate (ROBA) Multiplier. In this multiplier the operands are rounded to the nearest exponent of two. This approximation will lead to simplification of multiplication operation thus reducing area and increasing speed. MROBA gives better results with the MROBA MAC unit is implemented.

Keywords: Approximate multiplier, Digital Signal Processing, Finite Impulse Response (FIR), MROBA Multiplier

I. INTRODUCTION

In the applications of DSP digital filters are used broadly to reduce noise compared to analog filters. In our the optimization of bit width and hardware resources without any impact on the frequency response and output signal precision. Addition (or subtraction), Multiplication (normally of a signal by a constant) Time Delay i.e. delaying a digital signal by one or more sample periods are three basic mathematical operations used in digital filters. The coefficients are multiplied by fixed-point constants using additions, subtractions and shifts in a multiplier block. In VLSI Signal Processing two types of digital filters are most widely used one is FIR (finite impulse response) and the other is IIR (infinite impulse response). FIR as indicates that the impulses are finite in this filter and phase is kept linear in order to noise distortions and no feedback is used for such a filters. As compared to IIR, FIR is very simple to design. Such type of FIR filters are used in DSP processors for high speed. In Digital Signal Processing Multiplication and addition is of times required. The main idea of ROBA approximate multiplier is to simplify the multiplication operation thus reducing area and increasing speed. MROBA gives better results with the MROBA MAC unit is implemented.

In IIR filter due to feedback problems will occur where as in FIR filters limited bits are efficient in which there is no feedback. Using fractional arithmetic we can implement FIR filters. The multipliers play a significant role in arithmetic operations in DSP filtering applications. The multipliers can be used in most of the DSP applications as vector product, matrix multiplication, convolution and filtering etc. In general, DSP applications requires only efficient result than accuracy. In the design we have two significant conflicts are occurring which are power consumption and propagation delay. Considering these requirements, the design of low power multiplier is of great interest. In this paper, we focus on proposing a high-speed low power energy yet approximate multiplier appropriate for error resilient DSP applications. The contributions of this paper can be summarized as follows:

Presenting a new structure for MROBA multiplication by modifying the ROBA multiplication approach;

Describing three hardware architectures of the proposed approximate multiplication scheme for sign and unsigned operations. The proposed FIR filter consists of approximate multiplier, which is also area efficient, is constructed by modifying the conventional ROBA multiplication approximate at the algorithm level assuming rounded input values.

The main idea of ROBA approximate multiplier is to round the numbers for easy of operations as two to the power n (2n). To elaborate on the operation of the approximate multiplier, first, let the rounded numbers of the input of A and B are denoted by Ar and Br, respectively. The multiplication of A by B can be rewritten as:

$$A \times B = (Ar \times A) + (Ar \times Br)$$

(1)

The key observation is that the multiplications of $Ar \times A$, $Ar \times Br$, and $Br \times A$ are implemented with shift operation. The hardware implementation of $(Ar - A) \times (Br - B)$, which is complex and depends on differences of the exact numbers from their rounded ones, is typically small. Hence, we propose to omit this part from (2), helping simplify the multiplication operation. Hence, to perform the multiplication process, the following expression is used:

$$A \times B = (Ar \times B) + (Br \times A) - (Ar \times Br)$$

(2)

Thus, to perform the multiplication operation using three shift and two addition/subtraction operations. In this approach, the nearest
values for A and B in the form of 2n should be determined. When the value of A (or B) is equal to the $3 \times 2^p$ (where $p$ is an arbitrary positive integer larger than one), it has two nearest values in the form of 2n with equal absolute differences that are $2p$ and $2p-1$. While both values lead to the same effect on the accuracy of the proposed multiplier, selecting the larger one (except for the case of $p = 2$) leads to a smaller hardware implementation for determining the nearest rounded value and hence, it is considered in this paper. It originates from the fact that the numbers in the form of $3 \times 2^p$ are considered as do not care in both rounding up and down simplifying the process, and smaller logic expressions may be achieved if they are used in the rounding up. It should be noted that contrary to the previous work where the approximate result is smaller than the exact result, the final result calculated by the ROBA multiplier may be either larger or smaller than the exact result depending on the magnitudes of Ar and Br compared with those of A and B, respectively. Note that if one of the operands (say A) is smaller than its corresponding rounded value while the other operand (say B) is larger than its corresponding rounded value, then the approximate result will be larger than the exact result. This is due to the fact that, in this case, the multiplication result of $(A_r - A) \times (B_r - B)$ will be negative. Since the difference between (2) and (3) is precisely this product, the approximate result becomes larger than the exact one. Finally, it should be noted the advantage of the proposed ROBA multiplier exists only for positive inputs because in the two’s complement representation, the rounded values of negative inputs are not in the form of 2n. Hence, we suggest that, before the multiplication operation starts, the absolute values of both inputs and the output sign of the multiplication result based on the inputs signs be determined and then the operation be performed for unsigned numbers and, at the last stage, the proper sign be applied to the unsigned result.

II. PROPOSED MODIFIED ROUNDED BASED APPROXIMATE MULTIPLIER (MROBA)

2.1. Algorithm for MROBA multiplier

The main concept of the proposed modified rounding based approximate multiplier is to design the multiplier so that it takes all values which are irrespective of 2n. The detailed explanation of the multiplier is described further. Initially let us consider A as the rounded input value of input A and BR as the rounded input value of input B. Now the multiplication of the A * B is written has follows:

$$A * B = (A_r * B) + (B_r * A) + (A_r * B) - (A_r * B)$$

[1] ------- (a)

The basic key point to be considered is the product of $(A_r * B)$ and $(B_r * A)$ is complex and the weight of the term would result in small values when compared with the exact numbers so the product of this term can be omitted and it also leads to complex hardware design approach. Hence the multiplication can be performed by the following expression as
2.2 Hardware implementation of the proposed modified rounding based approximate multiplier

Based on fig 3 we have the block diagram for the modified rounding based approximate multiplier. Initially we have a sign detector block where the inputs A and B are given. As in the case of the unsigned numbers the sign detector block can be applied to positive numbers. But if the signed numbers are considered the absolute values are to be generated and then inputs are to be given. The input of A can have its rounded value has the larger value (N+1) or the smaller value (N). Similarly for input B the rounded value can either be the larger value or the smaller value. This condition is applicable for both A greater than B (A > B) and for A less than B (A < B) Condition. But for the A equal to B (A = B) the rounded values are considered according to (2N+1)/2.

NOTE: The rounded values of both the inputs should be compulsory differ from other rounded value eg: for input A its rounded value Ar should be larger value and for the input B the rounded value Br should be smaller value or vice versa for both inputs. Expect for (A=B) the rounded value should be the middle value or (2N+1)/2 which is same for both the inputs.

Fig3: Block diagram of the modified rounding based approximate multiplier

III. COMPARISON OF MROBA MULTIPLIER WITH ROBA MULTIPLIER COMPARED WITH THE ROBA MULTIPLIER THE MROBA GIVES THE EXACT OUTPUT FOR THE GIVEN INPUTS IN ALL THE POSSIBLE CASES.

Table 3: Comparison of MROBA multiplier with ROBA multiplier conditions ROBA MROBA multiplier

<table>
<thead>
<tr>
<th>conditions</th>
<th>ROBA multiplier</th>
<th>MROBA multiplier</th>
</tr>
</thead>
<tbody>
<tr>
<td>A=B</td>
<td>Small value than the exact</td>
<td>Exact result</td>
</tr>
<tr>
<td>A&gt;B</td>
<td>Smaller value than the exact</td>
<td>Exact result</td>
</tr>
<tr>
<td>A&lt;B</td>
<td>Larger value than the exact</td>
<td>Exact result</td>
</tr>
</tbody>
</table>

Examples 1:
ROBA multiplier: (unsigned)
When: A=12 (3*22), B=6 (3*21)
Ar=24, Br=23
A*B= (Ar*A) + (Br*B)
Here Ar=16, Br=8
(16*6) + (8*12) = 16*8
72 ≠ 64

In the conventional approach the inputs should be in the 3 x 2p-1 form and we have two rounded values in the form of 2p and 2p-1 both the values lead to the same effect but here the larger value is considered as the rounded value for both the inputs because larger values leads in smaller hardware implementation.

Examples 2:
ROBA multiplier: (signed)
When: A= -12 (3*22), B= -12 (3*22)
Ar=24, Br=24
A*B= (Ar*A) + (Br*B)
Here Ar=16, Br=16
(16*-12) + (16*-12) = 16*16
144 ≠ 256

Here -256 is the value since we have a sign set block at the end only positive value is obtained with the ROBA multiplier. Most of the parameters are considered by considering (A=B) condition.

Examples 3:
MROBA multiplier: (unsigned)
1. Maximum error:
When: A= 7, B= 5
Max{error(A,B)} = ((2N+1/2)-N) *(2N+1/2)- N
A*B= (Ar*A) + (Br*B)
Here Ar=7, Br=6
(7*5) + (6*7) - (7*6) =1/4N2
35 = 35

2. Approximate signed error:
Examples 4:
MROBA multiplier: (signed)
Error (A, B)= (Ar-A) (Br-B) + A+B+1
Considering: A= -35, B= 47
Ar=36, Br=47
A*B= (Ar*A) + (Br*B)
Here Ar=36, Br=47
(36*47) + (47*-35) - (36*47)
1645 =1645

Here -1645 is the value since we have a sign set block at the end only positive value is obtained results compared with ROBA. So the main concept here is accuracy.

So, from the above examples of MROBA multiplier and ROBA multiplier we can say that the MROBA is more accurate than ROBA and the main advantage is that we can perform multiplication for all the numbers irrespective of 2n since in the ROBA multiplier only the inputs and rounded values are strictly 2n but in this MROBA we can perform multiplication for all N inputs (N=1, 2, 3, ……).
IV. FIR FILTER IMPLEMENTATION

In the FIR system, the impulse response is of finite duration, this means that it has a finite number of nonzero terms. The response of the FIR filter depends only on the present and past input samples.

A. Normal FIR filter:

The conventional design of the FIR filter is shown in Figure 8. The implementation of an FIR requires three basic building blocks: Multiplication, Addition and Signal delay. Where N represents the filter order, y[n] is the output signal and bk represents the set of filter coefficients. If x[n] is the input signal applied, x[n - k] terms are referred as taps or tapped delay lines. In the conventional design, present a simple structure of multiplier for FIR filters. It performs multiplication by generating partial products. If the multiplier digit is a 1, the multiplicand is simply copied down and represents the product. If the multiplier digit is a 0 the product is also 0. Therefore the area and delay will increase. It affects the performance of the FIR filter.

\[
x[n] \xrightarrow{z^{-1}} b_0 \xrightarrow{z^{-1}} b_1 \xrightarrow{z^{-1}} b_2 \rightarrow \Sigma \rightarrow \Sigma \rightarrow \Sigma \rightarrow y[n]
\]

Fig 8. Block diagram of FIR filter.

V. FIR FILTER USING ROBA MULTIPLIER

As the multiplier is the slowest element in the system, it will affect the performance of the FIR filter. So, ROBA multiplier is suggested since it reduces area and it is faster than other conventional multipliers.

KS adder is a fast adder design as it generate carry signal in \(O(\log_2 n)\) time and has the best performance in VLSI implementations. KS adder has large area with minimum fan-out which increases its performance. Kogge-Stone adder is widely used in high performance 32-bit, 64-bit, and 128-bit adders as it reduces the critical path to great extent. In fig.3 each vertical stage produce propagate and generate bits. Generate bits are produced in the last stage and XORed with initial propagate and generate bits to produce sum.

VI. CONCLUSION

An approximate multiplier(MROBA) based FIR filter implementation was proposed in the paper. The results may show that in MROBA multiplier may gives better performance in area, power and delay.

REFERENCES

6. ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu
7. www.iosrjournals.org Issue 1 (September 2012), PP.031-038 www.irjes.com