DESIGNING OF MODIFIED BOOTH ENCODER WITH POWER SUPPRESSION TECHNIQUE

Vasudeva G¹ and Venkatesh S N²

Abstract—The project titled “Designing of Modified Booth Encoder with power suppression technique” is useful technique in reducing the area, delay and power consumption. Recent studies in designing of DSP systems have revealed loss of high performance due to huge area utilization and delay. In Radix-4 Modified Booth encoder the partial products has been reduced to half of the earlier one. Hence the number of adders are reduced, so the area consumption will be reduced and delay in the output also reduced by a factor. The power Suppression technique used along with the Modified booth Encoder is SPST (Spurious Power Suppression Technique). By using this technique the glitches can be reduced and avoids unwanted operation such as addition of repeated zeros. Therefore the power consumption will be less. Hence this project is useful in the field of signal processing and also very useful in portable systems.

I. INTRODUCTION

As integrated circuit technology has improved to allow more and more components on a chip, digital systems have continued to grow in complexity. As digital systems have become more complex, detailed design of the systems at the gate and flip-flop level has become very tedious and time consuming. For this reason, use of hardware description languages in the digital design process continues to grow in importance. A hardware description language allows a digital system to be designed and debugged at a higher level before conversion to the gate and flip-flop level. Digital signal processing is one of the core technologies in rapidly growing application areas such as wireless communications, audio and video processing, and industrial control. Digital signal processing (DSP) applications constitute the critical operations which usually involve many multiplications.

Fast multipliers are essential parts of digital signal processing systems. The speed of multiply operation is of great importance in digital signal processing as well as in the general purpose processors today, especially since the media processing took off. In the past multiplication was generally implemented via a sequence of addition, subtraction, and shift operations. Multiplication can be considered as a series of repeated additions. The number to be added is the multiplicand, the number of times that it is added is the multiplier, and the result is the product. Each step of addition generates a partial product. In most computers, the operand usually contains the same number of bits. When the operands are interpreted as integers, the product is generally twice the length of operands in order to preserve the information content. This repeated addition method that is suggested by the arithmetic definition is slow that it is
almost always replaced by an algorithm that makes use of positional representation. It is possible to decompose multipliers into two parts. The first part is dedicated to the generation of partial products, and the second one collects and adds them. The basic multiplication principle is twofold i.e., evaluation of partial products and accumulation of the shifted partial products. It is performed by the successive additions of the columns of the shifted partial product matrix. The „multiplier” is successfully shifted and gates the appropriate bit of the „multiplicand”. The delayed, gated instance of the multiplicand must all be in the same column of the shifted partial product matrix. They are then added to form the product bit for the particular form. Multiplication is therefore a multi operand operation. To extend the multiplication to both signed and unsigned numbers, a convenient number system would be the representation of numbers in two's complement format.

II. NORMAL MULTIPLICATION METHOD

Generally in multiplication there will be an two terms name multiplicand and multiplier. The multiplicand each terms will be multiplied with each terms of multiplier and produces partial product for each term and finally all the partial products are added to get the final result. In this the partial products will the terms of multiplicand values. Example If the multiplier has 8 bits and multiplicand will have 5 bits then there will be 5 partial products and hence the five adders required.

III. MODIFIED BOOTH ALGORITHM

Booth's multiplication algorithm is a multiplication algorithm that multiplies two signed binary numbers in two's complement notation. The algorithm was invented by Andrew Donald Booth in 1950. In booth multiplication, partial products generation is done based on recoding scheme. e.g., radix 4 encoding. Bits of Multiplicand (Y) are grouped from left to right and corresponding operation on Multiplier (X) is done in order to generate the partial product. In radix-4 booth multiplication partial product generation is done on based on encoding which is as given by Table1. Parallel multiplication using Booth’s recording algorithm technique based on the fact that partial product can be generated for group of consecutive 0’s and 1’s which is called Booth’s recording. This recording algorithm is used to generate efficient partial product. These partial products always have large number of bits than the input number of bits. This increase in the width of partial products usually depends upon the radix scheme used for recording. So, these scheme uses less partial product
generation which in turn provides low power and are in the Modified Booth encoding multiplier.

Table 1: Booth’s encoding Table

<table>
<thead>
<tr>
<th>Blocks C&lt;sub&gt;k&lt;/sub&gt;</th>
<th>Recoding Digit S&lt;sub&gt;k&lt;/sub&gt;</th>
<th>Operation on M</th>
<th>Partial product PP&lt;sub&gt;k&lt;/sub&gt;</th>
</tr>
</thead>
<tbody>
<tr>
<td>000</td>
<td>0</td>
<td>0*M</td>
<td>S&lt;sub&gt;0&lt;/sub&gt;*M</td>
</tr>
<tr>
<td>001</td>
<td>+1</td>
<td>+1*M</td>
<td>S&lt;sub&gt;1&lt;/sub&gt;*M</td>
</tr>
<tr>
<td>010</td>
<td>+1</td>
<td>+1*M</td>
<td>S&lt;sub&gt;2&lt;/sub&gt;*M</td>
</tr>
<tr>
<td>011</td>
<td>+2</td>
<td>+2*M</td>
<td>S&lt;sub&gt;3&lt;/sub&gt;*M</td>
</tr>
<tr>
<td>100</td>
<td>-2</td>
<td>-2*M</td>
<td>S&lt;sub&gt;4&lt;/sub&gt;*M</td>
</tr>
<tr>
<td>101</td>
<td>-1</td>
<td>-1*M</td>
<td>S&lt;sub&gt;5&lt;/sub&gt;*M</td>
</tr>
<tr>
<td>110</td>
<td>-1</td>
<td>-1*M</td>
<td>S&lt;sub&gt;6&lt;/sub&gt;*M</td>
</tr>
<tr>
<td>111</td>
<td>0</td>
<td>0*M</td>
<td>S&lt;sub&gt;7&lt;/sub&gt;*M</td>
</tr>
</tbody>
</table>

Booth's algorithm can be implemented by repeatedly adding (with ordinary unsigned binary addition) one of two predetermined values A and S to a product P, then performing a rightward arithmetic shift on P. Let m and r be the multiplicand and multiplier, respectively; and let x and y represent the number of bits in m and r.

1. Determine the values of A and S, and the initial value of P. All of these numbers should have a length equal to (x + y + 1).
   i. A: Fill the most significant (leftmost) bits with the value of m. Fill the remaining (y + 1) bits with zeros.
   ii. S: Fill the most significant bits with the value of (−m) in two's complement notation. Fill the remaining (y + 1) bits with zeros.
   iii. P: Fill the most significant x bits with zeros. To the right of this, append the value of r. Fill the least significant (rightmost) bit with a zero.

2. Determine the two least significant (rightmost) bits of P.
   i. If they are 01, find the value of P + A. Ignore any overflow.
   ii. If they are 10, find the value of P + S. Ignore any overflow.
   iii. If they are 00, do nothing. Use P directly in the next step.
   iv. If they are 11, do nothing. Use P directly in the next step.

3. Arithmetically shift the value obtained in the 2nd step by a single place to the right. Let P now equal this new value.

4. Repeat steps 2 and 3 until they have been done y times.
5. Drop the least significant (rightmost) bit from $P$. This is the product of $m$ and $r$.

To Booth recode the multiplier term, we consider the bits in blocks of three, such that each block overlaps the previous block by one bit. Grouping starts from the LSB, and the first block only uses two bits of the multiplier. Figure 2 shows the grouping of bits from the multiplier term for use in modified booth encoding.

![Fig 2: Grouping of Bits from the Multiplier Term](image)

Each block is decoded to generate the correct partial product. The encoding of the multiplier $Y$, using the modified booth algorithm, generates the following five signed digits, $-2$, $-1$, $0$, $+1$, $+2$. Each encoded digit in the multiplier performs a certain operation on the multiplicand, $X$, as illustrated in Table 1.

| $0000010008$ | $0000010000$ |
| $00000000001000000000000000X_Y$ | $1X_Y$ |
| $00000000001000$ | $1X_Y$ |
| $00000000000$ | $0X_Y$ |
| $00000000$ | $0X_Y$ |

| $00000000101000000160$ |

### IV. SPST Adder

The SPST uses a detection logic circuit to detect the effective data range of arithmetic units, e.g., adders or multipliers. When a portion of data does not affect the final computing results, the data controlling circuits of the SPST latch this portion to avoid useless data transitions occurring inside the arithmetic units. Besides, there is a data asserting control realized by using registers to further filter out the useless spurious signals of arithmetic unit every time when the latched portion is being turned on. This asserting control brings evident power reduction. Figure 3 shows the design of low power adder/subtraction with SPST.

The adder/subtraction is divided into two parts, the most significant part (MSP) and the least significant part (LSP). The MSP of the original adder/subtraction is modified to include detection logic circuits, data controlling circuits, sign extension circuits, logics for calculating carry in and carry out signals. The most important part of this study is the design of the control signal asserting circuits, denoted as asserting circuits in Figure 3.
Although this asserting circuit brings evident power reduction, it may induce additional delay. There are two implementing approaches for the control signal assertion circuits. The first implementing approach of control signal assertion circuit is using registers. This is illustrated in Figure 3. The three output signals of the detection logic are close, Carr_ctrl, sign. The three output signals the detection logic unit are given a certain amount of delay before they assert. Spurious transitions (also called glitches) in combinational CMOS logic are a well known source of unnecessary power dissipation. Reducing glitch power is a highly desirable target because in the vast majority of digital CMOS circuits, only one signal transition per clock cycle is functionally meaningful. Unfortunately, glitch power is heavily dependent on the low-level implementation details, namely, gate propagation delays and input transitions misalignments.

The MSP of the original adder is modified to include detection logic circuits, data controlling circuits, sign extension circuits, logic for calculating carry-in and carry-out signals. Simple logic gates are used to implement the latches and the sign extension circuits in order to reduce the additional overhead as far as possible.

Low power adder/subtraction consists of
- latch,
- Detection logic,
- Sign extension logic.

Detection Logic using registers

V. MODIFIED BOOTH ENCODER WITH SPST

Figure 5 shows a computing example of Booth multiplying two numbers “2AC9” and “006A”. The shadow denotes that the numbers in this part of Booth multiplication are all zero so that this part of the computations can be neglected. Saving those computations can significantly reduce the power consumption caused by the transient signals. According to the analysis of the multiplication shown in figure 8, we propose the SPST-equipped modified-Booth encoder, which is controlled by a detection unit. The detection unit has one of the two
operands as its input to decide whether the Booth encoder calculates redundant computations as shown in Figure 6.

The latches can, respectively, freeze the inputs of MUX-4 to MUX-7 or only those of MUX-6 to MUX-7 when the PP4 to PP7 or the PP6 to PP7 are zero; to reduce the transition power dissipation.

Applying the SPST on the Compression Tree
The proposed SPST-equipped multiplier is illustrated in figure 7. The PP generator generates five candidates of the partial products, i.e., \{-2A, -A, 0, A, 2A\}. These are then selected according to the Booth encoding results of the operand B. When the operand besides the Booth encoded one has a small absolute value, there are opportunities to reduce the spurious power dissipated in the compression tree.

VI. RESULTS AND CONCLUSION
In this project we are examining the performance of the proposed high speed low power multiplier with respect of array multiplier. This multiplier can be implemented using Verilog-HDL coding. In order to get the power report and delay report we are synthesizing this multiplier using Xilinx.

![Waveforms of Booth Multiplier](image1)

**Fig8: Waveforms of Booth Multiplier**

![Waveform of SPST ADDER](image2)

**Fig9 : waveform of SPST ADDER**

<table>
<thead>
<tr>
<th>CONSTRAINTS</th>
<th>BOOTH MULTIPLIER</th>
<th>BOOTH MULTIPLIER WITH SPST ADDER</th>
</tr>
</thead>
<tbody>
<tr>
<td>DELAY</td>
<td>23.79ns</td>
<td>10.391ns</td>
</tr>
<tr>
<td>POWER</td>
<td>0.305mW</td>
<td>0.211mW</td>
</tr>
</tbody>
</table>

**VII. APPLICATIONS**

The SPST can drastically reduce the power dissipation of combinational VLSI designs for multimedia/DSP applications. There are several DSP applications, out of which one is an efficient multi Transform Design (ETD) for MPEG-4 and the second one is a Versatile Multimedia Functional Unit (VMFU) design. The ETD can compute forward, inverse, hadamard transforms. On the other hand, the VMFU can compute six commonly used arithmetic operations in multimedia/DSP processing like addition, subtraction, multiplication,
MAC (Multiplier and accumulator), interpolation, Sum-of-Absolute Difference (SAD) Encapsulating the VMFUs, designers can increase the flexibility and scalability of multimedia/DSP processors. When applying the SPST to these two designs, the realization issues in every design highly differ from each other due to the large hardware-configuration differences.

VIII. CONCLUSION

This work presents the designing of a 16x16 multiplier with high speed low-power technique called SPST (Spurious Power Suppression Technique). A Modified Radix-4 Booth Encoder circuit is used for this multiplier architecture. Compared to array multiplier, the booth multiplier has the highest operational speed and less hardware count.

The presented low-power technique called SPST and the theoretical analysis of the SPST are fully discussed. The proposed SPST can obviously decrease the switching (or dynamic) power dissipation, which decreases a significant portion of the whole power dissipation in integrated circuits. Simulation results and Power analysis reports of SPST-equipped multiplier and array multiplier are shown.

REFERENCES