

# Analysis of Accuracy Tradeoff Multiplier for Imprecise computing system

Deborah Roseline. P¹, Venkatesh Babu.S², Annamalai. P³, PREM SANGEETH.V  $^4,$  THANGADURAI. R  $^5$ 

1,3. Assistant Professor, Department of BioMedical Engineering, Jaya Sakthi Engineering College Chennai 602024, INDIA

2. Professor, Department of BioMedical Engineering, Jaya Engineering College Chennai 602024, INDIA.

4,5 Assistant Professor,Department of Electrical & Electronics Engineering, Jaya Engineering College Chennai 602024, INDIA

# Abstract

Dynamic—Multiplication is a key focal limit with respect to some blunder tolerant applications. Harsh augmentation is seen as a successful methodology for trading off vitality against execution and precision. This paper proposes a 16 bit multiplier accumulator. The proposed arrangement can effectively pick the length of the convey proliferation to satisfy the exactness essentials adaptable. The partial product tree of the multiplier is approximated by the proposed tree blower. A multiplier design is realized by using the carry maskable adder and the compressor. The Multiplier-accumulator (MAC) unit supports huge number of digital signal processing (DSP) applications.

**KEYWORDS:** MAC, compressor, carry maskable adder.

# Introduction

Various irrefutably standard applications, for instance, picture preparing and affirmation, are distinctively tolerant of little oversights. These applications are computationally mentioning and increase is their essential calculating limit, which makes an opportunity to trade off computational exactness for diminished power utilization.

The MAC unit is a unit that is generally requested in DSP applications. Mac unit performs both multiply and addition functions. It works in two phases. Right off the bat it figures the result of given numbers and forward the outcome for the second stage task for example addition/accumulate. In the event that both the processing is

#### 8353 | Deborah Roseline.P Multiplier for Imprecise computing system

executed in a solitary adjusting, at that point it is said to be combined multiply-add/accumulate (MAC) unit.

This paper bases on a harsh multiplier plan that can control precision logically. A carry maskable adder (CMA) is suggested that can be effectively intended to function as a standard carry propagation adder (CPA), a great deal of bit-parallel OR portals, or a blend of the two. This configurability is recognized by veiling convey proliferation: the CPA in the last period of themultiplier is replaced by the proposed CMA. A deduced tree compressor is utilized to diminish the hoarding layer significance of the partial product tree.

Our approach displays a term addressing the power and precision essentials which streamline the partial product reduction (PPR) portion as required. An induced multiplier is arranged using the proposed adder and compressor. This multiplier, together with a customary multiplier and the as of late thought about vague multipliers, was realized in Verilog HDL using a 45-nm library to survey the power utilization, fundamental way deferral, and plan zone. Examinations with the set up assessed multipliers, none of which have any novel re-configurability, demonstrate that the proposed multiplier gave the best trade off of power and delay against precision. All the multiplier structures are then evaluated in a veritable picture handling application.

#### LITERATURE SURVEY

The adder is a basic part of for the most part multipliers. Mahdiani et al. [2] proposed the lower-part-OR adder, which uses OR doors for development of the lower bits and exact adders for extension of the upper bits. It resembles our proposed CMA in that it uses OR ways to make the whole around, anyway our CMA is in like manner logically reconfigurable.

Liu et al. [3] utilized an inaccurate adder to diminish convey engendering delay in incomplete item accumulation. They also proposed a recovery vector to improve precision. The bit width of the blunder recovery vector can be picked by the originator to satisfy exactness requirements. Hashemi et al. [4] proposed a methodology that decreases the degree of the multiplier by perceiving the fundamental one bit of the information operands and picking the going with bits as compacted operands for the two information sources, where is an organizer described regard that decides the move speed used in the inside exact multiplier. Both [3] and [4] license a static trade off between power utilization and precision.

The bit lengths of the recovery vector [3] and the information operands [4] are settled in the midst of the structure methodology and the exactness isn't logically controllable, not in the slightest degree like with our proposed multiplier. Moons et al. [5] proposed a structure level system that impedes some part of the combinational justification and reconfigures the pipelined registers and combinational method of reasoning.

It can trade off exactness for power continuously by changing the amounts of pipeline stages and voltage-precision scaling modes. Our proposed multiplier moreover debilitates some bit of the combinational method of reasoning in the CPA to achieve lower control utilization, yet our very own does not require a pipeline structure or control circuits for voltage scaling.

#### **PROPOSED METHOD**

#### MAC UNIT:

The MAC unit underpins chiefly three functions:
1.Signed and unsigned whole number multiplies
2. Multiply-collect activities supporting marked, unsigned,
3. and marked fragmentary operands Miscellaneous register tasks

A accumulator/adder and multiplier together structure a MAC unit. Typically Carry-select/Carry- save adders are generally executed because of the DSP application prerequisite of quick speed[1]. The memory gets the contributions from its area to multiplier for further multiply-accumulate activities. The created aftereffect of MAC unit is put away at an important memory area. The circumstance requests this total procedure ought to be done into a solitary clock cycle



Figure 1: Block diagram of MAC Unit

#### A. ACCURACY-CONTROLLABLE MULTIPLIER

An ordinary multiplier contains three segments: (i) incomplete item age using an AND entryway; (ii) PPR using a snake tree; and (iii) development to make the last result using a CPA. Power utilization and circuit multifaceted nature are overpowered by the PPR [6], and the multiplier's essential way is led by the spread convey chain in the CPA [7].

#### **B. APPROXIMATE TREE COMPRESSOR**

8355 | Deborah Roseline.P Multiplier for Imprecise computing system

From precise half adder, for which the accompanying condition can be acquired:

$$\{c, s\} = a + b = 2c + s = (c + s) + c,$$

where  $\{,\}$  and + indicate connection and expansion, separately. The esteem c is created by an AND b and s is produced by a XOR b, so (C+S) can be produced by an OR b. In view of the above mentioned, consider the cell appeared in Fig. 1(b), for which the accompanying conditions can be gotten:

$$p = c + s,$$
  

$$q = c,$$
  

$$\{c, s\} = a + b = p + q.$$

This is called an incomplete adder cell (iCAC). Table I demonstrates reality tables for a precise half adder and an iCAC. Note that the bit position of c and that of s, p, and q are extraordinary. As can be seen, q is equivalent to c. While p isn't rise to s, the exact total can be gotten by including p and q, so the iCAC isn't an estimated adder however a component of an exact adder.

By stretching out the above condition to  $\_$  bits, the accompanying condition can be acquired:

$$S = A + B = P + Q.$$

where A, B, P, and Q are m-bit esteems, the bits of which compare to a, b, p, and q, separately. A column of eight iCACs, utilized for 8-bit inputs, is shown in Fig. 2. Consider the case of a 8-bit snake with the two data sources A = 01011111 and B = 00110110. The precise whole S is 10010101, while the column of iCACs produces P = 01111111 and Q = 00010110. Once more, it is obvious that the accompanying holds :

# S = P + Q.

While S is obtained from P and Q, P can be used as an approximation for S, and Q can be used as an error recovery vector for the approximate sum P.



Figure 2. Accurate half adder

#### **C.CARRY-MASKABLE ADDER**

A CMA is proposed to control the precision adaptable and continuously. A \_-bit CMA incorporates carry maskable full adders and one carry maskable half adder, and

8356 | Deborah Roseline.P Multiplier for Imprecise computing system

its structure resembles that of a \_-bit CPA. The structures of the proposed carry maskable half and full adders. Toward the day's end, the undertaking of the proposed half adder can be compelled by the dynamic low sign mask\_x. Right when mask\_x is crippled (=1), it fills in as a precise half adder, and when mask\_x is enabled (=0), Cout is secured to 0 and it functions as an OR door with yield S. The action of the proposed full adder resembles the half adder: when mask\_x is incapacitated (=1), it fills in as a precise full adder, and when mask\_x is engaged (=0), Cout is equal to Cin and S is the yield of an OR door.



Fig. 3. (a) Carry-maskable half adder, (b) Carry-maskable full adder.

#### RESULTS

The RTL schematic and simulation result of proposed method is as follows:



Figure 4: RTL schematic of proposed method

8357 | Deborah Roseline.P Multiplier for Imprecise computing system



Figure 5: Internal RTL Schematic



Table I : Comparison of delay and LUTs

|                    | Delay   | Number of LUTs |
|--------------------|---------|----------------|
| EXISTING<br>SYSTEM | 21.2896 | 155            |
| PROPOSE            | 4.576   | 151            |
| DSYSTEM            |         |                |

#### CONCLUSION

A precision controllable deduced multiplier has been proposed in this paper expends less number of LUTs and has a shorter fundamental path delay than the standard structure. Its dynamic controllability is recognized by the proposed CMA. The multiplier was evaluated at both the circuit and application levels. The test outcomes demonstrate that the proposed multiplier had the alternative to pass on

#### 8358 | Deborah Roseline.P Multiplier for Imprecise computing system

significant power venture assets and speedups while keeping up a basically humbler circuitzone than that of the common Wallace tree multiplier. In addition, for a comparative precision, the proposed multiplier passed on progressively noticeable updates in both power utilization and fundamental path delay than other as of late pondered harsh multipliers.

#### REFERENCES

[1] S. Venkataramani, V. K. Chippa, S. T. Chakradhar, K. Roy, and A. Raghunathan. "Quality programmable vector processors for approximate computing," *46th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)*, pp. 1-12, Dec. 2013.

[2] H. R. Mahdiani, A. Ahmadi, S. M. Fakhraie, and C. Lucas, "Bio-Inspired imprecise computational blocks for efficient VLSI implementation of Soft-Computing applications," *IEEE Transactions on Circuits and Systems I: Regular Papers*, vol. 57, no. 4, pp. 850-862, Apr. 2010.

[3] C. Liu, J. Han, and F. Lombardi, "A Low-Power, High-Performance approximate multiplier with configurable partial error recovery," *Design, Automation & Test in Europe Conference & Exhibition (DATE)*, Mar. 2014.

[4] S. Hashemi, R. I. Bahar, and S. Reda, "DRUM: A Dynamic Range Unbiased Multiplier for approximate applications," *IEEE/ACM International Conference on Computer-Aided Design (ICCAD)*, pp. 418- 425, Nov. 2015.

[5] B. Moons, M. Verhelst, "DVAS: Dynamic Voltage Accuracy Scaling for increased energy- efficiency in approximate computing," *IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)*, Jul. 2015.

[6] A. Momeni, J. Han, P. Montuschi, and F. Lombardi, "Design and analysis of approximate compressors for multiplication," *IEEE Transactions on Computers*, vol. 64, no. 4, pp. 984-994, Apr. 2015.

[7] K. C. Bickerstaff, E. E. Swartzlander, and M. J. Schulte, "Analysis of column compression multipliers," *15th IEEE Symposium on Computer Arithmetic*, pp. 33-39, Jun. 2001.

[8] Z. Yang, J. Han, and F. Lombardi, "Approximate compressors for Error- Resilient multiplier design," *IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFTS)*, pp. 183-186, Oct. 2015.

[9] NanGate, Inc. NanGate FreePDK45 Open Cell Library, http://www.nangate.com/?page\_id=2325, 2008

[10] J. Liang, J. Han, and F. Lombardi, "New metrics for the reliability of approximate and probabilistic adders," *IEEE Transactions on computers*, vol. 62, no. 9, pp. 1760-1771, Sep. 2013.

[11] M. S. Lau, K. V. Ling, and Y. C. Chu, "Energy-Aware probabilistic multiplier: Design and Analysis," *2009 international Conferrence on Compliers, architeture, and synthesis for embedded systems,* pp. 281-290, Oct. 2009.