60 likes | 78 Vues
Filter plays a major role for removal of unwanted signal noise from the original signal, especially Adaptive FIR filter is easy to attract for many applications, where it is need to minimize computational requirement. This paper shows a novel pipelined design for low power, high throughput, and low area implementation of adaptive lter based on distributed arithmetic DA . The DA formulation utilized for two separate blocks weight update block and filtering operations requires larger area and is n't suited for higher order filters therefore causes reduction in the throughput. These issues have been overcome by efficient distributed formulation of Adaptive filters. LMS adaptation performed on a sample by sample basis is replaced by a dynamic LUT update by the weight update scheme. Adder based shift accumulation for inner product computation replaced by conditional signed carry save accumulation to reduces the sampling period and area density. Yamuna P | B K Venu Gopal "FPGA Based Hybrid LMS Algorithm Design on Distributed Arithmetic" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-1 , December 2018, URL: https://www.ijtsrd.com/papers/ijtsrd19018.pdf Paper URL: http://www.ijtsrd.com/engineering/electronics-and-communication-engineering/19018/fpga-based-hybrid-lms-algorithm-design-on-distributed-arithmetic/yamuna-p<br>
E N D
International Journal of Trend in International Open Access Journal International Open Access Journal | www.ijtsrd.com International Journal of Trend in Scientific Research and Development (IJTSRD) Research and Development (IJTSRD) www.ijtsrd.com ISSN No: 2456 ISSN No: 2456 - 6470 | Volume - 3 | Issue – 1 | Nov 1 | Nov – Dec 2018 FPGA Bas Design Design on Distributed Arithmetic FPGA Based Hybrid LMS Algorithm Yamuna P1, B K Venu Gopal2 Yamuna P 1P PG.Scholar, 2Associate Professor f Electronics and Communication Engineering, University Visvesvaraya College of Engineering, Bengaluru,Karnataka Department of University Visvesvaraya College Karnataka, India ABSTRACT Filter plays a major role for removal of unwanted signal/noise from the original signal, especially Adaptive FIR filter is easy to attract for many applications, where it is need to minimize computational requirement. This paper shows a novel pipelined design for low-power, high-throughput, and low-area implementation of adaptive filter based on distributed arithmetic (DA). The DA formulation utilized for two separate blocks weight update block and filtering operations requires larger area and is n’t suited for higher order filters therefore causes reduction in the throughput. These issues have been overcome by efficient distributed formulation of Adaptive filters. LMS adaptation performed on a sample by sample basis is replaced by a dynamic LUT update by the weight update scheme. Adder based shift accumulation for inner product computation replaced by conditional accumulation to reduces the sampling period and area density. KEY WORDS: Distributed Arithmetic, LMS Adaptive Filter, VHDL, DA, Adaptive Filter, LMS algorithm, Carrysave adder, Noise Cancellation, Digital Signal Processing 1.INTRODUCTION Adaptive digital filters find wide application in few advanced digital signal processing (DSP) areas, e.g noise and echo cancellation, system identification, channel estimation, channel equalization and so forth [1]. The tapped-delay-line response(FIR) filter whose weights are updated by the celebrated “Widrow Hoff” least-mean- algorithm [2] might be considered as the most straight forward known adaptive filter. The LMS adaptive filter is prominent because of its low-complexity, as well as due to its stability and attractive performance [3]. because of current relevance and expanding limitations time, and power complexity, efficient implementation of the LMS adaptive filter is still very vital implement the LMS algorithm, one needs the filter weights during each sampling period using the estimated error, which equals the difference between the current filter output and the desired response. The new weights of the are refreshed. Then µ convergence factor or step which is usually assumed to be a positive number an Nis the quantity of weights utilized adaptive filter. The pipelining structure is over the time consuming combinational blocks of the structure. The normal LMS algorithm does not support pipelined implementation due to behavior. It is adjusted to a form called the delayed LMS (DLMS) algorithm. This DLMS utilize an altered systolic architecture to reduce the adaptation delay where they have utilized moderately extensive processing elements (PEs) for accomplishing adaptation delay with the basic way of one MAC task The direct form configuration on the forwar the FIR filter subsequently prompts a long critical path because an inner product a filter output. subsequently, when the input signal is having a high sampling rate, it becomes crucial to reduce the critical path of the structure wi that it could not surpass the sampling period Distributed Arithmetic based technique comp structure which is stay away from of multiplier that expands the throughput. 2.FIR FILTER The FIR filter is a circuit that filters a digital (samples of numbers) and provides output that i (samples of numbers) and provides output that is Filter plays a major role for removal of unwanted signal/noise from the original signal, especially Adaptive FIR filter is easy to attract for many applications, where it is need to minimize computational requirement. This paper shows a novel o its stability and attractive convergence its few essential use of vance and expanding limitations on area, time, and power complexity, efficient implementation S adaptive filter is still very vital. To ement the LMS algorithm, one needs to update the filter weights during each sampling period using the estimated error, which equals the difference between the current filter output and the desired . The new weights of the LMS adaptive filter . Then µ convergence factor or step-size, which is usually assumed to be a positive number and Nis the quantity of weights utilized in the LMS adaptive filter. The pipelining structure is proposed the time consuming combinational blocks of the structure. The normal LMS algorithm does not pipelined implementation due to its recursive to a form called the delayed . This DLMS utilize an ystolic architecture to reduce the adaptation they have utilized moderately extensive ing elements (PEs) for accomplishing a lower ion delay with the basic way of one MAC task. The direct form configuration on the forward path of the FIR filter subsequently prompts a long critical inner product computation to acquire throughput, and filter based on distributed arithmetic (DA). The DA formulation utilized for two separate blocks weight update block and filtering operations requires larger area and is n’t suited for higher order filters therefore causes reduction in the throughput. These issues have been overcome by efficient distributed formulation of Adaptive filters. LMS adaptation performed on a sample by sample basis is replaced by a dynamic LUT e by the weight update scheme. Adder based shift accumulation for inner product computation replaced by conditional accumulation to reduces the sampling period and area signed signed carry carry-save Distributed Arithmetic, LMS Adaptive DA, Adaptive Filter, LMS algorithm, Carrysave adder, Noise Cancellation, Digital Signal , when the input signal is find wide application in few digital signal processing (DSP) areas, e.g., noise and echo cancellation, system identification, channel equalization and so forth having a high sampling rate, it becomes crucial to critical path of the structure with the goal surpass the sampling period [4]. Distributed Arithmetic based technique comprises of a structure which is stay away from of multiplier that line eights are updated by the -square (LMS) finite finite-impulse- might be considered as the most straight known adaptive filter. The LMS adaptive The FIR filter is a circuit that filters a digital signal complexity, as @ IJTSRD | Available Online @ www.ijtsrd.com www.ijtsrd.com | Volume – 3 | Issue – 1 | Nov-Dec 2018 Dec 2018 Page: 558
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456 International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456 International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470 another digital signal with attributes that are reliant the response of the filter. The FIR fi recursive. It utilizes a finite duration of none zero input values and produces a finite duration of output values which are non-zero. These days, Field Programmable Gate Array technology is widely utilized in digita processing area of the fact that FPGA-based solution can accomplish high speeds owing to its parallel structure and configurable logic, thus providing great amount of flexibility and high reliability throughout design process and later maintenance. A number of architectures have been reported in the writing memory-based implementation of DSP algorithms which includes orthogonal transforms and digital filters [6]. FIR filters use addition to calculate their outputs simply like averaging does. The primitive elements utilized in the outline of a FIR filter are delays, multipliers and adders. The FIR filter comprises series of delays, multiplications and additions to create the time domain output response. The impulse response of the FIR filter is the coefficients utilized. The phase of a FIR which is its dominant feature. Filter is im sequentially utilizing a multiplier and an adder with feedback which is outlined in the high level schematic in Fig. 1[11]. another digital signal with attributes that are reliant on the response of the filter. The FIR filter is non – a finite duration of none zero signal) is minimized. Most adaptive filters are dynamic filters which change their attributes to desired output. Adaptive filter has various signal) is minimized. Most adaptive filters are dynamic filters which change accomplish desired output. Adaptive filter has various parameters which should be ch maintain optimal filtering. Adaptive filtering types of algorithms one is Least and the other Recursive0Least Squares filtering. one of the applications of adaptive filter noise cancellation. Adaptive filter has two vital work successfully in unknown environment; second, it is used to track the input signal of time characteristics [7]. Due to DSP has serial architecture, it can't process high sampling rate used only for amazingly c tasks. An example of adaptive filter system in Fig. 3. The objective of the modify the coefficient of the adaptive fi with the end goal to limit the error term e[n] in the mean-squared sense. The adaptive algorithm distinguishes a vector of coefficient w[n] that minimizes the following quadratic equation, ???? ? ??|????|2 where e[n] = d[n] ?y[n], d[n] is the output from the unknown system (desired signal), y[n] is the output from the adaptive filter, the expected value, and _[n] is the mean squared error (MSE). Many to solve Eqn. 3.1,-0- the most widely recognized being the strategy for steepest descent. This procedure proceeds until the adaptive algorithm achieves state, where the concession solution vector and the optimal solution, or MSE, is at its minimum. In general, adaptive algorithms can be separated into four steps that are performed consecutively: filtering, calculating the coefficient updates coefficients. When split into this form, the essential deference prediction and interference canceling. and interference canceling. duration of output be changed in order to daptive filtering has two Least Mean0Square (LMS) Least Squares (RLS) optimal days, Field Programmable Gate Array (FPGA) in digital signal based solution filtering. one of the applications of adaptive filter is can accomplish high speeds owing to its parallel and configurable logic, thus providing great amount of flexibility and high reliability throughout design process and later maintenance. A number of been reported in the writing for properties: first, it can in unknown environment; second, it is used to track the input signal of time-varying . Due to DSP has serial architecture, igh sampling rate applications and of DSP algorithms orthogonal transforms and digital complex math-intensive An example of adaptive filter system is shown adaptive algorithm is to the coefficient of the adaptive filter, w[n], the error term e[n] in the squared sense. The adaptive algorithm basically vector of coefficient w[n] that minimizes the following quadratic equation, ? 2 (3.1) to calculate their outputs oes. The primitive elements of a FIR filter are delays, comprises of a cations and additions to the time domain output response. The impulse multiplication . The phase of a FIR filter is linear ilter is implemented a multiplier and an adder with a in the high level schematic y[n], d[n] is the output from the unknown system (desired signal), y[n] is the output expected value, and _[n] is the mean squared error (MSE). Many methods exist the most widely recognized being the strategy for steepest descent. This procedure l the adaptive algorithm achieves steady ere the concession between the current solution vector and the optimal solution, or MSE, is at aptive algorithms can be eps that are performed filtering, ly: computing computing updates, and updating the plit into this form, the essential the the error, error, Fig.1: FIR Filter The LMS algorithm updates the coefficient in an iterative way and feeds to the FIR filter. The FIR fil at that point utilizes this coefficient c(n) in addition to the input reference signal x(n) to generate the output response y(n). The output y(n) is then subtracted from the desired signal d(n) to produce an error signal e(n), or, in other words by the LMS algorithm to compute the next set of coefficients. 3.ADAPTIVE FIR FILTER The idea behind a closed loop adaptive filter is th variable filter is balanced until the error (the difference between the filter output and the desired difference between the filter output and the desired efficient in an e FIR filter. The FIR filter s this coefficient c(n) in addition to the input reference signal x(n) to generate the output response y(n). The output y(n) is then subtracted from an error signal e(n), LMS algorithm to compute The idea behind a closed loop adaptive filter is that a until the error (the Fig 3 Unknown and adaptive filter have different length length Unknown and adaptive filter have different @ IJTSRD | Available Online @ www.ijtsrd.com www.ijtsrd.com | Volume – 3 | Issue – 1 | Nov-Dec 2018 Dec 2018 Page: 559
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456 International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456 International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470 Adaptive filtering has two basic operations: the filtering process and the adaptation process filter used to generate output signal from an input signal by the filtering process, whereas the adaptation process consists of an algorithm which desired cost function by adjusting the coefficients of the filter. 4.LMS ADAPTIVE DESIGN) LMS algorithm is introduced. LMS remain Mean-Square. This algorithm was created by “Bernard Widrow” during the 1960’ first0generally used adaptive algorithm. It is sti generally utilized in adaptive digital signal processing and adaptive antenna arrays, primarily because of its simplicity, ease of implementation and great convergence properties. The objective of the LMS algorithm is to deliver mean square error0 for the given condition weights that minimize the mean-squared error between a desired signal and the arrays ou loosely speaking, it attempts to maximi towards the desired signal (who or what the array is trying to communicate with) and limit reception from the interfering or undesirable signals in the Fig 4 desirable signals in the Fig 4 two basic operations: the adaptation process. A digital he number of weights used in the LMS adaptive . It is outstanding that the LMS convergence for correlated rcumstance that can show up in identification applications is when the coefficients of varying. The adaptive algorithm a mechanism to track the changes of the in the LMS algorithm described has a The number of weights used in the LMS a filter is denoted as N. It is outstanding algorithm has a moderate convergence for correlated inputs. Another circumstance that can show up identification applications is when the coefficients of the model are time-varying. The a should give a mechanism to track t model. Fig 4 in the LMS algorithm describ very low computational0complexity (number of additions, subtractions, divisions, multiplications per iteration) and memory loa exceptionally interesting implementations. It is notable influences the performances of the a spite of its low complexity, the LMS has additionally some drawback. 5.DISTRIBUTED ARITHMETIC (DA) Distributed arithmetic (DA) is signal processing algorithms where figurin product of two vectors involves majority computational work load. This sort profile depicts a huge part algorithms, so the potential usage of dist arithmetic is huge. The inner product is us processed utilizing multipl Distributed0Arithmetic, Arithmetic, are computation algorithms that perform multiplication with look-up table based plan blended some enthusiasm more than two decades prior however have mulled from that point onward fact, DA particularly focuses on (sometimes referred to as the vector dot produ computation that spread a large number DSP filtering and frequency transforming functions. XILINX FPGA look up table are not familiar f users and DA algorithm popularly used to produce filter designs. The introduction of the DA algorithm to a great degree straightforward yet its applica are to a great degree wide. The science incorporates a blend of Boolean and ordinary alge prior preparation - even for the logic designer. DA fundamentally diminishes the quantity of additions required for filtering. especially detectable for filters accuracy. This reduction in the remaining burden is a result of storing the pre computed partial sums of the filter co memory table. Distributed arithmetic needs arithmetic computing resources and no when compared with other alternati when compared with other alternatives. filter used to generate output signal from an input , whereas the adaptation process consists of an algorithm which minimize a he coefficients of LMS ADAPTIVE FILTER FILTER (EXISTING (EXISTING complexity (number of is introduced. LMS remains for Least- . This algorithm was created by additions, subtractions, divisions, multiplications per iteration) and memory load, which makes it exceptionally interesting mplementations. It is notable that the step-size µ influences the performances of the adaptive filter. In low complexity, the LMS has additionally ’s, and is the for practical used adaptive algorithm. It is still in adaptive digital signal processing and adaptive antenna arrays, primarily because of its ease of implementation and great DISTRIBUTED ARITHMETIC (DA) ibuted arithmetic (DA) is ordinarily utilized for essing algorithms where figuring the inner vectors involves majority of the mputational work load. This sort of computing rofile depicts a huge part of signal processing algorithms, so the potential usage of distributed arithmetic is huge. The inner product is usually multipliers Arithmetic, Arithmetic, are computation algorithms that perform up table based plans. Both blended some enthusiasm more than two decades have mulled from that point onward. In fact, DA particularly focuses on the sum of products (sometimes referred to as the vector dot product) computation that spread a large number of the vital DSP filtering and frequency transforming functions. XILINX FPGA look up table are not familiar for DSP users and DA algorithm popularly used to produce tion of the DA algorithm is to a great degree straightforward yet its applications the LMS algorithm is to deliver the condition and squared error between a desired signal and the arrays output, to maximize reception the desired signal (who or what the array is reception from and adders. Modulo alongside alongside Fig 4 Conventional LMS adaptive filter Conventional LMS adaptive filter some data is required before optimal weights can be resolved. The weights of LMS adaptive the nth emphasis are updated according to the following equation. The updated weight is wn+1= wn+ µenxn0 en = dn – yn0 yn = wTnx0 Where en is the error, dn is the desired during nth iteration calculated, µis convergence or step size. The weight vector Wn and input vector xn is given by ptimal weights can be The weights of LMS adaptive filter during are updated according to the following equation. The updated weight is The science incorporates a Boolean and ordinary algebra and require no even for the logic designer. fundamentally diminishes the quantity of ed for filtering. This decrease is pecially detectable for filters with high piece . This reduction in the computational is a result of storing the pre- computed partial sums of the filter co-efficient in the ed arithmetic needs fewer arithmetic computing resources and no0multipliers, dn is the desired response , µis convergence factor or step size. The weight vector Wn and input vector @ IJTSRD | Available Online @ www.ijtsrd.com www.ijtsrd.com | Volume – 3 | Issue – 1 | Nov-Dec 2018 Dec 2018 Page: 560
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456 International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456 International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470 6.DISTRIBUTED APPROACH (PROPOSED DESIGN) The essential block diagram for the proposed system for the design of an Adaptive FIR Filter using Distributive Arithmetic is proposed Adaptive Filter The proposed structure of DA adaptive filter of length N = 4 is appeared 6.1[11]. It comprises of a four block and a weight-increment block along with additional circuits for the calculation e(n) and control word t for the barrel shifters. On relating with existing filter0implementations the proposed architecture has achieved low complexity. DISTRIBUTED APPROACH (PROPOSED DESIGN) block diagram for the proposed system for the design of an Adaptive FIR Filter using is proposed DA-Based LMS The proposed structure of DA-based ARITHMETIC ARITHMETIC BASED BASED This part of distributed arithmetic is a good computing environments with limited computational resources, especially multipliers. These kind computing environments can be found on older field programmable gate arrays (FPGAs) and lo FPGAs. ibuted arithmetic is a good one for computing environments with limited computational pecially multipliers. These kind of computing environments can be found on older field- ys (FPGAs) and low-end, ease = 4 is appeared in Fig s of a four-point inner product increment block along with nal circuits for the calculation of error value e(n) and control word t for the barrel shifters. On isting the proposed architecture has DA DA based based adaptive adaptive Fig 6.1 DA-based LMS adaptive LMS adaptive filter of filter length N length N For accomplishing this preferred standpoint Binary Coding (OBC) has been design -based LMS adaptive delay element DA-table structure. 6.1.FOUR POINT INNER- The underlying part of the filtering process of the LMS adaptive filter, for each cycle, is the need to perform an inner product calculation Arithmetic (DA) procedure is bit It is entirely a bit-level modification and accumulation operation. The computational algorithm that of the weighted sum of0produ four-point inner-product block shown in Fig.6 incorporates a DA table comprisin registers which stores the partial inner products yl for {0 < l ≤ 15} and a 16: 1 multiplexor (MUX) to choose the substance of one of those registers. Bit cut weights A = {w3l w2l w1l w0l} for 0 fed to the MUX as control in L the carry Save accumulator is fed by the output of the MUX. Fig 5 Distributed arithmetic Distributed arithmetic For accomplishing this preferred standpoint Offset Binary Coding (OBC) has been utilized in this filter “Note that cK for{0 ≤ k ≤ 2N – 1} registered and put away in RAM-based LUT of 2N words. Notwithstanding, rather than putting away 2N words in LUT, store (2N − 1) words in a DA table of 2N − 1 registers. A case of such a DA tab is appeared in Fig 5. It contains just 15 registers to store the pre computed sums of input words. Seven new estimations of ck0 are computed by seven adders in parallel Fig 5[11]. AK is the incorporate when Xkb =1 that is the least significant bit. The substance of the kth LUT area can be identified as kj. where kj is the (j + 1)th bit of N representation of integer k for 0 ≤ k ≤ 2N (K = 4) execution of the DA filter is appeared Figure 3.1. The contains all 16 possible combination sums of the filter weights w0;w1;w2 and of shift registers in Fig5[11] stores 4 consecutive input samples (x[n - i]; i =0…0……..3). The connection of rightmost bits of the shift registers becomes the address of the memory table clock cycle the shift register is shifted right. he shift register is shifted right. } can be pre- based LUT of 2N filter design requires 16 table structure. Notwithstanding, rather than putting away 2N 1) words in a DA table of of such a DA table for N = 4 -PRODUCT BLOCK part of the filtering process of the LMS adaptive filter, for each cycle, is the need to orm an inner product calculation. Distributed etic (DA) procedure is bit-sequential in nature. level modification of the multiply and accumulation operation. The fundamental DA is a computational algorithm that affords effective usage products, or dot product. The product block shown in Fig.6.2 incorporates a DA table comprising of an array of 15 registers which stores the partial inner products yl for 1 multiplexor (MUX) to choose of those registers. Bit cuts of weights A = {w3l w2l w1l w0l} for 0 ≤ l ≤ L − 1 are fed to the MUX as control in LSB-to- MSB order, and Save accumulator is fed by the output of the 15 registers to of input words. Seven are computed by seven adders . AK is the incorporate into Sum significant bit. The substance of the kth LUT area can be identified as kj. where kj is the (j + 1)th bit of N-bit binary ≤ 2N − 1. A 4-tap xecution of the DA filter is appeared in Figure 3.1. The contains all 16 possible combination w0;w1;w2 and w3.The bank stores 4 consecutive ……..3). The ightmost bits of the shift registers becomes the address of the memory table”. At every @ IJTSRD | Available Online @ www.ijtsrd.com www.ijtsrd.com | Volume – 3 | Issue – 1 | Nov-Dec 2018 Dec 2018 Page: 561
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456 International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456 International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470 After L bit cycles, the carry-save accumulator shif accumulates all the partial inner products and produces a sum word and a carry word of size (L + 2) bit each. The carry and sum words are shifted added with an input carry “1” to generate filter output which is consequently subtracted from the desired ou d(n) to acquire the error e(n). save accumulator shift tial inner products and significant one in the estimated error by appropriate The barrel shifter decides, added with or subtracted from the current weights. significant one in the estimated e number of locations. The barrel shifter decides, with or subtracted from the current weights. Realize the corresponding multiplication of a shift operation. To update n weights block consists of n carry-save units units performs the multiplication of shifted error values with the delayed input samples along with the addition with the old weights. weight-increment term the addition of old weight with weight increment term is merged with multiplicat To control for adder or subtract the error is used such that, when sign bit is zero barrel-shifter output added and when one it is subtracted from the content of the corresponding current value in the Multiplication is performed by each MAC unit, shifted value of error with the delayed input samples xn-m followed by the additions. by the additions. s a sum word and a carry word of size (L + 2) bit each. The carry and sum words are shifted added erate filter output which he desired output Realize the corresponding multiplication of a shift o update n weights, the weight-update save units. Each carry-save ts performs the multiplication of shifted error values with the delayed input samples along with the with the old weights. To the calculation of the addition of old weight with weight increment term is merged with multiplication. To control for adder or subtract cells the sign bit of at, when sign bit is zero the shifter output added and when one it is subtracted from the content of the corresponding current value in the is performed by each MAC unit, of the shifted value of error with the delayed input samples weight weight register [11]. Fig 6.2 Four-point inner-product block product block Each one the bits of the error except the MSB bit ignored, such that multiplication of input error is executed by a right shift, in the magnitude of the error the number of locations are number of leading zeros. “To generate the control word t for the barrel shifter the magnitude of the computed error is decoded by the logic convergence factor µ is taken as0 (1/N). also we use as 2−i/N, where i is a small integer”. The number of shifts t in that case is increased by i, and the input to the barrel shifters is pre shifted by i locations accordingly to reduce the hardware complexity. used to compute the inner0(dot) product of a consta coefficient vector and a variable inpu single bit serial operation”. This is the task that contributes to most of the critical path. product be thought to be y(n)=?????(?) , e(n)=d(n) where the weight vector w(n) and input vector x(n) the nth iteration are respectively given by {X(n)=[x(n),x(n-1),…….x(n-N+W(n)=[ (n),….,}the corresponding current value in the weight register. 6.2 THE WEIGHT INCREMENT UNIT It comprises of four barrel shifters and fo tractor cells in the Fig.6.3[11]. “The bar shifts the distinctive input values xk for N – 1} determined by the location of the most determined by the location of the most error except the MSB bit are ignored, such that multiplication of input xk by the in the magnitude of given by the o generate the control he magnitude of the by the logic. The (1/N). also we use µ . The number of shifts t in that case is increased by i, and the input to the barrel shifters is pre shifted by i locations accordingly to reduce the hardware complexity. “DA (dot) product of a constant coefficient vector and a variable input vector in a Fig 6.3 the weight increment unit weight increment unit The final outputs of the carry- updated weights which serve as an input to the error computation block as well as the weight for the next iterations. To keep the critical to one addition time a pair of delays are introduced before and after the final addition of the weight update block. 7.SIMULATION AND RESULTS Simulation and synthesis is performed by software using VHDL programming. may be seen that the power consump to just below half of that in the existing design. This has resulted because of a reduced switching activity of the design based on carry-save adder. An effic pipelined architecture implementation of DA-based adaptiv based adaptive filter. -save units constitute the weights which serve as an input to the error computation block as well as the weight-update block o keep the critical-path equal pair of delays are introduced before and after the final addition of the weight- . This is the task that . Let the inner e(n)=d(n)-y(n) ) and input vector x(n) at the nth iteration are respectively given by RESULTS N+W(n)=[?0 (n), ?1 and synthesis is performed by XILINX using VHDL programming. Here it very well power consumption has reduced to just below half of that in the existing design. This has resulted because of a reduced switching activity of the corresponding current value in the weight THE WEIGHT INCREMENT UNIT s of four barrel shifters and four adder/sub save adder. An efficient and low The barrel shifter pipelined architecture for for, delay input values xk for {k =0, 1, . . ., @ IJTSRD | Available Online @ www.ijtsrd.com www.ijtsrd.com | Volume – 3 | Issue – 1 | Nov-Dec 2018 Dec 2018 Page: 562
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456 International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456 International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470 9.ACKNOWLEDGMENTS Thanks for the base idea paper by Dr. S. A. Whit “Applications of the distributed arithmetic to digital signal processing: A tutorial review Mag., vol. 6. We would like to thank our mentor have been greatly supporting and provided guidance and offered help to make 10. REFERENCES 1.Apolinário Jr, José A., and Sergio L. Netto. "Introduction to Adaptive Filters." In QRD Adaptive Filtering, pp. 1-27. Springer US, 2009. 2.B. Widrow and S. D. Ste processing. Prentice Hall, Englewood Cliffs, NJ, 1985. 3.S. Háykin and B. Widrow, Least adaptive filters. Wiley-Interscience, Hoboken, NJ, 2003. 4.Park, Sang Yoon, and Pr "Lowpower, high-throughput, adaptive FIR filter arithmetic." Circuits and Systems II: Express Briefs, IEEE Transactions on 60, no. 6 (2013): 346-350. 5.D. J. Allréd, H. Yoo, V. Krishnan, W. Huang, and D. V. Anderson, “LMS adaptive filters using distributed arithmetic for high throughput,” IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 52, no. 7, pp. 1327–1337, Jul. 2005. 6.P. K. Meher, ‘LUT Optimization for Memory Based Computation,’ IEEE Trans on Circuits & Systems-II, pp.285-289, April 2010. 7.Háykin, Simon S. Adaptive filter theory. Pearson Education India, pp.18, 1996. 8.A. Croisiér, D. Estebán, M. L “Digital filter for PCM encoded signals US Patent 3, 777, 130,” 1973. 9.S. Zohar, “New Hardware Realizations of Nonrecursive Digital Filters,” IEEE Transactions on Computers, vol. C-22, no. 4, pp. 328 1973. 10.A. Peléd and B. Liu, “A New Hardware Realization of Digital Filters,” IEEE Transactions on ASSP, vol. 22, no. 6, pp. 456 on ASSP, vol. 22, no. 6, pp. 456–462, 1974. ACKNOWLEDGMENTS paper by Dr. S. A. Whité, Applications of the distributed arithmetic to digital signal processing: A tutorial review,” IEEE ASSP Fig 7.1 Simulation output for DA based weighted LMS algorithm The simulation results in the fig.7.1 describes the DA based weighted LM output and filtered, shifted outputs of adaptive LMS filter algorithm respectively. Simulation output for DA based weighted ur mentor manikandan who greatly supporting and provided steady and offered help to make this a success. in the fig.7.1 and fig.7.2 DA based weighted LMS algorithm of adaptive LMS Apolinário Jr, José A., and Sergio L. Netto. "Introduction to Adaptive Filters." In QRD-RLS 27. Springer US, 2009. B. Widrow and S. D. Steárns, Adaptive signal l, Englewood Cliffs, NJ, ykin and B. Widrow, Least-mean-square Interscience, Hoboken, NJ, Fig 7.2 Simulation output for adaptive LMS based filter algorithm A carry-save accumulation scheme of signed partial inner products for the computation of filter output has been implemented. From the synthesis results, it was found that the proposed design consumes less power over our previous DA based FIR adaptive filter. In future, work can be implemented on digital communication, signal processing application, digital radio receivers, software radio receivers and echo cancellation. Gate Count EXISTING 31,500 PROPOSED 40,000 Table 1: Result Comparison of Existing a Proposed design in terms of Area, Delay and Power 8.CONCLUSION This paper presents significantly less adaptation delay and provided significant saving of area and power. In proposed we are going to deal with direct form LMS algorithm along with transposed form of delayed least mean square algorithm (TDLMS), where the power is significantly reduced and the performance is increased along with decrease in area. From the synthesis results, it was found that the proposed design consumes less power over our previous DA FIR adaptive filter. Simulation output for adaptive LMS based Park, Sang Yoon, and Prámod Kumar Mehér. throughput, daptive FIR filter arithmetic." Circuits and Systems II: Express Briefs, IEEE Transactions on 60, no. 6 (2013): and on on and low low-area distributed based based distributed save accumulation scheme of signed partial inner products for the computation of filter output has been implemented. From the synthesis results, it was the proposed design consumes less power based FIR adaptive filter. In future, work can be implemented on digital communication, signal processing application, digital radio receivers, software radio receivers and echo d, H. Yoo, V. Krishnan, W. Huang, and D. V. Anderson, “LMS adaptive filters using thmetic for high throughput,” IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 52, no. 7, Delay (nS) 2.3 1.70 Power (mW) 91 67 DESIGN P. K. Meher, ‘LUT Optimization for Memory- Based Computation,’ IEEE Trans on Circuits & 289, April 2010. Table 1: Result Comparison of Existing and Simon S. Adaptive filter theory. Pearson Education India, pp.18, 1996. Area, Delay and n, M. Lévilion, and V. Rizo, “Digital filter for PCM encoded signals US Patent significantly less adaptation delay S. Zohar, “New Hardware Realizations of igital Filters,” IEEE Transactions 22, no. 4, pp. 328–338, and provided significant saving of area and power. In proposed we are going to deal with direct form LMS algorithm along with transposed form of delayed least TDLMS), where the power is ly reduced and the performance is increased d and B. Liu, “A New Hardware Realization of Digital Filters,” IEEE Transactions From the synthesis results, it was found that the proposed design consumes less power over our previous DA-based @ IJTSRD | Available Online @ www.ijtsrd.com www.ijtsrd.com | Volume – 3 | Issue – 1 | Nov-Dec 2018 Dec 2018 Page: 563