html5-img
1 / 16

High speed digital systems laboratory

Technion - Israel institute of technology department of Electrical Engineering . High speed digital systems laboratory. High-Throughput FFT. Student : Andrey Kuyel Supervised by Mony Orbach Spring 2011 Midterm Presentation (One semestrial project). Project goals.

lona
Télécharger la présentation

High speed digital systems laboratory

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Technion- Israel institute of technology department of Electrical Engineering High speed digital systems laboratory High-Throughput FFT Student : AndreyKuyelSupervised by MonyOrbachSpring 2011Midterm Presentation (One semestrial project)

  2. Project goals • The project goal is to design and implement on FPGA device FFT that capable to deal with data transmitted at the rates up to 10Ms/sec*. • The design will be written on VHDL • The project has aspects of: signal processing and logic design and high rate data processing. *- 5Ms/sec for each of I and Q components .

  3. The DFT (N- length vector) definition is: FFT - Theoretic overview The time-complexity of the DFT is: The FFT algorithm (developed at first by J.W. Cooleyand John Tukey at 1965) comes to reduce the time-complexity of DFT into This algorithm called: "The Cooley–Tukey radix-2 FFT algorithm". It is one of the most common FFT algorithms.

  4. FFT radix-2 - Theoretic overview The Cooley–Tukey radix-2 FFT algorithm The idea of algorithm is to compute in two parts: 1. Calculation two series While: : The first series is using all of the even components of the input vector - , and the second uses the odd components - This calculation takes

  5. FFT radix-2 - Theoretic overview • Calculation of the twiddles: The FFT (N- length vector) definition is:

  6. FFT radix-2 - Theoretic overview The FFT schematic Radix-2 diagram (for N=2) – called butterfly unit: 4 real multipliers and 2 add/sub 16*16 bit 2 (16bits)adders/substractors The butterfly unit is used as a sub-unit in any Radix-2 FFT-N-size unit. The "twiddle factor" is the sine/cosine imaginary/real factor:

  7. The FFT (N=8) radix 2 data flow C language program for creating FFT data flow for N points FFT

  8. FFT core features • FFT core will have the following features: • Real and imaginary Inputs: 8 bits width each (for each of N point). • Real and imaginary outputs: 16 bits width each, where 8 MSB bits for integer part and 8 LSB bits for fractional part (for each N point). • Drop-in module for Virtex-6 (xc6vlx240T) • Forward complex FFT • Transform sizes N = 16,32,(possibly 64 and 128 ) • Arithmetic type: Fixed-point • Truncation after the butterfly • Block RAM or/and Distributed RAM for data storage • Bit/digit reversed or natural output order • Input data at frequency 10 Gs/sec (total rate for real and image part of data )

  9. Parallel N points radix 2 FFT – Block diagram clock Parallel N points radix 2 FFT Synchronization signals Controls signals Input data (real). N points each of 8 bits width Output data (real). N points each of 16 bits width Input data (imag). N points each of 8 bits width Input data (imag). N points each of 16 bits width

  10. The FFT (N=16) radix 2 data flow – pipeline stages separation

  11. FFT verification and performance measurement • The FFT core will be tested on Virtex-6 FPGA (Ml605 based on xc6vlx240T) Test module • Virtex-6 FPGA A2D@10Gs/sec* Output signals Data in Data out FFT core Controls out Controls in Stimulus Memory MEM DATA@10Gs/sec • Currently implementation will be made on Virtex-6 FPGA Family (xc6vlx240T), the devise contains 768 DSP slices, distributed RAM 3,650 Kb and block RAM of the size 14,976 Kb. • See attached specification table at the end of document. *- Primary verification will be done on the reference data from FPGA internal memory that will emulate data sampled at the rate 10Ms/sec

  12. Post synthesis report • Maximum operational frequency up to 350MHz (5.6 Gs/sec per component). • The design have: • 8*4*4=128 real (16*16 bits) multipliers. Where one 16*16 bits multiplier take 280 6LUTs • 128 (16 bits) adders/subtractors each of the takes 16 of 6LUTS

  13. Simulation and verification • C program for generating radix 2 data flow • Radix 2 FFT N points parallel C language model for verification VHDL implementation of design.

  14. Project Schedule

  15. Project Schedule (continued…)

More Related