1 / 41

Introduction to Field Programmable Gate Arrays

Introduction to Field Programmable Gate Arrays. Lecture 2/3 CERN Accelerator School on Digital Signal Processing Sigtuna, Sweden, 31 May – 9 June 2007 Javier Serrano, CERN AB-CO-HT. Outline. Digital Signal Processing using FPGAs Introduction. Why FPGAs for DSP?

rock
Télécharger la présentation

Introduction to Field Programmable Gate Arrays

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Introduction to Field Programmable Gate Arrays Lecture 2/3 CERN Accelerator School on Digital Signal Processing Sigtuna, Sweden, 31 May – 9 June 2007 Javier Serrano, CERN AB-CO-HT

  2. Outline • Digital Signal Processing using FPGAs • Introduction. Why FPGAs for DSP? • Fixed point and its subtleties. • Doing arithmetic in hardware. • Distributed Arithmetic (DA). • COordinate Rotation DIgital Computer (CORDIC).

  3. Outline • Digital Signal Processing using FPGAs • Introduction. Why FPGAs for DSP? • Fixed point and its subtleties. • Doing arithmetic in hardware. • Distributed Arithmetic (DA). • COordinate Rotation DIgital Computer (CORDIC).

  4. Why FPGAs for DSP? (1) Reason 1: FPGAs handle high computational workloads FPGA Conventional DSP Device(Von Neumann architecture) Data In Data In Reg1 Reg2 Reg255 Reg0 Reg C0 C1 C2 C255 .... MAC unit Data Out Data Out 256 Loops needed to process samples All 256 MAC operations in 1 clock cycle

  5. FPGAs are ideal for multi-channel DSP designs • Many low sample rate channels can be multiplexed (e.g. TDM) and processed in the FPGA, at a high rate. • Interpolation (using zeros) can also drive sample rates higher. 80MHz Samples ch1 20MHz Samples LPF ch2 LPF LPF ch3 LPF Multi Channel Filter ch4 LPF

  6. × × × × + + + + + + Why FPGAs for DSP? (2) Reason 2: Tremendous Flexibility A Q = (A x B) + (C x D) + (E x F) + (G x H) can be implemented in parallel B C D Q E F G H But is this the only way in the FPGA?

  7. × × × × × × × D Q + + + + + + + + + + + + Customize Architectures to Suit Your Ideal Algorithms Parallel Semi-Parallel Serial D Q Speed Optimized for? Cost FPGAs allow Area (cost) / Performance tradeoffs

  8. Hundreds of Termination Resistors A/D AFE A/D DSP Procs. DDC DDC MACs MACs FPGA DDC DDC Control Control SDRAM SDRAM FPGA D/A DUC DUC DSP Card D/A DUC DUC SDRAM SDRAM Why FPGAs for DSP? (3) Reason 3: Integration simplifies PCBs ASSP SSTL3 Translators Quad TRx PowerPC NetworkCard Quad TRx A/D ASSP A/D Control PL4 3.125 Gbps PowerPC PowerPC MACs, DUCs, DDCs, Logic D/A PowerPC PowerPC D/A Control CORBA

  9. Outline • Digital Signal Processing using FPGAs • Introduction. Why FPGAs for DSP? • Fixed point and its subtleties. • Doing arithmetic in hardware. • Distributed Arithmetic (DA). • COordinate Rotation DIgital Computer (CORDIC).

  10. Fixed point binary numbers Example: 3 integer bits and 5 fractional bits

  11. Fixed point truncation vs. rounding Note that in 2’s complement, truncation is biased while rounding isn’t.

  12. Outline • Digital Signal Processing using FPGAs • Introduction. Why FPGAs for DSP? • Fixed point and its subtleties. • Doing arithmetic in hardware. • Distributed Arithmetic (DA). • COordinate Rotation DIgital Computer (CORDIC).

  13. The Full Adder (FA)

  14. Add/subtract circuit S = A+B when Control=‘0’ S = A-B when Control=‘1’

  15. Saturation You can’t let the data path become arbitrarily wide. Saturation involves overflow detection and a multiplexer. Useful in accumulators (like the one in the PI controller we use in the lab).

  16. Multiplication: pencil & paper approach

  17. A 4-bit unsigned multiplier using Full Adders and AND gates Of course, you can use embedded multipliers if your chip has them!

  18. Constant coefficient multipliers using ROM For “easy” coefficients, there are smarter ways. E.g. to multiply a number A by 31, left-shift A by 5 places then subtract A.

  19. Division: pencil & paper • Uses add/subtract blocks presented earlier. • MSB produced first: this will usually imply we have to wait for whole operation to finish before feeding result to another block. • Longer combinational delays than in multiplication: an N by N division will always take longer than an N by N multiplication.

  20. Pipelining the division array

  21. Square root • Take a division array, cut it in half (diagonally) and you have square root. Square root is therefore faster than division! • Although with less ripple through, this block suffers from the same problems as the division array. • Alternative approach: first guess with a ROM, then use an iterative algorithm such as Newton-Raphson.

  22. Outline • Digital Signal Processing using FPGAs • Introduction. Why FPGAs for DSP? • Fixed point and its subtleties. • Doing arithmetic in hardware. • Distributed Arithmetic (DA). • COordinate Rotation DIgital Computer (CORDIC).

  23. Distributed Arithmetic (DA) 1/2 Digital filtering is about sums of products: c[n] constant (prerequisite to use DA) x[n] input signal B bits wide Let’s assume: xb[n] is bit number b of x[n] (either 0 or 1) Then: And after some rearrangement of terms: This can be implemented with an N-input LUT

  24. Distributed Arithmetic (DA) 2/2 LUT + Register xB[0] …… x1[0] x0[0] xB[1] …… x1[1] x0[1] y …….... …….... …….... xB[N-1] …… x1[N-1] x0[N-1] 2-1 Generates a result every B clock ticks. Replicating logic one can trade off speed vs. area, to the limit of getting one result per clock tick.

  25. Outline • Digital Signal Processing using FPGAs • Introduction. Why FPGAs for DSP? • Fixed point and its subtleties. • Doing arithmetic in hardware. • Distributed Arithmetic (DA). • COordinate Rotation DIgital Computer (CORDIC).

  26. COrdinate Rotation DIgital Computer

  27. Pseudo-rotations

  28. Basic CORDIC iterations

  29. Angle accumulator

  30. The scaling factor

  31. Rotation Mode

  32. Example: calculate sin and cos of 30º

  33. Vectoring Mode Vector magnitude

  34. Circular coordinate system

  35. Other coordinate systems

  36. Generalized CORDIC equations

  37. Summary of CORDIC functions

  38. Precision and convergence

  39. FPGA implementation

  40. Iterative bit-serial design

  41. Acknowledgements • Many thanks to Jeff Weintraub (Xilinx University Program) and Bob Stewart (University of Strathclyde) for many of these slides.

More Related