1 / 108

Adaptive Thermoregulation for Applications on Reconfigurable Devices

Adaptive Thermoregulation for Applications on Reconfigurable Devices. Funded by NSF Grant ITR 0313203. Phillip Jones Applied Research Laboratory Washington University Saint Louis, Missouri, USA http://www.arl.wustl.edu/arl/~phjones Iowa State University Seminar April 2008.

kalei
Télécharger la présentation

Adaptive Thermoregulation for Applications on Reconfigurable Devices

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Adaptive Thermoregulation for Applications on Reconfigurable Devices Funded by NSF Grant ITR 0313203 Phillip Jones Applied Research Laboratory Washington University Saint Louis, Missouri, USA http://www.arl.wustl.edu/arl/~phjones Iowa State University Seminar April 2008

  2. What are FPGAs? CLB CLB CLB CLB CLB CLB CLB CLB Configurable Logic Block CLB CLB CLB CLB CLB CLB CLB CLB • FPGA: Field Programmable Gate Array • Sea of general purpose logic gates

  3. What are FPGAs? Configurable Logic Block • FPGA: Field Programmable Gate Array • Sea of general purpose logic gates CLB CLB CLB CLB CLB CLB CLB CLB CLB CLB CLB CLB CLB CLB CLB

  4. What are FPGAs? Configurable Logic Block • FPGA: Field Programmable Gate Array • Sea of general purpose logic gates CLB CLB CLB CLB CLB CLB CLB CLB

  5. FPGA Usage Models • Experimental ISA • Experimental Micro • Architectures • Run-time adaptation • Run-time Customization CPU + Specialized HW - Sparc-V8 Leon Partial Reconfiguration System on Chip (SoC) Fast Prototyping Full Reconfiguration Parallel Applications • Image Processing • Computational • Biology • Remote Update • Fault Tolerance

  6. Some FPGA Details CLB CLB CLB CLB

  7. Some FPGA Details ABCD Z 0000 0001 1110 1111 4 input Look Up Table CLB CLB CLB Z A LUT B C D

  8. Some FPGA Details CLB CLB CLB Z A LUT B C D ABCD Z 0000 0001 1110 1111 0 0 0 1 A Z AND B 4 input Look Up Table C D

  9. Some FPGA Details CLB CLB CLB Z A LUT B C D ABCD Z 0000 0001 1110 1111 0 1 1 1 A Z OR B 4 input Look Up Table C D

  10. Some FPGA Details CLB CLB CLB Z A LUT B C D ABCD Z B X000 X001 X110 X111 0 1 1 1 Z 2:1 Mux C 4 input Look Up Table D

  11. Some FPGA Details CLB CLB CLB Z A LUT B C D

  12. Some FPGA Details CLB CLB Programmable Interconnection Point PIP CLB Z A LUT DFF B C D

  13. Some FPGA Details CLB CLB Programmable Interconnection Point PIP CLB Z A LUT DFF B C D

  14. Outline • Why Thermal Management? • Measuring Temperature • Thermally Driven Adaptation • Experimental Results • Temperature-Safe Real-time Systems • Future Directions

  15. Why Thermal Management?

  16. Why Thermal Management? Hot Regulated Cold Location?

  17. Why Thermal Management? Mobile? Hot Regulated Cold

  18. Why Thermal Management? Microcontroller Plasma Physics Reconfigurability FPGA

  19. Why Thermal Management? Exceptional Events

  20. Why Thermal Management? Exceptional Events

  21. Local Experience • Thermally aggressive application • Disruption of air flow

  22. Damaged Board (bottom view) • Thermally aggressive application • Disruption of air flow

  23. Damaged Board (side view) • Thermally aggressive application • Disruption of air flow

  24. Response to catastrophic thermal events Very Inconvenient Not Feasible!! Easy Fix

  25. Solutions • Over provision • Large heat sinks and fans • Restrict performance • Limiting operating frequency • Limit amount chip utilization • Use thermal feedback • Dynamic operating frequency • Adaptive Computation • Shutdown device My approach

  26. Outline • Why Thermal Management? • Measuring Temperature • Thermally Driven Adaptation • Experimental Results • Temperature-Safe Real-time Systems • Future Directions

  27. Measuring Temperature FPGA

  28. Measuring Temperature FPGA A/D 60 C

  29. Background: Measuring Temperature S. Lopez-Buedo, J. Garrido, and E. Boemo, . Thermal testing on reconfigurable computers,. IEEE Design and Test of Computers, vol. 17, pp. 84.91, 2000. FPGA Temperature .0 1. .1 0. .0 1. Period

  30. Background: Measuring Temperature FPGA Temperature .0 1. 1. .1 0. 1. .0 1. 0. Period

  31. Background: Measuring Temperature FPGA Temperature .0 1. 1. .1 0. 1. .0 1. 0. Period

  32. Background: Measuring Temperature S. Lopez-Buedo, J. Garrido, and E. Boemo, . Thermal testing on reconfigurable computers,. IEEE Design and Test of Computers, vol. 17, pp. 84.91, 2000. FPGA Temperature 1. .1 .0 Period Voltage

  33. Background: Measuring Temperature FPGA Temperature 1. .1 .0 Period Voltage

  34. Background: Measuring Temperature 1. .1 .0 Voltage FPGA “Adaptive Thermoregulation for Applications on Reconfigurable Devices”,by Phillip H. Jones, James Moscola, Young H. Cho, and John W. Lockwood;Field Programmable Logic and Applications (FPL’07), Amsterdam, Netherlands Temperature Period

  35. Background: Measuring Temperature FPGA Mode 1 Core 1 Core 2 Temperature Core 3 Core 4 Period Frequency: High

  36. Background: Measuring Temperature FPGA Mode 1 Mode 2 Core 1 Core 2 Temperature Core 3 Core 4 Period Frequency: High

  37. Background: Measuring Temperature FPGA Mode 3 Mode 1 Mode 2 Core 1 Core 2 70C Temperature 40C Core 3 Core 4 Period 8,300 8,000 Frequency: Low Frequency: High

  38. Background: Measuring Temperature FPGA Mode 3 Mode 1 Mode 2 Pause Sample Controller Core 1 Core 2 Temperature Core 3 Core 4 Period Frequency: High

  39. Background: Measuring Temperature FPGA Mode 3 Mode 1 Mode 2 Pause Time out Counter Core 1 Core 2 Temperature Core 3 Core 4 Period Frequency: High

  40. Background: Measuring Temperature FPGA Mode 3 Mode 1 Mode 2 Pause Time out Counter 5 3 2 4 0 5 1 0 3 1 2 Core 1 Core 2 Temperature Core 3 Core 4 Period Frequency: High Frequency: Low

  41. Background: Measuring Temperature FPGA Mode 3 Mode 1 Mode 2 Pause Time out Counter 5 3 1 4 5 2 0 3 0 1 2 3 Core 1 Core 2 Temperature Core 3 Core 4 Period Frequency: High Frequency: Low

  42. Background: Measuring Temperature Mode 3 Mode 1 Mode 2 Sample Mode FPGA Pause Time out Counter 5 4 2 3 0 5 1 0 3 1 2 3 Core 1 Core 2 Temperature Core 3 Core 4 Period Frequency: High

  43. Temperature Benchmark Circuits Desired Properties: • Scalable • Work over a wide range of frequencies • Can easily increase or decrease circuit size • Simple to analyze • Regular structure • Distributes evenly over chip • Help reduce thermal gradients that may cause damage to the chip • May serve as standard • Further experimentation • Repeatability of results “A Thermal Management and Profiling Method for Reconfigurable Hardware Applications”,by Phillip H. Jones, John W. Lockwood, and Young H. Cho;Field Programmable Logic and Applications (FPL’06), Madrid, Spain,

  44. Temperature Benchmark Circuits LUT 00 LUT 05 DFF DFF LUT 70 LUT 75 DFF DFF Core Block (CB): Array of 48 LUTs and 48 DFF

  45. Temperature Benchmark Circuits AND 00 AND 05 DFF DFF AND 70 AND 75 DFF DFF Thermal workload unit: Computation Row Input Gen CB 0 CB 1 CB 16 CB 17 8 8 (1 LUT, 1 DFF) Array of 18 core blocks (864 LUTs, 864 DFFs) RLOC: Row, Col 0 , 0 7 , 5 Core Block (CB): Array of 48 LUTs and 48 DFF Each LUT configured to be a 4-input AND gate

  46. Temperature Benchmark Circuits AND 00 AND 05 DFF DFF AND 70 AND 75 DFF DFF RLOC_ORIGIN: Row, Col 100% Activation Rate RLOC: Row, Col 0 , 0 7 , 5 Core Block (CB): Array of 48 LUTs and 48 DFF Each LUT configured to be a 4-input AND gate Thermal workload unit: Computation Row 01 Input Gen CB 0 CB 1 CB 16 CB 17 00 1 0 1 0 8 8 (1 LUT, 1 DFF) Array of 18 core blocks (864 LUTs, 864 DFFs)

  47. Example Circuit Layout(Configuration 1x, 9% LUTs and DFFs) RLOC_ORIGIN: Row, Col (27,6) Thermal Workload Unit

  48. Example Circuit Layout(Configuration 4x, 36% LUTs and DFFs)

  49. Observed Temperature vs. Frequency T ~ P P ~ F*C*V2 Steady-State Temperatures Cfg4x Cfg10x Cfg2x Cfg1x

  50. Observed Temperature vs. Active Area Max rated Tj 85 C T ~ P P ~ F*C*V2 Steady-State Temperatures 200 MHz 100 MHz 50 MHz 25 MHz 10 MHz

More Related