1 / 22

Volume 7,Issue 1,April 2010 Presented by SHILPA CASULA

A Power aware mapping approach to map IP cores onto NOCs under bandwidth and latency constraints XIAO HANG WANG ,MEI YANG,YINGTAO JIANG &PENG LIU. Volume 7,Issue 1,April 2010 Presented by SHILPA CASULA. INTRODUCTION.

sari
Télécharger la présentation

Volume 7,Issue 1,April 2010 Presented by SHILPA CASULA

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Power aware mapping approach to map IP cores onto NOCs under bandwidth and latency constraints XIAO HANG WANG ,MEI YANG,YINGTAO JIANG &PENG LIU Volume 7,Issue 1,April 2010 Presented by SHILPA CASULA

  2. INTRODUCTION • We are investigating the IP mapping problem by mapping a set of IP cores onto the tiles of a mesh based Network-on-Chip (NoC) architecture. • Power consumption due to intercore communications should be minimized. • This problem is considered under both bandwidth and latency constraints.

  3. Now for a target application to run on a NoC based MPSoC architecture, a 3 step design flow is followed. • A target application is partitioned into multiple concurrent tasks.

  4. The tasks are scheduled and allocated to selected IP cores. • Then the IP cores are optimally mapped onto the Noc based MPSoc architecture in terms of power, performance under the constraints of latency or bandwidth. After mapping, all the routing paths between any pair of communication IP cores are optimally identified.

  5. TEM ALGORITHM • Generates high mapping results with low runtime. • Achieves significant improvement over MOCA in terms of runtime and quality solutions. • Compared with BNB it produces results of same quality but with less CPU time.

  6. RELATED RESEARCH BRANCH AND BOUND(HU and MARCULESCU 2005) • With large queue size, it demands high memory and more CPU time. • Feasible for mapping IP cores with fairly small size. • Since the work queue is small, most solutions are trimmed off. GREEDY BASED HEURISTIC ALGORITHM( Murali and De Micheli 2004) • Very low runtime but often sacrifice the quality of solutions. • If properly designed, degradation of the solution is minimized. • IP cores with demanding communication requests should be mapped first to reduce overall power consumption.

  7. SA/GA/TS ALGORITHMS(HARMANI & FARAH 2008,LU 2008) • IP mapping can be done in divide and conquer like manner • High quality solutions are not guaranteed. • Require longer time than Greedy Based mapping algorithm. LINEAR PROGRAMMING ALGORITHM(MURALI AND DE MICHELI 2004) • Computation is time consuming and no guarantee for high quality solutions.

  8. APPROACH PROBLEM FORMULATION AND POWER MODEL

  9. The power model used in Hu and Marculescu in 2005 is followed in this study. • Bit power(E bit)is defined as the power consumed when 1 bit of data is transported through a router. It is given as

  10. Problem Description • Assumption: Before IP mapping is performed, a given application is already bounded and scheduled onto a list of selected IP cores. • The communication patterns between any pair of IP cores are modeled by target applications CTG. • Whereas the Noc architecture that the application will be mapped onto is described in terms of ACG.

  11. Classification of applications: Template 1: An application falls into template 1(tightly coupled) if there there is atleast one hot node in its CTG. Template 2: An application falls into template 2 (distributed) if there is no hot nodes in its CTG.

  12. MAPPING ALGORITHM FOR TEM Input 1)CTG of an application 2)ACG of an NoC Architecture 3)E,A sorted edge list of CTG Output 1)MAP1: The table storing the mapping results for each IP core Function Map the application(CTG) onto the NoC architecture(ACG) and allocate routing paths.

  13. MAPPING ALGORITHM FOR TEMPLATE1 Input 1)CTG of an application 2)ACG of an NoC Architecture 3)A sorted edge list of CTG 4)The set of hot nodes. Output 1)MAP 2: The table storing the mapping results for each IP core. Function Map the application of template1 onto the NoC architecture

  14. MAPPING ALGORITHM FOR TEMPLATE 2 Input 1)CTG of an application 2)ACG of an NoC Architecture Output 1)MAP 2: The table storing the mapping results for each IP core. Function Map the application of template2 onto the NoC architecture

  15. ROUTING PATH ALLOCATION • Next step after mapping is routing path allocation. • In NoC design ,the deterministic and static routing is preferred due to its simplicity. • All the routing paths have to be minimal and the resulting network has to be deadlock free. • Bandwidth and latency constraints need to be satisfied.

  16. Performance Evaluation • To evaluate the performance of TEM algorithm,template1 and template 2 algorithms are implemented and simulated • BNB and MOCA are also implemented . • Mapping results of all the 3 algorithms are compared. • Random and multimedia applications are adopted as benchmarks.

  17. Experiments on Random Applications and their Results

  18. Experiments on Multimedia Benchmarks and their results

  19. The multimedia system(MMS) benchmark is also simulated on a mesh based NoC. • Table 6 shows the reduction in power consumption of TEM over MOCA. • On average, the degradation of TEM over BNB is within 10% for both XY and Odd-Even routing. • With latency constraints, the power consumption of TEM is slightly lower than MOCA.

  20. CONCLUSION AND FUTURE RESEARCH • This article presented a Template based greedy algorithm to address the IP mapping problem. • Application falls into template 1 if there are 1 or more hot nodes. • Application falls into Template 2 if the communications are nearly evenly distributed among the vertices. • The experiment results showed that TEM generates high quality mapping results with low runtimes. • Future work includes extension of TEM to folded torus, fat tree and other NoC topologies.

  21. Questions????

More Related