Hardware accelerated basic blocks for power-aware intercommunication in HPC and embedded systems
dc.contributor.advisor | Papaefstathiou Ioannis | en |
dc.contributor.advisor | Παπαευσταθιου Ιωαννης | el |
dc.contributor.author | Tampouratzis Nikolaos | en |
dc.contributor.author | Ταμπουρατζης Νικολαος | el |
dc.contributor.committeemember | Dollas Apostolos | en |
dc.contributor.committeemember | Δολλας Αποστολος | el |
dc.contributor.committeemember | Pnevmatikatos Dionysios | en |
dc.contributor.committeemember | Πνευματικατος Διονυσιος | el |
dc.date.accessioned | 2024-10-31T15:44:46Z | |
dc.date.available | 2024-10-31T15:44:46Z | |
dc.date.issued | 2014 | |
dc.date.submitted | 2014-07-02 | |
dc.description.abstract | In the past, a transition to the next fabrication process typically translated to more transistors and frequency and less power. The higher frequencies paired with innovations in computer architecture defined the semiconductor industry and research until the mid-90s. At that point architecture research saturated and industry resided to the technology scaling for performance gains. During the mid-00s frequency scaling saturated as well. Transistor count, the only resource which reliably kept scaling, along with intra-chip parallelism, which could leverage and extend the existing knowledge of old-days supercomputers, emerged as the only solution to keep Moore’s law live. In parallel systems, computing nodes cooperate to solve processing intensive problems. The communication between nodes is achieved through a variety of protocols. Traditionally, research has focused on optimizing these protocols and identifying the most suitable ones per system and application. Recently, an attempt to unify the primitive operations of the proposed intercommunication protocols has been realized through the Portals system. Portals offer a set of low level communication routines which can be composed to model complex protocols. However, Portals modularity comes at a performance cost, as communication protocols have been tuned and many of their timing critical parts have been decoupled from the main execution thread and in many cases accelerated as dedicated hardware. This work targets to close the performance gap between a generic and reusable intercommunication layer, Portals, and the several monolithic but highly tuned protocols. A software driven hardware accelerated system is suggested which resides on execution of actual software to highlight the critical parts of the communication routines. Accelerating the bottlenecks starts by modeling the hardware in untimed virtual prototypes and the software in a range of candidate embedded processors. A novel path from hardware prototypes to actual silicon allows rapid characterization of the accelerator in terms of power, performance and area. The suggested approach triggers a speedup from one order of magnitude in bottleneck components of Portals, while it is up to two orders of magnitude faster in both MPI and GA baseline implementations in a recent embedded processor. | en |
dc.format.extent | 3 megabytes | en |
dc.identifier | 10.26233/heallink.tuc.18839 | |
dc.identifier.citation | Nikolaos Tampouratzis, "Hardware accelerated basic blocks for power-aware intercommunication in HPC and embedded systems", Master Thesis, Σχολή Ηλεκτρονικών Μηχανικών και Μηχανικών Υπολογιστών, Πολυτεχνείο Κρήτης, Chania, Greece, 2014 | en |
dc.identifier.citation | Νικόλαος Ταμπουρατζής, "Hardware accelerated basic blocks for power-aware intercommunication in HPC and embedded systems", Μεταπτυχιακή Διατριβή, Σχολή Ηλεκτρονικών Μηχανικών και Μηχανικών Υπολογιστών, Πολυτεχνείο Κρήτης, Χανιά, Ελλάς, 2014 | el |
dc.identifier.uri | https://dspace.library.tuc.gr/handle/123456789/743 | |
dc.language.iso | en | |
dc.publisher | Πολυτεχνείο Κρήτης | el |
dc.publisher | Technical University of Crete | en |
dc.relation.replaces | 5945 | |
dc.rights | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en |
dc.subject | Accelerate Intercommunication cost in HPC and Embedded Systems | en |
dc.subject | Embedded systems (Computer systems) | en |
dc.subject | embedded computer systems | en |
dc.subject | embedded systems computer systems | en |
dc.subject | HPC (Computer science) | en |
dc.subject | high performance computing | en |
dc.subject | hpc computer science | en |
dc.title | Hardware accelerated basic blocks for power-aware intercommunication in HPC and embedded systems | en |
dc.type | Μεταπτυχιακή Διατριβή | el |
dc.type | Master Thesis | en |
dcterms.mediator | Πολυτεχνείο Κρήτης::Σχολή Ηλεκτρονικών Μηχανικών και Μηχανικών Υπολογιστών | el |
dspace.entity.type | Publication |
Αρχεία
Πρωτότυπος φάκελος/πακέτο
1 - 1 από 1
Δεν υπάρχει διαθέσιμη μικρογραφία
- Ονομα:
- Tabouratzis_Nikolaos_MSc_2014.pdf
- Μέγεθος:
- 3.03 MB
- Μορφότυπο:
- Adobe Portable Document Format