Combining PREM Compilation and Static Scheduling for High-Performance and Predictable MPSoC Execution

Matějka J.; Forsberg B.; Sojka M.; Šůcha P.; Benini L.; Marongiu A.; Hanzálek Z.

dc.contributor.author	Matějka J.
dc.contributor.author	Forsberg B.
dc.contributor.author	Sojka M.
dc.contributor.author	Šůcha P.
dc.contributor.author	Benini L.
dc.contributor.author	Marongiu A.
dc.contributor.author	Hanzálek Z.
dc.date.accessioned	2020-03-31T15:36:35Z
dc.date.available	2020-03-31T15:36:35Z
dc.date.issued	2019
dc.identifier	V3S-328628
dc.identifier.citation	MATĚJKA, J., et al. Combining PREM Compilation and Static Scheduling for High-Performance and Predictable MPSoC Execution. Parallel Computing. 2019, 85 27-44. ISSN 0167-8191. DOI 10.1016/j.parco.2018.11.002.
dc.identifier.issn	0167-8191 (print)
dc.identifier.issn	1872-7336 (online)
dc.identifier.uri	http://hdl.handle.net/10467/87215
dc.description.abstract	Many applications require both high performance and predictable timing. High-performance can be provided by COTS Multi-Core System on Chips (MPSoC), however, as cores in these systems share main memory, they are susceptible to interference from each other, which is a problem for timing predictability. We achieve predictability on multi-cores by employing the predictable execution model (PREM), which splits execution into a sequence of memory and compute phases, and schedules these such that only a single core is executing a memory phase at a time. We present a toolchain consisting of a compiler and a scheduling tool. Our compiler uses region and loop based analysis and performs tiling to transform application code into PREM-compliant binaries. In addition to enabling predictable execution, the compiler transformation optimizes accesses to the shared main memory. The scheduling tool uses a state-of-the-art heuristic algorithm and is able to schedule industrial-size instances. For smaller instances, we compare the results of the algorithm with optimal solutions found by solving an Integer Linear Programming model. Furthermore, we solve the problem of scheduling execution on multiple cores while preventing interference of memory phases. We evaluate our toolchain on Advanced Driver Assistance System (ADAS) application workloads running on an NVIDIA Tegra X1 embedded system-on-chip (SoC). The results show that our approach maintains similar average performance to the original (unmodified) program code and execution, while reducing variance of completion times by a factor of 9 with the identified optimal solutions and by a factor of 5 with schedules generated by our heuristic scheduler.	eng
dc.format.mimetype	application/pdf
dc.language.iso	eng
dc.publisher	Elsevier
dc.relation.ispartof	Parallel Computing
dc.relation.uri	http://rtime.felk.cvut.cz/publications/public/PARCO2019.pdf
dc.rights	Creative Commons Attribution-NonCommercial-NoDerivs (CC BY-NC-ND) 4.0
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject	PREM	eng
dc.subject	predictability	eng
dc.subject	LLVM	eng
dc.subject	static scheduling	eng
dc.subject	Integer Linear Programming	eng
dc.subject	NVIDIA TX1	eng
dc.title	Combining PREM Compilation and Static Scheduling for High-Performance and Predictable MPSoC Execution	eng
dc.type	článek v časopise	cze
dc.type	journal article	eng
dc.identifier.doi	10.1016/j.parco.2018.11.002
dc.relation.projectid	info:eu-repo/grantAgreement/EC/H20/688860/EU/High-Performance Real-time Architectures for Low-Power Embedded Systems/HERCULES
dc.rights.access	embargoedAccess
dc.date.embargoEndDate	2021-07-31
dc.identifier.wos	000471087700003
dc.type.status	Peer-reviewed
dc.type.version	publishedVersion
dc.identifier.scopus	2-s2.0-85064278558

Soubory tohoto záznamu

Název:: Matejka_Forsberg_Sojka_et_al__ ...
Velikost:: 1.056Mb
Formát:: PDF
Popis:: PUBLISHED ## EMBARGOED:2021-07-31 ...
: Zobrazit/otevřít

Tento záznam se objevuje v následujících kolekcích

Publikační činnost ČVUT [1342]

Zobrazit minimální záznam

Kromě případů, kde je uvedeno jinak, licence tohoto záznamu je Creative Commons Attribution-NonCommercial-NoDerivs (CC BY-NC-ND) 4.0