Zobrazit minimální záznam



dc.contributor.authorMatějka J.
dc.contributor.authorForsberg B.
dc.contributor.authorSojka M.
dc.contributor.authorŠůcha P.
dc.contributor.authorBenini L.
dc.contributor.authorMarongiu A.
dc.contributor.authorHanzálek Z.
dc.date.accessioned2020-03-31T15:36:35Z
dc.date.available2020-03-31T15:36:35Z
dc.date.issued2019
dc.identifierV3S-328628
dc.identifier.citationMATĚJKA, J., et al. Combining PREM Compilation and Static Scheduling for High-Performance and Predictable MPSoC Execution. Parallel Computing. 2019, 85 27-44. ISSN 0167-8191. DOI 10.1016/j.parco.2018.11.002.
dc.identifier.issn0167-8191 (print)
dc.identifier.issn1872-7336 (online)
dc.identifier.urihttp://hdl.handle.net/10467/87215
dc.description.abstractMany applications require both high performance and predictable timing. High-performance can be provided by COTS Multi-Core System on Chips (MPSoC), however, as cores in these systems share main memory, they are susceptible to interference from each other, which is a problem for timing predictability. We achieve predictability on multi-cores by employing the predictable execution model (PREM), which splits execution into a sequence of memory and compute phases, and schedules these such that only a single core is executing a memory phase at a time. We present a toolchain consisting of a compiler and a scheduling tool. Our compiler uses region and loop based analysis and performs tiling to transform application code into PREM-compliant binaries. In addition to enabling predictable execution, the compiler transformation optimizes accesses to the shared main memory. The scheduling tool uses a state-of-the-art heuristic algorithm and is able to schedule industrial-size instances. For smaller instances, we compare the results of the algorithm with optimal solutions found by solving an Integer Linear Programming model. Furthermore, we solve the problem of scheduling execution on multiple cores while preventing interference of memory phases. We evaluate our toolchain on Advanced Driver Assistance System (ADAS) application workloads running on an NVIDIA Tegra X1 embedded system-on-chip (SoC). The results show that our approach maintains similar average performance to the original (unmodified) program code and execution, while reducing variance of completion times by a factor of 9 with the identified optimal solutions and by a factor of 5 with schedules generated by our heuristic scheduler.eng
dc.format.mimetypeapplication/pdf
dc.language.isoeng
dc.publisherElsevier
dc.relation.ispartofParallel Computing
dc.relation.urihttp://rtime.felk.cvut.cz/publications/public/PARCO2019.pdf
dc.rightsCreative Commons Attribution-NonCommercial-NoDerivs (CC BY-NC-ND) 4.0
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectPREMeng
dc.subjectpredictabilityeng
dc.subjectLLVMeng
dc.subjectstatic schedulingeng
dc.subjectInteger Linear Programmingeng
dc.subjectNVIDIA TX1eng
dc.titleCombining PREM Compilation and Static Scheduling for High-Performance and Predictable MPSoC Executioneng
dc.typečlánek v časopisecze
dc.typejournal articleeng
dc.identifier.doi10.1016/j.parco.2018.11.002
dc.relation.projectidinfo:eu-repo/grantAgreement/EC/H20/688860/EU/High-Performance Real-time Architectures for Low-Power Embedded Systems/HERCULES
dc.rights.accessembargoedAccess
dc.date.embargoEndDate2021-07-31
dc.identifier.wos000471087700003
dc.type.statusPeer-reviewed
dc.type.versionpublishedVersion
dc.identifier.scopus2-s2.0-85064278558


Soubory tohoto záznamu


Tento záznam se objevuje v následujících kolekcích

Zobrazit minimální záznam

Creative Commons Attribution-NonCommercial-NoDerivs (CC BY-NC-ND) 4.0
Kromě případů, kde je uvedeno jinak, licence tohoto záznamu je Creative Commons Attribution-NonCommercial-NoDerivs (CC BY-NC-ND) 4.0