Multi-inference on the Edge: Scheduling Networks with Limited Available Memory

None, None

Multi-inference on the Edge: Scheduling Networks with Limited Available Memory

Bachelor Thesis (2020)

Author(s)

J.M. Galjaard (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Lydia Y. Chen – Mentor (TU Delft - Data-Intensive Systems)

S. Ghiassi – Graduation committee member (TU Delft - Data-Intensive Systems)

B.A. Cox – Graduation committee member (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Scheduling Convolutional Neural Networks Edge Computing Constrained Memory Memory Aware Multi-inference

To reference this document use:

https://resolver.tudelft.nl/uuid:2fadc0f7-e381-419e-8593-92af74ead039

More Info

expand_more

Publication Year

2020

Language

English

Copyright

Graduation Date

25-06-2020

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The execution of multi-inference tasks on low-powered edge devices has become increasingly popular in recent years for adding value to data on-device. The focus of the optimization of such jobs has been on hardware, neural network architectures, and frameworks to reduce execution speed. However, it is yet not known how different scheduling policies affect the execution speed of a multi-inference job. An empirical study has been performed to investigate the effects of scheduling policies on multi-inference. The execution performance information of multi-inference batch jobs under combinations of loading and scheduling policies were determined under varying levels of constrained memory. These results were obtained using EdgeCaffe: a framework developed to execute Caffe networks on edge oriented devices. Our research showed that a novel scheduling policy, MeMa, can significantly reduce execution speed under stringent memory availability. Overall, this study demonstrates that scheduling policies can significantly reduce the execution speed of multi-inference jobs.

Files

JEROEN_GALJAARD_paper.pdf

(pdf | 0.425 Mb)

License info not available