Home // ICSEA 2021, The Sixteenth International Conference on Software Engineering Advances // View article


Portable Fast Platform-Aware Neural Architecture Search for Edge/Mobile Computing AI Applications

Authors:
Kuo-Teng Ding
Hui-Shan Chen
Yi-Lun Pan
Hung-Hsin Chen
Yuan-Ching Lin
Shih-Hao Hung

Keywords: Portable, Neural Architecture Search, Performance Prediction

Abstract:
The recent rise and progress of neural network-based artificial intelligence are obvious, which we have settled up many traditional hard problems by deep learning. But the problem in neural network deployment on diverse hardware platforms is met, which needs lots of computational capability and time to ”try out” the best architecture tipping the balance of model accuracy and execution latency. The proposed Portable Fast Platform-Aware Neural Architecture Search (PFP-NAS) system allows users to use the trained neural network model easily without considering the hardware architecture of the edge/mobile computing on the client-side. The portable neural architecture search device shrinks the data center and converts it to allow users to utilize it on-demand and dynamically. This just-in-time, secure, and portable neural architecture search method is mainly based on the platform-aware client-side and applying the neural network model trained on the data center. Another key thing to remember is that users use the expandable modules of this device-Performance Prediction Module and Client Requirement-oriented Module: ie Accuracy, Latency, Throughput-FLOPs, MAC, Cost, etc., and then the device can detect hardware architectures such as USB/TPU/GPU/FPGA on edge/mobile computing. When detected, the device will send signals to connect the data center and drive the trained model in the data center for the corresponding hardware architecture. The proposed technique has the following characteristics: a. Only a boot medium is needed to detect and determine the hardware and then get the most suitable neural network from the server; b. Provide performance prediction module and client requirement-oriented module; c. Automatically match the model and the corresponding hardware architecture; d. Designed with modular scalability, and there is no need to configure any settings on the client-side. Consequently, the proposed framework achieves a portable data center.

Pages: 98 to 105

Copyright: Copyright (c) IARIA, 2021

Publication date: October 3, 2021

Published in: conference

ISSN: 2308-4235

ISBN: 978-1-61208-894-5

Location: Barcelona, Spain

Dates: from October 3, 2021 to October 7, 2021