Distributed DNN serving in the network data plane

K. Razavi, G. Karlos, V. Nigade, M. Mühlhäuser, L. Wang

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

166 Downloads (Pure)

Abstract

Programmable networks have received tremendous attention recently. Apart from exciting network innovations, in-network computing has been explored as a means to accelerate a variety of distributed systems concerns, by leveraging programmable network devices. In this paper, we extend in-network computing to an important class of applications called deep neural network (DNN) serving. In particular, we propose to run DNN inferences in the network data plane in a distributed fashion and make our programmable network a powerful accelerator for DNN serving. We demonstrate the feasibility of this idea through a case study with a real-world DNN on a typical data center network architecture.
Original languageEnglish
Title of host publicationEuroP4 '22
Subtitle of host publicationProceedings of the 5th International Workshop on P4 in Europe
PublisherAssociation for Computing Machinery, Inc
Pages67-70
Number of pages4
ISBN (Electronic)9781450399357
DOIs
Publication statusPublished - Dec 2022
Event5th International Workshop on P4 in Europe, EuroP4 2022, co-located with ACM CoNEXT 2022 - Rome, Italy
Duration: 9 Dec 2022 → …

Conference

Conference5th International Workshop on P4 in Europe, EuroP4 2022, co-located with ACM CoNEXT 2022
Country/TerritoryItaly
CityRome
Period9/12/22 → …

Funding

FundersFunder number
Open Competition Domain Science XS
Deutsche Forschungsgemeinschaft
Google Research
Not added12611

    Fingerprint

    Dive into the research topics of 'Distributed DNN serving in the network data plane'. Together they form a unique fingerprint.

    Cite this