TY - GEN
T1 - DaRPC
T2 - 5th ACM Symposium on Cloud Computing, SOCC 2014
AU - Stuedi, Patrick
AU - Trivedi, Animesh
AU - Metzler, Bernard
AU - Pfefferle, Jonas
PY - 2014/11/3
Y1 - 2014/11/3
N2 - Remote Procedure Call (RPC) has been the cornerstone of distributed systems since the early 80s. Recently, new classes of large-scale distributed systems running in data centers are posing extra challenges for RPC systems in terms of scaling and latency. We find that existing RPC systems make very poor usage of resources (CPU, memory, network) and are not ready to handle these upcoming workloads. In this paper we present DaRPC, an RPC framework which uses RDMA to implement a tight integration between RPC message processing and network processing in user space. DaRPC efficiently distributes computation, network resources and RPC resources across cores and memory to achieve a high aggregate throughput (2-3M ops/sec) at a very low per-request latency (10μs with iWARP). In the evaluation we show that DaRPC can boost the RPC performance of existing distributed systems in the cloud by more than an order of magnitude for both throughput and latency.
AB - Remote Procedure Call (RPC) has been the cornerstone of distributed systems since the early 80s. Recently, new classes of large-scale distributed systems running in data centers are posing extra challenges for RPC systems in terms of scaling and latency. We find that existing RPC systems make very poor usage of resources (CPU, memory, network) and are not ready to handle these upcoming workloads. In this paper we present DaRPC, an RPC framework which uses RDMA to implement a tight integration between RPC message processing and network processing in user space. DaRPC efficiently distributes computation, network resources and RPC resources across cores and memory to achieve a high aggregate throughput (2-3M ops/sec) at a very low per-request latency (10μs with iWARP). In the evaluation we show that DaRPC can boost the RPC performance of existing distributed systems in the cloud by more than an order of magnitude for both throughput and latency.
KW - RDMA
KW - Remote Procedure Call
UR - http://www.scopus.com/inward/record.url?scp=84912072267&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84912072267&partnerID=8YFLogxK
U2 - 10.1145/2670979.2670994
DO - 10.1145/2670979.2670994
M3 - Conference contribution
AN - SCOPUS:84912072267
T3 - Proceedings of the 5th ACM Symposium on Cloud Computing, SOCC 2014
BT - Proceedings of the 5th ACM Symposium on Cloud Computing, SOCC 2014
PB - Association for Computing Machinery, Inc
Y2 - 3 November 2014 through 5 November 2014
ER -