Towards Fast Overlapping Community Detection

Ismail Elhelw, R.F.H. Hofman, Henri E. Bal

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Accelerating sequential algorithms in order to achieve high performance is often a nontrivial task. However, there are certain properties that can exacerbate this process and make it particularly daunting. For example, building an efficient parallel solution for a data-intensive algorithm requires a deep analysis of the memory access patterns and data reuse potential. Attempting to scale out the computations on clusters of machines introduces further complications due to network speed limitations. In this context, the optimization landscape can be extremely complex owing to the large number of trade-off decisions. In this paper, we discuss our experience designing two parallel implementations of an existing data-intensive machine learning algorithm that detects overlapping communities in graphs. The first design uses a single GPU to accelerate the computations of small data sets. We employed a code generation strategy in order to test and identify the best performing combination of optimizations. The second design uses a cluster of machines to scale out the computations for larger problem sizes. We used a mixture of MPI, RDMA and pipelining in order to circumvent networking overhead. Both these efforts bring us closer to understanding the complex relationships hidden within networks of entities.

Original languageEnglish
Title of host publicationProceedings - 2016 16th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2016
PublisherInstitute of Electrical and Electronics Engineers, Inc.
Pages175-178
Number of pages4
ISBN (Electronic)9781509024520
DOIs
Publication statusPublished - 18 Jul 2016
Event16th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2016 - Cartagena, Colombia
Duration: 16 May 201619 May 2016

Conference

Conference16th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2016
Country/TerritoryColombia
CityCartagena
Period16/05/1619/05/16

Keywords

  • Algorithms for Accelerators and Heterogeneous Systems
  • Combinatorial and Data In-tensive Application
  • Performance Analysis
  • Statistical Learning

Fingerprint

Dive into the research topics of 'Towards Fast Overlapping Community Detection'. Together they form a unique fingerprint.

Cite this