A distributed algorithm for solving large-scale p-median problems using expectation maximization

Harsha Gwalani; Joseph Helsing; Sultanah M. Alshammari; Chetan Tiwari; Armin R. Mikler

doi:10.7717/peerj-cs.2446

PeerJ Computer Science (Nov 2024)

A distributed algorithm for solving large-scale p-median problems using expectation maximization

Harsha Gwalani,
Joseph Helsing,
Sultanah M. Alshammari,
Chetan Tiwari,
Armin R. Mikler

Affiliations

Harsha Gwalani: Department of Computer Science and Engineering, University of North Texas, Denton, Texas, United States
Joseph Helsing: Department of Electrical and Computer Engineering, Stevens Institute of Technology, Hoboken, New Jersey, United States
Sultanah M. Alshammari: Center of Research Excellence in Artificial Intelligence and Data Science, King Abdul Aziz University, Jeddah, Saudi Arabia
Chetan Tiwari: Department of Computer Science and Department of Geosciences, Georgia State University, Atlanta, Georgia, United States
Armin R. Mikler: Department of Computer Science, Georgia State University, Atlanta, Georgia, United States

DOI: https://doi.org/10.7717/peerj-cs.2446
Journal volume & issue: Vol. 10
p. e2446

Abstract

Read online Read online

The p-median problem selects p source locations to serve n destinations such that the average distance between the destinations and corresponding sources is minimized. It is a well-studied NP-hard combinatorial optimization problem with many existing heuristic solutions, however, existing algorithms are not scalable for large-scale problems. The fast interchange (FI) heuristic which yields results close to the optimal solution with respect to the objective function value becomes suboptimal with respect to time requirements for large-scale problems. We present a novel distributed divide and conquer algorithm, EM-FI, to solve large-scale p-median problems quickly even with limited computing resources. The algorithm identifies the existing spatial clusters of the destination locations using expectation maximization (EM) and solves them as independent p-median problems using integer programming or FI concurrently. The proposed algorithm showed an order of magnitude improvement in time without the loss of quality in terms of the objective function value on synthetic and real datasets.

Published in PeerJ Computer Science

ISSN: 2376-5992 (Online)
Publisher: PeerJ Inc.
Country of publisher: United States
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://peerj.com/computer-science/

About the journal

Abstract

Keywords