Applied Sciences (Nov 2024)
Development of HTC-DBSCAN: A Hierarchical Trajectory Clustering Algorithm with Automated Parameter Tuning
Abstract
Existing route-clustering methods often fail to identify abnormal sections or similarities between routes, mainly when working with large or long datasets. While sub-route clustering can detect regional patterns, it struggles to accurately capture the overall route structure. The present study proposes a new ship route-clustering method that enhances computational efficiency and noise recognition while addressing these limitations. We refined Automatic Identification System data via four data-cleaning processes and applied a statistical distance measurement to assess ship trajectory similarity. Dimensionality reduction was then used to facilitate clustering. The clustering of ship route similarities is non-parametric and can be applied to datasets not separated based on density to find clusters of various densities. Density-Based Spatial Clustering of Applications (DBSCA) applies to many research fields; using the DBSCA with Noise (DBSCAN) algorithm, we propose an improved DBSCAN algorithm that automatically determines the parameters Epsilon and MinPts. In this study, as a core ship route-clustering process, we propose a sub-route clustering process by setting the distance and density of data points to clear standards for re-analysis and completion. The proposed approach demonstrates markedly enhanced clustering performance, offering a more sophisticated and efficient basis for ship route decision-making.
Keywords