MethodsX (Dec 2023)
Classification of broadband network devices using text mining technique
Abstract
The Broadband Internet industry is highly competitive, with service providers investing heavily in network development to meet customer demands and competing on pricing. Effective cost management is crucial for profitability in this market. This work proposes a model for classifying broadband network devices based on text mining techniques applied to a device list from a leading broadband network company in Thailand. The device descriptions are used to generate a feature vector, which is then employed by a classification algorithm to categorize devices into core, access, and last mile hierarchies. Various algorithms including decision tree, naïve Bayes, Bayesian network, k-nearest neighbor, support vector machine, and deep neural network are compared, with support vector machine achieving the highest accuracy of 90.35%. The results are visualized to provide insights into network hierarchy, device replacement dates, and budget requirements, enabling support for cost management, budget planning, maintenance, and investment decision-making. The methodology outline includes, • Obtaining a device list from a major broadband network company and extracting device descriptions through text mining and generating a feature vector. • Using a support vector machine for classification and comparing algorithm performances. • Visualizing the results for actionable insights in cost management, budget planning, and investment decisions.