Abstract
The article presents the application of cluster analysis, one of the most common machine learning methods, to the study of the level of regional transport development, segmentation of regions by the existing demand for transport services, and determination of the most important components of the transport system for certain regions. The purpose of the study is to divide the regions of the country into clusters that are relatively homogeneous in the main aspects of demand for transport services. Each cluster combines regions with similar economic, geographical and economic characteristics, which determines similarity in the most demanded modes of transport and transport infrastructure facilities. The research methodology is based on machine learning algorithms, the application of mathematical metrics to sets of statistical data. The research includes: selection of significant factors; analysis and normalization of statistical data; various clustering methods. In normalization, the data are converted to a single scale from 0 to 100 points with outliers excluded. As a result of cluster analysis, regions are distributed into four main clusters. Technical implementation of different variants of cluster analysis is possible in tabular editors and statistical packages. Based on the clustering results, each cluster is interpreted and common characteristics of the regions within them are identified. The prospects of the study include its annual updating based on updated statistical data. The results can be used to: analyze and develop the country’s transport system; identify priorities in the development of regional transport infrastructure; assess the significance and necessity of regional transport projects.
Affiliated Institutions
Related Publications
Robust Improper Maximum Likelihood: Tuning, Computation, and a Comparison With Other Methods for Robust Gaussian Clustering
The two main topics of this paper are the introduction of the "optimally\ntuned improper maximum likelihood estimator" (OTRIMLE) for robust clustering\nbased on the multivariate...
Model-Based Clustering, Discriminant Analysis, and Density Estimation
Cluster analysis is the automated search for groups of related observations in a dataset. Most clustering done in practice is based largely on heuristic but intuitively reasonab...
Integration K-Means Clustering Method and Elbow Method For Identification of The Best Customer Profile Cluster
Clustering is a data mining technique used to analyse data that has variations and the number of lots. Clustering was process of grouping data into a cluster, so they contained ...
<b>clValid</b>: An<i>R</i>Package for Cluster Validation
The R package clValid contains functions for validating the results of a clustering analysis. There are three main types of cluster validation measures available, "internal", "s...
CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure
Abstract Motivation: Clustering of individuals into populations on the basis of multilocus genotypes is informative in a variety of settings. In population-genetic clustering al...
Publication Info
- Year
- 2025
- Type
- article
- Volume
- 1
- Issue
- 3
- Pages
- 77-88
- Citations
- 0
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.26794/3033-7097-2025-1-3-77-88