Comparative framework for analyzing distance metrics in high-dimensional spaces

Authors

DOI:

https://doi.org/10.30837/2522-9818.2025.1.143

Keywords:

distance metrics; high-dimensional spaces; metric comparison; norm; norm; data analysis; machine learning.

Abstract

Subject of the research – developing a comprehensive framework to measure and analyze the relationships between different distance metrics in high-dimensional spaces. Aim of the research – to create a comparative framework that quantifies the "distance" between various distance metrics in high-dimensional settings. This framework aims to provide deeper insights into the interrelationships of these metrics and to guide practitioners in selecting the most appropriate metric for specific data analysis tasks. The research tasks include a theoretical formulation of methods to measure the "distance between distances", enabling a systematic comparison of different metrics. We conduct a thorough analysis of how these relationships evolve with increasing dimensionality. This involves developing mathematical models and employing visualization techniques to illustrate and interpret the relationships between metrics like the Manhattan distance, Euclidean distance, and others in high-dimensional spaces. A series of experiments are conducted on synthetic datasets to validate the theoretical findings and demonstrate the practical utility of the proposed framework. These datasets are carefully selected to cover a wide range of dimensionalities and data characteristics, ensuring a comprehensive evaluation of the framework's effectiveness. The methodology includes statistical analyses and visualization methods such as multidimensional scaling and heatmaps to represent the relationships between distance metrics clearly. The findings of the research are significant, revealing that the relationships between different distance metrics change notably as dimensionality increases. The results show patterns of convergence or divergence among certain metrics, providing valuable insights into their behavior in high-dimensional spaces. These insights are crucial for improving the accuracy and efficiency of data analysis techniques that rely on distance computations. The conclusions indicate that the proposed framework successfully quantifies the relationships between various distance metrics in high-dimensional spaces. By enhancing the understanding of how these metrics relate to one another, the research offers a valuable tool for selecting appropriate distance measures in high-dimensional data analysis. This contributes to more accurate and efficient analytical processes across various fields, including machine learning, data mining, and pattern recognition.

Author Biographies

Dmytro Chernyshov, Kharkiv National University of Radio Electronics

Department of System Engineering, PhD of Computer Science

Mykhailo Koziuberda, Kharkiv National University of Radio Electronics

postgraduate student of the Department of Software Engineering

References

References

Halder, R. K., Uddin, M. N., Uddin, Md. A., Aryal, S., and Khraisat, A. (2024), "Enhancing K-nearest neighbor algorithm: a comprehensive review and performance analysis of modifications". Journal of Big Data, Springer Science and Business Media LLC. Vol. 11, No. 1. Aug. 11. DOI: 10.1186/s40537-024-00973-y

Aggarwal, C. C., Hinneburg, A., and Keim, D. A. (2001), "On the Surprising Behavior of Distance Metrics in High Dimensional Space". Lecture Notes in Computer Science, Springer Berlin Heidelberg. Vol. 1973. P. 420–434. DOI: 10.1007/3-540-44503-x_27

Silva, E. C. de M. (2024), "Asymptotic behavior of the Manhattan distance in n-dimensions: Estimating multidimensional scenarios in empirical experiments". arXiv. DOI: 10.48550/ARXIV.2406.15441

Jamali, A. R. M. J. U. U., and Alam, Md. A. (2019), "Approximate relations between Manhattan and Euclidean distance regarding Latin hypercube experimental design". Journal of Physics: Conference Series, IOP Publishing. Vol. 1366, No. 1. Nov. 1, 012030 р. DOI: 10.1088/1742-6596/1366/1/012030

Cardarilli, G. C., Di Nunzio, L., Fazzolari, R., Nannarelli, A., Re, M., and Spano, S. (2020), "NN-Dimensional Approximation of Euclidean Distance". IEEE Transactions on Circuits and Systems II: Express Briefs, IEEE. Vol. 67, No. 3. Mar., P. 565–569. DOI: 10.1109/tcsii.2019.2919545

Cahya, F. N., Mahatma, Y., and Rohimah, S. R. (2023), "Perbandingan Metode Perhitungan Jarak Euclidean dengan Perhitungan Jarak Manhattan pada K-Means Clustering Dalam Menentukan Penyebaran Covid di Kota Bekasi". JMT: Jurnal Matematika dan Terapan, Universitas Negeri Jakarta. Vol. 5, No. 1. Feb. 28, P. 43–55. DOI: 10.21009/jmt.5.1.5

Liu, W., and Zhang, W. (2020), "A Quantum Protocol for Secure Manhattan Distance Computation". IEEE Access, IEEE. Vol. 8. P. 16456–16461. DOI: 10.1109/access.2020.2966800

Blackburn, S. R., Homberger, C., and Winkler, P. (2019), "The minimum Manhattan distance and minimum jump of permutations". Journal of Combinatorial Theory, Series A, Elsevier BV. Vol. 161. Jan., P. 364–386. DOI: 10.1016/j.jcta.2018.09.002

Fkih, F. (2022), "Similarity measures for Collaborative Filtering-based Recommender Systems: Review and experimental comparison", Journal of King Saud University - Computer and Information Sciences, Vol. 34, no. 9, Elsevier, Р. 7645–7669, Oct. 2022, DOI: 10.1016/j.jksuci.2021.09.014

Vančura, V.; Kordík, P., Straka, M. (2024), "beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems", arXiv, DOI: 10.48550/ARXIV.2409.10309

Yang, J. (2025), "DFF: Decision-Focused Fine-tuning for Smarter Predict-then-Optimize with Limited Data,” arXiv, DOI: 10.48550/ARXIV.2501.01874

Monteiro, T. (2024), "AI-Powered Energy Algorithmic Trading: Integrating Hidden Markov Models with Neural Networks", arXiv, DOI: 10.48550/ARXIV.2407.19858

Samantaray, A. K., Rahulkar, A. D. (2019), "Comparison of Similarity Measurement Metrics on Medical Image Data", in 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT), IEEE, Р. 1–5, DOI: 10.1109/icccnt45670.2019.8944781

Maudet, G., Batton-Hébert, M., Maillé, P., Toutain, L. (2022), "Emission Scheduling Strategies for Massive-IoT: Implementation and Performance Optimization", in NOMS 2022 IEEE/IFIP Network Operations and Management Symposium, IEEE, Р. 1–4, Apr. 25, DOI: 10.1109/noms54207.2022.9789769

Shafique, A., Asad, M., Aslam, M., Shaukat, S., Cao, G. (2022), "Multi-hop similarity-based-clustering framework for IoT-oriented software-defined wireless sensor networks", IET Wireless Sensor Systems, Vol. 12, No. 2, Institution of Engineering and Technology (IET), Р. 67–80, DOI: 10.1049/wss2.12037

Downloads

Published

2025-03-31

How to Cite

Chernyshov, D., & Koziuberda, M. (2025). Comparative framework for analyzing distance metrics in high-dimensional spaces. INNOVATIVE TECHNOLOGIES AND SCIENTIFIC SOLUTIONS FOR INDUSTRIES, (1(31), 143–150. https://doi.org/10.30837/2522-9818.2025.1.143