Comparative framework for analyzing distance metrics in high-dimensional spaces
DOI:
https://doi.org/10.30837/2522-9818.2025.1.143Keywords:
distance metrics; high-dimensional spaces; metric comparison; norm; norm; data analysis; machine learning.Abstract
Subject of the research – developing a comprehensive framework to measure and analyze the relationships between different distance metrics in high-dimensional spaces. Aim of the research – to create a comparative framework that quantifies the "distance" between various distance metrics in high-dimensional settings. This framework aims to provide deeper insights into the interrelationships of these metrics and to guide practitioners in selecting the most appropriate metric for specific data analysis tasks. The research tasks include a theoretical formulation of methods to measure the "distance between distances", enabling a systematic comparison of different metrics. We conduct a thorough analysis of how these relationships evolve with increasing dimensionality. This involves developing mathematical models and employing visualization techniques to illustrate and interpret the relationships between metrics like the Manhattan distance, Euclidean distance, and others in high-dimensional spaces. A series of experiments are conducted on synthetic datasets to validate the theoretical findings and demonstrate the practical utility of the proposed framework. These datasets are carefully selected to cover a wide range of dimensionalities and data characteristics, ensuring a comprehensive evaluation of the framework's effectiveness. The methodology includes statistical analyses and visualization methods such as multidimensional scaling and heatmaps to represent the relationships between distance metrics clearly. The findings of the research are significant, revealing that the relationships between different distance metrics change notably as dimensionality increases. The results show patterns of convergence or divergence among certain metrics, providing valuable insights into their behavior in high-dimensional spaces. These insights are crucial for improving the accuracy and efficiency of data analysis techniques that rely on distance computations. The conclusions indicate that the proposed framework successfully quantifies the relationships between various distance metrics in high-dimensional spaces. By enhancing the understanding of how these metrics relate to one another, the research offers a valuable tool for selecting appropriate distance measures in high-dimensional data analysis. This contributes to more accurate and efficient analytical processes across various fields, including machine learning, data mining, and pattern recognition.
References
References
Halder, R. K., Uddin, M. N., Uddin, Md. A., Aryal, S., and Khraisat, A. (2024), "Enhancing K-nearest neighbor algorithm: a comprehensive review and performance analysis of modifications". Journal of Big Data, Springer Science and Business Media LLC. Vol. 11, No. 1. Aug. 11. DOI: 10.1186/s40537-024-00973-y
Aggarwal, C. C., Hinneburg, A., and Keim, D. A. (2001), "On the Surprising Behavior of Distance Metrics in High Dimensional Space". Lecture Notes in Computer Science, Springer Berlin Heidelberg. Vol. 1973. P. 420–434. DOI: 10.1007/3-540-44503-x_27
Silva, E. C. de M. (2024), "Asymptotic behavior of the Manhattan distance in n-dimensions: Estimating multidimensional scenarios in empirical experiments". arXiv. DOI: 10.48550/ARXIV.2406.15441
Jamali, A. R. M. J. U. U., and Alam, Md. A. (2019), "Approximate relations between Manhattan and Euclidean distance regarding Latin hypercube experimental design". Journal of Physics: Conference Series, IOP Publishing. Vol. 1366, No. 1. Nov. 1, 012030 р. DOI: 10.1088/1742-6596/1366/1/012030
Cardarilli, G. C., Di Nunzio, L., Fazzolari, R., Nannarelli, A., Re, M., and Spano, S. (2020), "NN-Dimensional Approximation of Euclidean Distance". IEEE Transactions on Circuits and Systems II: Express Briefs, IEEE. Vol. 67, No. 3. Mar., P. 565–569. DOI: 10.1109/tcsii.2019.2919545
Cahya, F. N., Mahatma, Y., and Rohimah, S. R. (2023), "Perbandingan Metode Perhitungan Jarak Euclidean dengan Perhitungan Jarak Manhattan pada K-Means Clustering Dalam Menentukan Penyebaran Covid di Kota Bekasi". JMT: Jurnal Matematika dan Terapan, Universitas Negeri Jakarta. Vol. 5, No. 1. Feb. 28, P. 43–55. DOI: 10.21009/jmt.5.1.5
Liu, W., and Zhang, W. (2020), "A Quantum Protocol for Secure Manhattan Distance Computation". IEEE Access, IEEE. Vol. 8. P. 16456–16461. DOI: 10.1109/access.2020.2966800
Blackburn, S. R., Homberger, C., and Winkler, P. (2019), "The minimum Manhattan distance and minimum jump of permutations". Journal of Combinatorial Theory, Series A, Elsevier BV. Vol. 161. Jan., P. 364–386. DOI: 10.1016/j.jcta.2018.09.002
Fkih, F. (2022), "Similarity measures for Collaborative Filtering-based Recommender Systems: Review and experimental comparison", Journal of King Saud University - Computer and Information Sciences, Vol. 34, no. 9, Elsevier, Р. 7645–7669, Oct. 2022, DOI: 10.1016/j.jksuci.2021.09.014
Vančura, V.; Kordík, P., Straka, M. (2024), "beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems", arXiv, DOI: 10.48550/ARXIV.2409.10309
Yang, J. (2025), "DFF: Decision-Focused Fine-tuning for Smarter Predict-then-Optimize with Limited Data,” arXiv, DOI: 10.48550/ARXIV.2501.01874
Monteiro, T. (2024), "AI-Powered Energy Algorithmic Trading: Integrating Hidden Markov Models with Neural Networks", arXiv, DOI: 10.48550/ARXIV.2407.19858
Samantaray, A. K., Rahulkar, A. D. (2019), "Comparison of Similarity Measurement Metrics on Medical Image Data", in 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT), IEEE, Р. 1–5, DOI: 10.1109/icccnt45670.2019.8944781
Maudet, G., Batton-Hébert, M., Maillé, P., Toutain, L. (2022), "Emission Scheduling Strategies for Massive-IoT: Implementation and Performance Optimization", in NOMS 2022 IEEE/IFIP Network Operations and Management Symposium, IEEE, Р. 1–4, Apr. 25, DOI: 10.1109/noms54207.2022.9789769
Shafique, A., Asad, M., Aslam, M., Shaukat, S., Cao, G. (2022), "Multi-hop similarity-based-clustering framework for IoT-oriented software-defined wireless sensor networks", IET Wireless Sensor Systems, Vol. 12, No. 2, Institution of Engineering and Technology (IET), Р. 67–80, DOI: 10.1049/wss2.12037
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Our journal abides by the Creative Commons copyright rights and permissions for open access journals.
Authors who publish with this journal agree to the following terms:
Authors hold the copyright without restrictions and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-commercial and non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
Authors are permitted and encouraged to post their published work online (e.g., in institutional repositories or on their website) as it can lead to productive exchanges, as well as earlier and greater citation of published work.












