Accelerating Image Classification based on a Model for Estimating Descriptor-to-Class Distance

Authors

  • Volodymyr Gorokhovatskyi
  • Svitlana Gadetska
  • Natalia Stiahlyk

DOI:

https://doi.org/10.47839/ijc.22.4.3355

Keywords:

image classification, keypoint descriptor, distance estimation, classification speed

Abstract

The article describes a method of image classification based on the estimation of the distance to the etalon class. The implementation of estimates gives a significant gain in classification speed compared to linear search while maintaining a decent level of accuracy. The methodology is based on the use of the triangle inequality for images given by a set of binary vectors as descriptors of the image key points. The evaluation is applied to the "object   descriptor – etalon" classification method, which is based on the descriptor voting procedure. An analysis of evaluation options is carried out using the parameters of the etalon sets in the form of a medoid and the closest or farthest points from it. The gain in classification time compared to the traditional method proportionally depends on the number of descriptors in the etalon description. Software simulation of classifiers with the implementation of evaluation shows a gain in speed of 350-450 times for the description of 500 descriptors while maintaining one hundred percent classification accuracy on the training set of similar NFT images. A control sample experiment shows that the classifier with estimation can respond better to image details compared to the traditional method.

References

R. Tymchyshyn, O. Volkov, O. Gospodarchuk, Yu. Bogachuk, “Modern approaches to computer vision,” Control Systems and Computers, no. 6, pp. 46-73, 2018. https://doi.org/10.15407/usim.2018.06.046.

J. Leskovec, A. Rajaraman, J. D. Ullman, Mining of Massive Datasets, Cambridge Univ. Press, New York, NY, USA, 2020, 511 p. https://doi.org/10.1017/9781108684163.

P. Flach, Machine learning. The Art and Science of Algorithms that Make Sense of Data, Cambridge University Press, New York, USA, 2012, 409 p. https://doi.org/10.1017/CBO9780511973000.

C. D. Manning, P. Raghavan, and H. Schutze, Introduction to Information Retrieval, Cambridge University Press, 2008, 528 p. [Online]. Avai;ab;e at: https://nlp.stanford.edu/IR-book/pdf/irbookonlinereading.pdf

S. Mashtalir, V. Mashtalir, “Spatio-temporal video segmentation,” In: V. Mashtalir, I. Ruban, V. Levashenko (Eds), Advances in Spatio-Temporal Segmentation of Visual Data. Studies in Computational Intelligence, vol 876. Springer, Cham, 2020, pp. 161-210. https://doi.org/10.1007/978-3-030-35480-0_4.

Y. I. Daradkeh, V. Gorokhovatskyi, I. Tvoroshenko, M. Zeghid, “Tools for fast metric data search in structural methods for image classification,” IEEE Access, vol. 10, pp. 124738-124746, 2022. https://doi.org/10.1109/ACCESS.2022.3225077.

W. Hu, Y. Chen, L. Wu, G. Shi, M. Jian, “Boundary-aware hashing for hamming space retrieval,” Appl. Sci., vol. 12, issue 1, p. 508, 2022. https://doi.org/10.3390/app12010508.

A. Babenko, A. Slesarev, A. Chigorin, V. Lempitsky, “Neural codes for image retrieval,” Conference Paper. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8689 LNCS(PART 1), рp. 584-599, 2014. https://doi.org/10.1007/978-3-319-10590-1_38.

B. Ramesh, C. Xiang, T. H. Lee, “Shape classification using invariant features and contextual information in the bag-of-words model,” Pattern Recognition, vol. 48, pp. 894-906, 2015. https://doi.org/10.1016/j.patcog.2014.09.019.

P. Dhar, S. Guha, “Fish image classification by XgBoost based on Gist and GLCM features,” International Journal of Information Technology and Computer Science (IJITCS), vol. 13, no. 4, pp. 17-23, 2021. https://doi.org/10.5815/ijitcs.2021.04.02.

W. H. Press, S. A. Teukolsky, W. T. Vetterling, B. P. Flannery, Numerical Recipes: The Art of Scientific Computing, 3nd ed., Cambridge University Press, 2007, 1262 p.

E. Chavez, G. Navarro, “A compact space decomposition for effective metric indexing,” Pattern Recognition Letters, vol. 26, no. 9, pp. 1363−1376, 2005. https://doi.org/10.1016/j.patrec.2004.11.014.

G. Hjaltason, H. Samet, “Index-driven similarity search in metric spaces,” ACM Trans. on Database Systems (TODS), vol. 28, no. 4, pp. 517−580, 2003. https://doi.org/10.1145/958942.958948.

A. Gionis, P. Indyk, R. Motwani, “Similarity search in high dimensions via hashing,” Proceedings of the Intl. Conf. on Very Large Databases, 1999, pp. 518–529.

D. Kinoshenko, V. Mashtalir, E. Yegorova, V. Vinarsky. “Hierarchical partitions for content image retrieval from large-scale database,” Machine Learning and Data Mining in Pattern Recognition, Lecture Notes in Artificial Intelligence, Springer-Verlag, vol. 3587, pp. 445−455, 2005. https://doi.org/10.1007/11510888_44.

Y. Liu, D. Zhanga, G. Lua, W.-Y. Ma, “A survey of content-based image retrieval with high-level semantics,” Pattern Recognition, vol. 40, no. 1, pp. 262–282, 2007. https://doi.org/10.1016/j.patcog.2006.04.045.

A. Berman, L. Shapiro, “A flexible image database system for content-based retrieval,” Computer Vision and Image Understanding, vol. 75, no. 1/2, pp. 175–195, 1999. https://doi.org/10.1006/cviu.1999.0772.

C. Celik, H. Sakir, “Content based image retrieval with sparse representations and local feature descriptors: A comparative study,” Pattern Recognit, vol. 68, pp. 1–13, 2017. https://doi.org/10.1016/j.patcog.2017.03.006.

M. Ghahremani, Y. Liu, and B. Tiddeman, “FFD: Fast Feature Detector,” IEEE Trans. Image Process., vol. 30, pp. 1153–1168, 2021. https://doi.org/10.1109/TIP.2020.3042057.

V. Gorokhovatskiy, “Compression of descriptions in the structural image recognition,” Telecommunications and Radio Engineering, vol. 70, no. 15, pp. 1363–1371, 2011. https://doi.org/10.1615/TelecomRadEng.v70.i15.60.

A. Oliinyk, S. Subbotin, V. Lovkin, O. Blagodariov, T. Zaiko, “The system of criteria for feature informativeness estimation in pattern recognition,” Radio Electronics, Computer Science, Control, no. 4, pp. 85–96, 2017. https://doi.org/10.15588/1607-3274-2017-4-10.

H. Kuchuk, A. Podorozhniak, N. Liubchenko, and D. Onishchenko, “System of license plate recognition considering large camera shooting angles,” Radioelectronic and Computer Systems, vol. 4, no. 100, pp. 82–91, 2021. https://doi.org/10.32620/reks.2021.4.07.

S. Gadetska, V. Gorokhovatskyi, N. Stiahlyk, N. Vlasenko, “Aggregate parametric representation of image structural description in statistical classification methods,” CEUR Workshop Proceedings: Computer Modeling and Intelligent Systems (CMIS-2022), vol. 3137, pp. 68-77, 2022. https://doi.org/10.32782/cmis/3137-6.

T. Kohonen, Self-Organizing Maps, Springer-Verlag, Berlin Heidelberg, 1995. https://doi.org/10.1007/978-3-642-97610-0.

R. Duda, P. Hart, D. Stork, Pattern Classification, Wiley, 2000, 738p.

A. Aho, J. Hopcroft, and J. Ullman, Data Structures and Algorithms, Delhi, India: Pearson, 2003, 479 p.

S. Leutenegger, M. Chli and R. Y. Siegwart, “BRISK: Binary Robust invariant scalable keypoints,” Proceedings of the International Conference on Computer Vision, Barcelona, Spain, November 3-16, 2011, pp. 2548-2555. https://doi.org/10.1109/ICCV.2011.6126542.

M. Khamsi, and W. Kirk, An Introduction to Metric Spaces and Fixed Point Theory”, John Wiley & Sons, 2001. https://doi.org/10.1002/9781118033074.ch1.

O. Gorokhovatskyi, O. Peredrii, “Image pair comparison for near-duplicates detection,” International Journal of Computing, vol. 22, issue 1, pp. 51-57, 2023. https://doi.org/10.47839/ijc.22.1.2879.

K. Laskhmaiah, S. Murali Krishna, B. Eswara Reddy, “An optimized k-means with density and distance-based clustering algorithm for multidimensional spatial databases,” International Journal of Computer Network and Information Security (IJCNIS), vol. 13, no. 6, pp. 70-82, 2021. https://doi.org/10.5815/ijcnis.2021.06.06.

OpenCV Open Source Computer Vision, [Online]. Available at: https://docs.opencv.org/master/index.html.

Downloads

Published

2023-12-31

How to Cite

Gorokhovatskyi, V., Gadetska, S., & Stiahlyk, N. (2023). Accelerating Image Classification based on a Model for Estimating Descriptor-to-Class Distance. International Journal of Computing, 22(4), 485-492. https://doi.org/10.47839/ijc.22.4.3355

Issue

Section

Articles