Vol. 3 No. 2 (2025): SJESR - June 2025
Articles

Multi-class classification framework for ear imaging using attention-based hybrid dense CNN and non-linear manifold approximation

Mohammed Mahmood Ali Alezzi School of Engineering and Natural Sciences, Electrical and Computer Engineering, Altınbaş University, Istanbul, Turkey

Published 2025-06-30

Keywords

  • Ear Identification,
  • Convolutional Neural Network,
  • Attention,
  • Deep Feature Fusion,
  • Instance Learning

How to Cite

Multi-class classification framework for ear imaging using attention-based hybrid dense CNN and non-linear manifold approximation. (2025). Samarra Journal of Engineering Science and Research, 3(2), 1-23. https://doi.org/10.65115/7dnryk35

Abstract

The ear identification problem is the task of recognizing or authenticating an individual's identity based on the unique characteristics of their ear. Like fingerprints or face recognition, ear identification tasks mainly depend on the unique shape, structure, and geometry of the ear for accurate classification. Moreover, realizing high classification accuracy is challenging for both security systems and forensic investigations since it boosts the robustness of security systems and favors law enforcement efforts. In this work, we develop a novel deep learning-based multi-class classification approach for ear images that uses a hybrid-dense CNN architecture combined with a self-attention mechanism and uniform manifold approximation and projection (UMAP) for dimensionality reduction. We conducted our experiments with ear images labeled for multi-class classification, using a Dense CNN network built from Xception-Net and Dense-Net architectures to capture intricate patterns. The self-attention mechanism enhances feature representation, while UMAP optimizes feature space by simplifying high-dimensional data. The model processes low-dimensional features through fully connected layers and softmax for classification.
The experimental findings of the proposed methodology showed classification accuracies of 98.24% on the first dataset and 94.66% on the second. When compared to cutting-edge models like VGG-Net, Inception Networks, and ResNet-50, our technique outperformed them in terms of classification accuracy, precision, recall, and F1 score. This framework has great potential for use in a variety of applications, including biometric identification, medical diagnostics, forensic science, security systems, and assistive technology.

Downloads

Download data is not yet available.

References

  1. [1] Ţ. Emeršič, V. Štruc, and P. Peer, "Ear recognition: More than a survey," Neurocomputing, vol. 255, pp. 26–39, 2017.
  2. [2] H. Chen and B. Bhanu, "Human ear recognition in 3D," IEEE Trans. Pattern Anal. Mach. Intell., vol. 29, no. 4, pp. 718–737, 2007.
  3. [3] I. I. Ganapathi, S. S. Ali, S. Prakash, N. S. Vu, and N. Werghi, "A survey of 3D ear recognition techniques," ACM Comput. Surv., vol. 55, no. 10, pp. 1–36, 2023.
  4. [4] A. S. Anwar, K. K. A. Ghany, and H. Elmahdy, "Human ear recognition using geometrical features extraction," Procedia Comput. Sci., vol. 65, pp. 529–537, 2015.
  5. [5] A. Korichi, S. Slatnia, and O. Aiadi, "TR-ICANet: A fast unsupervised deep-learning-based scheme for unconstrained ear recognition," Arabian J. Sci. Eng., vol. 47, no. 8, pp. 9887–9898, 2022.
  6. [6] M. Sharkas, "Ear recognition with ensemble classifiers: A deep learning approach," Multimedia Tools Appl., vol. 81, no. 30, pp. 43919–43945, 2022.
  7. [7] H. Alshazly, C. Linse, E. Barth, S. A. Idris, and T. Martinetz, "Towards explainable ear recognition systems using deep residual networks," IEEE Access, vol. 9, pp. 122254–122273, 2021.
  8. [8] L. Tian and Z. Mu, "Ear recognition based on deep convolutional network," in Proc. 9th Int. Congr. Image Signal Process., Biomed. Eng. Informatics (CISP-BMEI), Oct. 2016, pp. 437–441.
  9. [9] D. P. Chowdhury, S. Bakshi, G. Guo, and P. K. Sa, "On applicability of tunable filter bank based feature for ear biometrics: A study from constrained to unconstrained," J. Med. Syst., vol. 42, 2018, Art. no. 1.
  10. [10] M. Z. Alom et al., "The history began from AlexNet: A comprehensive survey on deep learning approaches," arXiv preprint arXiv:1803.01164, 2018.
  11. [11] A. Vedaldi and A. Zisserman, "VGG convolutional neural networks practical," Dept. Eng. Sci., Univ. Oxford, Tech. Rep., 2016.
  12. [12] B. Koonce, SqueezeNet, in Convolutional Neural Networks with Swift for TensorFlow: Image Recognition and Dataset Categorization, 2021, pp. 73–85.
  13. [13] K. Maharana, S. Mondal, and B. Nemade, "A review: Data pre-processing and data augmentation techniques," Global Transitions Proc., vol. 3, no. 1, pp. 91–99, 2022.
  14. [14] R. A. Priyadharshini, S. Arivazhagan, and M. Arun, "A deep learning approach for person identification using ear biometrics," Appl. Intell., vol. 51, no. 4, pp. 2161–2172, 2021.
  15. [15] A. Kumar and C. Wu, "Automated human identification using ear imaging," Pattern Recognit., vol. 45, no. 3, pp. 956–968, 2012.
  16. [16] E. Gonzalez, L. Alvarez, and L. Mazorra, "AMI ear database," [Online]. Available: https://webctim.ulpgc.es/research_works/ami_ear_database/, Accessed Apr. 14, 2024.
  17. [17] I. I. Ganapathi, S. Prakash, I. R. Dave, and S. Bakshi, "Unconstrained ear detection using ensemble-based convolutional neural network model," Concurrency Comput. Pract. Exp., vol. 32, no. 1, 2020, Art. no. e5197.
  18. [18] L. Yuan and F. Zhang, "Ear detection based on improved AdaBoost algorithm," in Proc. Int. Conf. Mach. Learn. Cybern., Jul. 2009, vol. 4, pp. 2414–2417.
  19. [19] K. V. Joshi and N. C. Chauhan, "Edge detection and template matching approaches for human ear detection," in Proc. Int. Conf. Intell. Syst. Data Process. (ICISD), 2011, pp. 50–55.
  20. [20] J. C. Yoo and T. H. Han, "Fast normalized cross-correlation," Circuits Syst. Signal Process., vol. 28, pp. 819–843, 2009.
  21. [21] S. Prakash and P. Gupta, "An efficient ear localization technique," Image Vis. Comput., vol. 30, no. 1, pp. 38–50, 2012.
  22. [22] N. K. A. Wahab, E. E. Hemayed, and M. B. Fayek, "HEARD: An automatic human ear detection technique," in Proc. Int. Conf. Eng. Technol. (ICET), Oct. 2012, pp. 1–7.
  23. [23] P. Chidananda, P. Srinivas, K. Manikantan, and S. Ramachandran, "Entropy-cum-Hough-transform-based ear detection using ellipsoid particle swarm optimization," Mach. Vis. Appl., vol. 26, pp. 185–203, 2015.
  24. [24] H. Peng and C. Ding, "Minimum redundancy and maximum relevance feature selection and recent advances in cancer classification," in Feature Selection for Data Mining, 2005, p. 52.
  25. [25] L. Wang and H. Li, "HMCNet: Hybrid efficient remote sensing images change detection network based on cross-axis attention MLP and CNN," IEEE Trans. Geosci. Remote Sens., vol. 60, pp. 1–14, 2022.
  26. [26] T. Y. Lin, P. Dollár, R. Girshick, K. He, B. Hariharan, and S. Belongie, "Feature pyramid networks for object detection," in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2017, pp. 2117–2125.
  27. [27] L. McInnes, J. Healy, and J. Melville, "UMAP: Uniform manifold approximation and projection for dimension reduction," arXiv preprint arXiv:1802.03426, 2018.
  28. [28] F. Chollet, "Xception: Deep learning with depthwise separable convolutions," in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2017, pp. 1251–1258.
  29. [29] F. Iandola, M. Moskewicz, S. Karayev, R. Girshick, T. Darrell, and K. Keutzer, "DenseNet: Implementing efficient convnet descriptor pyramids," arXiv preprint arXiv:1404.1869, 2014.
  30. [30] C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna, "Rethinking the Inception architecture for computer vision," in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2016, pp. 2818–2826.
  31. [31] B. Koonce, ResNet 50, in Convolutional Neural Networks with Swift for TensorFlow: Image Recognition and Dataset Categorization, 2021, pp. 63–72.
  32. [32] M. Lebrun, "An analysis and implementation of the BM3D image denoising method," Image Process. OnLine, vol. 2, pp. 175–213, 2012.
  33. [33] X. Xia, C. Xu, and B. Nan, "Inception-v3 for flower classification," in Proc. 2nd Int. Conf. Image Vis. Comput. (ICIVC), Jun. 2017, pp. 783–787.
  34. [34] B. Koonce, ResNet 50, in Convolutional Neural Networks with Swift for TensorFlow: Image Recognition and Dataset Categorization, 2021, pp. 63–72.
  35. [35] N. K. Kumar and J. Schneider, "Literature survey on low rank approximation of matrices," Linear Multilinear Algebra, vol. 65, no. 11, [incomplete reference].
  36. [36] L. Torrey and J. Shavlik, "Transfer learning," in Handbook of Research on Machine Learning Applications and Trends: Algorithms, Methods, and Techniques, IGI Global, 2010, pp. 242–264.
  37. [37] A. Tiwari and A. Chaturvedi, "A hybrid feature selection approach based on information theory and dynamic butterfly optimization algorithm for data classification," Expert Syst. Appl., vol. 196, p. 116621, 2022.
  38. [38] A. Tiwari, "Wilson's disease classification using higher-order Gabor tensors and various classifiers on a small and imbalanced brain MRI dataset," Multimedia Tools Appl., vol. 82, no. 23, pp. 35121–35147, 2023.
  39. [39] A. U. Rahman, G. Tikhonov, J. Oksanen, T. Rossi, and O. Ovaskainen, "Accelerating joint species distribution modelling with Hmsc-HPC by GPU porting," PLoS Comput. Biol., vol. 20, no. 9, p. e1011914, 2024.
  40. [40] F. Belletti, D. King, K. Yang, R. Nelet, Y. Shafi, Y. F. Shen, and J. Anderson, "Tensor processing units for financial Monte Carlo," in Proc. SIAM Conf. Parallel Process. Sci. Comput., 2020, pp. 12–23.
  41. [41] J. Waldo, G. Wyant, A. Wollrath, and S. Kendall, "A note on distributed computing," in Int. Workshop Mobile Object Syst., Jul. 1996, pp. 49–64.