Preview

Russian Technological Journal

Advanced search

The structure of the local detector of the reprint model of the object in the image

https://doi.org/10.32362/2500-316X-2021-9-5-7-13

Abstract

Currently, methods for recognizing objects in images work poorly and use intellectually unsatisfactory methods. The existing identification systems and methods do not completely solve the problem of identification, namely, identification in difficult conditions: interference, lighting, various changes on the face, etc. To solve these problems, a local detector for a reprint model of an object in an image was developed and described. A transforming autocoder (TA), a model of a neural network, was developed for the local detector. This neural network model is a subspecies of the general class of neural networks of reduced dimension. The local detector is able, in addition to determining the modified object, to determine the original shape of the object as well. A special feature of TA is the representation of image sections in a compact form and the evaluation of the parameters of the affine transformation. The transforming autocoder is a heterogeneous network (HS) consisting of a set of networks of smaller dimension. These networks are called capsules. Artificial neural networks should use local capsules that perform some rather complex internal calculations on their inputs, and then encapsulate the results of these calculations in a small vector of highly informative outputs. Each capsule learns to recognize an implicitly defined visual object in a limited area of viewing conditions and deformations. It outputs both the probability that the object is present in its limited area and a set of “instance parameters” that can include the exact pose, lighting, and deformation of the visual object relative to an implicitly defined canonical version of this object. The main advantage of capsules that output instance parameters is a simple way to recognize entire objects by recognizing their parts. The capsule can learn to display the pose of its visual object in a vector that is linearly related to the “natural” representations of the pose that are used in computer graphics. There is a simple and highly selective test for whether visual objects represented by two active capsules A and B have the correct spatial relationships for activating a higher-level capsule C. The transforming autoencoder solves the problem of identifying facial images in conditions of interference (noise), changes in illumination and angle.

About the Author

A. A. Kulikov
IREA – Russian Technological University
Russian Federation

Alexander А. Kulikov, Cand. Sci. (Eng.), Associate Professor, Department of the Tool and Applied Software, Institute of Information Technologies

78, Vernadskogo pr., Moscow, 119454 



References

1. Parfinovich S.N. Algorithms of face recognition for identity verification by image. In: “Molodoi issledovatel’: vyzovy i perspektivy”: sb. mat. CXIV Mezhdunarodnoi nauchno-prakticheskoi konferentsii” (Proceedings CXIV International Scientific and Practical Conference “Young Researcher: Challenges and Prospects”). Moscow: Internauka; 2019, p. 115–163. (in Russ.).

2. Akhmedov A.A., Sagidov G.S., Kurbanismailov G.M. Algorithm of face recognition based on the Viola–Jones method. In: “Molodoi issledovatel’: vyzovy i perspektivy”: sb. mat. CXVIII Mezhdunarodnoi nauchno-prakticheskoi konferentsii”(ProceedingsCXVIII International Scientific and Practical Conference “Young Researcher: Challenges and Prospects”). Moscow: Internauka; 2019, p. 270−274. (in Russ.).

3. Pentland A., Choudhary T. Face recognition for smart environments. Otkrytye sistemy =Open Systems Publications. 2000;03 (in Russ.). Available from URL: https://www.osp.ru/os/2000/03/177939

4. Gorelik A.L., Gurevich I.B., Skripkin V.A. Sovremennoe sostoyanie problemy raspoznavaniya: Nekotorye aspekty (The current state of the recognition problem: Some aspects). Moscow: Radio i svyaz’; 1985. 161 р. (in Russ.).

5. Samal D.I., Frolov I.I. Algorithm of preparation of the training sample using 3D face modeling. Sistemnyi analiz i prikladnaya informatika = System analysis and applied Information science. 2016;4:17–23 (in Russ.). Available from URL: https://sapi.bntu.by/jour/article/view/128/105

6. Zavalov R.A., Garaev R.A. Implementation of the Viola–Jones algorithm on a microcontroller with limited resources. Nauka i obrazovanie segodnya = Science and Education Today. 2018;6(29):18−23 (in Russ.). Available from URL: https://cyberleninka.ru/article/n/realizatsiya-algoritma-violy-dzhonsa-na-mikrokontrollere-s-ogranichennymi-resursami/viewer

7. Baldin A.V., Eliseev D.V. Multidimensional matrix algebra for adapted data model processing. Nauka i obrazovanie: nauchnoe izdanie MGTU im. N.E. Baumana= Science and Education of Bauman MSTU. 2011;7:4 (in Russ.). Available from URL: http://technomag.edu.ru/doc/199561.html

8. Korotkov A. Database index for approximate string matching. In: Proceedings of the 4th Spring/Summer Young Researchers’ Colloquium on Software Engineering. SYRCoSE ’10. 2010, p. 136−140. https://doi.org/10.15514/syrcose-2010-4-27

9. Kononykhin I.A., Ezhov F.V., Martynyuk R.A., et al. Implementation of a face recognition and tracking system. Molodoi uchenyi = Young Scientist. 2020;28(318):8−12 (in Russ.). Available from URL: https://moluch.ru/archive/318/72492/

10. Hinton G.E., Krizhevsky A., Wang S.D. Transforming auto-encoders. In: Honkela T., Duch W., Girolami M., Kaski S. (Eds.). Artificial Neural Networks and Machine Learning – ICANN 2011. ICANN 2011. Lecture Notes in Computer Science. Springer, Berlin, Heidelberg; 2011. V. 6791. P. 44−51. https://doi.org/10.1007/978-3-642-21735-7_6

11. Alghaili M., Li Z., Ali H.A.R. FaceFilter: Face identification with deep learning and filter algorithm. Scientific Programming. 2020:1−9. https://doi.org/10.1155/2020/7846264

12. Fitzgerald R.J., Price H.L., Valentine T. Eyewitness identification: Live, photo, and video lineups. Psychology, Public Policy, and Law. 2018;24(3):307−325. http://dx.doi.org/10.1037/law0000164

13. Etemad K., Chellapa R. Discriminant analysis for recognition of human face images. Journal of the Optical Society of America A. 2004;14(8):1724−1733. https://doi.org/10.1364/JOSAA.14.001724

14. Kulikov A.A. The model is a reprint of an object in the image. Rossiiskii tekhnologicheskii zhurnal = Russian technological journal. 2020;8(3):7−13 (in Russ.). https://doi.org/10.32362/2500-316X-2020-8-3-7-13

15. Romanenko A.O., Yufryakov A.V. Image blur evaluation for biometric identification. Nauka i obrazovanie segodnya= Science and Education Today. 2018;7(30):16−19 (in Russ.). Available from URL: https://cyberleninka.ru/article/n/otsenka-razmytiya-izobrazheniya-dlya-biometricheskoy-identifikatsii/viewer


Supplementary files

1. Three capsules of transforming autoencoder
Subject
Type Исследовательские инструменты
View (139KB)    
Indexing metadata ▾

To recognize objects in images, a local detector for a reprint model of an object in an image was developed and described. A transforming autocoder, a model of a neural network, was developed for the local detector. The local detector is able, in addition to determining the modified object, to determine the original shape of the object as well. The transforming autocoder is a heterogeneous network (HS) consisting of a set of networks of smaller dimension. These networks are called capsules. The capsule can learn to display the pose of its visual object in a vector. The transforming autoencoder solves the problem of identifying facial images in conditions of interference (noise), changes in illumination and angle.

Review

For citations:


Kulikov A.A. The structure of the local detector of the reprint model of the object in the image. Russian Technological Journal. 2021;9(5):7-13. https://doi.org/10.32362/2500-316X-2021-9-5-7-13

Views: 623


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 2782-3210 (Print)
ISSN 2500-316X (Online)