Kushnir D. Methods and means of searching and recognizing objects in video images on the mobile platform in real-time.

Українська версія

Thesis for the degree of Doctor of Philosophy (PhD)

State registration number

0824U000366

Applicant for

Specialization

  • 123 - Комп’ютерна інженерія

28-06-2023

Specialized Academic Board

ID 1684

Lviv Polytechnic National University

Essay

The Ph.D. thesis is devoted to solving the current scientific and technical problem of developing real-time methods to search and recognize objects in video images on a mobile platform. The introduction substantiates the relevance of the topic of dissertation research, formulates the purpose of the study and the scientific and technical tasks necessary to achieve it, shows the connection of the study with scientific programs and topics, provides the scientific novelty of the results obtained, their practical value and the personal contribution of the applicant. Information about the work results' testing and the author's personal contribution and publication are presented. The first section analyzes existing approaches to integrating search and object recognition systems, namely, varieties and architectural features of recognition models and algorithms for tracking an arbitrary class of objects. The analysis results showed that integrating such systems requires applying a particular set of filters, specialized activation functions, and object-tracking algorithms. During the analysis, the Yolo family of convolutional neural network models was chosen as the basic neural network, as the most promising in the field of object recognition. In addition, an analysis of existing mobile systems for searching and recognizing objects in real time was carried out. It was determined that a significant problem of such systems is the lack of an effective platform for automatic training and integrating models into the mobile platform. Also, one of the problems is increasing the efficiency of such systems since they mostly have limited hardware capabilities. As a conclusion to the first chapter, a set of methods and tools for solving the problem of search and recognition in video images on a mobile platform in real time was formed, and the task of the dissertation research was formulated. In the second section, metrics for evaluating the results of object recognition and tracking were proposed. The general structure of the Yolov4 convolutional neural network model for the mobile platform is formed and described. A modified method of recognition object clustering based on k-means++ was used to create recognition anchors. Methods of filtering recognition results have been developed. Three object tracking algorithms have been developed: algorithmic, algorithmic with reinforcement learning, and an operational tracking algorithm based on the IOU minimization filter, using the Hungarian algorithm as a convergence function. Methods of memoization of tracking objects have been developed. Finally, a method of quantizing the output weight coefficients of a convolutional neural network by affine transformations is proposed. In the third chapter, according to the proposed methods and tools, algorithms for training the convolutional neural network model, automatic annotation of input images, and conversion of the model into CoreML format for the mobile platform are developed. According to the selected means of scaling and containerization of Docker, the structure of the system of autonomous annotation, training, and conversion of such a model was built. From this structure, Docker containers can be extracted for each module/service, which will offer scalable hardware capabilities of the operating system. The interdependence between each element of such a system is described. A means of integrating a built-in module for tracking moving objects on the iOS mobile platform is proposed. The integration takes place with the use of the JavaScriptCore library for data transfer between the system and the module.. The fourth chapter presents the developed system architecture of the iOS mobile operating system and the Ubuntu operating system and justifies the choice of components of such systems. The results of system analysis and testing are presented. The obtained research results confirmed the effectiveness of search and recognition algorithms in real time. Keywords: object recognition, object tracking algorithm, results filtering, scalable environment, activation functions, video images, mobile platform, convolutional neural network, real-time map, object search time, object recognition time, scalable Docker system, Yolo cluster of convolutional neural network models.

Research papers

Kushnir D. Methods and means for small dynamic objects recognition and tracking // Computers, Materials & Continua. 2022. Vol. 73, iss. 2. P. 3649–3665.

Paramud Y., Kushnir D. The algorithm of cyber-physical system targeting on a movable object using the smart sensor unit // Advances in Cyber-Physical Systems. 2020. Vol. 5, № 1. P. 16–22.

Кушнір Д. О. Методи та засоби покращення точності розпізнавання об’єктів на мобільній платформі iOS в реальному часі // Комп’ютерні системи та мережі. 2021. Вип. 3, № 1. С. 80–88.

Кушнір Д. О., Парамуд Я. С. Методи пошуку та розпізнавання об'єктів у відеозображеннях на мобільній платформі IOS в реальному часі // Комп’ютерні системи та мережі. 2019. Вип. 1, № 1. С. 24–34.

Парамуд Я. С., Кушнір Д. О. Алгоритм оперативного наведення засобів вимірювально-керувального вузла кіберфізичної системи на рухомий об’єкт // Комп’ютерні системи та мережі. 2020. Вип. 2, № 1. С. 44–52.

Kushnir D., Paramud Y. Model for real-time object searching and recognizing on mobile platform // Advanced trends in radioelectronics, telecommunications and computer engineering : proceedings of 15th International conference, February 25–29, 2020, Lviv, Slavske, Ukraine. 2020. P. 127–130.

Paramud Y., Kushnir D., Ocherklevich O. Deep neural network model for text semantic analysis based on word embeddings // Advanced computer information technologies, ACIT’2021 : proceedings of the 11th International conference (Deggendorf, Germany, September 15-17, 2021). 2021. P. 718–721.

Kushnir D., Vavruk E. Mobile system for text recognition and translation with using Microsoft Cognitive API // VІIІ Міжнародний молодіжний науковий форум "Litteris et Artibus" & 13-та Міжнародна конференція "Молоді вчені до викликів сучасної технології" : матеріали, 22–24 листопада, 2018, Львів, Україна. 2018. C. 81–84

Ваврук Є. Я., Кушнір Д. О. Система розпізнавання та перекладу текстової інформації в мобільних додатках з використанням бібліотеки Microsoft Cognitive OCR // Вісник Національного університету “Львівська політехніка”. Серія: Комп’ютерні системи та мережі. 2018. № 905. С. 33–41.

Impact of optical illumination on transmission of subterahertz electromagnetic waves by Bi12GeO20 crystals / N. Andrushchak, D. Vynnyk, M. Melnyk, P. Bajurko, J. Sobolewski, V. Haiduchok, D. Kushnir, A. Andrushchak, Y. Yashchyshyn // Acta Physica Polonica A. 2022. Vol. 141, № 4 : Proceedings of the International conference on oxide materials for electronic engineering (OMEE 2021) September 28 – October 2, 2021. P. 415–419.

Kushnir D., Paramud Y., Borak T. Microprocessor subsystem of the smart house to control the multichannel irrigation of the room plants // Advances in Cyber-Physical Systems. 2022. Vol. 7, № 1. P. 1–7.

Files

Similar theses