Khabarlak K. Image Classification and Segmentation Methods Based on Changeable Convolutional Neural Networks

Українська версія

Thesis for the degree of Doctor of Philosophy (PhD)

State registration number

0825U000392

Thesis Registration Form

0825U000392.pdf

Applicant for

Konstantin Khabarlak

Specialization

122 - Комп’ютерні науки

Date of defense

25-10-2023

Specialized Academic Board

ДФ 08.080.021

Dnipro University of Technology

Essay

PhD thesis in specialty 122 Computer Science. – Dnipro University of Technology, Dnipro, 2023. Convolutional neural networks show high quality in solving computer vision tasks. A significant amount of research is devoted to the development of neural networks, that target inference on powerful servers. However, in a number of cases their use is complicated for the following reasons: when the Internet connection is unstable or absent at all, when the user does not agree to share private data from his device, when the data volume is too large to be transferred from all devices to the server, etc. If it is necessary to process images on a mobile or low-power device, a number of problems arise: — such devices have limited computing resources, and the network inference might be unacceptably long for the target task. Therefore, deep convolutional neural network architectures with many parameters that show high quality on servers need to be modified for mobile applications; — inference when running on battery implies that the number of computations should be minimized. Because of this, great interest of the scientific community is devoted to the development of mobile neural network architectures that take into account the mobile device limitations at the design stage. Such architectures require the network configuration to be finalized before the start of the training procedure, as a result long training procedure should be repeated after each network architecture adjustment, which is a problem; — if the application is expected to be installed on an IoT device, this adds another category of devices with less computing power and presents the neural network developer with a choice: either to train one network that will be fast enough for all devices, but potentially have poor performance; or to train a separate network for each category of devices, which, given the long training time of deep neural networks, will significantly increase the cost of development of the system. The purpose of the work is to accelerate convolutional neural network training and inference for the tasks of image classification and segmentation without recognition quality loss (or with as little loss as possible) by developing changeable neural networks and their training methods. By changeable neural network we mean a convolutional network with changeable complexity. Scientific novelty of the obtained results: — for the first time, the changeable convolutional neural network and its training method were developed for the tasks of classification and segmentation. In contrast to the existing ones, changeable neural networks enable configuration selection among the ones with different computational complexities during or after training. On the ImageNet dataset the developed neural network in terms of efficiency (in the sense of recognition quality/execution time ratio) took fifth place among the 17 considered state-of-the-art neural network architectures on the ImageNet dataset. On the CamVid dataset the speed up is above 6 % without quality loss; — for the first time, the Λ-patterns method of optimization meta-learning acceleration was developed, which, unlike existing methods, allows to change neural network complexity during training and, consequently, speed up the neural network few-shot adaptation by 7.5 % with the quality loss below 0.4 %.

Thesis supervisor

Larisa Koriashkina

Official opponents

Sergei Subbotin
Pylyp Prystavka

Reviewers

Katerina L. Sergieieva
Irina M. Udovik

Research papers

Khabarlak K., Koriashkina L. Fast Facial Landmark Detection and Applications: A Survey // Journal of Computer Science and Technology. 2022. Квіт. Т. 22, № 1. С. 12–41.

Khabarlak K. S. Faster Optimization-Based Meta-Learning Adaptation Phase // Radio Electronics, Computer Science, Control. 2022. Квіт. № 1. С. 82–92. DOI: 10.15588/1607-3274-2022-1-10.

Khabarlak K. S., Koriashkina L. S. Scoping Adversarial Attack for Improving Its Quality // Radio Electronics, Computer Science, Control. 2019. Трав. № 2. С. 108–118.

Khabarlak K. S., Koriashkina L. S. Mobile Access Control System Based on RFID Tags and Facial Information // Bulletin of National Technical University ”KhPI”. Series: System Analysis, Control and Information Technologies. 2020. Т. 2, № 4.

Хабарлак К. С. Особливості роботи методів пошуку облич на мобільних пристроях // System Technologies. 2021. Т. 6, № 137. С. 34 – 45.

Khabarlak K. Post-Train Adaptive U-Net for Image Segmentation // Information Technology: Computer Science, Software Engineering and Cyber Security. 2022. № 2. С. 73–78.

Комп’ютерна програма «Мобільна нейромережева система пошуку облич із анті-спуфінгом» : авт. свід. України №110917 / К. С. Хабарлак. 11.01.2022

Khabarlak K. Post-Train Adaptive MobileNet for Fast Anti-Spoofing // Proceedings of the 3rd International Workshop on Intelligent Information Technologies & Systems of Information Security, Khmelnytskyi, Ukraine, March 23–25. Т. 3156. CEUR-WS.org, 2022. С. 44–53. (CEUR Workshop Proceedings)

Хабарлак К. С., Коряшкіна Л. С. Деякі особливості гіперпараметрів глибоких нейронних мереж // III Всеукраїнська Інтернет-конференція здобувачів вищої освіти і молодих учених «Інформаційні технології: теорія і практика». Харків, 03.2020. С. 98–99

Khabarlak K., Koriashkina L. Top image classification accuracy through hyperparameter search // 15th International Forum for Students and Young Researchers “Widening our horizons”. Dnipro, 05.2020. С. 271–274

Khabarlak K. Mobile Application for RFID Access Control System // V міжнародна науково-практична конференція «Прикладні науково-технічні дослідження». Івано-Франківськ, 04.2021. С. 99–100

Хабарлак К. С. Анти-спуфінг для системи контролю доступу із RFID мітками // Збірник матеріалів III Всеукраїнської конференції «Теоретико-практичні проблеми використання математичних методів і комп’ютерно-орієнтованих технологій в освіті та науці». Київ, 04.2021. С. 141–142

Хабарлак К. С. Проблеми нейронних мереж для розпізнавання на пристроях із різними обчислювальними можливостями // Тези VI Всеукраїнської науково-практичної Інтернет-конференції здобувачів вищої освіти і молодих учених Інформаційні технології: теорія і практика. Харків, 03.2023. С. 101–102

Хабарлак К. С. Конфігурація після навчання нейронної мережі для сегментації зображень // Матеріали XIII Міжнародної науково-технічної конференції аспірантів та молодих вчених «Наукова весна». Дніпро, 03.2023. С. 194–195

Khabarlak K. Semantic segmentation with Post-Train Adaptive Neural Network // Тези XI міжнародної науково-практичної конференції «Сучасні проблеми і досягнення в галузі радіотехніки, телекомунікацій та інформаційних технологій». Запоріжжя, 12.2022. С. 124–125

Хабарлак К. С. Нейро-мережева система класифікації із конфігурацією після навчання // Матеріали X Міжнародної науково-технічної конференції студентів, аспірантів і молодих вчених «Молодь: наука та інновації». Дніпро, 11.2022. С. 383

0825U000098

Rostyslav Zakharov

Information Technology for Developing Educational Competencies

0824U003733

Yevhen H. Hulak

Models and methods of guaranteeing capability and cyber security of information and communication systems of the energy sector

0824U003503

Shevtsova Olga Serhiivna

Information technology for pre-processing and classification of multi-time satellite images of high spatial resolution

0824U003501

Ilya K. Ziborov

Information technology for decision support in complex production processes based on the evolutionary method

0824U003415

Yaroslav Shestak

Modeling Technology of HEI Information Infrastructure