Nasirov E. Parallelization of non-negative huge sparse linguistic matrix and tensors factorization

Українська версія

Thesis for the degree of Candidate of Sciences (CSc)

State registration number

0421U102388

Thesis Registration Form

0421U102388.pdf

Applicant for

Nasirov Emil Mekhdiievych

Specialization

01.05.01 - Теоретичні основи інформатики та кібернетики

Date of defense

13-05-2021

Specialized Academic Board

Д 26.001.09

Taras Shevchenko National University of Kyiv

Essay

The paper describes algorithms and methods for parallelizing non-negative factorization of sparse matrices and tensors - popular technology in artificial intelligence in general, and in computational linguistics in particular. Two methods of parallelization of the algorithm for factorization of non-negative matrices are proposed: a local algorithm using a hard disk and computations on GPUs and a distributed algorithm using a network of computational nodes and GPUs. The paper also proposes a block-diagonal approach to factorization of inherent sparse linguistic matrices and tensors, which can be reduced to a block-diagonal form. The proposed method also allows the model to be supplemented with new data without no need to perform the nonnegative factorization of the entire super-large tensor from the very beginning. It is also proposed to use the latent Dirichlet distribution to reduce matrices and tensors to the block-diagonal form by constructing thematic diagonal blocks.

Thesis supervisor

Marchenko Oleksandr Oleksandrovich

Official opponents

Zhezherun O. P.
Doroshenko Anatolii Yukhymovych

Files

autoreferat-aref.doc

dis.doc

Similar theses

0420U102355

Dudar Vyacheslav Vyacheslavovych

Training algorithms and invariance to geometric transformations of neural networks

0520U101700

Prytomanova Olga M.

Fuzzy problems of optimal set partitioning: theoretical bases, methods and algorithms for solving

0520U100242

Zavadskyi Ihor O.

Splittable codes and their applications

0520U100209

Rystsov Ihor

Application of algebraic automata theory to analysis of discrete dynamical systems.

0420U100673

Kanarska Iryna Serhiivna

Set-theoretic table operations and its complexity