Nikolaievskyi O. Models, methods and information technology of automated natural language text processing

Українська версія

Thesis for the degree of Candidate of Sciences (CSc)

State registration number

0417U004041

Applicant for

Specialization

  • 05.13.06 - Інформаційні технології

20-10-2017

Specialized Academic Board

Д 26.056.01

Kyiv National University of Construction and Architecture

Essay

The dissertation is the result of the research and development of models, methods and information technology for the natural language text processing. The analysis of theoretical and practical developments in linguistic research and information technologies for search engines, referencing system and machine translation systems is conducted. A linguistic database for automated morphological and semantic analysis are reviewed. The main problems of existing approaches are determined, and the model of presentation of linguistic information for word-forms is proposed. Methods for constructing the vocabulary of quasi-endings and other dictionaries that is the basis of the linguistic database are developed to solve problems of the morphological and semantic level of natural language texts processing. A comparative analysis of proposed methods and models with existing ones was carried out, and the advantages of the proposed method is in place. It should be noted that it is important that developed in the research analytical and grammar dictionaries are unified and multilingual, which determines the possibilities for the universality of their application. The theoretical research has been brought to the program implementation. So, two software products - ARM PARADIGMA and ARM EXPERT have been developed, the initial results of the programs and the effectiveness of these complexes have been researched in the dissertation, as well as examples of the use of ARMs by experts in practice.

Files

Similar theses