Izvarin I. Semantic database of unstructured and structured documents with different requisite content

Українська версія

Thesis for the degree of Candidate of Sciences (CSc)

State registration number

0412U001541

Applicant for

Specialization

  • 01.05.03 - Математичне та програмне забезпечення обчислювальних машин і систем

20-02-2012

Specialized Academic Board

К 26.139.03

Essay

The thesis for the Candidate?s degree in technical sciences on speciality 01.05.03 -software computers and computer systems. - Open international university of human development "Ukraine", Kyiv, 2012. The thesis is devoted to the investigation and solution of scientific and technical problems of constructing semantic database of unstructured and structured documents with different requisite content and based on the RDMS, and implementation of information retrieval from documents that are stored in a semantic database. The definition of the document was given and formalized; document's requisite set and semantic properties of the requisites was given. Were considered methods for formalization of unstructured and poorly structured documents, proposed the use of semantic properties of the requisites of the documents and grouping of requisites into appropriate blocks on their semantic value, which adds a second level to the semantic decomposition of documents. This, in turn, simplifies the search for information within documents of various types and build database queries with different types of documents without the participation of the analyst, or a specialist in information technology. Search algorithm was developed to search information in semantic database of the documents of different types that consists of three parts: forming of the search conditions, selection of the set of the documents and forming the report. Based on developed algorithms and libraries were built corresponding software implementations and were tested their functioning and effectiveness. Key words: document, document structure, database, semantic characteristic, information system.

Files

Similar theses