RU2544739C1

RU2544739C1 - Method to transform structured data array

Info

Publication number: RU2544739C1
Application number: RU2014111223/08A
Authority: RU
Inventors: Игорь Петрович Рогачев
Original assignee: Игорь Петрович Рогачев
Priority date: 2014-03-25
Filing date: 2014-03-25
Publication date: 2015-03-20
Also published as: EA033096B1; WO2015147706A3; WO2015147706A9; WO2015147706A2; EA201600675A1

Abstract

FIELD: information technologies.SUBSTANCE: in the method of structured data array transformation, which contains text in natural language, they create (101) the first data structure of the structured data array from the end data structure of the structured data array. They create (102) a data base of logical connections between logical sections of elements of the first data structure. They create (103) the second data structure of the structured data array. They create (104) a data base of semantic parts of logical sections of elements of the second data structure. They create (105) grammatically and orthographically correct semantic parts of logical sections of the second data structure elements by means of linguistic transformations over the specified semantic parts. They create (106) the end data structure of the structured data array.EFFECT: creation of logically, grammatically and orthographically true data structure, providing for quick and convenient navigation by structure elements.17 cl, 15 dwg, 3 tbl

Description

Группа изобретений относится к решениям в области обработки массивов данных, в частности к решениям в области обработки структурированных массивов данных, содержащих текст на естественном языке, и может быть использовано для предварительного преобразования структурированного массива данных, содержащего текст на естественном языке, для удобства его последующей обработки.The group of inventions relates to solutions in the field of processing data arrays, in particular to solutions in the field of processing structured data arrays containing natural language text, and can be used for preliminary transformation of a structured data array containing natural language text, for the convenience of its subsequent processing .

УРОВЕНЬ ТЕХНИКИBACKGROUND

Из патента EA 002016 B1, G06F 17/30, 22.10.2001 (МАТВЕЕВ ЛЕВ ЛАЗАРЕВИЧ И ДР.) известен способ поиска похожих по текстовому и/или смысловому содержимому фрагментов в электронных документах, хранимых на устройствах хранения данных, заключающийся в индексировании каждого сохраняемого в архиве документа, разбиении упомянутых документов на фрагменты и формировании тематик из одного и более фрагмента, определении параметров поиска, проведении поиска, ранжировании полученного в результате поиска списка фрагментов документов, причем в качестве параметров поиска определяют множество входящих в выбранный фрагмент документа уникальных блоков информации и расширяют его посредством предварительной обработки каждого из упомянутых уникальных блоков информации, где под уникальным блоком информации понимают блок информации, встретившийся в выбранном фрагменте документа один и более раз, где в качестве предварительной обработки используют операцию получения, по меньшей мере, из одного уникального блока информации, одного или нескольких блоков информации, связанных с уникальным блоком информации заданным соотношением.From patent EA 002016 B1, G06F 17/30, 10.22.2001 (MATVEEV LEV LAZAREVICH AND OTHERS) a method is known for searching for fragments similar in text and / or semantic content in electronic documents stored on data storage devices, which consists in indexing each stored in document archive, dividing the mentioned documents into fragments and forming topics from one or more fragments, determining search parameters, conducting a search, ranking the list of document fragments obtained as a result of a search, moreover, as parameters by they determine the set of unique blocks of information included in the selected fragment of the document and expand it by pre-processing each of the mentioned unique blocks of information, where a unique block of information means the block of information that occurs one or more times in the selected fragment of the document, where the operation is used as preliminary processing receiving at least one unique block of information, one or more blocks of information associated with a unique block of information and the predetermined ratio.

Из патента RU 2476927 C2, G06F 17/30, 27.02.2013 (АНШУКОВ СЕРГЕЙ АЛЕКСАНДРОВИЧ И ДР.) известен способ позиционирования текстов в пространстве знаний, заключающийся в том, что из входных данных выделяют элементы, соответствующие паттернам, входящим в таксоны, образующие таксономии, объединенные в онтологии; определяют значимые таксоны, которые взвешивают с учетом условий, приписанных паттернам; составляют набор взвешенных векторов, позиционирующих входной документ в пространстве знаний, отличающийся тем, что в нем для позиционирования используют множество онтологий, а также тем, что при составлении наборов векторов рассматривают только те элементы, которые соответствуют паттернам, входящим в один таксон или в таксоны, имеющие общие родительские таксоны.From the patent RU 2476927 C2, G06F 17/30, 02.27.2013 (ANSHUKOV SERGEY ALEXANDROVICH AND OTHERS), a method for positioning texts in the knowledge space is known, which consists in extracting elements from the input data that correspond to patterns that are included in taxa that form taxonomies combined in ontology; significant taxa are determined that are weighted based on the conditions assigned to the patterns; make up a set of weighted vectors that position the input document in the knowledge space, characterized in that it uses a lot of ontologies for positioning, and also in the fact that when compiling sets of vectors, only those elements that correspond to patterns included in one taxon or in taxons are considered, having common parent taxa.

Из патента RU 2210809 C2, G06F 17/28, 20.08.2003 (ОТКРЫТОЕ АКЦИОНЕРНОЕ ОБЩЕСТВО «МОСКОВСКАЯ ТЕЛЕКОММУНИКАЦИОННАЯ КОРПОРАЦИЯ») известен способ автоматического преобразования исходного текста в совокупность взаимосвязанных объектов на основе таблицы настроек, содержащей знания о структуре исследуемой системы в виде совокупности образующих ее классов, включающих определенный набор атрибутов (в том числе взаимосвязи и отношения между объектами заданных классов) и установленные для каждого атрибута правила распознавания атрибута в тексте. Предусматривается возможность определения формата исходного текста и автоматического перевода его фрагментов в ходе формирования объектов.From patent RU 2210809 C2, G06F 17/28, 08/20/2003 (OPEN JOINT-STOCK COMPANY “MOSCOW TELECOMMUNICATION CORPORATION”), a method is known for automatically converting the source text into a set of interconnected objects based on a settings table containing knowledge about the structure of the system under study in the form of the set forming it classes that include a specific set of attributes (including relationships and relationships between objects of given classes) and rules for recognizing an attribute in the text established for each attribute. It is possible to determine the format of the source text and automatically translate its fragments during the formation of objects.

Из патента RU 2292078 C1, G06F 17/30, 20.01.2007 (ЗАКРЫТОЕ АКЦИОНЕРНОЕ ОБЩЕСТВО «МЕДИАЛИНГВА») известен способ поиска, разметки и отображения информации, включающий ввод искомых объектов данных исходных электронных документов, подлежащих поиску по информационным сетям с терминала абонента сети, осуществляющего функцию источника запроса искомых объектов данных, сравнение искомых объектов данных исходных электронных документов с контрольными объектами данных ассоциированной информации в информационной сети, а при совпадении искомых объектов с контрольными преобразование объектов данных исходных электронных документов посредством разметки объектов данных исходных электронных документов гиперссылками, визуализацию на терминале абонента сети электронных документов с гиперссылками и вызов на терминал абонента сети данных ассоциированной информации информационной сети, отличающийся тем, что перед разметкой создают, по меньшей мере, две области данных, по меньшей мере, одна из которых является резидентной областью для источника запроса искомых объектов данных и обеспечивает привязку к объектам данных первичных гиперссылок, содержащих дополнительные параметры, для адресации, по меньшей мере, к одной другой области, а, по меньшей мере, одна другая область является нерезидентной для источника запроса искомых объектов данных и обеспечивает привязку к объектам данных вторичных гиперссылок для адресации, по меньшей мере, к одному ресурсу данных ассоциированной информации для доступа к ней с терминалов абонентов, по меньшей мере, одной резидентной области, являющейся источником первичной гиперссылки, при этом в резидентной области создают массив контрольных объектов данных с соответствующими каждому указанному объекту первичными гиперссылками в качестве связанных данных, а в нерезидентной области создают массив контрольных объектов данных ассоциированной информации с соответствующей каждому указанному объекту, по меньшей мере, одной вторичной гиперссылкой в качестве связанных данных ассоциированной информации информационной сети.From patent RU 2292078 C1, G06F 17/30, 20.01.2007 (MEDIA LINGUA CLOSED JOINT-STOCK COMPANY), a method for searching, marking and displaying information is known, including entering the desired data objects of the source electronic documents to be searched on information networks from a network subscriber terminal, performing the function of the request source of the desired data objects, comparing the desired data objects of the source electronic documents with the control data objects of the associated information in the information network, and if the desired objects coincide with the control conversion of data objects of source electronic documents by marking data objects of source electronic documents with hyperlinks, visualization of electronic documents with hyperlinks at the subscriber terminal and calling to the subscriber terminal of the data network of associated information of the information network, characterized in that at least two are created before marking data areas, at least one of which is a resident area for the request source of the desired data objects and provides a link access to data objects of primary hyperlinks containing additional parameters for addressing at least one other region, and at least one other region is non-resident for the query source of the desired data objects and provides binding to data objects of secondary hyperlinks for addressing at least one data resource of associated information for access to it from subscriber terminals of at least one resident area that is the source of the primary hyperlink, while in the res In this area, an array of control data objects is created with primary hyperlinks corresponding to each specified object as related data, and in a non-resident area, an array of control data objects of associated information is created with at least one secondary hyperlink corresponding to each specified object as information associated information network.

Из патента RU 2386166 C2, G06F 17/30, 10.04.2010 (ОТКРЫТОЕ АКЦИОНЕРНОЕ ОБЩЕСТВО «ТАГАНРОГСКИЙ АВИАЦИОННЫЙ НАУЧНО-ТЕХНИЧЕСКИЙ КОМПЛЕКС ИМ. Г.М. БЕРИЕВА») известен способ формирования базы знаний, которую формируют в виде трехмерного информационного пространства, в котором данные о документе или его части определяют в кластер или кластеры, образованные единичными отрезками (ортами) характеристических признаков. Полный идентификационный номер документа формируют из кодов орт составляющих характеристических признаков и идентификационного номера документа. Производят анализ каждого кластера на полноту определения ограниченной им сферы деятельности содержащимися в кластере документами. Результат анализа заносят в этот же кластер. Поиск и анализ данных производят как с помощью формирования и обработки запроса, так и в обратном направлении с помощью подготовки базы данных для ожидаемого пользователя. В системе также предусмотрены средства для работы с базой данных, для поиска, контроля и анализа информации, документов, областей деятельности, для создания и корректировки документов системными администраторами, экспертами и пользователями в соответствии с правами доступа.From patent RU 2386166 C2, G06F 17/30, 04/10/2010 (OPEN JOINT-STOCK COMPANY “TAGANROG AVIATION SCIENTIFIC AND TECHNICAL COMPLEX NAMED AFTER GM BERIEV”), a method of forming a knowledge base is known which is formed in the form of a three-dimensional information space in which data about a document or its parts is determined in a cluster or clusters formed by single segments (orts) of characteristic features. The full identification number of the document is formed from the codes orth component characteristic features and the identification number of the document. Each cluster is analyzed for the completeness of the definition of the limited scope of activity contained in the documents contained in the cluster. The result of the analysis is recorded in the same cluster. Search and analysis of data is carried out both by means of the formation and processing of the request, and in the opposite direction by preparing the database for the expected user. The system also provides tools for working with the database, for searching, controlling and analyzing information, documents, areas of activity, for creating and updating documents by system administrators, experts and users in accordance with access rights.

Из патента RU 2253893 C2, G06F 17/27, 10.06.2005 (ЧЕРНИКОВ БОРИС ВАСИЛЬЕВИЧ) известен способ автоматизированного лексикологического синтеза документов, включающий создание и сохранение унифицированной формы документа, классификацию содержания документа путем выделения унифицированной постоянной информации и переменной информации, сохранения постоянной информации в базах данных, внесения постоянной информации в унифицированную форму документа и введения переменной информации в документ, при котором в переменной информации выделяют переменную унифицированную информацию, связанную с устойчивыми формулировками, переменную вводимую информацию, представляющую собой конкретизирующие сведения, и переменную неунифицированную информацию, содержащую свободные формулировки, причем переменную унифицированную информацию выделяют путем формирования совокупности опорных слов, однозначно определяющих конкретные формулировки в документе и составляющих лексикологический скелет документа, и сохраняют в машинной базе данных с избытком по отношению к отдельно взятому экземпляру документа, формируют лексикологическое дерево документа путем определения взаимозависимости отдельных опорных слов и затем формируют информационный контур управления документом путем установления способа внедрения формулировок переменной унифицированной и неунифицированной информации в зависимости от характера связи опорного слова с фрагментом документа.From patent RU 2253893 C2, G06F 17/27, 06/10/2005 (CHERNIKOV BORIS VASILIEVICH) a method for automated lexicological synthesis of documents is known, including the creation and preservation of a unified form of a document, classification of the content of a document by highlighting unified constant information and variable information, storing constant information databases, entering constant information in a unified form of the document and introducing variable information into the document, in which the variable un is allocated in the information variable fictitious information associated with stable formulations, variable input information representing concretizing information, and variable non-standardized information containing free formulations, the variable unified information being distinguished by forming a set of support words that uniquely identify specific formulations in the document and constitute the lexicological skeleton of the document, and save the document in excess with respect to a single copy in a machine database They form the lexical tree of a document by defining interdependency separate support and then form the word document information control loop by setting method variable unified language implementation and non-unified information depending on the nature of the bond with the fragment of the reference words of the document.

Из заявки WO 2013043160 A1, G06F 17/21, 28.03.2013 (HEWLETT PACKARD DEVELOPMENT СО ЕТ AL.) известен способ обработки текстового массива данных, который заключается в построении графа, представляющего микромодель сущностей, образующих тело обрабатываемого документа. Разбиение такого текста на узлы графа, причем каждый узел относится к своей особенности выделенного фрагмента из текста, причем упомянутые узлы графа связываются между собой отношениями аналогично связи фрагментов текста, соответствующего упомянутым узлам. В дальнейшем выстроенные узлы графа ранжируются для определения релевантных данных относительно запроса пользователя.From WO 2013043160 A1, G06F 17/21, 03/28/2013 (HEWLETT PACKARD DEVELOPMENT WITH ET AL.), A method for processing a text data array is known, which consists in constructing a graph representing a micromodel of entities that form the body of the processed document. The division of such text into nodes of the graph, with each node referring to its own peculiarity of the selected fragment from the text, and the said nodes of the graph are interconnected by relations similar to the relationship of fragments of text corresponding to the said nodes. Subsequently, the constructed nodes of the graph are ranked to determine the relevant data regarding the user's request.

Из заявки WO 2001001289 A1, G06F 17/27, 04.01.2001 (TNV MACHINE CORP INC) известен способ, который заключается в семантической обработке данных, представленных на естественном языке, причем способ включает ввод и хранение пользовательских условий, которые в дальнейшем используются для поиска в массивах данных, содержащих данные на естественном языке, представлений текста, содержащих релевантную пользовательскому вводу информацию, форматирование упомянутых представлений, извлечение из отформатированных представлений текста отношений типа субъект-действие-объект (СДО) и их сохранение в удаленном месте хранения, например базе данных, реструктуризация выявленных СДО в нормализованный вид, назначение частей СДО, таких как действие-объект (ДО) в качестве наименования папок, в которых содержаться части СДО и назначение с указанными папками одного или более идентичных ассоциированных частей субъекта (S1, S2...Sn), которые ассоциированы с соответствующими ДО частями. Способ позволяет также ассоциировать предложения, содержащие соответствующие элементы субъектов S1, S2…Sn, и выделять в них релевантные СДО с их последующей маркировкой на фоне общего массива данных.From the application WO 2001001289 A1, G06F 17/27, 04/04/2001 (TNV MACHINE CORP INC), a method is known which consists in semantic processing of data presented in a natural language, the method including entering and storing user conditions, which are then used for searching in data arrays containing natural language data, text representations containing information relevant to user input, formatting the aforementioned representations, extracting subject-action-object relationships from formatted text representations CT (LMS) and their storage in a remote storage location, for example, a database, restructuring the identified LMS into a normalized form, assigning parts of the LMS, such as an action-object (DOS), as the name of the folders that contain parts of the LMS and the destination with the specified folders one or more identical associated parts of the subject (S1, S2 ... Sn), which are associated with the corresponding TO parts. The method also allows you to associate sentences containing the corresponding elements of the subjects S1, S2 ... Sn, and select relevant SDOs in them with their subsequent labeling against the background of a common data array.

Из патента US 8229730 B2, G06F 17/30, 24.07.2012 (MICROSOFT CORP ЕТ AL.) известен способ поиска данных по запросу пользователя, представленного на естественном языке, причем способ заключается в том, что производят парсинг текстового массива данных с назначением грамматических ролей терминам и их последующей индексацией, которые находятся в семантической связи с терминами поискового запроса, причем упомянутые роли содержат доминантную и второстепенные роли, которые выявляются при анализе пользовательского запроса. Данный способ позволяет определять релевантные части документа, содержащие термины с ролями, совпадающими с ролями текста запроса пользователя.From patent US 8229730 B2, G06F 17/30, 24.07.2012 (MICROSOFT CORP ET AL.) There is a method of searching for data at the request of a user presented in natural language, the method consisting in parsing a text data array with assignment of grammatical roles terms and their subsequent indexation, which are in semantic connection with the terms of the search query, and the mentioned roles contain dominant and secondary roles that are identified in the analysis of the user query. This method allows you to determine the relevant parts of the document containing terms with roles that match the roles of the text of the user’s request.

Из заявки EP 2400400 A1, G06F 17/27, 28.12.2011 (TNBENTA PROFESSIONAL SERVICES S L) известен способ семантического поиска релевантной информации, заключающийся в том, что с помощью лексических функций и критерия значения текста в массиве данных, представленных на естественном языке, формируют фразы или выражения, полученные из базы данных содержания, и выбирают ответ, обладающий нависшим показателем семантического соответствия, причем способ заключается в трансформации контентов и запроса самостоятельных слов или групп слов с присвоенными им токенами, которые преобразовываются в семантические представления, тем самым применяя правила критерия значения текста посредством лексических функций, причем каждое из таких семантических представлений состоит из леммы и семантической категории.From the application EP 2400400 A1, G06F 17/27, 12/28/2011 (TNBENTA PROFESSIONAL SERVICES SL), a method for semantic search for relevant information is known, which consists in the fact that using lexical functions and a criterion, the values of the text in the data array presented in natural language are formed phrases or expressions obtained from the content database and select an answer with an impending indicator of semantic correspondence, and the method consists in transforming the content and requesting independent words or groups of words with the tokens assigned to them that transform zovyvayutsya in semantic representations, thereby applying the rules of the meaning of the text by means of the criterion of lexical functions, each of these semantic representations consists of a lemma and a semantic category.

Из заявки WO 2010105216 A2, G06F 17/20, 16.09.2010 (INVENTION MACHINE CORP) известен способ маркировки текстовых данных документа, заключающийся в том, что осуществляют лингвистический анализ документа, сравнивают документ после его анализа с шаблоном требуемых семантических отношений между объектами, формируют семантически размеченный текст с помощью применения семантических связей, основанный на лингвистическом анализе текста и сравнения с шаблоном семантических связей, причем семантические метки ассоциированы со словами или фразами предложений текста, и идентифицируют компоненты определенных семантических отношений с последующим сохранением в базе данных семантически размеченного текста для последующего поиска релевантной информации по полученной структуре данных.From the application WO 2010105216 A2, G06F 17/20, September 16, 2010 (INVENTION MACHINE CORP), a method for marking text data of a document is known, which consists in linguistically analyzing a document, comparing the document after its analysis with a template of the required semantic relations between objects, forming semantically marked-up text using semantic links, based on linguistic analysis of the text and comparison with a template of semantic links, with semantic labels associated with words or phrases of sentences of the text, and ide components of certain semantic relations are identified, with the subsequent storage of semantically marked-up text in the database for the subsequent search for relevant information on the obtained data structure.

Из заявки ЕР 2105847 A1, G06F 17/30, 30.09.2009 (ALCATEL LUCENT) известен способ автоматического формирования онтологии, заключающийся в том, что принимают термин, для которого необходимо сформировать онтологию, определяют значение упомянутого термина с помощью словаря, извлекают подходящие определения для упомянутого термина, определяют значение каждого из извлеченных определений с помощью упомянутого словаря, выполняют построение для каждого из определенных значений термина и каждого подходящего термина для термина начального запроса на создание онтологии, по меньшей мере, один логический пункт, описывающий взаимоотношение между парой упомянутых подходящих терминов, причем упомянутые логические пункты определяют онтологию термина ввода.From the application EP 2105847 A1, G06F 17/30, 09/30/2009 (ALCATEL LUCENT), a method for automatically generating an ontology is known, which consists in accepting a term for which it is necessary to form an ontology, determining the meaning of the term using a dictionary, and extracting suitable definitions for the said term, determine the meaning of each of the extracted definitions using the aforementioned dictionary, build for each of the defined meanings of the term and each suitable term for the term of the initial ontol creation request ogy, at least one logical paragraph describing the relationship between said pair of suitable terms, said logic input points define the ontology term.

Все указанные выше решения не позволяют формировать семантически и логически верно структурированный массив данных из исходного массива данных, содержащего текст, представленный на естественном языке, путем разбиения упомянутого массива на логические разделы, которые подвергаются семантическому разбиению конструкций самих разделов и элементов, входящих в упомянутые разделы, последующий их орфографический и грамматический анализ, и последующую оценку их взаимосвязанности в исходном массиве данных.All of the above solutions do not allow the formation of a semantically and logically correctly structured data array from the original data array containing text presented in natural language by splitting the said array into logical sections, which are subjected to semantic decomposition of the structures of the sections themselves and the elements included in the sections mentioned, their subsequent spelling and grammatical analysis, and the subsequent assessment of their interconnectedness in the original data array.

Ближайшим аналогом (прототипом) заявленного решения принят способ автоматизированной обработки текста на естественном языке путем его семантической индексации, описанный в патенте RU 2399959 C2, G09B 19/00, 20.09.2010 (ЗАКРЫТОЕ АКЦИОНЕРНОЕ ОБЩЕСТВО «АВИКОМП СЕРВИСЕЗ»). Известный способ представляет собой способ, при котором текст сегментируют в электронной форме на элементарные единицы, выявляют устойчивые словосочетания, формируют предложения, выявляют семантически значимые объекты и семантически значимые отношения между ними, формируют для каждого семантически значимого отношения множество триад, в которых единственная триада первого типа соответствует связи, устанавливаемой семантически значимым отношением между двумя семантически значимыми объектами, причем каждая из триад второго типа соответствует значению конкретного атрибута одного из этих семантически значимых объектов, каждая из триад третьего типа соответствует значению конкретного атрибута самого семантически значимого отношения, индексируют на множестве сформированных триад все связанные семантически значимыми отношениями семантически значимые объекты по отдельности, запоминают в базе данных сформированные триады и полученные индексы вместе со ссылкой на исходный текст, из которого сформированы эти триады.The closest analogue (prototype) of the claimed solution is a method for automated processing of natural language text by means of its semantic indexing, described in patent RU 2399959 C2, G09B 19/00, 09/20/2010 (CLOSED JOINT STOCK COMPANY “AVIKOMP SERVISES”). The known method is a method in which text is segmented in electronic form into elementary units, identifies stable phrases, form sentences, identifies semantically significant objects and semantically significant relationships between them, form many triads for each semantically significant relationship, in which a single triad of the first type corresponds to a relationship established by a semantically significant relationship between two semantically significant objects, each of the triads of the second type of co corresponds to the value of a specific attribute of one of these semantically significant objects, each of the triads of the third type corresponds to the value of a specific attribute of the semantically significant relation, index all related semantically meaningful relations separately from the set of triads, store the generated triads and obtained indices in the database together with a link to the source text from which these triads are formed.

Недостатком данного способа является то, что при формировании упомянутых триад, текст сегментируется непосредственно на элементарные единицы, т.е. слова, а не на логические разделы, при этом данный способ не предусматривает формирования промежуточной структуры исходного текстового массива для ее последующего грамматического и орфографического анализа и не обеспечивает формирование конечной логически, грамматически и орфографически верной структуры данных, пригодной для быстрой и удобной навигации по элементам структуры.The disadvantage of this method is that when forming the mentioned triads, the text is segmented directly into elementary units, i.e. words, and not into logical sections, while this method does not provide for the formation of an intermediate structure of the original text array for its subsequent grammatical and spelling analysis and does not provide the formation of a final logical, grammatical and spelling correct data structure suitable for quick and convenient navigation through structure elements .

РАСКРЫТИЕ ИЗОБРЕТЕНИЯSUMMARY OF THE INVENTION

Исходя из этого задачей, на решение которой направлено заявленное изобретение, является обеспечение такой обработки структурированного массива данных, содержащего текст на естественном языке, которая позволила бы генерировать логически, грамматически и орфографически верную преобразованную структуру, содержащую логические конструкции элементов массива и обеспечивающую быструю и удобную навигацию по элементам массива.Based on this, the task to be solved by the claimed invention is aimed at providing such processing of a structured data array containing natural language text that would allow generating a logically, grammatically and spelling-correct transformed structure containing the logical structures of the array elements and providing quick and convenient navigation by array elements.

Техническим результатом является формирование логически, грамматически и орфографически верной структуры данных, пригодной для быстрой и удобной навигации по элементам структуры.The technical result is the formation of a logically, grammatically and spelling-correct data structure suitable for quick and convenient navigation through structural elements.

Заявленный технический результат достигается за счет того, что выполняют способ преобразования структурированного массива данных, содержащего, по меньшей мере, текст на естественном языке, причем упомянутый способ содержит, по меньшей мере, этапы, на которых:The claimed technical result is achieved due to the fact that they perform a method of converting a structured data array containing at least natural language text, said method comprising at least the steps of:

A) формируют первую структуру данных структурированного массива данных, содержащую элементы упомянутой первой структуры данных, причем упомянутые элементы первой структуры данных содержат первые логические разделы и вторые логические разделы;A) form a first data structure of a structured data array containing the elements of said first data structure, said elements of the first data structure containing first logical partitions and second logical partitions;

Б) формируют базу данных логических связей логических разделов упомянутых элементов первой структуры данных;B) form a database of logical connections of logical sections of the mentioned elements of the first data structure;

B) формируют вторую структуру данных структурированного массива данных, содержащую элементы упомянутой второй структуры данных, причем упомянутые элементы второй структуры данных содержат логические конструкции логических разделов упомянутых элементов первой структуры данных, сформированные с использованием информации из упомянутой базы данных логических связей логических разделов, причем упомянутые логические разделы содержат первые семантические части и вторые семантические части;B) form a second data structure of a structured data array containing the elements of said second data structure, said elements of the second data structure containing logical structures of logical partitions of said elements of the first data structure generated using information from said logical linking database of logical partitions, said logical sections contain the first semantic parts and the second semantic parts;

Г) формируют базу данных семантических частей логических разделов из упомянутых вторых семантических частей, причем упомянутые вторые семантические части исключаются из соответствующих упомянутых логических разделов;D) form a database of semantic parts of logical sections from said second semantic parts, wherein said second semantic parts are excluded from the corresponding mentioned logical sections;

Д) формируют грамматически и орфографически верные семантические части упомянутых логических разделов путем лингвистических преобразований над упомянутыми семантическими частями;E) form the grammatically and spelling-correct semantic parts of the mentioned logical sections by linguistic transformations over the mentioned semantic parts;

Е) формируют итоговую структуру данных структурированного массива данных, содержащую элементы упомянутой итоговой структуры данных, причем упомянутые элементы итоговой структуры данных содержат логические конструкции, содержащие, по меньшей мере, упомянутые грамматически и орфографически верные семантические части логических разделов.E) form the final data structure of the structured data array containing the elements of the said final data structure, said elements of the final data structure containing logical structures containing at least the grammatically and spelling-correct semantic parts of logical sections.

Варианты осуществления настоящего изобретения относятся к способу, устройству, системе и машиночитаемому носителю для обеспечения эффективного преобразования структурированного массива данных, содержащего, по меньшей мере, текст на естественном языке.Embodiments of the present invention relate to a method, apparatus, system, and computer-readable medium for efficiently converting a structured data array containing at least natural language text.

КРАТКОЕ ОПИСАНИЕ ЧЕРТЕЖЕЙBRIEF DESCRIPTION OF THE DRAWINGS

Иллюстративные варианты осуществления настоящего изобретения описываются далее подробно со ссылкой на прилагаемые чертежи, которые включены в данный документ посредством ссылки и на которых:Illustrative embodiments of the present invention are described below in detail with reference to the accompanying drawings, which are incorporated herein by reference and in which:

На фиг.1 изображена общая схема выполнения этапов заявленного способа преобразования структурированного массива данных, содержащего, по меньшей мере, текст на естественном языке.Figure 1 shows a General diagram of the steps of the claimed method of converting a structured data array containing at least natural language text.

На фиг.2 изображена общая схема этапа формирования первой структуры данных.Figure 2 shows a General diagram of the stage of formation of the first data structure.

На фиг.3 изображена общая структура исходной структуры данных, из которой сформирована первая структура данных.Figure 3 shows the general structure of the original data structure from which the first data structure is formed.

На фиг.4 изображена общая схема этапа формирования базы данных логических связей логических разделов.Figure 4 shows a General diagram of the stage of forming a database of logical connections of logical partitions.

На фиг.5 изображен общий принцип формирования базы данных логических связей логических разделов.Figure 5 shows the general principle of forming a database of logical connections of logical partitions.

На фиг.6 изображена общая схема этапа формирования второй структуры данных.Figure 6 shows a General diagram of the stage of formation of the second data structure.

На фиг.7 изображена общая структура второй структуры данных.7 shows the general structure of a second data structure.

На фиг.8 изображена общая схема этапа формирования базы данных семантических частей.On Fig shows a General diagram of the stage of forming a database of semantic parts.

На фиг.9 изображен общий принцип формирования базы данных семантических частей.Figure 9 shows the general principle of forming a database of semantic parts.

На фиг.10 изображена общая схема этапа формирования грамматически и орфографически верных семантических частей.Figure 10 shows a General diagram of the stage of formation of grammatically and spelling correct semantic parts.

На фиг.11 изображена общая схема второй структуры данных, полученной после выполнения этапа формирования грамматически и орфографически верных семантических частей.11 shows a general diagram of a second data structure obtained after performing the step of forming grammatically and spelling-correct semantic parts.

На фиг.12 изображена общая схема этапа формирования итоговой структуры данных.On Fig shows a General diagram of the stage of formation of the final data structure.

На фиг.13 изображена общая схема итоговой структуры данных.On Fig shows a General diagram of the final data structure.

На фиг.14 изображена общая структура элементов итоговой структуры данных.On Fig shows the General structure of the elements of the final data structure.

На фиг.15 изображена общая схема системы преобразования структурированного массива данных, содержащей устройство преобразования структурированного массива данных.On Fig shows a General diagram of a system for converting a structured data array containing a device for converting a structured data array.

ВАРИАНТЫ ОСУЩЕСТВЛЕНИЯ ИЗОБРЕТЕНИЯMODES FOR CARRYING OUT THE INVENTION

Далее приводятся варианты осуществления настоящего изобретения, раскрывающие примеры его реализации в частных исполнениях. Тем не менее само описание не предназначено для ограничения объема прав, предоставляемых данным патентом. Скорее следует исходить из того, что заявленное изобретение также может быть осуществлено другими способами таким образом, что будет включать в себя отличающиеся этапы или комбинации этапов, аналогичных этапам, описанным в данном документе, в сочетании с другими существующими и будущими технологиями.The following are embodiments of the present invention, revealing examples of its implementation in private versions. However, the description itself is not intended to limit the scope of the rights granted by this patent. Rather, it should be assumed that the claimed invention can also be carried out in other ways in such a way that it will include different steps or combinations of steps similar to the steps described herein, in combination with other existing and future technologies.

Заявленный способ будет рассмотрен на примере обработки структурированного массива данных, содержащего текст на естественном языке, представляющего собой, не ограничиваясь, нормативно-правовые акты (НПА). Для специалиста должно быть очевидно, что, несмотря на то что в данном конкретном примере реализации настоящего изобретения осуществляется преобразование НПА, такой способ преобразования может быть применен в отношении любого структурированного массива данных, схожего с НПА.The claimed method will be considered on the example of processing a structured data array containing text in a natural language, which is, but is not limited to, legal acts (NLA). It should be obvious to a person skilled in the art that, although in this particular implementation example of the present invention, the conversion of NLA is carried out, such a conversion method can be applied to any structured data array similar to the NLA.

НПА - это документ, характеризующийся следующими признаками:NLA is a document characterized by the following features:

1) НПА имеют правотворческий характер: в них нормы права либо устанавливаются, либо изменяются, либо отменяются. Нормативные правовые акты - это носители правовых норм;1) NLAs are of a law-making nature: in them, the rules of law are either established, changed or canceled. Normative legal acts are carriers of legal norms;

2) НПА содержит правовые инструменты, с помощью которых осуществляется правовое регулирующее воздействие.2) the regulatory legal act contains legal instruments with the help of which the legal regulatory influence is implemented.

3) НПА издается только в пределах компетенции правотворческого органа;3) NPA is published only within the competence of the law-making body;

4) НПА облекается в документальную форму и имеет следующие реквизиты: вид нормативного акта, его наименование, орган, его принявший, дату, место принятия акта, номер;4) NPA is clothed in documentary form and has the following details: type of regulatory act, its name, authority that adopted it, date, place of adoption of the act, number;

5) НПА не является хаотичным набором положений (предложений), а имеет определенную структуру;5) NPA is not a chaotic set of provisions (proposals), but has a certain structure;

6) НПА должен соответствовать конституции или иному вышестоящему НПА, имеющему большую юридическую силу.6) The NLA must be consistent with the constitution or another higher NLA, which has greater legal force.

7) НПА обязательно подлежит доведению до сведения граждан и организаций, т.е. опубликованию, и лишь только после этого государство имеет право требовать его неукоснительного исполнения исходя из презумпции знания закона и налагать санкции за его неисполнение.7) NPA must be brought to the attention of citizens and organizations, i.e. publication, and only after that the state has the right to demand its rigorous execution on the basis of the presumption of knowledge of the law and impose sanctions for its non-fulfillment.

Необходимо отметить, что под термином «структурированный массив данных» в рамках заявленного изобретения может рассматриваться не только совокупность НПА, но и отдельный самостоятельный НПА, представляющий собой, например: Конституцию, закон, указ, постановление и т.п. Отдельный НПА может состоять, например, из частей, глав, разделов, статей. При этом инструментом правового регулирующего воздействия НПА является юридическое правило, обличенное в структуру нормативного предписания, которое, в свою очередь, является элементом (частью) нормы права (правовой нормы).It should be noted that the term “structured data array” within the framework of the claimed invention can be considered not only a set of legal acts, but also a separate independent legal acts, which is, for example: the Constitution, law, decree, regulation, etc. A separate LA can, for example, consist of parts, chapters, sections, articles. At the same time, the instrument of the legal regulatory impact of the legal acts is the legal rule, encapsulated in the structure of the regulatory prescription, which, in turn, is an element (part) of the rule of law (legal norm).

В первом варианте осуществления настоящего изобретения обеспечивается способ преобразования структурированного массива данных, содержащего, по меньшей мере, текст на естественном языке, причем упомянутый способ содержит, по меньшей мере, этапы, на которых:In a first embodiment of the present invention, there is provided a method for converting a structured data array containing at least natural language text, said method comprising at least the steps of:

А) формируют первую структуру данных структурированного массива данных, содержащую элементы упомянутой первой структуры данных, причем упомянутые элементы первой структуры данных содержат первые логические разделы и вторые логические разделы;A) form the first data structure of a structured data array containing the elements of the aforementioned first data structure, said elements of the first data structure containing the first logical partitions and second logical partitions;

В) формируют вторую структуру данных структурированного массива данных, содержащую элементы упомянутой второй структуры данных, причем упомянутые элементы второй структуры данных содержат логические конструкции логических разделов упомянутых элементов первой структуры данных, сформированные с использованием информации из упомянутой базы данных логических связей логических разделов, причем упомянутые логические разделы содержат первые семантические части и вторые семантические части;C) form a second data structure of a structured data array containing elements of said second data structure, said elements of the second data structure containing logical structures of logical partitions of said elements of the first data structure generated using information from said logical linking database of logical partitions, said logical sections contain the first semantic parts and the second semantic parts;

Во втором варианте осуществления настоящего изобретения обеспечивается способ преобразования структурированного массива данных, содержащего, по меньшей мере, текст на естественном языке, причем упомянутый способ содержит, по меньшей мере, этапы, на которых:In a second embodiment of the present invention, there is provided a method for converting a structured data array containing at least natural language text, said method comprising at least the steps of:

А) идентифицируют исходную структуру данных структурированного массива данных; идентифицируют элементы исходной структуры данных; идентифицируют первые логические разделы упомянутых элементов исходной структуры данных и вторые логические разделы упомянутых элементов исходной структуры данных; и формируют первую структуру данных структурированного массива данных, содержащую элементы упомянутой первой структуры данных, причем упомянутые элементы первой структуры данных содержат первые логические разделы и вторые логические разделы;A) identify the original data structure of a structured data array; identify elements of the original data structure; identifying the first logical partitions of said elements of the original data structure and the second logical partitions of said elements of the original data structure; and form a first data structure of a structured data array containing the elements of said first data structure, said elements of the first data structure containing first logical partitions and second logical partitions;

В третьем варианте осуществления настоящего изобретения обеспечивается способ преобразования структурированного массива данных, содержащего, по меньшей мере, текст на естественном языке, причем упомянутый способ содержит, по меньшей мере, этапы, на которых:In a third embodiment of the present invention, there is provided a method for converting a structured data array containing at least natural language text, said method comprising at least the steps of:

Б) идентифицируют элементы первой структуры данных, содержащие один упомянутый первый логический раздел, и элементы первой структуры данных, содержащие один упомянутый второй логический раздел; идентифицируют элементы первой структуры данных, содержащие более одного упомянутого первого логического раздела, и элементы первой структуры данных, содержащие более одного упомянутого второго логического раздела; в элементах первой структуры данных, содержащих более одного упомянутого первого логического раздела, и в элементах первой структуры данных, содержащих более одного упомянутого второго логического раздела, идентифицируют логические связи между упомянутыми первыми логическими разделами или между упомянутыми вторыми логическими разделами; в элементах первой структуры данных, содержащих более одного упомянутого первого логического раздела, и в элементах первой структуры данных, содержащих более одного упомянутого второго логического раздела, идентифицируют элементы первой структуры данных, не имеющие логической связи между логическими разделами; и формируют базу данных логических связей логических разделов элементов первой структуры данных;B) identify the elements of the first data structure containing one mentioned first logical section, and the elements of the first data structure containing one mentioned second logical section; identifying elements of a first data structure containing more than one of said first logical partitions, and elements of a first data structure containing more than one of said second logical partition; in the elements of the first data structure containing more than one of said first logical partitions, and in the elements of the first data structure containing more than one of said second logical partitions, logical relationships between said first logical partitions or between said second logical partitions are identified; in the elements of the first data structure containing more than one of said first logical partitions, and in the elements of the first data structure containing more than one of said second logical partitions, elements of the first data structure that do not have a logical connection between logical partitions are identified; and form a database of logical relationships of logical partitions of the elements of the first data structure;

В четвертом варианте осуществления настоящего изобретения обеспечивается способ преобразования структурированного массива данных, содержащего, по меньшей мере, текст на естественном языке, причем упомянутый способ содержит, по меньшей мере, этапы, на которых:In a fourth embodiment of the present invention, there is provided a method for converting a structured data array containing at least natural language text, said method comprising at least the steps of:

В) формируют логические конструкции логических разделов элементов первой структуры данных, используя информацию из базы данных логических связей логических разделов элементов первой структуры данных и логические разделы упомянутых элементов первой структуры данных, содержащих один упомянутый первый логический раздел, и логические разделы упомянутых элементов первой структуры данных, содержащих один упомянутый второй логический раздел; и формируют вторую структуру данных, содержащую элементы второй структуры данных, причем упомянутые элементы второй структуры данных представляют собой сформированные логические конструкции логических разделов первой структуры данных;C) form logical structures of logical partitions of elements of the first data structure using information from a database of logical connections of logical partitions of elements of the first data structure and logical partitions of said elements of the first data structure containing one of the first logical partitions and logical partitions of the said elements of the first data structure, containing one said second logical partition; and form a second data structure containing elements of the second data structure, said elements of the second data structure being formed logical structures of logical partitions of the first data structure;

В пятом варианте осуществления настоящего изобретения обеспечивается способ преобразования структурированного массива данных, содержащего, по меньшей мере, текст на естественном языке, причем упомянутый способ содержит, по меньшей мере, этапы, на которых:In a fifth embodiment of the present invention, there is provided a method for converting a structured data array containing at least natural language text, said method comprising at least the steps of:

Г) идентифицируют первые логические разделы элементов второй структуры данных и вторые логические разделы элементов второй структуры данных; в упомянутых первых логических разделах и вторых логических разделах элементов второй структуры данных идентифицируют первые семантические части и вторые семантические части; и в упомянутых первых и вторых логических разделах элементов второй структуры данных идентифицируют, по меньшей мере, особые семантические части первых логических разделов элементов второй структуры данных и особые семантические части вторых логических разделов элементов второй структуры данных и формируют базу данных особых семантических частей логических разделов элементов второй структуры данных путем перемещения упомянутых особых семантических частей в упомянутую формируемую базу данных особых семантических частей логических разделов элементов второй структуры данных;D) identify the first logical partitions of the elements of the second data structure and the second logical partitions of the elements of the second data structure; in said first logical sections and second logical sections of the elements of the second data structure, the first semantic parts and the second semantic parts are identified; and in said first and second logical partitions of the elements of the second data structure, at least special semantic parts of the first logical partitions of the elements of the second data structure and the special semantic parts of the second logical partitions of the elements of the second data structure are identified and form a database of special semantic parts of the logical partitions of the elements of the second data structures by moving said special semantic parts to said logically generated database of special semantic parts x sections of the elements of the second data structure;

В шестом варианте осуществления настоящего изобретения обеспечивается способ преобразования структурированного массива данных, содержащего, по меньшей мере, текст на естественном языке, причем упомянутый способ содержит, по меньшей мере, этапы, на которых:In a sixth embodiment of the present invention, there is provided a method for converting a structured data array containing at least natural language text, said method comprising at least the steps of:

Д) в упомянутых вторых семантических частях упомянутых вторых логических разделов элементов второй структуры данных идентифицируют, по меньшей мере, уточняющие структуры вторых семантических частей вторых логических разделов; и осуществляют лингвистические преобразования над всеми семантическими частями, за исключением упомянутых особых семантических частей упомянутых первых и вторых логических разделов, для формирования грамматически и орфографически верных семантических частей логических разделов элементов второй структуры данных;E) in said second semantic parts of said second logical sections of the elements of the second data structure, at least the refinement structures of the second semantic parts of the second logical sections are identified; and carry out linguistic transformations over all semantic parts, with the exception of the mentioned special semantic parts of the mentioned first and second logical sections, to form grammatically and spelling-correct semantic parts of logical sections of the elements of the second data structure;

В седьмом варианте осуществления настоящего изобретения обеспечивается способ преобразования структурированного массива данных, содержащего, по меньшей мере, текст на естественном языке, причем упомянутый способ содержит, по меньшей мере, этапы, на которых:In a seventh embodiment of the present invention, there is provided a method for converting a structured data array containing at least natural language text, said method comprising at least the steps of:

Е) формируют из первых грамматически и орфографически верных семантических частей вторых логических разделов элементов второй структуры данных и упомянутых грамматически и орфографически верных уточняющих структур вторых семантических частей вторых логических разделов элементов второй структуры данных смысловые сочетания грамматически и орфографически верных семантических частей вторых логических разделов элементов третьей структуры данных; и формируют итоговую структуру данных, содержащую элементы итоговой структуры данных, причем упомянутые элементы итоговой структуры данных представляют собой логические конструкции, содержащие упомянутые грамматически и орфографически верные семантические части логических разделов элементов второй структуры данных.E) form semantic combinations of grammatically and spelling correct semantic parts of the second logical sections of the elements of the third structure from the first grammatically and orthographically correct semantic parts of the second logical sections of the elements of the second data structure and the grammatically and spelling correct semantic parts of the second semantic parts of the second logical sections of the elements of the second data structure data; and form the final data structure containing the elements of the final data structure, and the said elements of the final data structure are logical constructs containing the mentioned grammatically and spelling correct semantic parts of the logical sections of the elements of the second data structure.

В восьмом варианте осуществления настоящего изобретения обеспечивается способ преобразования структурированного массива данных, содержащего, по меньшей мере, текст на естественном языке, причем упомянутый способ содержит, по меньшей мере, этапы, на которых:In an eighth embodiment of the present invention, there is provided a method for converting a structured data array containing at least natural language text, said method comprising at least the steps of:

Е) формируют из первых грамматически и орфографически верных семантических частей вторых логических разделов элементов второй структуры данных и упомянутых грамматически и орфографически верных уточняющих структур вторых семантических частей вторых логических разделов элементов второй структуры данных смысловые сочетания грамматически и орфографически верных семантических частей вторых логических разделов элементов третьей структуры данных; и формируют итоговую структуру данных, содержащую элементы итоговой структуры данных, причем упомянутые элементы итоговой структуры данных представляют собой логические конструкции, содержащие упомянутые грамматически и орфографически верные семантические части логических разделов элементов второй структуры данных; причем упомянутые логические конструкции из упомянутой итоговой структуры данных дополнительно могут содержать упомянутые сформированные смысловые сочетания грамматически и орфографически верных семантических частей вторых логических разделов элементов второй структуры данных.E) form semantic combinations of grammatically and spelling correct semantic parts of the second logical sections of the elements of the third structure from the first grammatically and orthographically correct semantic parts of the second logical sections of the elements of the second data structure and the grammatically and spelling correct semantic parts of the second semantic parts of the second logical sections of the elements of the second data structure data; and form the final data structure containing the elements of the final data structure, and the said elements of the final data structure are logical constructs containing the mentioned grammatically and spelling correct semantic parts of the logical sections of the elements of the second data structure; moreover, said logical constructions from said final data structure may additionally contain said formed semantic combinations of grammatically and spelling-correct semantic parts of second logical sections of elements of the second data structure.

В девятом варианте осуществления настоящего изобретения обеспечивается способ преобразования структурированного массива данных, содержащего, по меньшей мере, текст на естественном языке, причем упомянутый способ содержит, по меньшей мере, этапы, на которых:In a ninth embodiment of the present invention, there is provided a method for converting a structured data array containing at least natural language text, said method comprising at least the steps of:

В) формируют логические конструкции логических разделов элементов первой структуры данных, используя информацию из базы данных логических связей логических разделов элементов первой структуры данных, и логические разделы упомянутых элементов первой структуры данных, содержащих один упомянутый первый логический раздел, и логические разделы упомянутых элементов первой структуры данных, содержащих один упомянутый второй логический раздел; и формируют вторую структуру данных, содержащую элементы второй структуры данных, причем упомянутые элементы второй структуры данных представляют собой сформированные логические конструкции логических разделов первой структуры данных;C) form logical structures of logical partitions of elements of the first data structure, using information from a database of logical relationships of logical partitions of elements of the first data structure, and logical partitions of said elements of the first data structure containing one of the first logical partitions, and logical partitions of said elements of the first data structure containing one said second logical partition; and form a second data structure containing elements of the second data structure, said elements of the second data structure being formed logical structures of logical partitions of the first data structure;

В десятом варианте осуществления настоящего изобретения обеспечивается способ преобразования структурированного массива данных, содержащего, по меньшей мере, текст на естественном языке, причем упомянутый способ содержит, по меньшей мере, этапы, на которых:In a tenth embodiment of the present invention, there is provided a method for converting a structured data array containing at least natural language text, said method comprising at least the steps of:

При этом для специалиста в области техники, к которой относится настоящее изобретение, должно быть очевидным, что варианты осуществления изобретения со второго по десятый характеризуют уточненные этапы способа, охарактеризованного первым вариантом осуществления изобретения, и иные варианты осуществления изобретения могут быть реализованы, причем такие иные варианты осуществления изобретения будут включать различные комбинации уточненных этапов способа.Moreover, for a specialist in the field of technology to which the present invention relates, it should be obvious that the second to tenth embodiments characterize the specified steps of the method described by the first embodiment of the invention, and other embodiments of the invention can be implemented, and such other options embodiments of the invention will include various combinations of the specified process steps.

В одиннадцатом варианте осуществления настоящего изобретения обеспечивается устройство преобразования структурированного массива данных, содержащее, по меньшей мере:In an eleventh embodiment of the present invention, there is provided a device for converting a structured data array comprising at least:

один или более процессоров;one or more processors;

один или более модулей ввода/вывода (I/O); иone or more input / output (I / O) modules; and

память, содержащую код программы, который при выполнении побуждает упомянутые один или более процессоры упомянутого устройства и/или устройства, связанного с упомянутым устройством, выполнять действия способа по любому из вариантов осуществления настоящего изобретения с первого по десятый, и содержащую подлежащие преобразованию один или несколько структурированных массивов данных, содержащих, по меньшей мере, текст на естественном языке.a memory containing program code, which upon execution causes said one or more processors of said device and / or device associated with said device to perform the actions of the method according to any of the first to tenth embodiments of the present invention and containing one or more structured ones to be converted data sets containing at least natural language text.

В двенадцатом варианте осуществления настоящего изобретения обеспечивается устройство преобразования структурированного массива данных, содержащее, по меньшей мере:In a twelfth embodiment of the present invention, there is provided a device for converting a structured data array comprising at least:

один или более процессоров;one or more processors;

память, содержащую код программы, который при выполнении побуждает упомянутые один или более процессоры упомянутого устройства и/или устройства, связанного с упомянутым устройством, выполнять действия способа по любому из вариантов осуществления настоящего изобретения с первого по десятый, и содержащую подлежащие преобразованию один или несколько структурированных массивов данных, содержащих, по меньшей мере, текст на естественном языке, причем упомянутые подлежащие преобразованию один или несколько структурированных массивов данных являются загружаемыми, а упомянутое устройство выполнено с возможностью соединения с базой данных, в которой хранятся упомянутые загружаемые подлежащие преобразованию один или несколько структурированных массивов данных, для осуществления загрузки в упомянутую память устройства, по меньшей мере, одного загружаемого подлежащего преобразованию структурированного массива данных.a memory containing program code, which upon execution causes said one or more processors of said device and / or device associated with said device to perform the actions of the method according to any of the first to tenth embodiments of the present invention and containing one or more structured ones to be converted data arrays containing at least natural language text, said one or more structured data arrays to be converted s are downloadable, and said device is configured to connect to a database in which said downloadable one or more structured data arrays to be converted are stored to load at least one loadable structured data array to be converted into said device memory.

В тринадцатом варианте осуществления настоящего изобретения обеспечивается система преобразования структурированного массива данных, содержащая, по меньшей мере:In a thirteenth embodiment of the present invention, there is provided a structured data array transformation system comprising at least:

одно или более устройств, выполненных в виде устройств по любому из одиннадцатого или двенадцатого вариантов осуществления настоящего изобретения;one or more devices made in the form of devices according to any one of the eleventh or twelfth embodiments of the present invention;

один или несколько серверов, обеспечивающих регулирование обменом данных в системе;one or more servers providing regulation of data exchange in the system;

одну или несколько баз данных, предназначенных для хранения данных, выполненных с возможностью взаимодействия с упомянутыми одним или более устройствами;one or more databases for storing data configured to interact with said one or more devices;

одну или более сетей передачи данных, через которые осуществляется взаимодействие упомянутых устройств, серверов и баз данных.one or more data transmission networks through which said devices, servers and databases interact.

В четырнадцатом варианте осуществления настоящего изобретения обеспечивается система преобразования структурированного массива данных, содержащая, по меньшей мере:In a fourteenth embodiment of the present invention, there is provided a structured data array transformation system comprising at least:

одну или более сетей передачи данных, через которые осуществляется взаимодействие упомянутых устройств, серверов и баз данных; причемone or more data transmission networks through which said devices, servers and databases interact; moreover

способ по любому из вариантов осуществления настоящего изобретения с первого по десятый осуществляется одним или более упомянутыми серверами, а упомянутые устройства представляют собой тонкий клиент.the method according to any one of the first to tenth embodiments of the present invention is carried out by one or more of said servers, and said devices are a thin client.

В пятнадцатом варианте осуществления настоящего изобретения обеспечивается система преобразования структурированного массива данных, содержащая, по меньшей мере:In a fifteenth embodiment of the present invention, there is provided a structured data array transformation system comprising at least:

способ по любому из вариантов осуществления настоящего изобретения с первого по десятый осуществляется одним или более упомянутыми серверами, а упомянутые устройства представляют собой тонкий клиент; причемthe method according to any one of the first to tenth embodiments of the present invention is carried out by one or more of said servers, and said devices are a thin client; moreover

упомянутая база данных служит для хранения данных, представляющих собой, по меньшей мере, одно из: код программы, который при выполнении побуждает упомянутые один или более процессоры упомянутого устройства и/или устройства, связанного с упомянутым устройством, выполнять действия способа по любому из вариантов осуществления настоящего изобретения с первого по десятый, подлежащие преобразованию один или несколько структурированных массивов данных, содержащих, по меньшей мере, текст на естественном языке.said database is used to store data representing at least one of: a program code, which upon execution causes said one or more processors of said device and / or device associated with said device, to perform the actions of the method according to any one of the embodiments first to tenth of the present invention, one or more structured data arrays to be converted containing at least natural language text to be converted.

В шестнадцатом варианте осуществления настоящего изобретения обеспечивается система по любому из вариантов осуществления настоящего изобретения с тринадцатого по пятнадцатый, причем упомянутая сеть передачи данных представляет собой одно из локальная сеть (LAN), глобальная сеть (WAN), информационно-телекоммуникационная сеть Интернет, виртуальная частная сеть (VPN).In a sixteenth embodiment of the present invention, there is provided a system according to any of the thirteenth to fifteenth embodiments of the present invention, said data network being one of a local area network (LAN), wide area network (WAN), telecommunications network Internet, virtual private network (VPN).

В семнадцатом варианте осуществления настоящего изобретения обеспечивается машиночитаемый носитель данных, содержащий код программы, который при выполнении побуждает процессор или процессоры устройства, с которым взаимодействует машиночитаемый носитель данных, выполнять действия способа по любому из вариантов осуществления настоящего изобретения с первого по десятый.In a seventeenth embodiment of the present invention, there is provided a computer-readable storage medium comprising program code that, upon execution, causes a processor or processors of a device with which the computer-readable storage medium interacts to perform method steps according to any one of the first to tenth embodiments of the present invention.

ДЕТАЛЬНОЕ ОПИСАНИЕ ЧЕРТЕЖЕЙDETAILED DESCRIPTION OF THE DRAWINGS

Описанные в данном разделе возможные осуществления вариантов настоящего изобретения представлены на неограничивающих объем правовой охраны примерах, применительно к конкретным вариантам осуществления настоящего изобретения, которые во всех их аспектах предполагаются иллюстративными и не накладывающими ограничения. Альтернативные варианты реализации настоящего изобретения, не выходящие за пределы объема его правовой охраны, являются очевидными специалистам в данной области, имеющим обычную квалификацию, на которых это изобретение рассчитано.The possible implementations of the embodiments of the present invention described in this section are presented in non-limiting examples, with reference to specific embodiments of the present invention, which in all their aspects are assumed to be illustrative and not limiting. Alternative embodiments of the present invention, not beyond the scope of its legal protection, are obvious to experts in this field with the usual qualifications on which this invention is intended.

На фиг.1 в качестве примера, но не ограничения, изображена общая схема выполнения этапов заявленного способа 100 преобразования структурированного массива данных, содержащего, по меньшей мере, текст на естественном языке. Заявленный способ 100 преобразования структурированного массива данных, содержащего, по меньшей мере, текст на естественном языке, характеризуется выполнением этапа 101 формирования первой структуры данных, на котором формируют первую структуру данных структурированного массива данных, содержащую элементы упомянутой первой структуры данных, причем упомянутые элементы первой структуры данных содержат первые логические разделы и вторые логические разделы; выполнением этапа 102 формирования базы данных логических связей логических разделов, на котором формируют базу данных логических связей логических разделов упомянутых элементов первой структуры данных; выполнением этапа 103 формирования второй структуры данных, на котором формируют вторую структуру данных структурированного массива данных, содержащую элементы упомянутой второй структуры данных, причем упомянутые элементы второй структуры данных содержат логические конструкции логических разделов упомянутых элементов первой структуры данных, сформированные с использованием информации из упомянутой базы данных логических связей логических разделов, причем упомянутые логические разделы содержат первые семантические части и вторые семантические части; выполнением этапа 104 формирования базы данных семантических частей логических разделов, на котором формируют базу данных семантических частей логических разделов из упомянутых вторых семантических частей, причем упомянутые вторые семантические части исключаются из соответствующих упомянутых логических разделов; выполнением этапа 105 формирования грамматически и орфографически верных семантических частей, на котором формируют грамматически и орфографически верные семантические части упомянутых логических разделов путем лингвистических преобразований над упомянутыми семантическими частями; и выполнением этапа 106 формирования итоговой структуры данных, на котором формируют итоговую структуру данных структурированного массива данных, содержащую элементы упомянутой итоговой структуры данных, причем упомянутые элементы итоговой структуры данных содержат логические конструкции, содержащие, по меньшей мере, упомянутые грамматически и орфографически верные семантические части логических разделов.Figure 1 as an example, but not limitation, shows a General diagram of the steps of the claimed method 100 for converting a structured data array containing at least natural language text. The claimed method 100 for converting a structured data array containing at least natural language text is characterized by performing step 101 of generating a first data structure in which a first data structure of a structured data array containing elements of said first data structure is formed, said elements of the first structure data contains first logical partitions and second logical partitions; performing step 102 of forming a database of logical connections of logical partitions, on which a database of logical connections of logical partitions of said elements of the first data structure is formed; performing step 103 of generating a second data structure, which forms a second data structure of a structured data array containing elements of said second data structure, said elements of the second data structure containing logical structures of logical partitions of said elements of the first data structure generated using information from said database logical connections of logical sections, moreover, the mentioned logical sections contain the first semantic parts and second semantic part; performing step 104 of forming a database of semantic parts of logical partitions, on which a database of semantic parts of logical partitions from said second semantic parts is formed, said second semantic parts being excluded from corresponding logical partitions; performing step 105 of forming grammatically and spelling correct semantic parts, in which grammatically and spelling correct semantic parts of said logical sections are formed by linguistic transformations over said semantic parts; and performing step 106 of forming the final data structure, which forms the final data structure of the structured data array containing the elements of the said final data structure, said elements of the final data structure containing logical constructs containing at least the grammatically and spelling-correct semantic parts of the logical sections.

На фиг.2 в качестве примера, но не ограничения, изображена общая схема выполнения этапов этапа 101 формирования первой структуры данных. Этап 101 характеризуется выполнением этапа 1011 идентификации исходной структуры, на котором идентифицируют исходную структуру данных 1 структурированного массива данных; выполнением этапа 1012 идентификации элементов, на котором идентифицируют элементы 11 исходной структуры данных 1; выполнением этапа 1013 идентификации логических разделов, на котором идентифицируют первые логические разделы 111 элементов 11 исходной структуры данных 1 и вторые логические разделы 112 элементов 11 исходной структуры данных 1; и выполнением этапа 1013 формирования первой структуры данных, на котором формируют первую структуру данных 2 структурированного массива данных, содержащую элементы 21 первой структуры данных, которые являются элементами 11 исходной структуры данных 1, причем элементы 21 первой структуры данных 2 содержат первые логические разделы 111 элементов 11 исходной структуры данных 1, и содержащую элементы 22 первой структуры данных, причем элементы 22 первой структуры данных 2 содержат вторые логические разделы 112, элементов 11 исходной структуры данных 1.Figure 2 as an example, but not limitation, shows a General diagram of the steps of step 101 of the formation of the first data structure. Step 101 is characterized by performing step 1011 of identifying the source structure, in which the source data structure 1 of the structured data array is identified; performing step 1012 of the identification of the elements, which identify the elements 11 of the original data structure 1; by performing logical partition identification step 1013, in which the first logical partitions 111 of the elements 11 of the original data structure 1 and the second logical partitions 112 of the elements 11 of the original data structure 1 are identified; and performing step 1013 of forming the first data structure, which forms the first data structure 2 of the structured data array containing the elements 21 of the first data structure, which are the elements 11 of the original data structure 1, the elements 21 of the first data structure 2 containing the first logical sections 111 of the elements 11 the original data structure 1, and containing the elements 22 of the first data structure, the elements 22 of the first data structure 2 containing the second logical sections 112, elements 11 of the original data structure 1.

На фиг.3 в качестве примера, но не ограничения, изображена общая структура исходной структуры данных 1, из которой формируется первая структура данных 2. Исходная структура данных 1 представляет собой структурированный массив данных, содержащий, по меньшей мере, текст на естественном языке. Как уже упоминалось выше, такой массив данных может представлять собой, в частности, нормативно-правовой акт (НПА). Исходная структура данных 1 содержит элементы 11, которые представляют собой положения, являющиеся предложениями - грамматически организованными соединениями слов. При этом каждое предложение характеризуется смысловой законченностью. Идентификация предложений на этапе 1012 идентификации элементов производится путем выявления в тексте на естественном языке признаков окончания предложения. Признаками окончания предложения при этом являются: точка, точка с запятой, многоточие и т.п. Идентификация предложений осуществляется в совокупности с выявлением признаков начала предложения. Признаками начала предложения являются: заглавная буква, цифра, цифра с закрывающей скобкой, цифра с точкой и т.п. При этом при идентификации также учитывается наличие определенных сочетаний знаков препинаний, а именно знаков пунктуации - точки, запятой, скобки, двоеточия и т.п., - словоразделителей, а именно - пробела и т.п., - и типографики - абзаца, номера, градуса и т.п. На этапе 1012 идентификации элементов осуществляется выявление элементарных смысловых единиц - предложений, являющихся суждениями. Простое суждение - это суждение, никакая часть которого не является суждением. Языковой формой выражения суждения являются повествовательные предложения. Идентификация элементов 11 исходной структуры данных 1 осуществляется путем выявления и дефрагментации предложения на первичные составляющие предложения, а именно - на слова, частицы, союзы, предлоги и т.п., и знаки препинания. После чего из первичных элементов формируются понятия, выраженные в отдельных словах и/или словосочетаниях на основании различных справочников и словарей. Затем из сформированных понятий формируются простые суждения, которые представляют собой группы взаимосвязанных понятий, при этом взаимосвязанность понятий определяется на основании синтаксических или иных связей между понятиями. Для формирования простых суждений осуществляется лингвистическо-семантический анализ элементов 11 исходной структуры данных 1, посредством чего в элементах 11 осуществляется выявление структурных элементов простых суждений. Под структурными элементами простых суждений понимаются субъект суждения, предикат суждения, связка и кванторное слово. Субъектом суждения (S) является понятие, выражающее предмет суждения, т.е. то, о чем говорится в данном суждении. Предикатом суждения (Р) является понятие, выражающее ту или иную информацию о предмете суждения. Субъект суждения и предикат суждения - основные структурные элементы суждения, являющиеся терминами суждения. Связь между субъектом суждения и предикатом суждения, отражающая реальные отношения между мыслимыми в понятиях объектами, раскрывается посредством логической связки. В русском языке связка выражается словами: «есть» («не есть»), «является» («не является»), «имеется» («не имеется») и т.д., обозначается тире, двоеточием, а также может подразумеваться, выражаясь согласованием слов («Идет дождь», «Собака лает»). Связка - это логическая постоянная, поскольку в ней заключено неизменное содержание - она всякий раз служит показателем наличия или отсутствия чего-либо у предмета мысли. «Кванторное слово» (например, «каждый», «все», «ни один», «некоторый» и т.д.) указывает, относится ли информация о предикате суждения ко всему объему понятия, выражающему субъект суждения, или к его части. Например, в суждении «Всякое преступление - противоправное деяние» субъектом суждения является понятие «преступление», предикатом суждения - «противоправное деяние», - связка выражена знаком тире, а кванторное слово «всякое» указывает на то, что характеристика «противоправное деяние» относится ко всему объему (к каждому элементу) понятия «преступление». В самом общем виде простое суждение можно выразить формулой: «S есть (не есть) P». Таким образом, в результате идентификации исходной структуры данных 1 элементы 11 разделяются на то количество суждений, которое в них заложено, т.е. число суждений равно количеству логических разделов в предложении. Далее осуществляется идентификация первых логических разделов 111 и вторых логических разделов 112 элементов 11 исходной структуры данных 1. Идентификация осуществляется на основании результатов идентификации выявленных суждений, определенных как сложные суждения. Сложные суждения - это группа простых суждений, в которых присутствует связь между отдельными суждениями, установленная с помощью логических союзов «и», «или», «если…, то…», «тогда и только тогда…, когда», «неверно, что…». Виды связей между отдельными суждениями выражаются соответствующими логическими связками и приведены в Таблице 1.Figure 3 shows, by way of example, but not limitation, the general structure of the original data structure 1 from which the first data structure 2 is formed. The original data structure 1 is a structured data array containing at least natural language text. As mentioned above, such an array of data may constitute, in particular, a regulatory legal act (NLA). The original data structure 1 contains elements 11, which are positions that are sentences - grammatically organized word combinations. Moreover, each sentence is characterized by semantic completeness. The identification of sentences at step 1012 identification of elements is carried out by identifying in the natural language text signs of the end of the sentence. Signs of the end of the sentence are: period, semicolon, ellipsis, etc. Identification of proposals is carried out in conjunction with the identification of signs of the beginning of the proposal. Signs of the beginning of a sentence are: a capital letter, a number, a number with a closing bracket, a number with a period, etc. Moreover, the identification also takes into account the presence of certain combinations of punctuation marks, namely punctuation marks - periods, semicolons, brackets, colons, etc., - word separators, namely space, etc., and typography - paragraph, number , degrees, etc. At step 1012, the identification of elements reveals elementary semantic units - sentences that are judgments. A simple judgment is a judgment, no part of which is a judgment. The linguistic form of expression of judgment is narrative sentences. The identification of the elements 11 of the initial data structure 1 is carried out by identifying and defragmenting the sentence into the primary components of the sentence, namely words, particles, conjunctions, prepositions, etc., and punctuation marks. After that, concepts expressed in separate words and / or phrases based on various directories and dictionaries are formed from the primary elements. Then, simple judgments are formed from the formed concepts, which are groups of interconnected concepts, while the interconnectedness of concepts is determined on the basis of syntactic or other relationships between the concepts. To form simple judgments, a linguistic-semantic analysis of the elements 11 of the original data structure 1 is carried out, whereby the elements of simple judgments are identified in the elements 11. The structural elements of simple judgments are understood as the subject of judgment, the predicate of judgment, the connective and the quantifier word. The subject of judgment (S) is the concept expressing the subject of judgment, i.e. what is said in this statement. The predicate of judgment (P) is a concept that expresses this or that information about the subject of judgment. The subject of judgment and the predicate of judgment are the basic structural elements of judgment, which are terms of judgment. The connection between the subject of judgment and the predicate of judgment, which reflects the real relationship between objects conceivable in concepts, is revealed through a logical connective. In Russian, the connective is expressed by the words: “is” (“is not”), “is” (“is not”), “is” (“is not available”), etc., is indicated by a dash, a colon, and can also be implied by the agreement of words (“It is raining”, “The dog barks”). A connective is a logical constant, because it contains unchanged content - it always serves as an indicator of the presence or absence of something in the subject of thought. A “quantifier word” (for example, “everyone”, “all”, “none”, “some”, etc.) indicates whether the information on the predicate of a judgment applies to the entire volume of a concept expressing a subject of a judgment, or to its part . For example, in the judgment “Every crime is an illegal act”, the subject of the judgment is the concept of “crime”, the predicate of the judgment is “illegal act”, the connective is expressed by a dash, and the quantifier word “every” indicates that the characteristic “illegal act” refers to to the entire volume (to each element) of the concept of “crime”. In its most general form, a simple proposition can be expressed by the formula: “S is (not is) P”. Thus, as a result of the identification of the initial data structure 1, the elements 11 are divided into the number of judgments that are embedded in them, i.e. the number of judgments is equal to the number of logical sections in the sentence. Next, the identification of the first logical sections 111 and the second logical sections 112 of the elements 11 of the original data structure 1. The identification is based on the results of the identification of the identified judgments, defined as complex judgments. Complex judgments are a group of simple judgments in which there is a connection between separate judgments, established using logical conjunctions "and", "or", "if ... then ...", "if and only if ... when", "it’s wrong, what…". Types of relationships between individual judgments are expressed by the corresponding logical connectives and are shown in Table 1.

Характер связи определяется смыслом логических союзов, который состоит в ответе на вопрос: «При каких условиях сложное суждение будет истинно, а при каких - ложно?». Иначе говоря - при каких сочетаниях истинности и ложности простых суждений, из которых состоит сложное суждение, логический союз определяет истинную связь, а при каких - ложную. Суждение рассматривается как истинное, если даваемое им описание соответствует действительности (реальной ситуации), и как ложное, если не соответствует ей. «Истина» и «ложь» называются истинностными значениями суждения и являются основной логической характеристикой суждений. Смысл логических союзов можно определить с помощью истинностной таблицы (Таблица 2), в которой в столбцах 1 и 2 содержатся все возможные комбинации истинностных значений простых суждений, а в столбцах 3-9 содержатся значения сложного суждения, образованного из простых суждений с помощью соответствующего логического союза. При этом исходные простые суждения обозначают буквами «А», «В», а значения истинности символами: «и» - истинно, «л» - ложно.The nature of the connection is determined by the meaning of logical alliances, which consists in answering the question: “Under what conditions will a complex judgment be true, and under which it will be false?” In other words, under what combinations of truth and falsity of simple judgments that make up a complex proposition, a logical union determines the true connection, and in which - the false one. A judgment is considered true if the description it gives is true (a real situation), and false if it does not. “Truth” and “false” are called truth values of judgment and are the main logical characteristic of judgments. The meaning of logical alliances can be determined using the truth table (Table 2), in which columns 1 and 2 contain all possible combinations of truth values of simple judgments, and columns 3-9 contain the values of a complex proposition formed from simple judgments using the corresponding logical union . In this case, the initial simple judgments are denoted by the letters "A", "B", and the truth values by the symbols: "and" - true, "l" - false.

Для выявления первых логических разделов 111 и вторых логических разделов 112 элементов 11 упомянутой исходной структуры данных 1 необходимо в предложениях текста выявить простые суждения, имеющие между собой импликативную (условную) логическую связь и взаимную (двойную) импликативную (условную) связь. Условное суждение (импликативное суждение) - это сложное суждение, в котором простые суждения объединяются логическим союзом «если…, то…». Например: «Если гражданин нарушает закон, то это порождает ответственность за нарушение» или «Если число делится на 2 без остатка, то оно четное». Условное суждение состоит из двух составляющих его видов суждений. Суждение, записанное после слова «если», называется основанием (предыдущим). Суждение, записанное после слова «то» называется следствием (последующим). Формула условного суждения может быть представлена как «A→B», где A - основание, B - следствие. При этом основания и следствия сами по себе могут быть как простыми суждениями, так и сложными суждениями. Образованное из предыдущего и последующего суждений условное суждение, прежде всего, подразумевает, что не может быть так, чтобы то, о чем говорится в основании, имело место, а то, о чем говорится в следствии - отсутствовало. Иными словами, если основание будет истинным, а следствие - ложным, то такое условное суждение будет ложным. Это условие определяет то, что условное суждение истинно во всех случаях, кроме одного: когда предшествующее есть, а последующего нет, т.е. - суждение по формуле «A→B» - ложно только в одном случае, когда A - истинно, а B - ложно (см. таблицу 2, столбец 6). В форме условных суждений могут быть выражены как объективные зависимости одних объектов от других, так и права и обязанности субъектов правоотношений, связанные с теми или иными условиями. Эквивалентное суждение (двойная импликация) - это сложное суждение, в котором объединяются суждения с взаимной условной зависимостью. Эквивалентные суждения образуются с помощью логического союза «если и только если…, то…», который обозначается символом «↔». Формула эквивалентности: «A↔B», где A, B - суждения, из которых образуется эквивалентное суждение, например: «Человек имеет право на пенсию по возрасту, если и только если он достиг пенсионного возраста». В естественном языке, в том числе, в экономических и юридических текстах, для выражения эквивалентных суждений используются грамматические союзы: «лишь при условии, что…, то…», «только тогда, когда…, то…», «в том и только в том случае, когда…, тогда…». Условия истинности эквивалентных суждений представлены в столбце 7 таблицы 2. Эквивалентное суждение истинно в двух случаях - когда оба составляющих его суждения истинны или, когда оба ложны. Иными словами, связь между элементами эквивалентного суждения можно охарактеризовать как необходимую: «истинность A достаточна для признания истинности B и наоборот» и «ложность A служит показателем ложности B и наоборот». По причине того, что в двойных импликативных суждениях отсутствуют четко выраженные основания и следствия, главным фактором в идентификации первых и вторых логических разделов 111, 112 является наличие в суждениях, находящихся во взаимной импликативной зависимости признаков юридического факта. На примере упомянутого импликативного суждения «Человек имеет право на пенсию по возрасту, если и только если он достиг пенсионного возраста» можно установить, что суждение «Человек имеет право на пенсию по возрасту» не содержит таких признаков, а суждение «если и только если он достиг пенсионного возраста» имеет признак юридического факта, которым является событие - «достижение пенсионного возраста». Таким образом, именно суждение, содержащее признаки юридического факта, признается для целей идентификации основанием (А). Другое суждение признается следствием (В). Термин «юридический факт» означает определенное жизненное обстоятельство, с которым норма права связывает возникновение, изменение или прекращение правоотношения или правоотношений. Если предложение содержит одно простое суждение (или несколько простых суждений) или одно сложное суждение (или несколько сложных суждений), которое не идентифицировано как условное суждение, то в результате лингвистическо-семантического анализа текста, «окружающего» такое предложение, может быть выявлен фактический контекстный вид данного простого суждения - первый логический раздел 111 или второй логический раздел 112. Формируемая на этапе 101 первая структура данных 2 содержит такие элементы, как предложения (элемент 11 исходной структуры данных 1) и суждения логические разделы 111 или 112 элементов 11 исходной структуры данных 1. При этом суждения дополнительно идентифицированы по упомянутой логической связи как основания, т.е. как первый логический раздел 111 элемента исходной структуры данных 11, имеющий логическую связь 1-го типа и являющийся суждением «A», и как следствия, т.е. как второй логический раздел 112 элемента 11 исходной структуры данных 1, имеющий логическую связь 1-го типа и являющийся суждением «B». В первой структуре данных 2 все элементы исходной структуры данных 1 сепарированы по признаку наличия в них упомянутых первых логических разделов 111 или вторых логических разделов 112 исходной структуры данных 1, за счет чего сформированы элементы 21 первой структуры данных 2, имеющие первые логические разделы 111, и элементы 22 первой структуры данных 2, имеющие вторые логические разделы 112.In order to identify the first logical sections 111 and the second logical sections 112 of the elements 11 of the mentioned initial data structure 1, it is necessary to identify simple propositions in the text sentences that have an implicative (conditional) logical connection and mutual (double) implicative (conditional) connection. A conditional proposition (implicative proposition) is a complex proposition in which simple propositions are united by the logical union “if ... then ...”. For example: “If a citizen violates the law, this creates liability for the violation” or “If the number is divisible by 2 without a remainder, then it is even.” A conditional proposition consists of two types of judgments that comprise it. The judgment written after the word “if” is called the basis (previous). The judgment written after the word "that" is called the consequence (subsequent). The conditional propositional formula can be represented as “A → B”, where A is the basis, B is the consequence. At the same time, the foundations and consequences themselves can be both simple judgments and complex judgments. The conditional proposition formed from the previous and subsequent judgments, first of all, implies that it cannot be that what is said in the basis takes place, but what is said in the investigation is absent. In other words, if the ground is true and the effect is false, then such a conditional proposition will be false. This condition determines that a conditional proposition is true in all cases except one: when the previous one is, and the next is not, i.e. - a judgment by the formula “A → B” is false only in one case, when A is true and B is false (see table 2, column 6). In the form of conditional judgments can be expressed as the objective dependence of some objects on others, as well as the rights and obligations of subjects of legal relations related to certain conditions. Equivalent judgment (double implication) is a complex proposition in which propositions with mutual conditional dependence are combined. Equivalent judgments are formed using the logical union “if and only if ... then ...”, which is indicated by the symbol “↔”. Equivalence formula: “A↔B”, where A, B are the judgments from which the equivalent judgment is formed, for example: “A person has the right to an old-age pension if and only if he has reached retirement age”. In the natural language, including economic and legal texts, grammatical unions are used to express equivalent judgments: “only if ..., then ...”, “only when ..., then ...”, “and only in the case when ... then ... ". The truth conditions for equivalent judgments are presented in column 7 of table 2. An equivalent proposition is true in two cases - when both components of its judgments are true or when both are false. In other words, the connection between the elements of an equivalent judgment can be described as necessary: "the truth of A is sufficient to recognize the truth of B and vice versa" and "the falsity of A is an indicator of the falsity of B and vice versa." Due to the fact that double implicative judgments do not have clear grounds and consequences, the main factor in identifying the first and second logical sections 111, 112 is the presence in judgments that are in the mutual implicative dependence of the signs of a legal fact. On the example of the mentioned implicative judgment “A person has the right to an old-age pension if and only if he has reached retirement age”, it can be established that the judgment “A person has a right to an old-age pension” does not contain such signs, and a judgment “if and only if he has reached retirement age ”has a sign of legal fact, which is the event -“ reaching retirement age ”. Thus, it is a judgment containing signs of a legal fact that is recognized for identification purposes by reason (A). A different judgment is recognized by consequence (B). The term "legal fact" means a specific circumstance of life with which a rule of law relates the occurrence, change or termination of a legal relationship or legal relationship. If a sentence contains one simple proposition (or several simple propositions) or one complex proposition (or several complex propositions) that is not identified as a conditional proposition, then a linguistic-semantic analysis of the text “surrounding” such an proposition may reveal the actual contextual the form of this simple proposition is the first logical section 111 or the second logical section 112. The first data structure 2 formed in step 101 contains elements such as sentences (element 11 of the original structure data 1) and the judgment logical partitions 111 or 112 members 11 original data structure 1. In this case, judgment is further identified by said logical link as a base, i.e., as the first logical section 111 of the element of the original data structure 11, which has a logical connection of the 1st type and is the judgment “A”, and as a consequence, i.e. as the second logical section 112 of element 11 of the original data structure 1, having a logical connection of the 1st type and which is the judgment "B". In the first data structure 2, all elements of the original data structure 1 are separated by the presence of the aforementioned first logical partitions 111 or second logical partitions 112 of the original data structure 1, whereby the elements 21 of the first data structure 2 having the first logical partitions 111 are formed, and elements 22 of the first data structure 2 having second logical partitions 112.

На фиг.4 в качестве примера, но не ограничения, изображена общая схема выполнения этапов этапа 102 формирования базы данных 3 логических связей логических разделов. Этап 102 формирования базы данных 3 логических связей логических разделов характеризуется выполнением этапа 1021 идентификации элементов первой структуры данных 2, на котором идентифицируют элементы 21 первой структуры данных 2, содержащие один упомянутый первый логический раздел 111, представляющие собой элементы 31 первой структуры данных 2, и элементы 22 первой структуры данных 2, содержащие один упомянутый второй логический раздел 112, представляющие собой элементы 32 первой структуры данных 2; выполнением этапа 1022 идентификации элементов первой структуры данных 2, на котором идентифицируют элементы 21 первой структуры данных, содержащие более одного упомянутого первого логического раздела 111, представляющие собой элементы 33 первой структуры данных 2 и элементы 22 первой структуры данных, содержащие более одного упомянутого второго логического раздела, представляющие собой элементы 34 первой структуры данных 2; выполнением этапа 1023 идентификации логических связей, на котором среди элементов 33 первой структуры данных 2, содержащих более одного упомянутого первого логического раздела 111 и элементов 34 первой структуры данных 2, содержащих более одного упомянутого второго логического раздела 112, идентифицируют логические связи между упомянутыми первыми логическими разделами 111 или логические связи между упомянутыми вторыми логическими разделами 112; выполнением этапа 1024 идентификации отсутствия логических связей, на котором среди элементов 33 первой структуры данных 2, содержащих более одного упомянутого первого логического раздела 111, и среди элементов 34 первой структуры данных 2, содержащих более одного упомянутого второго логического раздела 112, идентифицируют элементы 35 первой структуры данных 2, не имеющие логических связей между своими логическими разделами; и выполнением этапа 1025 формирования базы данных, на котором формируют базу данных 3 логических связей логических разделов элементов первой структуры данных. Для уточнения всех дополнительных (не только импликативных) имеющихся логических связей между логическими разделами (суждениями) все элементы первой структуры данных 2, а именно массивы элементов 21 предложений, содержащих первые логические разделы 111, и элементов 22 предложений, содержащих вторые логические разделы 112, необходимо сепарировать на группы элементов 31, 33 и 32, 34, содержащих либо только первые логические разделы 111, либо только вторые логические разделы 112, соответственно. При этом каждый элемент, входящий в массивы элементов 31, 33, содержащий первый логический раздел 111, идентифицируется как элемент 31, имеющий только один логический раздел 111, или как элемент 33, имеющий более одного логического раздела 111. При этом в случае наличия в идентифицированных элементах 31, 33 вторых логических разделов 112, вторые логические разделы 112 удаляются из идентифицированных элементов 31, 33. Полученные в итоге массивы элементов 31, 33 по-прежнему связаны с тем элементом, из которого они выделены и по этому основанию идентифицированы как отдельные элементы данного массива элементов. В свою очередь, каждый элемент, входящий в массивы элементов 32, 34, содержащий второй логический раздел 112, идентифицируется как элемент 32, имеющий только один логический раздел 112, или как элемент 34, имеющий более одного логического раздела 112. При этом в случае наличия в идентифицированных элементах 32, 34 первых логических разделов 111, первые логические разделы 111 удаляются из идентифицированных элементов 32, 34. Полученные в итоге массивы элементов 32, 34 по-прежнему связаны с тем элементом, из которого они выделены и по этому основанию идентифицированы как отдельные элементы данного массива элементов. Далее устанавливается характер логических связей между однотипными суждениями в элементах двух созданных массивов элементов 31, 33 и элементов 32, 34. В однотипных элементах массивов элементов 33, 34 выявляются логические связи между суждениями, указанные в таблицах 1 и 2, а именно - соединительные связи (конъюнкция), разделительные связи (дизъюнкция, строгая дизъюнкция), эквивалентные связи (эквиваленция). Конъюнктивное (соединительное) суждение - это сложное суждение, образованное из исходных суждений посредством логического союза «и», обозначаемого символом «^∧». Например, суждение: «Сегодня я пойду на лекцию по логике и в кино», - является «конъюнктивным суждением», состоящим из двух простых суждений (обозначим их A и B, соответственно) - «Сегодня я пойду на лекцию по логике» (А), «Сегодня я пойду в кино» (В). Данное сложное суждение можно представить формулой: «A^∧B», где A, B - элементы конъюнкции; «^∧» - символ логического союза - конъюнкция. В русском языке конъюнктивный логический союз выражается многими грамматическими союзами: «и», «а», «но», «да», «хотя», «однако», «а также…». Нередко подобные грамматические союзы заменяются знаками препинания - запятой, двоеточием, точкой с запятой. Дизъюнктивное (разделительное) суждение - это сложное суждение, образованное из «исходных» суждений посредством логического союза «или», обозначаемого символом «V». Например, суждение: «Право может способствовать экономическому развитию или препятствовать ему», - является дизъюнктивным суждением, состоящим из двух простых суждений: «Право может способствовать экономическому развитию» и «Право может препятствовать экономическому развитию». Соответственно, обозначив их через буквы A, B, такое суждение можно представить через формулу: «A V B». Поскольку связка «или» употребляется в двух разных значениях - неисключающем и исключающем, то различают слабую и сильную (строгую) дизъюнкции. Слабая дизъюнкция является истинной в тех случаях, когда истинно, по крайней мере, одно из составляющих ее суждений (или оба вместе) и ложна, когда оба составляющих ее суждения ложны (см. таблицу 2, столбец 4). Сильная дизъюнкция (символ «VV») отличается от слабой дизъюнкции тем, что ее составляющие исключают друг друга. Например, «Преступление может быть умышленным или по неосторожности». Для того, чтобы подчеркнуть строго разделительный, исключающий характер связи, в естественном языке используется усиленная двойная форма разделения: «…либо…, либо…», «…или…, или…», например: «Либо я найду путь, либо я проложу его». Строгая дизъюнкция истинна лишь тогда, когда одно из составляющих ее суждений истинно, а другое - ложно (см. таблицу 2, столбец 5). В результате выявляются все логические связи между элементами (суждениями) массивов элементов 33, 34. При этом допускается, что часть элементов данных массивов может не иметь логических связей друг с другом. Далее для уточнения вида суждений, которые не были до сих пор идентифицированы как сложные суждения, элементы массивов элементов должны быть подвергнуты идентификации на их соответствие отрицаемому суждению. Этому анализу должны быть подвергнуты массивы элементов 31, 32 и, частично, массивы элементов 33, 34, в которых остались суждения, не имеющие логических связей с другими суждениями. Отрицаемое суждение - это сложное суждение, образованное с помощью логического союза «неверно, что…» (или просто «не»), который, как правило, представлен знаком отрицания (символ «~»). В отличие от упомянутых выше бинарных союзов, такой союз относится к одному суждению. Прибавление этого союза к какому-либо суждению означает образование нового суждения, которое находится в определенной зависимости от исходного суждения - отрицаемое суждение истинно, если исходное суждение ложно, и наоборот (см. таблицу 2, столбцы 8, 9). Например, если исходное суждение: «Все свидетели правдивы», - то отрицаемое: «Неверно, что все свидетели правдивы». Если отдельный логический раздел (простое суждение) остается не идентифицированным с точки зрения логического характера суждения, то в результате лингвистическо-семантического анализа текста, окружающего предложения, в котором содержится такой раздел, может быть выявлен фактический контекстный вид данного простого суждения. Таким образом, первая структура данных содержит логические разделы предложений, формирующих исходную структуру данных. По итогам идентификации всех элементов первой структуры данных выявляются все логические разделы предложений (суждения), с точки зрения их наличия и характера связей суждений, формирующих сложные суждения. На основании выявленных характеров связей суждений (в том числе их отсутствия) формируется база данных 3 логических связей логических разделов (фиг.5).Figure 4 as an example, but not limitation, shows a General diagram of the steps of step 102 of forming a database 3 of logical connections of logical partitions. The step 102 of creating a database 3 of logical connections of logical partitions is characterized by the execution of step 1021 of identifying the elements of the first data structure 2, which identifies the elements 21 of the first data structure 2, containing one of the first logical partition 111, which are the elements 31 of the first data structure 2, and the elements 22 of a first data structure 2, comprising one of said second logical partitions 112, which are elements 32 of a first data structure 2; performing step 1022 of identifying the elements of the first data structure 2, which identifies the elements 21 of the first data structure containing more than one of the first logical partitions 111, which are the elements 33 of the first data structure 2 and the elements 22 of the first data structure containing more than one of the second logical partitions representing the elements 34 of the first data structure 2; by performing step 1023 of identifying the logical connections, where among the elements 33 of the first data structure 2 containing more than one of the first logical partitions 111 and the elements 34 of the first data structure 2 containing more than one of the second logical part 112, the logical relationships between the first logical partitions are identified 111 or logical connections between said second logical partitions 112; by performing step 1024 of identifying the absence of logical connections, where among the elements 33 of the first data structure 2 containing more than one of the first logical partitions 111, and among the elements 34 of the first data structure 2 containing more than one of the second logical partitions 112, the elements 35 of the first structure are identified 2 data that do not have logical connections between their logical partitions; and the implementation of step 1025 of the formation of the database, which form the database 3 logical relationships of logical sections of the elements of the first data structure. To clarify all additional (not only implicative) logical connections between logical partitions (judgments), all elements of the first data structure 2, namely arrays of elements of 21 sentences containing the first logical sections 111, and elements of 22 sentences containing the second logical sections 112, it is necessary to separate into groups of elements 31, 33 and 32, 34, containing either only the first logical partitions 111, or only the second logical partitions 112, respectively. Moreover, each element included in the arrays of elements 31, 33 containing the first logical partition 111 is identified as element 31 having only one logical partition 111, or as element 33 having more than one logical partition 111. In this case, if identified elements 31, 33 of the second logical partitions 112, the second logical partitions 112 are deleted from the identified elements 31, 33. The resulting arrays of elements 31, 33 are still associated with the element from which they are selected and identified on this basis as separate elements of a given array of elements. In turn, each element in the arrays of elements 32, 34 containing the second logical partition 112 is identified as element 32 having only one logical partition 112, or as element 34 having more than one logical partition 112. In this case, if any in the identified elements 32, 34 of the first logical partitions 111, the first logical partitions 111 are deleted from the identified elements 32, 34. The resulting arrays of elements 32, 34 are still associated with the element from which they are selected and for this reason identified ovany as separate elements of the array elements. Next, the nature of the logical relationships between the same judgments in the elements of two created arrays of elements 31, 33 and the elements 32, 34 is established. In the same elements of the arrays of elements 33, 34, logical relationships between the judgments shown in tables 1 and 2 are revealed, namely, connecting connections ( conjunction), separation bonds (disjunction, strict disjunction), equivalent bonds (equivalent). A conjunctive (connecting) proposition is a complex proposition formed from the initial propositions by means of a logical union “and”, denoted by the symbol “ ^∧ ”. For example, the proposition: “Today I will go to a lecture on logic and in cinema” is a “conjunctive proposition” consisting of two simple propositions (we denote them by A and B, respectively) - “Today I will go to a lecture on logic” (A ), “Today I will go to the cinema” (B). This complex proposition can be represented by the formula: “A ^∧ B”, where A, B are conjunction elements; " ^∧ " - a symbol of logical union - conjunction. In Russian, the conjunctive logical union is expressed by many grammatical unions: “and”, “a”, “but”, “yes”, “although”, “however”, “as well as ...”. Often such grammatical unions are replaced by punctuation marks - a comma, a colon, a semicolon. A disjunctive (dividing) proposition is a complex proposition formed from "initial" propositions by means of a logical union "or", denoted by the symbol "V". For example, the proposition: “The law can contribute to economic development or hinder it,” is a disjunctive proposition consisting of two simple judgments: “The law can contribute to economic development” and “The law can impede economic development”. Accordingly, having designated them through the letters A, B, such a judgment can be represented through the formula: “AVB”. Since the connective "or" is used in two different meanings - non-exclusive and exclusive, they distinguish between weak and strong (strict) disjunctions. A weak disjunction is true when at least one of its component judgments (or both together) is true and false when both its judgments are false (see table 2, column 4). A strong disjunction (the symbol “VV”) differs from a weak disjunction in that its components are mutually exclusive. For example, “Crime may be intentional or negligent.” In order to emphasize the strictly dividing, excluding the nature of the connection, a reinforced double form of separation is used in the natural language: "... either ..., or ...", "... or ..., or ...", for example: "Either I will find the way, or I will pave him". A strict disjunction is true only when one of its component judgments is true and the other is false (see table 2, column 5). As a result, all logical connections between the elements (judgments) of the arrays of elements 33, 34 are revealed. It is assumed that some of the elements of these arrays may not have logical connections with each other. Further, to clarify the type of judgments that have not yet been identified as complex judgments, elements of the arrays of elements should be identified for their conformity to the denied judgment. Arrays of elements 31, 32 and, in part, arrays of elements 33, 34, in which there are judgments that have no logical connections with other judgments, should be subjected to this analysis. A denied proposition is a complex proposition formed by the logical union “it is not true that ...” (or simply “not”), which is usually represented by a negative sign (“~” symbol). Unlike the binary unions mentioned above, such an alliance refers to one proposition. Adding this union to a proposition means the formation of a new proposition, which depends on the original proposition - the denied proposition is true if the original proposition is false, and vice versa (see table 2, columns 8, 9). For example, if the initial judgment is: “All witnesses are truthful,” then the denied: “It is not true that all witnesses are truthful.” If a separate logical section (simple judgment) remains unidentified from the point of view of the logical nature of the judgment, then as a result of linguistic-semantic analysis of the text surrounding the sentence containing such a section, the actual contextual form of this simple judgment can be revealed. Thus, the first data structure contains logical sections of sentences that form the original data structure. According to the results of identification of all elements of the first data structure, all logical sections of sentences (judgments) are identified, in terms of their presence and the nature of the relationships of judgments forming complex judgments. Based on the identified nature of the relationships of judgments (including their absence), a database of 3 logical relationships of logical sections is formed (Fig. 5).

На фиг.6 в качестве примера, но не ограничения, изображена общая схема выполнения этапов этапа 103 формирования второй структуры данных 4. Этап 103 формирования второй структуры данных 4 характеризуется выполнением этапа 1031 формирования логических конструкций, на котором формируют логические конструкции 41 логических разделов 111, 112 элементов 31, 32, 33, 34 первой структуры данных 2, используя информацию из базы данных 3 логических связей логических разделов элементов 31, 32, 33, 34 первой структуры данных 2 и логические разделы упомянутых элементов 31 первой структуры данных 2, содержащих один упомянутый первый логический раздел 111, и логические разделы упомянутых элементов 32 первой структуры данных 2, содержащих один упомянутый второй логический раздел 112; и выполнением этапа 1032 формирования второй структуры данных 4, на котором формируют вторую структуру данных 4, содержащую элементы 41 второй структуры данных 4, причем упомянутые элементы второй структуры данных 4 представляют собой сформированные логические конструкции 41 логических разделов 111, 112 элементов 31, 32, 33, 34 первой структуры данных 2. Логические конструкции 42 - это результат преобразования данных преобразуемого структурированного массива данных. Логические конструкции 41 формируются в соответствии со спецификой преобразуемого текста на естественном языке, в частности НПА. Специфика НПА состоит в том, что он содержит нормы права (правовые нормы). Также специфика НПА состоит в том, что в теории нормы права существуют понятия логической нормы права и юридической нормы права. Эти понятия не тождественны. Разница заключается в том, что логическая норма права включает в себя содержание всех элементов нормы права, установленных в юридической науке, включая гипотезу, диспозицию и санкцию, а юридическая норма права отражает конкретные нормативные предписания, содержащиеся в конкретных предложениях конкретных НПА. Фактически разница состоит в том, что одна конкретная логическая норма права может содержаться в конкретном множестве юридических норм права, т.е. в множестве нормативных предписаний. Логическая конструкция - это основа (каркас) основного нормативного предписания, содержащая два основных элемента конструкции, - «ситуацию» и «правило» (см. табл.3). Основное нормативное предписание (далее - нормативное предписание) является инструментом правового регулирования и включает в себя регулятивные и охранительные нормативные предписания. При этом под ситуацией в нормативных предписаниях понимается любая обусловленность правила, а под правилом понимаются любые правила, включая правила (модель) поведения субъектов правоотношений. Иными словами, ситуация - это суждения, имеющие логическую импликативную связь и являющиеся основаниями, а правила - это суждения, имеющие логическую импликативную связь и являющиеся следствиями. При формировании логических конструкций 41, т.е. нормативных предписаний, также необходимо учитывать, что каждый из элементов этой конструкции (и ситуация и правило) могут состоять как из одного суждения, так и из группы суждений. Для формирования логических конструкций необходимо использовать Базу данных логических связей логических разделов. Кроме выявленных логических связей между логическими разделами для формирования логической конструкции необходимо обратиться к правилам формирования логических конструкций. Правила формирования логических конструкций отражают требования юридической науки и юридической практики в отношении состава и структуры нормативного предписания (предписания). Например, условие того, что одно предписание не может содержать двух разных правил, приводит к тому, что в правилах устанавливается, что если в одном предложении содержатся два следствия, которые имеют логически слабую дизъюнктивную связь, то это значит, что эти суждения входят в разные правила и соответственно разные предписания. При этом если эти же два следствия имеют логически сильную дизъюнктивную связь, то это их объединяет в рамках одного сложного правила в рамках одного предписания. По существу, правила формирования логических конструкций 42 сводятся к допустимым сочетаниям логических связей между однотипными суждениями в рамках одного нормативного предписания.6, by way of example, but not limitation, the general flowchart of the steps of step 103 of generating the second data structure 4 is shown. Step 103 of the formation of the second data structure 4 is characterized by the execution of step 1031 of generating logical structures, in which logical structures 41 of logical sections 111 are formed, 112 elements 31, 32, 33, 34 of the first data structure 2, using information from the database 3 of logical connections of logical partitions of elements 31, 32, 33, 34 of the first data structure 2 and logical partitions of the mentioned elements 31 per 2 nd data structure comprising one said first logical partition 111 and logical partition of said data elements 32 of the first structure 2, comprising one said second logical partition 112; and performing step 1032 of generating a second data structure 4, wherein a second data structure 4 is formed comprising elements 41 of a second data structure 4, said elements of the second data structure 4 being formed logical structures 41 of logical partitions 111, 112 of elements 31, 32, 33 , 34 of the first data structure 2. Logical constructions 42 are the result of transforming the data of the transformed structured data array. Logical constructions 41 are formed in accordance with the specifics of the converted text in natural language, in particular, normative documents. The specificity of the legal acts is that it contains the rule of law (legal norms). Also, the specificity of the legal regulation is that in the theory of the rule of law there are concepts of a logical rule of law and a legal rule of law. These concepts are not identical. The difference lies in the fact that the logical rule of law includes the content of all elements of the rule of law established in legal science, including hypothesis, disposition and sanction, and the legal rule of law reflects the specific regulatory requirements contained in specific proposals of specific legal acts. In fact, the difference lies in the fact that one particular logical rule of law can be contained in a specific set of legal rules of law, i.e. in a multitude of regulatory requirements. The logical design is the basis (framework) of the main regulatory requirement, containing two basic structural elements - “situation” and “rule” (see table 3). The main regulatory requirement (hereinafter referred to as the regulatory requirement) is an instrument of legal regulation and includes regulatory and protective regulatory requirements. In this case, the situation in the regulatory requirements means any conditionality of the rule, and the rule means any rules, including the rules (model) of behavior of subjects of legal relations. In other words, a situation is a judgment having a logical implicative connection and being a basis, and a rule is a judgment having a logical implicative connection and being a consequence. In the formation of logical structures 41, i.e. regulatory requirements, it is also necessary to take into account that each of the elements of this design (both the situation and the rule) can consist of both a single judgment and a group of judgments. For the formation of logical constructions, it is necessary to use the database of logical connections of logical partitions. In addition to the identified logical connections between logical sections for the formation of a logical structure, you must refer to the rules for the formation of logical structures. The rules for the formation of logical constructions reflect the requirements of legal science and legal practice in relation to the composition and structure of a normative prescription (prescription). For example, the condition that one prescription cannot contain two different rules leads to the fact that the rules establish that if one sentence contains two consequences that have a logically weak disjunctive connection, this means that these judgments are different rules and accordingly different regulations. Moreover, if the same two consequences have a logically strong disjunctive connection, then this unites them in the framework of one complex rule within the framework of one prescription. In essence, the rules for the formation of logical constructions 42 are reduced to admissible combinations of logical connections between the same type of judgments within the framework of a single normative prescription.

При этом если в предложении имеются несколько ситуаций, объединенных логикой «ИЛИ» (слабая дизъюнктивная связь), то это означает, что каждая из таких ситуаций формирует собственные отдельные предписания с теми же правилами, которые использовались в первом предписании. Это означает, что несколько ситуаций с такой логикой «ИЛИ» не могут быть в одном предписании. Кроме того, в предложении имеются несколько правил, объединенных логикой «ИЛИ» (слабая дизъюнктивная связь). Это означает, что каждое из таких правил формирует собственные отдельные предписания с теми же ситуациями, которые использовались в первом предписании. Это означает, что несколько «правил» с такой логикой «ИЛИ» не могут быть в одном предписании. Сформированная вышеуказанным образом вторая структура данных содержит такие элементы, как суждения (логический раздел элемента исходной структуры данных) и нормативные предписания (логическая конструкция 41 логических разделов элемента исходной структуры данных) (фиг.7). При этом суждения идентифицированы по наличию импликативной логической связи на как два основных логических раздела:Moreover, if a sentence contains several situations united by the logic “OR” (weak disjunctive connection), this means that each of these situations forms its own separate instructions with the same rules that were used in the first order. This means that several situations with such “OR” logic cannot be in the same prescription. In addition, the proposal has several rules, united by the logic “OR” (weak disjunctive connection). This means that each of these rules forms its own separate regulations with the same situations that were used in the first regulation. This means that several “rules” with such “OR” logic cannot be in one prescription. The second data structure formed in the above manner contains elements such as judgments (logical section of the element of the original data structure) and regulatory requirements (logical structure 41 of logical sections of the element of the original data structure) (Fig. 7). In this case, judgments are identified by the presence of an implicative logical connection into two main logical sections:

1) основания (первый логический раздел элемента исходной структуры данных, содержащий логическую импликативную связь, связь 1-го типа, вида А);1) foundations (the first logical section of the element of the original data structure containing a logical implicative connection, a connection of the 1st type, type A);

2) следствия (второй логический раздел элемента исходной структуры данных, содержащий логическую импликативную связь, связь 1-го типа, вида В);2) consequences (the second logical section of the element of the original data structure, containing a logical implicative connection, a connection of the 1st type, type B);

При этом основания и следствия дополнительно идентифицированы также по факту выявления иных логических связей между однотипными импликативными суждениями в рамках одного предложения как дополнительные логические разделы:In this case, the grounds and consequences are additionally identified also by the fact of identifying other logical connections between the same implicative judgments within the framework of one sentence as additional logical sections:

1) суждения «И» (логический раздел элемента исходной структуры данных, содержащий логическую конъюнктивную (соединительную) связь, связь 2-го типа);1) judgments “AND” (the logical section of the element of the original data structure containing the logical conjunctive (connecting) connection, the connection of the 2nd type);

2) суждения «ИЛИ» (логический раздел элемента исходной структуры данных, содержащий логическую слабую дизъюнктивную (разделительную) связь, связь 3-го типа);2) “OR” judgments (the logical section of the element of the original data structure containing the logical weak disjunctive (dividing) connection, type 3 communication);

3) суждения «ИЛИ*» (логический раздел элемента исходной структуры данных, содержащий логическую сильную дизъюнктивную (разделительную) связь, связь 4-го типа).3) “OR *” judgments (a logical section of an element of the original data structure containing a logical strong disjunctive (separation) connection, a type 4 connection).

Кроме того, вышеуказанные разделы могут быть отдельно идентифицированы как «отрицаемые суждения» (логический раздел элемента исходной структуры данных, содержащий отрицаемую логическую связь, связь 5-го типа).In addition, the above sections can be separately identified as “denied judgments” (a logical section of an element of the original data structure containing a denied logical link, a type 5 link).

На фиг.8 в качестве примера, но не ограничения, изображена общая схема выполнения этапов этапа 104 формирования базы данных 5 семантических частей. Этап 104 формирования базы данных 5 семантических частей характеризуется выполнением этапа 1041 идентификации логических разделов, на котором идентифицируют первые логические разделы 411 элементов 41 второй структуры данных 4 и вторые логические разделы 412 элементов второй структуры данных 4; выполнением этапа 1042 идентификации семантических частей, на котором в упомянутых первых логических разделах 411 и вторых логических разделах 412 элементов второй структуры данных 4 идентифицируют первые семантические части 4110 и вторые семантические части 4120; и выполнением этапа 1043 идентификации особых семантических частей, на котором в упомянутых первых и вторых логических разделах 411, 412 элементов 41 второй структуры данных 4 идентифицируют, по меньшей мере, особые семантические части 4111 первых логических разделов 411 элементов 41 второй структуры данных 4 и особые семантические части 4121 вторых логических разделов 412 элементов 41 второй структуры данных 4 и формируют базу данных 5 особых семантических частей логических разделов элементов 41 второй структуры данных 4 путем перемещения упомянутых особых семантических частей 4111, 4121 в упомянутую формируемую базу данных 5 особых семантических частей логических разделов элементов 41 второй структуры данных 4 (фиг.9). Сформированные во второй структуре данных логические конструкции 41 являются каркасом и основой нормативного предписания, но по-прежнему не соответствуют ему полностью. Для достижения максимального соответствия структуры логических конструкций 41 структуре нормативного предписания необходимо провести комплексный семантический анализ логических разделов 411, 412, включающий как минимум синтаксический и логический разборы терминов и понятий, выявление взаимосвязей между понятиями суждения и между терминами сложных понятий. Целью данного семантического анализа является выявление и идентификация в логических разделах 411, 412 логических конструкций 41 второй структуры данных 4 ряда специфических частей (вторых частей) логических разделов, которые приводят к:On Fig as an example, but not limitation, shows a General diagram of the stages of step 104 of the formation of the database 5 semantic parts. The step 104 of the formation of the database of 5 semantic parts is characterized by the execution of step 1041 identification of logical partitions, which identify the first logical partitions 411 elements 41 of the second data structure 4 and the second logical partitions 412 elements of the second data structure 4; performing step 1042 of identifying the semantic parts, in which in the first logical sections 411 and second logical sections 412 of the elements of the second data structure 4, the first semantic parts 4110 and the second semantic parts 4120 are identified; and by performing step 1043 of identifying the specific semantic parts, in which at least the specific semantic parts 4111 of the first logical sections 411 of the elements 41 of the second data structure 4 and the special semantic are identified in the first and second logical sections 411, 412 of the elements 41 of the second data structure parts 4121 of the second logical partitions 412 of the elements 41 of the second data structure 4 and form a database 5 of special semantic parts of the logical partitions of the elements 41 of the second data structure 4 by moving the of the general semantic parts 4111, 4121 to the said generated database 5 of special semantic parts of the logical sections of the elements 41 of the second data structure 4 (Fig. 9). The logical structures 41 formed in the second data structure are the framework and the basis of the normative prescription, but still do not fully comply with it. To achieve maximum compliance with the structure of logical constructions 41 to the structure of a normative prescription, it is necessary to conduct a comprehensive semantic analysis of logical sections 411, 412, including at least syntactic and logical analysis of terms and concepts, identifying the relationships between concepts of judgment and between terms of complex concepts. The purpose of this semantic analysis is to identify and identify in logical sections 411, 412 logical structures 41 of the second data structure 4 rows of specific parts (second parts) of logical sections that lead to:

1) смешению основных понятий правовой нормы - к смешению ситуаций и правил путем включения в суждение различных обусловленностей;1) a mixture of the basic concepts of a legal norm - to a mixture of situations and rules by including various conditions in the judgment;

2) размытию - расфокусированию смысла суждений путем включения в суждение различных качественных и количественных уточнений и детализаций.2) blurring - defocusing the meaning of judgments by including various qualitative and quantitative refinements and details into the judgment.

На данном этапе осуществляется выявление специфических частей, т.е. выявление в логических разделах 411, 412 логических конструкций 41 второй структуры данных первых и вторых семантических частей 4110, 4120 логических разделов 411, 412. Причем первые семантические части 4110 формируются путем удаления из логических разделов 411, 412 вторых семантических частей 4120 (специфических частей). Первая семантическая часть 4110 логического раздела - это смысловое ядро суждения, т.е. суждение, очищенное от специфических частей. Смысловым ядром суждения являются основные элементы суждения, такие как субъект суждения, предикат суждения и связка. Особенность связки состоит в том, что связки является частью смыслового ядра суждения только тогда, когда она не может быть истолкована в «объемном плане», в случаях, когда связка раскрывает включение (или исключение) подкласса в класс объектов или принадлежность (непринадлежность) элемента классу. Например, в суждении: «Преступление есть противоправное деяние», - субъектом суждения является слово «преступление», предикатом суждения словосочетание - «противоправное деяние», а связкой - слово «есть». Вторые семантические части 4120 логического раздела - это понятия суждения, которые идентифицированы как признаки субъекта суждения, предиката суждения, а также термины суждения - связка (когда она может быть истолкована в «объемном плане») и кванторное слово, а также иные, особые части. Например, понятие (субъект суждения) «преступление, предусмотренное Уголовным Кодексом», содержит понятие - слово «преступление» и признак понятия - словосочетание «предусмотренное Уголовным Кодексом». Признаки понятия - это содержание понятия, указывающее на наличие или отсутствие у него того или иного свойства, состояния или отношения. Иначе говоря, признаком понятия является все то, в чем понятия могут быть сходны или отличны друг от друга. Все признаки понятия, образующие содержание понятий идентифицируются как существенные и несущественные по принципу утраты своего качества (невозможности быть самим собой) без данного признака. Например, в суждении: «Преступление есть противоправное деяние», - предикатом суждения является понятие «противоправное деяние» (В), являющееся сложным понятием или «отношением», в котором «деяние» является субъектом суждения (А), а понятие «противоправное» - является признаком А. На примере «отношения» «противоправное деяние» видно, что в нем находятся «понятия», у которых объем одного полностью входит в объем другого, но не исчерпывает его. Иначе говоря, все элементы объема (В) являются элементами объема (А), но не наоборот. Вид таких отношений - «подчинение», т.е. родовидовое отношение, где более общее «понятие» выступает родом, а менее общее - видом. Кванторное слово указывает, относится ли информация о предикате суждения ко всему объему понятия, выражающему субъект суждения, или к его части. Например, в суждении: «Всякое преступление есть противоправное деяние», - кванторное слово «всякое» указывает, что информация о предмете суждения (словосочетание «противоправное деяние») относится ко всему объему (к каждому элементу объема) предмета суждения - слова «преступление». Под иными, особыми частями понимаются такие отдельные понятия и группы понятий суждения, которые также уточняют значение понятий, составляющих первую семантическую часть 4110 логических разделов 41, но формально не относятся к признакам понятия, связке или кванторному слову. Например, в суждении: «Преступление (в т.ч. мошенничество, кража, убийство) есть противоправное деяние», - понятия, указанные в круглых скобках («мошенничество», «кража», «убийство») являются реальными (жизненными) примерами понятия «преступление». Признак понятия «противоправное» является существенным для понятия «деяние», поскольку без него понятие «противоправное деяние» утрачивает свое качество, перестает быть самим собой. Объем понятия - это класс (множество) мыслимых в понятии сущностей. Признаки понятия и объем понятия взаимосвязаны в рамках каждого понятия. Эта взаимосвязь позволяет установить реальный объем понятия, т.е. то, что реально подразумевается в смысловом содержании понятия. Для достижения максимального соответствия структуры логических конструкций 41 структуре нормативного предписания необходимо сепарировать виды вторых семантических частей 4120. С технической точки зрения, идентифицированные в процессе комплексного семантического анализа иные, особые части не являются элементом основного (регулятивного или охранительного) нормативного предписания. В связи с этим они удаляются из логических разделов и формируют базу данных 5 особых семантических частей, представляющую собой нормативный справочник или иной нормативный справочный материал, представляющий собой множество особых нормативных предписаний. Информация из такого справочника является доступной и актуальной, но структурно и методологически она находится за рамками основных (регулятивных и охранительных) нормативных предписаний (фиг.9). С точки зрения юридической науки особые нормативные предписания представляют собой предписания, устанавливающие основные принципы, механизмы, порядок и цели правового регулирования общественных отношений, закрепляют правовые категории и понятия (например, дефинитивные предписания - предписания, закрепляющие в обобщенном виде признаки того или иного юридического понятия).At this stage, the identification of specific parts, i.e. the identification in logical sections 411, 412 of logical structures 41 of the second data structure of the first and second semantic parts 4110, 4120 of logical sections 411, 412. Moreover, the first semantic parts 4110 are formed by removing from the logical sections 411, 412 second semantic parts 4120 (specific parts). The first semantic part 4110 of the logical section is the semantic core of judgment, i.e. judgment cleared of specific parts. The semantic core of judgment is the basic elements of judgment, such as the subject of the judgment, the predicate of the proposition, and the connective. The peculiarity of the connective is that the connective is only part of the semantic core of the judgment when it cannot be interpreted in a “three-dimensional plan”, in cases where the connective discloses the inclusion (or exclusion) of a subclass in the class of objects or the belonging (non-affiliation) of an element to the class . For example, in the judgment: “Crime is an unlawful act”, the subject of the judgment is the word “crime”, the predicate of the judgment is the phrase “unlawful act”, and the link is the word “is”. The second semantic parts of the 4120 logical section are the concepts of judgment, which are identified as signs of the subject of the judgment, the predicate of judgment, as well as the terms of the judgment — a connective (when it can be interpreted in the “three-dimensional plane”) and a quantifier word, as well as other, special parts. For example, the concept (subject of judgment) “crime provided for by the Criminal Code” contains the concept - the word “crime” and the sign of the concept - the phrase “provided for by the Criminal Code”. Signs of a concept - this is the content of a concept indicating the presence or absence of one or another property, state or relationship. In other words, a sign of a concept is all that in which the concepts can be similar or different from each other. All signs of a concept that form the content of concepts are identified as significant and non-essential by the principle of loss of one's quality (inability to be oneself) without this attribute. For example, in a judgment: “Crime is an unlawful act”, the predicate of a judgment is the concept of “unlawful act” (B), which is a complex concept or “attitude” in which “act” is the subject of judgment (A), and the concept of “unlawful” - is a sign of A. On the example of “relationship”, “illegal act” shows that it contains “concepts” in which the volume of one is fully included in the volume of the other, but does not exhaust it. In other words, all elements of volume (B) are elements of volume (A), but not vice versa. The type of such relations is “submission”, i.e. generic relation, where the more general “concept” acts as a genus, and the less general - as a species. A quantifier word indicates whether information about a predicate of a judgment applies to the whole volume of a concept expressing a subject of a judgment, or to a part of it. For example, in the judgment: “Every crime is an unlawful act”, the quantifier word “any” indicates that information about the subject of the judgment (the phrase “unlawful act”) refers to the entire volume (to each element of the volume) of the subject of the judgment - the word “crime” . Other, special parts are understood to mean such separate concepts and groups of concepts of judgment, which also clarify the meaning of concepts that make up the first semantic part 4110 of logical sections 41, but do not formally relate to the features of a concept, a bunch, or a quantifier word. For example, in the judgment: “Crime (including fraud, theft, murder) is an unlawful act”, the terms indicated in parentheses (“fraud”, “theft”, “murder”) are real (life) examples concepts of "crime". The sign of the concept of “unlawful” is essential for the concept of “act”, since without it the concept of “unlawful act” loses its quality, ceases to be itself. The scope of a concept is a class (set) of entities conceivable in a concept. The signs of the concept and the scope of the concept are interconnected within each concept. This relationship allows us to establish the real scope of the concept, i.e. what is really implied in the semantic content of the concept. In order to achieve maximum conformity of the structure of logical constructions 41 to the structure of the normative prescription, it is necessary to separate the types of the second semantic parts 4120. From a technical point of view, other, special parts identified in the process of complex semantic analysis are not an element of the main (regulatory or protective) normative prescription. In this regard, they are removed from the logical sections and form a database of 5 special semantic parts, which is a regulatory reference or other regulatory reference material, which represents a lot of special regulatory requirements. Information from such a reference is accessible and relevant, but structurally and methodologically it is beyond the scope of the basic (regulatory and protective) regulatory requirements (Fig. 9). From the point of view of legal science, special regulatory prescriptions are prescriptions that establish the basic principles, mechanisms, order and goals of the legal regulation of public relations, consolidate legal categories and concepts (for example, definitive prescriptions - prescriptions that consolidate in a generalized form the attributes of a legal concept) .

На фиг.10 в качестве примера, но не ограничения, изображена общая схема выполнения этапов этапа 105 формирования грамматически и орфографически верных семантических частей, на котором формируют грамматически и орфографически верные семантические части упомянутых логических разделов 41 путем лингвистических преобразований над упомянутыми семантическими частями. Этап 105 включает выполнение этапа 1051 идентификации уточняющих структур, на котором в упомянутых вторых семантических частях 4120 упомянутых вторых логических разделов 412 элементов 41 второй структуры данных 4 идентифицируют, по меньшей мере, уточняющие структуры вторых семантических 4122 частей вторых логических разделов 412; и выполнением этапа 1052 лингвистических преобразований, на котором осуществляют лингвистические преобразования над всеми семантическими частями, за исключением упомянутых особых семантических частей 4111, 4121 упомянутых первых и вторых логических разделов 411, 412, для формирования грамматически и орфографически верных семантических частей 4123 и уточняющих структур 4122 логических разделов элементов 41 второй структуры данных 4. Общая схема полученной второй структуры данных 4 представлена на фиг.11. Для достижения максимального соответствия структуры логических конструкций 41 структуре нормативного предписания необходимо дополнительно идентифицировать оставшиеся виды вторых семантических частей. Дополнительная идентификация также производится в рамках комплексного семантического анализа. Предметом анализа будет являться массив значений, выявленных указанных видов вторых семантических частей 4120, т.е. массив понятий, содержащихся в логических разделах и идентифицированных в качестве соответствующих видов. Каждое понятие данных массивов должно быть идентифицировано с точки зрения его принадлежности к уточнениям 4124 или к зависимостям 4125. При этом уточнением является такая характеристика понятия, которая осуществляет переход от более широкого понятию к более узкому, а зависимости содержат признаки юридического факта, т.е. некого события, при наличии (отсутствии) которого понятие, к которому относится зависимость, актуализируется или наоборот становится неактуальным. Лингвистические преобразования над всеми семантическими частями логических разделов связаны с восстановлением правильной грамматики и орфографии отдельных семантических частей, которое потребуется в связи с фактическим разделением текста предложений на отдельные части - семантические части 4110, 4120 логических разделов, и с учетом удаления особых семантических разделов 4111, 4121 из указанного текста. Под указанными лингвистическими преобразованиями понимается, в частности, согласование родов, чисел, падежей, правка (замена и удаление) несоответствующих знаков препинания.Figure 10 shows, by way of example, but not limitation, the general flowchart of the steps of step 105 of forming grammatically and spelling-correct semantic parts, on which grammatically and spelling-correct semantic parts of said logical sections 41 are formed by linguistic transformations over said semantic parts. Step 105 includes performing a refinement structure identification step 1051, wherein at least second refinement structures of the second semantic 4122 parts of the second logical partitions 412 are identified in said second semantic parts 4120 of said second logical partitions 412 of elements of the second data structure 4; and performing step 1052 of linguistic transformations, where linguistic transformations are performed on all semantic parts, with the exception of the mentioned special semantic parts 4111, 4121 of the mentioned first and second logical sections 411, 412, to form grammatically and spelling-correct semantic parts 4123 and refinement structures 4122 of logical sections of the elements 41 of the second data structure 4. The General scheme of the obtained second data structure 4 is presented in Fig.11. To achieve maximum compliance with the structure of logical constructions 41 to the structure of a normative prescription, it is necessary to additionally identify the remaining types of the second semantic parts. Additional identification is also carried out as part of a comprehensive semantic analysis. The subject of analysis will be an array of values identified by the indicated types of the second semantic parts 4120, i.e. an array of concepts contained in logical sections and identified as corresponding species. Each concept of these arrays should be identified from the point of view of its belonging to refinements 4124 or to dependencies 4125. The refinement is such a characteristic of the concept that carries out the transition from a broader concept to a narrower one, and the dependencies contain signs of a legal fact, i.e. a certain event, in the presence (absence) of which the concept to which the dependence refers is actualized or vice versa becomes irrelevant. Linguistic transformations over all semantic parts of logical sections are associated with the restoration of the correct grammar and spelling of individual semantic parts, which will be required in connection with the actual division of the text of sentences into separate parts - semantic parts 4110, 4120 of logical sections, and taking into account the removal of special semantic sections 4111, 4121 from the specified text. By the indicated linguistic transformations we mean, in particular, the coordination of childbirth, numbers, cases, editing (replacement and removal) of inappropriate punctuation marks.

На фиг.12 в качестве примера, но не ограничения, изображена общая схема выполнения этапов этапа 106 формирования итоговой структуры данных 6, на котором формируют итоговую структуру данных 6 структурированного массива данных, содержащую элементы 61 упомянутой итоговой структуры данных 6, причем упомянутые элементы 61 итоговой структуры данных содержат логические конструкции 61, содержащие, по меньшей мере, упомянутые грамматически и орфографически верные семантические части 4123 логических разделов. Этап 106 характеризуется выполнением этапа 1061 формирования смысловых сочетаний, на котором формируют из первых грамматически и орфографически верных семантических частей 4123 вторых логических разделов 412 элементов 41 второй структуры данных 4 и упомянутых грамматически и орфографически верных уточняющих структур 4122 вторых семантических частей 4120 вторых логических разделов 412 элементов 41 второй структуры данных 4 смысловые сочетания 611 грамматически и орфографически верных семантических частей 4122, 4123 вторых логических разделов 412 элементов 41 второй структуры данных 4; и выполнением этапа 1062 формирования итоговой структуры данных 6, на котором формируют итоговую структуру данных 6, содержащую элементы 61 итоговой структуры данных 6, причем упомянутые элементы 61 итоговой структуры данных 6 представляют собой логические конструкции 61, содержащие упомянутые грамматически и орфографически верные семантические части 4122, 4123 логических разделов элементов 41 второй структуры данных 4. Возможна также ситуация, когда логические конструкции 61 из упомянутой итоговой структуры данных 6 дополнительно содержат упомянутые сформированные смысловые сочетания 611 грамматически и орфографически верных семантических частей 4122, 4123 вторых логических разделов 412 элементов 41 второй структуры данных 4 (фиг.13). Итоговая общая структура элемента 61 итоговой структуры данных 6 представлена на фиг.14. Основой нормативного предписания является юридическое правило (правило), которое может быть максимально корректно сформировано в результате проведения комплексного семантического анализа логических разделов логических конструкций. На этапе формирования логических конструкций большая часть обусловленностей была отделена от правила (второго логического раздела 412) и выделена в отдельный раздел - первый логический раздел 411. По результатам комплексного семантического анализа было выявлено смысловое ядро правила (второго логического раздела 412) и остатки обусловленностей были также выделены во вторые семантические части 4120, в которых некоторые из указанных частей идентифицированы как уточнение 4124. В результате всех преобразований стало возможным в структуре логических конструкций 61 создать смысловые сочетания 611, т.е. сочетания из первых семантических частей 4110 вторых логических разделов 412 и вторых семантических частей 4120 вторых логических разделов 412, идентифицированных как уточнение 4124. Данные смысловые сочетания являются юридическими правилами. Итоговая структура данных представляет собой структурированную конструкцию, элементы которой максимально соответствуют структуре нормативного предписания. Итоговая структура данных сформирована в таком виде с целью упрощения и регламентации профессиональной работы по созданию и корректировке НПА. Итоговая структура данных представляет собой конструкцию, которая позволяет буквально визуализировать нормативные предписания, увидеть все фактические элементы смысловой конструкции, что позволяет проводить их многосторонний полноценный анализ с целью проведения точечных корректировок как существующих нормативных предписаний, так и проектов предписаний на разных стадиях их создания.12, by way of example, but not limitation, a general flowchart of the steps of step 106 of generating the final data structure 6 is shown, in which the final data structure 6 of the structured data array containing the elements 61 of the said final data structure 6 is formed, said elements 61 of the final data structures contain logical constructs 61 containing at least the grammatically and spelling-correct semantic parts 4123 of logical partitions mentioned. Step 106 is characterized by the execution of the semantic combination step 1061, in which the second grammatical and spelling correct semantic parts 4123 of the second logical sections 412 of the elements 41 of the second data structure 4 and the grammatically and spelling correct refinement structures 4122 of the second semantic parts 4120 of the second logical sections of the 412 elements are formed 41 second data structures 4 semantic combinations 611 grammatically and spelling-correct semantic parts 4122, 4123 second logical sections 412 element s 41 of the second data structure 4; and performing step 1062 of forming the final data structure 6, which forms the final data structure 6 containing the elements 61 of the final data structure 6, said elements 61 of the final data structure 6 being logical structures 61 containing the grammatically and spelling-correct semantic parts 4122, 4123 logical partitions of the elements 41 of the second data structure 4. It is also possible that the logical constructs 61 of said final data structure 6 further comprise the old semantic combinations 611 of grammatically and spelling-correct semantic parts 4122, 4123 of the second logical sections 412 of the elements 41 of the second data structure 4 (Fig. 13). The final overall structure of element 61 of the final data structure 6 is shown in FIG. The basis of a normative prescription is a legal rule (rule), which can be formed as correctly as possible as a result of a comprehensive semantic analysis of logical sections of logical structures. At the stage of formation of logical constructions, most of the conditions were separated from the rule (second logical section 412) and separated into a separate section - the first logical section 411. Based on the results of complex semantic analysis, the semantic core of the rule (second logical section 412) was identified and the rest of the conditions were also highlighted in the second semantic parts 4120, in which some of these parts are identified as refinement 4124. As a result of all the transformations, it became possible in the logical structure of their constructions 61 to create semantic combinations 611, i.e. combinations of the first semantic parts 4110 of the second logical sections 412 and the second semantic parts 4120 of the second logical sections 412, identified as refinement 4124. These semantic combinations are legal rules. The final data structure is a structured structure, the elements of which correspond to the structure of the regulatory prescription as much as possible. The final data structure is formed in this form in order to simplify and regulate professional work on the creation and adjustment of legal acts. The final data structure is a design that allows you to literally visualize regulatory requirements, see all the actual elements of the semantic design, which allows them to be comprehensively analyzed in order to make precise adjustments to both existing regulatory requirements and draft regulations at different stages of their creation.

На фиг.15 в качестве примера, но не ограничения, проиллюстрирована примерная схема заявленной системы 200 преобразования структурированного массива данных, которая в предпочтительном варианте реализации содержит, по меньшей мере, одно или более устройств 201 преобразования структурированного массива данных, содержащих, по меньшей мере, один или более процессоров 2011, один или более модулей ввода/вывода (I/O) 2012 и память 2013. Упомянутые устройства 201 преобразования структурированного массива данных могут представлять собой, но не ограничиваться: персональный компьютер, портативный компьютер, планшетный компьютер, карманный компьютер, смартфон, тонкий клиент и тому подобное. Память (машиночитаемый носитель данных) 2013 устройства 201 преобразования структурированного массива данных, содержит код программы, который при выполнении побуждает упомянутые один или более процессоры 2011 упомянутого устройства 201 и/или устройства 201, связанного с упомянутым устройством 201, выполнять действия описанного выше способа преобразования структурированного массива данных, и содержит подлежащие преобразованию один или несколько структурированных массивов данных, содержащих, по меньшей мере, текст на естественном языке. Более того, подлежащие преобразованию один или несколько структурированных массивов данных могут являться загружаемыми и храниться, в частности, в базе данных 203 системы преобразования структурированного массива данных. В качестве примера, но не ограничения, машиночитаемый носитель данных может включать в себя оперативную память (RAM); постоянное запоминающее устройство (ROM); электрически-стираемое программируемое постоянное запоминающее устройство (EEPROM); флэш-память или другие технологии памяти; CDROM, цифровой универсальный диск (DVD) или другие оптические или голографические носители данных; магнитные кассеты, магнитную пленку, запоминающее устройство на магнитных дисках или другие магнитные запоминающие устройства, несущие волны или другой носитель данных, который может быть использован для кодирования требуемой информации, и к которому может быть осуществлен доступ посредством описываемого устройства. Память включает в себя носитель данных на основе запоминающего устройства компьютера в форме энергозависимой или энергонезависимой памяти, или их комбинации. Примерные аппаратные устройства включают в себя твердотельную память, накопители на жестких дисках, накопители на оптических дисках и т.д. В памяти хранится примерная среда, в которой при помощи компьютерных команд или кодов, хранящихся в памяти устройства, может быть осуществлена процедура преобразования структурированного массива данных. Устройство содержит один или более процессоров 2011, которые предназначены для выполнения компьютерных команд или кодов, хранящихся в памяти устройства с целью обеспечения выполнения процедуры преобразования структурированного массива данных. Компьютерные команды или коды, хранящиеся в памяти, предназначены для выполнения преобразования структурированного массива данных. Эти команды и коды включают, по меньшей мере, команды формирования первой структуры данных структурированного массива данных, команды формирования базы данных логических связей, команды формирования второй структуры данных структурированного массива данных, команды формирования базы данных семантических частей логических разделов, команды формирования грамматически и орфографически верных семантических частей, команды формирования итоговой структуры данных структурированного массива данных и соответствующие команды, предназначенные для выполнения упомянутых команд. Модули I/O 2012 устройства 201 представляют собой, не ограничиваясь, типичные и известные из уровня техники средства управления устройством: манипулятор типа «мышь», клавиатура, джойстик, тачпад, трекбол, электронное перо, стилус, сенсорный дисплей и тому подобное. Также модули I/O 2012 представляют собой, не ограничиваясь, типичные и известные из уровня техники средства демонстрирования информации: дисплей, монитор, проектор, принтер, графопостроитель и тому подобное. Система 200 также может включать в себя базу данных (БД) 202. БД 202 может представлять собой, но не ограничиваясь: иерархическую БД, сетевую БД, реляционную БД, объектную БД, объектно-ориентированную БД, объектно-реляционную БД, пространственную БД, комбинацию перечисленных двух и более БД, и тому подобное. БД 202 хранит данные в памяти, которая может представлять собой, но не ограничиваясь: постоянное запоминающее устройство (ROM), электрически-стираемое программируемое постоянное запоминающее устройство (EEPROM), флэш-память, CDROM, цифровой универсальный диск (DVD) или другие оптические или голографические носители данных; магнитные кассеты, магнитную пленку, запоминающее устройство на магнитных дисках или другие магнитные запоминающие устройства, несущие волны или другой носитель данных, который может быть использован для хранения требуемой информации и к которому может быть осуществлен доступ посредством устройства 201 преобразования структурированного массива данных и сервера 203. БД 202 служит для хранения данных, представляющих собой, по меньшей мере, команды формирования первой структуры данных структурированного массива данных, команды формирования базы данных логических связей, команды формирования второй структуры данных структурированного массива данных, команды формирования базы данных семантических частей логических разделов, команды формирования грамматически и орфографически верных семантических частей, команды формирования итоговой структуры данных структурированного массива данных, и соответствующие команды, предназначенные для выполнения упомянутых команд; подлежащие преобразованию один или несколько структурированных массивов данных, содержащих, по меньшей мере, текст на естественном языке, которые могут быть загружены в память 2013 устройства 201 преобразования структурированного массива данных; и других данных, необходимых для функционирования системы. Примерная система 200 преобразования структурированного массива данных дополнительно содержит серверное вычислительное устройство (сервер) 203, которое сохраняет и содействует манипуляции компьютерными командами или кодами, ранее описанными в данном документе, которые, соответственно, дополнительно не описываются. Сервер 203 может представлять собой: персональный компьютер, портативный компьютер, планшетный компьютер, карманный компьютер, смартфон, машину баз данных и тому подобное. Сервер 203 обеспечивает регулирование обменом данных в системе 200 преобразования структурированного массива данных, а также обеспечивает обработку данных при условии подключения к нему одного или более чем одного устройств 201 преобразования структурированного массива данных или когда устройство 201 преобразования структурированного массива данных представляет собой тонкий клиент. В этом случае все вычислительные мощности, необходимые для обеспечения выполнения процедуры преобразования структурированного массива данных, расположены на сервере 203. Система 200 также содержит одну или более сетей 204 передачи данных. Сети 204 передачи данных могут включать в себя, но не ограничиваться, одну или более локальных сетей (LAN) и/или глобальных сетей (WAN), или могут представлять собой информационно-телекоммуникационную сеть Интернет, или Интранет, или виртуальную частную сеть (VPN), или их комбинацию, и тому подобное. Сервер 203 также имеет возможность обеспечивать виртуальную вычислительную среду (Virtual Machine) для обеспечения взаимодействия между устройством 201 преобразования структурированного массива данных и БД 202. Сеть 204 служит для обеспечения взаимодействия между устройством 201, базой данных 202 и сервером 203 системы 200 преобразования структурированного массива данных.15, by way of example, but not limitation, an exemplary diagram of the inventive structured data array conversion system 200 is illustrated, which in a preferred embodiment includes at least one or more structured data array conversion devices 201 comprising at least one or more processors 2011, one or more input / output (I / O) modules 2012, and memory 2013. Said structured data array conversion devices 201 may be, but not limited to : personal computer, laptop computer, tablet computer, PDA, smartphone, thin client and the like. The memory (computer-readable storage medium) 2013 of the structured data array conversion device 201 contains a program code which, when executed, causes said one or more processors 2011 of said device 201 and / or device 201 associated with said device 201 to perform the steps of the structured data conversion method described above data array, and contains one or more structured data arrays to be converted, containing at least natural language text. Moreover, one or more structured data arrays to be converted may be downloadable and stored, in particular, in the database 203 of the structured data array transformation system. By way of example, but not limitation, a computer-readable storage medium may include random access memory (RAM); read-only memory device (ROM); Electrically Erasable Programmable Read-Only Memory (EEPROM); flash memory or other memory technologies; CDROM, digital versatile disc (DVD) or other optical or holographic storage media; magnetic cassettes, magnetic tape, magnetic disk storage device or other magnetic memory devices, wave carriers or other storage medium that can be used to encode the required information, and which can be accessed through the described device. The memory includes a storage medium based on a computer storage device in the form of volatile or non-volatile memory, or a combination thereof. Exemplary hardware devices include solid state memory, hard disk drives, optical disk drives, etc. An example environment is stored in the memory in which, using computer instructions or codes stored in the device’s memory, a procedure for converting a structured data array can be carried out. The device contains one or more processors 2011, which are designed to execute computer instructions or codes stored in the device’s memory in order to ensure the implementation of the procedure for converting a structured data array. Computer instructions or codes stored in memory are designed to perform the conversion of a structured data array. These commands and codes include at least commands for generating the first data structure of a structured data array, commands for generating a database of logical connections, commands for generating a second data structure for a structured data array, commands for generating a database of semantic parts of logical sections, and commands for generating grammatically and spelling correct semantic parts, commands for the formation of the final data structure of a structured data array and the corresponding commands, intended nnye to execute the commands mentioned above. The I / O 2012 modules of device 201 are, but are not limited to, typical and prior art device controls: a mouse, keyboard, joystick, touchpad, trackball, electronic pen, stylus, touch screen, and the like. Also, I / O 2012 modules are, but are not limited to, typical and known from the prior art means of displaying information: a display, a monitor, a projector, a printer, a plotter, and the like. System 200 may also include a database (DB) 202. DB 202 can be, but is not limited to: a hierarchical database, a network database, a relational database, an object database, an object-oriented database, an object-relational database, a spatial database, a combination listed two or more databases, and the like. The DB 202 stores data in memory, which may be, but not limited to: read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory, CDROM, digital versatile disk (DVD) or other optical or holographic data carriers; magnetic cassettes, magnetic tape, magnetic disk storage device or other magnetic memory devices, wave carriers or other storage medium that can be used to store the required information and which can be accessed by the structured array data conversion device 201 and server 203. The database 202 is used to store data representing at least a command for generating a first data structure of a structured data array, a command for generating logical link databases, commands for generating a second data structure for a structured data array, commands for generating a database of semantic parts of logical partitions, commands for generating grammatically and spelling correct semantic parts, commands for generating a final data structure for a structured data array, and corresponding commands for executing said commands ; one or more structured data arrays to be converted containing at least natural language text that can be loaded into the memory 2013 of the structured data array transform device 201; and other data necessary for the functioning of the system. An exemplary structured data array conversion system 200 further comprises a server computing device (server) 203 that stores and facilitates manipulation of computer instructions or codes previously described herein, which, accordingly, are not further described. Server 203 may be a personal computer, a laptop computer, a tablet computer, a handheld computer, a smartphone, a database machine, and the like. The server 203 provides data exchange control in the structured data array conversion system 200, and also provides data processing provided that one or more of the structured data array conversion devices 201 is connected to it or when the structured data array conversion device 201 is a thin client. In this case, all the computing power necessary to ensure the conversion of the structured data array is located on the server 203. The system 200 also includes one or more data networks 204. Data network 204 may include, but is not limited to, one or more local area networks (LANs) and / or wide area networks (WANs), or may be an information and telecommunications network Internet, or an Intranet, or a virtual private network (VPN) , or a combination thereof, and the like. The server 203 also has the ability to provide a virtual computing environment (Virtual Machine) to facilitate interaction between the structured data array conversion device 201 and the database 202. The network 204 serves to provide interaction between the device 201, the database 202 and the server 203 of the structured data array conversion system 200.

Claims

1. A method of converting a structured data array containing at least natural language text, said method comprising at least the steps of:
A) form a first data structure of a structured data array containing the elements of said first data structure, said elements of the first data structure containing first logical partitions and second logical partitions;
B) form a database of logical connections of logical sections of the mentioned elements of the first data structure;
B) form a second data structure of a structured data array containing the elements of said second data structure, said elements of the second data structure containing logical structures of logical partitions of said elements of the first data structure generated using information from said logical linking database of logical partitions, said logical sections contain the first semantic parts and the second semantic parts;
D) form a database of semantic parts of logical sections from said second semantic parts, wherein said second semantic parts are excluded from the corresponding mentioned logical sections;
E) form the grammatically and spelling-correct semantic parts of the mentioned logical sections by linguistic transformations over the mentioned semantic parts;
E) form the final data structure of the structured data array containing the elements of the said final data structure, said elements of the final data structure containing logical structures containing at least the grammatically and spelling-correct semantic parts of logical sections.

2. The method according to claim 1, characterized in that stage A) is characterized by at least the stages in which:
- identify the original data structure of the structured data array;
- identify the elements of the original data structure;
- identify the first logical partitions of said elements of the original data structure and the second logical partitions of said elements of the original data structure; and
- form the first data structure of a structured data array containing the elements of said first data structure, said elements of the first data structure containing first logical partitions and second logical partitions.

3. The method according to claim 1, characterized in that stage B) is characterized by at least the stages in which:
- identify the elements of the first data structure containing one mentioned first logical partition, and the elements of the first data structure containing one mentioned second logical partition;
- identify elements of the first data structure containing more than one of the first logical partition, and elements of the first data structure containing more than one of the second logical partition;
- among the elements of the first data structure containing more than one of said first logical partitions, and in the elements of the first data structure containing more than one of said second logical partitions, logical relationships between said first logical partitions or between said second logical partitions are identified;
- among the elements of the first data structure containing more than one of the first logical partitions, and in the elements of the first data structure containing more than one of the second logical partitions, the elements of the first data structure that do not have a logical connection between logical partitions are identified; and
- form a database of logical relationships of logical sections of the elements of the first data structure.

4. The method according to claim 1, characterized in that stage B) is characterized by at least the stages in which:
- form logical structures of logical partitions of elements of the first data structure using information from a database of logical connections of logical partitions of elements of the first data structure and logical partitions of said elements of the first data structure containing one of the first logical partitions and logical partitions of said elements of the first data structure containing one said second logical partition; and
- form a second data structure containing the elements of the second data structure, said elements of the second data structure being formed logical structures of logical partitions of the first data structure.

5. The method according to claim 1, characterized in that step D) is characterized by at least the stages in which:
- identify the first logical partitions of the elements of the second data structure and the second logical partitions of the elements of the second data structure;
- in the aforementioned first logical sections and second logical sections of the elements of the second data structure, the first semantic parts and the second semantic parts are identified; and
- in said first and second logical sections of the elements of the second data structure, at least special semantic parts of the first logical sections of the elements of the second data structure and the special semantic parts of the second logical sections of the elements of the second data structure are identified and form a database of special semantic parts of the logical sections of the elements of the second data structures by moving said special semantic parts to said logically generated database of special semantic parts Partition elements of the second data structure.

6. The method according to claim 1, characterized in that stage D) is characterized by at least the stages in which:
- in said second semantic parts of said second logical sections of the elements of the second data structure, at least the refinement structures of the second semantic parts of the second logical sections are identified; and
- carry out linguistic transformations over all semantic parts, with the exception of the mentioned special semantic parts of the mentioned first and second logical sections, to form grammatically and spelling-correct semantic parts of logical sections of the elements of the second data structure.

7. The method according to claim 1, characterized in that step E) is characterized by at least the stages in which:
- form semantic combinations of grammatically and spelling correct semantic parts of the second logical sections of the elements of the second data structure from the first grammatically and spelling correct semantic parts of the second logical sections of the elements of the second data structure ; and
- form the final data structure containing the elements of the final data structure, and the said elements of the final data structure are logical constructs containing the grammatically and spelling correct semantic parts of the logical sections of the elements of the second data structure.

8. The method according to claim 7, characterized in that said logical constructions from said final data structure may further comprise said semantic combinations of grammatically and spelling-correct semantic parts of second logical sections of elements of the second data structure.

9. The method according to claim 1, characterized in that said method of converting a structured data array containing at least natural language text comprises at least the steps of:
A) identify the original data structure of a structured data array; identify elements of the original data structure; identifying the first logical partitions of said elements of the original data structure and the second logical partitions of said elements of the original data structure; and form a first data structure of a structured data array containing the elements of said first data structure, said elements of the first data structure containing first logical partitions and second logical partitions;
B) identify the elements of the first data structure containing one mentioned first logical section, and the elements of the first data structure containing one mentioned second logical section; identifying elements of a first data structure containing more than one of said first logical partitions, and elements of a first data structure containing more than one of said second logical partition; in the elements of the first data structure containing more than one of said first logical partitions, and in the elements of the first data structure containing more than one of said second logical partitions, logical relationships between said first logical partitions or between said second logical partitions are identified; in the elements of the first data structure containing more than one of said first logical partitions, and in the elements of the first data structure containing more than one of said second logical partitions, elements of the first data structure that do not have a logical connection between logical partitions are identified; and form a database of logical relationships of logical partitions of the elements of the first data structure;
C) form logical structures of logical partitions of elements of the first data structure using information from a database of logical connections of logical partitions of elements of the first data structure and logical partitions of said elements of the first data structure containing one of the first logical partitions and logical partitions of the said elements of the first data structure, containing one said second logical partition; and form a second data structure containing elements of the second data structure, said elements of the second data structure being formed logical structures of logical partitions of the first data structure;
D) identify the first logical partitions of the elements of the second data structure and the second logical partitions of the elements of the second data structure; in said first logical sections and second logical sections of the elements of the second data structure, the first semantic parts and the second semantic parts are identified; and in said first and second logical partitions of the elements of the second data structure, at least special semantic parts of the first logical partitions of the elements of the second data structure and the special semantic parts of the second logical partitions of the elements of the second data structure are identified and form a database of special semantic parts of the logical partitions of the elements of the second data structures by moving said special semantic parts to said logically generated database of special semantic parts x sections of the elements of the second data structure;
E) in said second semantic parts of said second logical sections of the elements of the second data structure, at least the refinement structures of the second semantic parts of the second logical sections are identified; and carry out linguistic transformations over all semantic parts, with the exception of the mentioned special semantic parts of the mentioned first and second logical sections, to form grammatically and spelling-correct semantic parts of logical sections of the elements of the second data structure;
E) form semantic combinations of grammatically and spelling correct semantic parts of the second logical sections of elements of the third structure from the first grammatically and spelling correct semantic parts of the second logical sections of the elements of the second data structure data; and form the final data structure containing the elements of the final data structure, and the said elements of the final data structure are logical constructs containing the mentioned grammatically and spelling correct semantic parts of the logical sections of the elements of the second data structure.

10. The method according to claim 9, characterized in that said logical constructions from said final data structure may further comprise said semantic combinations of grammatically and spelling-correct semantic parts of second logical sections of elements of the second data structure.

11. A device for converting a structured data array containing at least:
one or more processors;
one or more input / output (I / O) modules; and
a memory containing program code, which upon execution causes said one or more processors of said device and / or device associated with said device to perform the actions of the method according to any one of claims 1 to 10, and containing one or more structured data arrays to be converted containing at least natural language text.

12. The device according to claim 11, characterized in that said one or more structured data arrays to be converted are downloadable, and said device is configured to connect to a database in which said loadable one or more structured data arrays to be converted are stored, for loading into said memory of the device of at least one loaded structured data array to be converted.

13. A system for converting a structured data array containing at least:
one or more devices made in the form of devices according to any one of claims 11 or 12 of the formula;
one or more servers providing regulation of data exchange in the system;
one or more databases for storing data configured to interact with said one or more devices;
one or more data transmission networks through which said devices, servers and databases interact.

14. The system according to item 13, wherein the method according to any one of claims 1 to 10 of the formula is carried out by one or more of the mentioned servers, and the said devices are a thin client.

15. The system of clause 14, wherein said database is used to store data representing at least one of: program code, which upon execution causes said one or more processors of said device and / or device connected with the said device, perform the steps of the method according to any one of claims 1 to 10 of the formula, to be converted one or more structured data arrays containing at least natural language text.

16. The system according to any one of paragraphs.13-15, characterized in that said data network is one of: local area network (LAN), wide area network (WAN), information and telecommunication network Internet, virtual private network (VPN).

17. A computer-readable storage medium containing program code, which upon execution causes the processor or processors of the device with which the computer-readable storage medium interacts, to perform the actions of the method according to any one of claims 1 to 10 of the formula.