CN118761390B - Method for hiding knowledge in text file for independent or combined reading based on AI technology - Google Patents
Method for hiding knowledge in text file for independent or combined reading based on AI technology Download PDFInfo
- Publication number
- CN118761390B CN118761390B CN202411216100.8A CN202411216100A CN118761390B CN 118761390 B CN118761390 B CN 118761390B CN 202411216100 A CN202411216100 A CN 202411216100A CN 118761390 B CN118761390 B CN 118761390B
- Authority
- CN
- China
- Prior art keywords
- file
- knowledge
- text
- ofd
- content
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
- G06F40/169—Annotation, e.g. comment data or footnotes
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/103—Formatting, i.e. changing of presentation of documents
- G06F40/109—Font handling; Temporal or kinetic typography
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/103—Formatting, i.e. changing of presentation of documents
- G06F40/117—Tagging; Marking up; Designating a block; Setting of attributes
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- General Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Mathematical Physics (AREA)
- Document Processing Apparatus (AREA)
Abstract
The invention discloses a method for hiding knowledge in text files for independent or combined reading based on an AI technology, which comprises the following steps: the method comprises the steps of setting a knowledge matching base, analyzing and judging the type of the uploaded file; analyzing the file to obtain the type of the field to which the file belongs; according to the field type of the file, analyzing the file to obtain a text queue to be expanded, inputting the queue into a knowledge model, and obtaining a key and value key value pair model in one-to-one correspondence; the method effectively fuses the knowledge in the file by using the AI technology and the OFD expansion file, creates a method for hiding the knowledge in the file, and processes the file by using the technology so that a large amount of knowledge content is attached to the original file, thereby improving the reading experience of readers and improving the reading efficiency.
Description
Technical Field
The technology belongs to the technical field of artificial intelligence, and in particular relates to a method for hiding knowledge in text files for independent or combined reading by an AI technology.
Background
When the traditional electronic file is read, the read content is limited to the file itself, some expansion or association matching can not be carried out on the file itself, and compared with a webpage, remarks are required to be added during maintenance, and automatic remarks, marks or judgment can not be carried out; and the file reading is online matching inquiry and can not be checked offline.
Disclosure of Invention
The present invention is directed to a method for hiding knowledge from text files by AI technology for independent or combined reading, so as to solve one or more of the problems set forth in the background art.
In order to achieve the above purpose, the present invention provides the following technical solutions: the method for hiding knowledge in text files for independent or combined reading based on AI technology comprises the following steps:
step S1: setting a knowledge matching base;
Step S2: analyzing and judging the uploaded file type, and if the file type is not the standard OFD file, converting the file type into the standard OFD file;
step S3: analyzing the file by using the NLP to acquire the field type of the file, and matching the universal data in the knowledge matching base if the field type of the file cannot be judged;
step S4: according to the file field type, analyzing and acquiring file content information to obtain a text queue to be expanded, inputting the text queue into a knowledge model, and acquiring a key and value key value pair model in one-to-one correspondence;
Step S5: typesetting the association relation information between the content (key) of the recorded original text and the interpretation (value) of the original text according to a format to form a knowledge base xml file, generating an expansion file package of the OFD, and generating a new OFD file which can be used for online or offline transmission and reading;
Step S6: and loading an original file of the OFD file by using the OFD reader for reading, analyzing the knowledge expansion file package if the original file has the knowledge expansion file package, displaying the content identification appointed by the original file, and displaying and reading the knowledge matching base of the file by adopting different display modes.
Preferably, the fields of the knowledge matching base include a description in the field, written description, specific meaning and content interpretation, provenance, content description.
Preferably, in the step S2, the type of the uploaded file is analyzed and judged, and if the file is PDF, PPT, PPTX, DOC, DOCX, XLS, XLXS, CAD, true, DLF, the file is converted into an OFD file; if the file is a picture or a scanned file, performing OCR on the file to generate a double-layer OFD file; if the file is an audio file, a video file and the like, the file is used as a resource to be filled into a frame of the OFD, characters in the file are identified, and the file is used as a subtitle of the audio and video to be synchronously displayed.
Preferably, the types of the fields to which the documents belong include medical industry, scientific industry, educational industry, politics and chemistry; the general data refers to: industry interpretation information is the same between different domains.
Preferably, the specific steps of analyzing the file in step S4 are:
step S41: analyzing file content information, and carrying out semantic analysis on phrases, entries and related sentences;
step S42: obtaining a text queue to be expanded;
Step S43: inputting the text queues into a knowledge model to obtain key and value models corresponding to each other one by one;
Step S44: the extended text generated in step S43 is saved to generate a new OFD file.
Preferably, the expanded text is an association relationship between the content for recording the original text and the explanation of the original text.
Preferably, the specific steps for reading by using the OFD reader in the step S5 are as follows:
Step S51, loading OFD original text basic content by using an OFD reader, and judging whether an expansion file exists in the original text;
step S52: if the original text has the expansion file, analyzing the expansion file, and displaying the identification of the specified content of the original text, wherein the specified content comprises words, entries and sentences of the original text;
Step S53: the content in the extension file is displayed in an OFD original document in various forms, wherein the display forms comprise mouse suspension display, and annotation content description is carried out on the right side of the OFD original document;
Step S54: the drawing menu shows the names and interpretations contained throughout the text in the text.
Preferably, the content operation in the knowledge matching base comprises editing, modifying and erasing; and updating and adjusting the content of the expansion file displayed by the OFD original text in real time according to the updating of the knowledge matching base.
Preferably, during reading, different modes can be selected to control whether the display content is dynamically displayed or hidden.
Preferably, when the original OFD file is a combined file, the content information of whether hidden knowledge exists can be displayed on the combined file directory.
Compared with the prior art, the invention has the beneficial effects that:
(1) The method effectively fuses the AI technology and the OFD expansion file, creates a method for hiding the knowledge in the file, and processes the file by the technology so that a large amount of knowledge content is attached to the original file, thereby improving the reading experience of readers and improving the reading efficiency;
(2) The invention can convert various files through the OFD conversion technology, can expand the range of hiding knowledge in the files, and is suitable for more file types;
(3) The invention supports offline reading, and the files can attach knowledge to the files after processing, and other online knowledge bases or any background support are not needed.
Drawings
FIG. 1 is a flow chart of the present invention;
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Examples
Referring to fig. 1, the present invention provides a method for hiding knowledge from text files by AI technology for independent or combined reading, comprising the following steps:
Step S1: setting a knowledge matching base, wherein the fields of the knowledge matching base comprise the description of the field and the writing, the specific meaning and the explanation, provenance and description of the content, and a series of operations can be performed on the content in the knowledge matching base, and the content can be edited, modified and erased;
Step S2: when uploading a file, uploading the file to a server, analyzing and judging the type of the uploaded file, if the type of the file is not a standard OFD file, converting the file into the standard OFD file, and when analyzing and judging the type of the uploaded file, converting the file into the OFD file if the type of the uploaded file is PDF, PPT, PPTX, DOC, DOCX, XLS, XLXS, CAD, true, DLF files, wherein the DLF file is a compressed package file, the DLF file internally comprises a guide file and a plurality of OFD files, and the guide file mainly records the association relation among the files, the entry of the file, the storage path of the OFD file, the triggering position and the skip event between the OFD and the OFD; if the file is a picture, for example PNG, JPG, TIF or a scanned file, performing OCR on the file to generate a double-layer OFD file; if the file is an audio file, a video file and the like, filling the file into a frame of the OFD as a resource, identifying characters in the file, and synchronously displaying the file as subtitles of the audio and video;
Step S3: analyzing the file by using the NLP to acquire the field type of the file, and matching the universal data in the knowledge matching base if the field type of the file cannot be judged, wherein the field type of the file comprises industries such as medical industry, scientific industry, education industry, politics, chemistry, agriculture and the like; the general data refers to: industry interpretation information that is the same between different domains;
Step S4: according to the type of the file field, analyzing and acquiring file content information to obtain a text queue to be expanded, inputting the queue into a knowledge model, and acquiring a key and value key value pair model in one-to-one correspondence, wherein the key refers to the content of an original text, the value refers to the explanation of the original text, and the specific steps of analyzing the file content information are as follows:
step S41: analyzing the content information of the file, and carrying out semantic analysis on phrases, entries and related sentences in the file;
step S42: obtaining a text queue to be expanded, wherein the expanded text queue refers to a list of original text contents;
Step S43: inputting the text queue, namely the list of the original text contents, into a knowledge model, and obtaining a key and value model corresponding to the corresponding interpretation original text contents one by one through original text retrieval and matching, so as to generate an expanded text;
Step S44: directly writing the generated expanded text, namely a text queue for recording the association relation between the content of the original text and the explanation of the original text, into an OFD file, thereby saving and generating a new OFD file;
Step S5: recording association relation information between the content (key) of the original text and the explanation (value) of the original text, typesetting according to a format to form a knowledge base xml file, generating an OFD expansion file package, generating a new OFD file according to the OFD expansion file package, and carrying out real-time updating and adjustment on the expansion file content displayed by the OFD original text according to the updating of a knowledge matching base, wherein the specific steps of reading by adopting an OFD reader are as follows:
step S51, loading and generating new OFD original text basic content by using an OFD reader, and judging whether an expansion file package of a 'knowledge matching base' exists in the original text if the OFD file is an independent file;
step S52: if the original text has the expansion file package, analyzing the expansion file package, and displaying the identification of the specified content of the original text, wherein the specified content comprises words, entries and sentences of the original text;
Step S53: the content in the extension file is displayed in an OFD original document in various forms, wherein the display forms comprise mouse suspension display, annotation content description on the right side of the OFD original document and the like;
Step S54: drawing a menu in an original text to show names and explanations contained in the whole text;
step S6: when reading, the original content of the OFD file is loaded by the OFD reader, then whether an expansion file package of the knowledge matching library exists is judged, if the original file exists, the expansion file of the knowledge matching library indicates that the OFD file already has knowledge hiding, the expansion file package of the knowledge matching library is analyzed, the appointed content in the original file such as words, terms and sentences is displayed, and the knowledge matching library of the file is displayed and read by adopting different display modes, such as mouse suspension display, content description of annotation state on the right side and unified display of all involved terms, so that whether the displayed content is dynamically displayed or hidden can be controlled.
When the original OFD file is a combined file, judging whether an extended file package of a knowledge matching library exists in the combined file when the combined file is read, if the extended file package of the knowledge matching library exists in the file, analyzing the extended file package of the knowledge matching library in the combined file, wherein the extended files of the combined file are all stored in respective file contents, the contents existing in the extended files of the knowledge matching library can be displayed in the OFD file in one or more modes of mouse suspension display, right side annotation state content description and unified display of all related terms, and appointed contents in the OFD file can be marked and displayed, a reader can dynamically control whether the knowledge is displayed or hidden, and meanwhile, the content information of the hidden knowledge can be displayed on the directory of the combined file, so that the AI technology and the OFD extended file can be effectively fused, a method for hiding the knowledge in the file is created, a great amount of knowledge contents can be attached to the original file through the processing of the original file, the reader experience is improved, and the reader reading efficiency is improved.
It will be evident to those skilled in the art that the invention is not limited to the details of the foregoing illustrative embodiments, and that the present invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The present embodiments are, therefore, to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein. Any reference sign in a claim should not be construed as limiting the claim concerned.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202411216100.8A CN118761390B (en) | 2024-09-02 | 2024-09-02 | Method for hiding knowledge in text file for independent or combined reading based on AI technology |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202411216100.8A CN118761390B (en) | 2024-09-02 | 2024-09-02 | Method for hiding knowledge in text file for independent or combined reading based on AI technology |
Publications (2)
Publication Number | Publication Date |
---|---|
CN118761390A CN118761390A (en) | 2024-10-11 |
CN118761390B true CN118761390B (en) | 2024-11-15 |
Family
ID=92949171
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202411216100.8A Active CN118761390B (en) | 2024-09-02 | 2024-09-02 | Method for hiding knowledge in text file for independent or combined reading based on AI technology |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN118761390B (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116341569A (en) * | 2023-03-14 | 2023-06-27 | 复旦大学 | Professional document intelligent auxiliary reading method based on domain knowledge base |
CN116522233A (en) * | 2023-07-03 | 2023-08-01 | 国网北京市电力公司 | Method and system for extracting and classifying key point review content of research document |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6711585B1 (en) * | 1999-06-15 | 2004-03-23 | Kanisa Inc. | System and method for implementing a knowledge management system |
US8805766B2 (en) * | 2010-10-19 | 2014-08-12 | Hewlett-Packard Development Company, L.P. | Methods and systems for modifying a knowledge base system |
CN117009455A (en) * | 2022-04-27 | 2023-11-07 | 第四范式(北京)技术有限公司 | Knowledge content extraction method and device and electronic equipment |
CN116579295B (en) * | 2023-07-13 | 2023-09-29 | 江苏中威科技软件系统有限公司 | Method for dynamically inserting pages, moving bits and hiding and displaying multi-format file subscription |
CN118113880A (en) * | 2024-02-29 | 2024-05-31 | 浪潮通用软件有限公司 | Knowledge graph-based financial audit chain construction method and system |
CN118193885B (en) * | 2024-05-20 | 2024-08-02 | 江苏中威科技软件系统有限公司 | Device for functional review and performance evaluation based on DLF files |
CN118485151A (en) * | 2024-07-16 | 2024-08-13 | 江苏中威科技软件系统有限公司 | Methods for explaining AI semantic analysis results |
-
2024
- 2024-09-02 CN CN202411216100.8A patent/CN118761390B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116341569A (en) * | 2023-03-14 | 2023-06-27 | 复旦大学 | Professional document intelligent auxiliary reading method based on domain knowledge base |
CN116522233A (en) * | 2023-07-03 | 2023-08-01 | 国网北京市电力公司 | Method and system for extracting and classifying key point review content of research document |
Also Published As
Publication number | Publication date |
---|---|
CN118761390A (en) | 2024-10-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111753500B (en) | Method for merging and displaying formatted electronic form and OFD (office file format) and generating catalog | |
CN111178027B (en) | Report generation system and method based on online creation template and online specified data | |
US6072479A (en) | Multimedia scenario editor calculating estimated size and cost | |
CN1838111B (en) | Method for editing file and recording modification mark | |
US7673235B2 (en) | Method and apparatus for utilizing an object model to manage document parts for use in an electronic document | |
US7240279B1 (en) | XML patterns language | |
EP1672528A2 (en) | File Formats, Methods, and Computer Program Products for Representing Workbooks | |
CN111259202A (en) | Document structured data embedding method and system | |
CN109857670A (en) | A kind of test report automatic generation method based on common template | |
CN106940693B (en) | Single structure processing method for laboratory original record | |
CN112597750B (en) | Method, editor and medium for designing dynamic detection report template online | |
CN110211581B (en) | A laboratory automatic speech recognition record identification system and method | |
US8244697B2 (en) | Versioning system for electronic textbooks | |
CN106776779B (en) | Method for generating entity file by JSON data based on Mac platform | |
US8423888B2 (en) | Document conversion and use system | |
CN118761390B (en) | Method for hiding knowledge in text file for independent or combined reading based on AI technology | |
CN112596732B (en) | Electronic teaching material manufacturing method and system | |
CN110533143B (en) | Method and device for generating electronic card, storage medium and computer equipment | |
CN118278376A (en) | Method and device for checking contract text content, storage medium and computer product | |
CN116681049A (en) | Text generation method and device | |
JP2018136636A (en) | Information processing apparatus and information processing method | |
CN112668282B (en) | Method and system for converting format of equipment procedure document | |
US20220215034A1 (en) | Electronic apparatus and controlling method thereof | |
CN113128191B (en) | Method and device for scripted generation of arbitration document | |
CN101057232A (en) | Document processing device and document processing method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 40112197 Country of ref document: HK |