CN114817378A - Tax file search engine-based retrieval method - Google Patents
Tax file search engine-based retrieval method Download PDFInfo
- Publication number
- CN114817378A CN114817378A CN202210269052.3A CN202210269052A CN114817378A CN 114817378 A CN114817378 A CN 114817378A CN 202210269052 A CN202210269052 A CN 202210269052A CN 114817378 A CN114817378 A CN 114817378A
- Authority
- CN
- China
- Prior art keywords
- retrieval
- file
- tax
- level
- split screen
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/248—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2455—Query execution
- G06F16/24552—Database cache management
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/26—Visual data mining; Browsing structured data
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/38—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Library & Information Science (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
技术领域technical field
本发明涉及一种税法文件搜索引擎,特别是涉及一种基于税法文件搜索引擎的检索方法。The invention relates to a tax law document search engine, in particular to a retrieval method based on the tax law document search engine.
背景技术Background technique
税法文件种类繁多、版本复杂,如何快速、准确地找到目标文件是一直以来困扰税法相关从业人员的难题。在这个背景下,税法文件搜索引擎应运而生,然而现有产品界面呈现方式、检索技术及检索方法的设计有缺陷,造成检索速度慢或者多次检索才能获取检索结果,不能高效满足用户应用税法的需要。There are many types and complex versions of tax law documents. How to find the target document quickly and accurately is a difficult problem that has always plagued tax law practitioners. In this context, the tax law document search engine came into being. However, the design of the existing product interface presentation method, retrieval technology and retrieval method is flawed, resulting in slow retrieval speed or multiple retrievals to obtain retrieval results, which cannot efficiently meet the needs of users to apply tax laws. needs.
具体而言,现有技术的缺陷和不足:Specifically, the shortcomings and deficiencies of the prior art:
1、受制于交互技术,现有产品无法精准同屏呈现中央级、省/自治区/直辖市级、市/地/州级的税法文件,也无法实现这三个效力层级税法文件的同步检索及检索结果的同屏呈现;但用户正确处理税收事项需要准确应用这三个效力层级的税法文件,因此,不得不多次分别检索、存储、比对,效率低下。1. Restricted by interactive technology, existing products cannot accurately display central-level, provincial/autonomous-municipal-level, city/prefecture/state-level tax law documents on the same screen, and cannot achieve simultaneous retrieval and retrieval of tax law documents at these three effective levels. The results are presented on the same screen; however, users need to accurately apply the tax law documents of these three levels of effectiveness when handling tax matters correctly. Therefore, they have to retrieve, store, and compare multiple times, which is inefficient.
2、受制于屏幕显示技术,现有产品的用户需要对新旧法规条款比对或者对关联法规结合应用,不能直观在一个电脑屏幕上同时查阅新旧法规条款或者关联法规,需要反复最小化窗口并且不停切换,用户体验感极差。2. Subject to the screen display technology, users of existing products need to compare the old and new regulations or apply them in combination with related regulations. They cannot visually check the old and new regulations or related regulations on a computer screen at the same time. They need to repeatedly minimize the window and do not. Stop switching, the user experience is extremely poor.
3、现有产品每次检索直接从数据库中取数并进行计算,内存占用高、重复计算高,检索不流畅、结果反馈慢、并发风险高,既影响用户体验,也会造成系统风险。3. Existing products directly fetch data from the database and perform calculations each time they are retrieved, which requires high memory usage, high repeated calculations, unsmooth retrieval, slow result feedback, and high concurrency risks, which not only affects user experience, but also causes system risks.
4、现有产品检索税法文件,要么依赖自行建立的检索词库,要么依赖自然切词匹配检索,检索精准度不高,检索技术不完整。4. Existing product retrieval tax law documents, either rely on self-established retrieval thesaurus, or rely on natural word matching retrieval, retrieval accuracy is not high, retrieval technology is incomplete.
综上,现有产品造成用户时间成本增加与浪费,用户在具体工作学习中应用税法处理具体问题时,检索文件需要的时间长,效率不高,检索功能差,用户体验不好。To sum up, the existing products increase the user's time cost and waste. When the user applies the tax law to deal with specific problems in specific work and study, it takes a long time to retrieve documents, the efficiency is not high, the retrieval function is poor, and the user experience is not good.
发明内容SUMMARY OF THE INVENTION
本发明的目的在于克服现有技术的不足,提供一种更高效、更稳定、用户体验更好的基于税法文件搜索引擎的检索方法。The purpose of the present invention is to overcome the deficiencies of the prior art, and to provide a retrieval method based on a tax law document search engine that is more efficient, more stable, and better user experience.
本发明的目的是通过以下技术方案来实现的:The purpose of this invention is to realize through the following technical solutions:
一种基于税法文件搜索引擎的检索方法,系统在录入税法文件时,对每个税法文件所属的法规层级进行定义,所述的法规层级包括中央级、省/自治区/直辖市级、市/地/州级;用户进入系统时,根据用户的定位位置确定该定位位置对应适用的法规层级,在最新文件列表中展示适用于该定位位置法规层级的税法文件,同屏呈现中央级、省/自治区/直辖市级、市/地/州级的税法文件,且三个法规层级作为TAB栏可切换,切换选中后文件列表中展示对应法规层级的税法文件,最新文件列表中,按照税法文件成文时间降序排列;A retrieval method based on a tax law document search engine, the system defines the regulatory level to which each tax law document belongs when entering tax law documents, and the regulatory levels include central level, province/autonomous region/municipal level, city/region/ State level; when the user enters the system, the applicable regulatory level corresponding to the positioning location is determined according to the user's positioning location, and the tax law documents applicable to the regulatory level for the positioning location are displayed in the latest document list, and the central level, province/autonomous region/regional level is displayed on the same screen Municipal-level, city/prefecture/state-level tax law documents, and the three regulatory levels can be switched as TAB columns. After the switch is selected, the tax law documents of the corresponding regulatory level will be displayed in the file list. ;
运用vue响应式交互技术,实现中央级、省/自治区/直辖市级、市/地/州级文件同屏呈现,当用户采用某种检索条件进行检索时,同步对中央级文件、省/自治区/直辖市级文件、市/地/州级文件分别根据检索条件进行筛选,得到各级文件筛选后的检索结果并同屏呈现。Using vue responsive interactive technology, central-level, province/autonomous-region/municipal-level, city/region/state-level documents are presented on the same screen. Municipal-level documents and city/prefecture/state-level documents are screened according to the retrieval conditions, respectively, and the retrieval results after screening of documents at all levels are obtained and presented on the same screen.
所述的方法,还包括一个税法文件内容分屏展示的步骤,包括以下子步骤:The method further includes a step of displaying the contents of tax law documents on a split screen, including the following sub-steps:
S101:打开第一个税法文件链接时,在文件列表的基础上建立分屏区域,在分屏区域内显示该税法文件的内容,作为第一分屏开始分屏显示;S101: When the first tax law file link is opened, a split screen area is established on the basis of the file list, the content of the tax law file is displayed in the split screen area, and the split screen display is started as the first split screen;
S102:打开第二个税法文件链接时,首先判断该税法文件是否已被打开,若是,则提示该文件已被打开在第一分屏中,若不是,则对分屏区域进行进一步分屏显示,将第二个税法文件的内容作为第二分屏,与第一分屏左右并排显示在分屏区域内;S102: When opening the second tax law file link, first determine whether the tax law file has been opened. If so, it will prompt that the file has been opened in the first split screen. If not, further split screen display will be performed on the split screen area. , take the content of the second tax law document as the second split screen, and display it side by side with the first split screen in the split screen area;
S103:打开第三个税法文件链接时,首先判断该税法文件是否已被打开,若是,则提示该文件已被打开在第几分屏中,若不是,则将第三个税法文件的内容作为第三分屏显示到分屏区域内,同时第一分屏则被滑动隐藏,分屏区域内显示第二分屏和第三分屏,在分屏区域生成用于拖动显示的滚动条;S103: When opening the link of the third tax law file, first determine whether the tax law file has been opened. If so, it will prompt the number of screens that the file has been opened. If not, the content of the third tax law file will be taken as The third split screen is displayed in the split screen area, while the first split screen is hidden by sliding, the second split screen and the third split screen are displayed in the split screen area, and a scroll bar for dragging and displaying is generated in the split screen area;
S104:打开第四个税法文件链接时,首先判断该税法文件是否已被打开,若已打开,则提示该文件已被打开在第几分屏中,同时继续判断该分屏是否处于被滑动隐藏状态,若被隐藏,则跳转至将分屏显示在分屏区域内;若未打开,则将第四个税法文件的内容作为第四分屏显示到分屏区域内,同时第二分屏则被滑动隐藏,分屏区域内显示第三分屏和第四分屏;S104: When opening the fourth tax law file link, first determine whether the tax law file has been opened. If it has been opened, it will prompt which sub-screen the file has been opened, and at the same time continue to determine whether the sub-screen is hidden by sliding If it is hidden, it will jump to display the split screen in the split screen area; if it is not opened, the content of the fourth tax law document will be displayed in the split screen area as the fourth split screen, and the second split screen will be displayed at the same time. It is hidden by sliding, and the third and fourth split screens are displayed in the split screen area;
S105:以此类推,直到用户打开的文件数量超过系统预设的最大分屏数阈值,系统提示分屏数量已达上限。S105: By analogy, until the number of files opened by the user exceeds the threshold of the maximum number of split screens preset by the system, the system prompts that the number of split screens has reached the upper limit.
分屏显示模块实现一个电脑屏幕的分屏呈现,分屏技术贯穿于整个系统的所有模块。运用(Vue swiper滑动翻页)技术,根据人眼阅读习惯和工作实际需要,一个电脑屏幕显示区域内最多分三屏呈现,超过三屏的内容可通过滚动条拖动显示;设定屏幕隐藏及关闭按钮,实现一、二、三屏之间自由切换。The split-screen display module realizes the split-screen presentation of a computer screen, and the split-screen technology runs through all modules of the entire system. Using the (Vue swiper sliding page) technology, according to the reading habits of the human eye and the actual needs of work, a computer screen display area can be displayed in three screens at most, and the content of more than three screens can be displayed by dragging the scroll bar; set the screen to hide and Close button to achieve free switching between the first, second and third screens.
所述的方法,还包括一个预加载缓存步骤,将每个税法文件与该文件的文件名、文号、全文关键词、成文时间、附件、法规层级、相关链接等建立关联并缓存起来,用户进入系统检索时,若检索条件与缓存的关联匹配,则直接从预加载缓存中返回结果集,无需从数据库中取数计算,快速反馈用户检索结果,检索更流畅高效,系统运行更稳定。The method further includes a preloading and caching step, which associates and caches each tax law document with the file name, document number, full-text keywords, writing time, attachments, regulatory levels, relevant links, etc. of the document. When entering the system for retrieval, if the retrieval conditions match the association of the cache, the result set will be returned directly from the preloaded cache, without the need to fetch data from the database for calculation, and the retrieval results can be quickly fed back to the user. The retrieval is smoother and more efficient, and the system runs more stably.
所述的方法,还包括一个动态缓存步骤,将用户使用的法规层级、成文时间、关键字等不同组合检索条件下的检索结果热数据自动缓存,使不同的检索条件组合与检索结果建立关联,当之后其他用户使用的检索条件组合与缓存的检索条件组合匹配时,则优先从缓存的热数据中反馈检索结果,无需重复计算,检索更流畅高效,系统运行更稳定。The method further includes a dynamic caching step, which automatically caches the hot data of the retrieval results under different combinations of retrieval conditions, such as the regulatory level, the time of writing, and the keywords used by the user, so that the different combinations of retrieval conditions are associated with the retrieval results, When the combination of retrieval conditions used by other users matches the combination of retrieval conditions in the cache, the retrieval results are preferentially fed back from the cached hot data, without repeated calculations, the retrieval is smoother and more efficient, and the system runs more stably.
所述的方法,还包括一个文件检索自动升级的步骤,包括以下子步骤:The method also includes a step of automatically updating file retrieval, including the following sub-steps:
S201:搜集税法文件,多层次多维度建立税法文件关键词库;S201: Collect tax law documents, and establish a multi-level and multi-dimensional keyword database for tax law documents;
S202:运用(结巴联想分词,结巴联想分词能根据短语的语境自动联想创造词汇,确保与实际含义吻合)技术,将全部税法文件切词,形成第一切词库;S202: Using the technology (stammering associative participle, which can automatically create vocabulary according to the context of the phrase, to ensure that it is consistent with the actual meaning) technology, cut all tax law documents into words to form the first all thesaurus;
S203:将关键词库和第一切词库并库,得到第二切词库;S203: Combine the keyword library and the first all thesaurus to obtain the second all thesaurus;
S204:运用(结巴自定义词库)技术,将关键词库设置为第二切词库中的优先级地位,将第二切词库拆分为关键词库和非关键词库;S204: Using the (stammering custom thesaurus) technology, the keyword database is set as the priority position in the second-cut thesaurus, and the second-cut thesaurus is divided into a keyword database and a non-keyword database;
S205:运用(布隆过滤器,快速区分关键词和非关键词)技术,形成用户检索的两轮组合模式,用户仅需一次输入检索语句,检索模块自动处理为两轮检索,处理过程为:S205: Using the (Bloom filter, quickly distinguishing keywords and non-keywords) technology to form a two-round combination mode of user retrieval, the user only needs to input a retrieval sentence once, and the retrieval module automatically processes two rounds of retrieval. The processing process is as follows:
运用(Golang语言多协程并发检索)技术,检索模块使用关键词库和非关键词库同时对检索语句切词,按照关键词库切分出若干个主检索词,按照非关键词库切分出若干个从检索词;Using the (Golang language multi-coroutine concurrent retrieval) technology, the retrieval module uses the keyword database and the non-keyword database to segment the retrieval sentence at the same time. out a number of search terms;
第一轮检索时,运用(数列组合归并)技术,把主检索词形成的不同组合,对税法文件的文件名和正文内容进行匹配,并按匹配度的高低依次呈现检索结果;In the first round of retrieval, the (sequence combination) technology is used to match the file name and text content of the tax law documents with different combinations of the main search terms, and the retrieval results are presented in order of matching degree;
若第一轮检索无法满足用户需求,用户可启动操作界面上的“继续检索”按钮,检索模块仅在本次检索中,将从检索词升级为主检索词,按照第一轮检索规则重新检索并呈现检索结果;若第二轮检索能够满足用户需求,检索模块自动记录第二轮检索中使用的从检索词。If the first round of retrieval cannot meet the user’s needs, the user can activate the “Continue Retrieval” button on the operation interface. The retrieval module will upgrade the search term from the search term to the main search term only in this search, and re-search according to the first round of search rules. And present the retrieval results; if the second round of retrieval can meet the user's needs, the retrieval module automatically records the secondary search terms used in the second round of retrieval.
S206:运用(智能检索识别,将检索结果按文件重要性进行分析,对非关键词检索结果进行权重计算,检索结果越多,得到的权重值越大,自动识别非关键词增补)技术,定期对用户检索启动第二轮检索时,满足用户需求使用到的从检索词(即切词库中的非关键词)在税法文件中的词频和词向量进行计算,智能动态升级补充为关键词库。S206: Use (intelligent search and identification, analyze the search results according to the importance of the files, calculate the weight of the non-keyword search results, the more search results, the greater the weight value, the automatic identification of non-keyword supplements) technology, regularly When the user search starts the second round of search, the word frequency and word vector in the tax law document from the search words (that is, the non-keywords in the thesaurus) used to meet the needs of the user are calculated, and the intelligent dynamic upgrade is supplemented to the keyword database .
本发明的有益效果是:The beneficial effects of the present invention are:
1)本发明支持精准同屏呈现中央级、省/自治区/直辖市级、市/地/州级的税法文件,实现了这三个效力层级税法文件的同步检索及检索结果的同屏呈现,用户处理税收事项时,能够准确高效应用这三个效力层级的税法文件,无需多次分别检索、存储、比对,便于用户提高工作效率。1) The present invention supports the accurate presentation of central-level, provincial/autonomous-municipal-level, city/prefecture/state-level tax law documents on the same screen, and realizes the simultaneous retrieval of tax law documents of these three effective levels and the presentation of retrieval results on the same screen. When dealing with tax matters, it can accurately and efficiently apply the tax law documents of these three levels of effectiveness, without the need to retrieve, store, and compare multiple times, which is convenient for users to improve work efficiency.
2)本发明支持分屏显示,使得对比阅读税法文件的过程更方便,用户在检索、查阅、应用税法时,可在一个电脑屏幕上分屏完成工作,避免反复最小化窗口并且不停切换的低效工作状态。分屏显示细节的滑动隐藏、提示、跳转等功能界面友好,便于用户提高工作效率。2) The present invention supports split-screen display, which makes the process of comparing and reading tax law documents more convenient. When retrieving, consulting, and applying tax laws, users can complete the work in split-screen on a computer screen, avoiding repeated minimization of windows and constant switching. Inefficient working state. Split-screen display details such as sliding hiding, prompting, jumping and other functions have a friendly interface, which is convenient for users to improve work efficiency.
3)本发明支持预加载技术及动态缓存技术,避免每次检索直接从数据库中取数计算,解决内存占用高、重复计算高、检索不流畅、结果反馈慢、并发风险高的问题,有效提升用户体验、防范系统风险。3) The present invention supports preloading technology and dynamic caching technology, avoids directly fetching data from the database for each retrieval calculation, solves the problems of high memory occupation, high repeated calculation, unsmooth retrieval, slow result feedback, and high concurrency risk, effectively improving User experience and system risk prevention.
4)本发明设置智能升级关键词库与两轮检索模式,全面提升检索精准度,动态适应税法发展变化,满足用户检索、应用需求。4) The present invention sets up an intelligently upgraded keyword database and a two-round retrieval mode, which comprehensively improves retrieval accuracy, dynamically adapts to the development and changes of the tax law, and meets user retrieval and application requirements.
具体实施方式Detailed ways
下面将结合实施例,对本发明的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域技术人员在没有付出创造性劳动的前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions of the present invention will be clearly and completely described below with reference to the embodiments. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without creative efforts shall fall within the protection scope of the present invention.
本发明提供一种税法文件搜索引擎的检索方法,本方法适用的用户群体:各企事业单位、财会类从业人员、中介机构(含税务师事务所、会计师事务所、律师事务所)涉税服务从业人员、税务系统干部以及财税类院校的师生等。The invention provides a retrieval method for a tax law document search engine. The user groups applicable to the method include tax-related services of various enterprises and institutions, accounting practitioners, and intermediary agencies (including tax firms, accounting firms, and law firms). Practitioners, cadres of the taxation system, teachers and students of financial and taxation institutions, etc.
一种基于税法文件搜索引擎的检索方法,系统在录入税法文件时,对每个税法文件所属的法规层级进行定义,所述的法规层级包括中央级、省/自治区/直辖市级、市/地/州级;用户进入系统时,根据用户的定位位置确定该定位位置对应适用的法规层级,在最新文件列表中展示适用于该定位位置法规层级的税法文件,同屏呈现中央级、省/自治区/直辖市级、市/地/州级的税法文件,且三个法规层级作为TAB栏可切换,切换选中后文件列表中展示对应法规层级的税法文件,最新文件列表中,按照税法文件成文时间降序排列;A retrieval method based on a tax law document search engine, the system defines the regulatory level to which each tax law document belongs when entering tax law documents, and the regulatory levels include central level, province/autonomous region/municipal level, city/region/ State level; when the user enters the system, the applicable regulatory level corresponding to the positioning location is determined according to the user's positioning location, and the tax law documents applicable to the regulatory level for the positioning location are displayed in the latest document list, and the central level, province/autonomous region/regional level is displayed on the same screen Municipal-level, city/prefecture/state-level tax law documents, and the three regulatory levels can be switched as TAB columns. After the switch is selected, the tax law documents of the corresponding regulatory level will be displayed in the file list. ;
运用vue响应式交互技术,实现中央级、省/自治区/直辖市级、市/地/州级文件同屏呈现,当用户采用某种检索条件进行检索时,同步对中央级文件、省/自治区/直辖市级文件、市/地/州级文件分别根据检索条件进行筛选,得到各级文件筛选后的检索结果并同屏呈现。Using vue responsive interactive technology, central-level, province/autonomous-region/municipal-level, city/region/state-level documents are presented on the same screen. Municipal-level documents and city/prefecture/state-level documents are screened according to the retrieval conditions, respectively, and the retrieval results after screening of documents at all levels are obtained and presented on the same screen.
所述的方法,还包括一个税法文件内容分屏展示的步骤,包括以下子步骤:The method further includes a step of displaying the contents of tax law documents on a split screen, including the following sub-steps:
S101:打开第一个税法文件链接时,在文件列表的基础上建立分屏区域,在分屏区域内显示该税法文件的内容,作为第一分屏开始分屏显示;S101: When the first tax law file link is opened, a split screen area is established on the basis of the file list, the content of the tax law file is displayed in the split screen area, and the split screen display is started as the first split screen;
S102:打开第二个税法文件链接时,首先判断该税法文件是否已被打开,若是,则提示该文件已被打开在第一分屏中,若不是,则对分屏区域进行进一步分屏显示,将第二个税法文件的内容作为第二分屏,与第一分屏左右并排显示在分屏区域内;S102: When opening the second tax law file link, first determine whether the tax law file has been opened. If so, it will prompt that the file has been opened in the first split screen. If not, further split screen display will be performed on the split screen area. , take the content of the second tax law document as the second split screen, and display it side by side with the first split screen in the split screen area;
S103:打开第三个税法文件链接时,首先判断该税法文件是否已被打开,若是,则提示该文件已被打开在第几分屏中,若不是,则将第三个税法文件的内容作为第三分屏显示到分屏区域内,同时第一分屏则被滑动隐藏,分屏区域内显示第二分屏和第三分屏,在分屏区域生成用于拖动显示的滚动条;S103: When opening the link of the third tax law file, first determine whether the tax law file has been opened. If so, it will prompt the number of screens that the file has been opened. If not, the content of the third tax law file will be taken as The third split screen is displayed in the split screen area, while the first split screen is hidden by sliding, the second split screen and the third split screen are displayed in the split screen area, and a scroll bar for dragging and displaying is generated in the split screen area;
S104:打开第四个税法文件链接时,首先判断该税法文件是否已被打开,若已打开,则提示该文件已被打开在第几分屏中,同时继续判断该分屏是否处于被滑动隐藏状态,若被隐藏,则跳转至将分屏显示在分屏区域内;若未打开,则将第四个税法文件的内容作为第四分屏显示到分屏区域内,同时第二分屏则被滑动隐藏,分屏区域内显示第三分屏和第四分屏;S104: When opening the fourth tax law file link, first determine whether the tax law file has been opened. If it has been opened, it will prompt which sub-screen the file has been opened, and at the same time continue to determine whether the sub-screen is hidden by sliding If it is hidden, it will jump to display the split screen in the split screen area; if it is not opened, the content of the fourth tax law document will be displayed in the split screen area as the fourth split screen, and the second split screen will be displayed at the same time. It is hidden by sliding, and the third and fourth split screens are displayed in the split screen area;
S105:以此类推,直到用户打开的文件数量超过系统预设的最大分屏数阈值,系统提示分屏数量已达上限。S105: By analogy, until the number of files opened by the user exceeds the threshold of the maximum number of split screens preset by the system, the system prompts that the number of split screens has reached the upper limit.
分屏显示模块实现一个电脑屏幕的分屏呈现,分屏技术贯穿于整个系统的所有模块。运用(Vue swiper滑动翻页)技术,根据人眼阅读习惯和工作实际需要,一个电脑屏幕显示区域内最多分三屏呈现,超过三屏的内容可通过滚动条拖动显示;设定屏幕隐藏及关闭按钮,实现一、二、三屏之间自由切换。The split-screen display module realizes the split-screen presentation of a computer screen, and the split-screen technology runs through all modules of the entire system. Using the (Vue swiper sliding page) technology, according to the reading habits of the human eye and the actual needs of work, a computer screen display area can be displayed in three screens at most, and the content of more than three screens can be displayed by dragging the scroll bar; set the screen to hide and Close button to achieve free switching between the first, second and third screens.
所述的方法,还包括一个预加载缓存步骤,将每个税法文件与该文件的文件名、文号、全文关键词、成文时间、附件、法规层级、相关链接等建立关联并缓存起来,用户进入系统检索时,若检索条件与缓存的关联匹配,则直接从预加载缓存中返回结果集,无需从数据库中取数计算,快速反馈用户检索结果,检索更流畅高效,系统运行更稳定。The method further includes a preloading and caching step, which associates and caches each tax law document with the file name, document number, full-text keywords, writing time, attachments, regulatory levels, relevant links, etc. of the document. When entering the system for retrieval, if the retrieval conditions match the association of the cache, the result set will be returned directly from the preloaded cache, without the need to fetch data from the database for calculation, and the retrieval results will be quickly fed back to the user, the retrieval will be smoother and more efficient, and the system will run more stably.
所述的方法,还包括一个动态缓存步骤,将用户使用的法规层级、成文时间、关键字等不同组合检索条件下的检索结果热数据自动缓存,使不同的检索条件组合与检索结果建立关联,当之后其他用户使用的检索条件组合与缓存的检索条件组合匹配时,则优先从缓存的热数据中反馈检索结果,无需重复计算,检索更流畅高效,系统运行更稳定。The method further includes a dynamic caching step, which automatically caches the hot data of the retrieval results under different combinations of retrieval conditions, such as the regulatory level, the time of writing, and the keywords used by the user, so that the different combinations of retrieval conditions are associated with the retrieval results, When the combination of retrieval conditions used by other users matches the combination of retrieval conditions in the cache, the retrieval results are preferentially fed back from the cached hot data, without repeated calculations, the retrieval is smoother and more efficient, and the system runs more stably.
所述的方法,还包括一个文件检索自动升级的步骤,包括以下子步骤:The method also includes a step of automatically updating file retrieval, including the following sub-steps:
S201:搜集税法文件,多层次多维度建立税法文件关键词库;S201: Collect tax law documents, and establish a multi-level and multi-dimensional keyword database for tax law documents;
S202:运用(结巴联想分词,结巴联想分词能根据短语的语境自动联想创造词汇,确保与实际含义吻合)技术,将全部税法文件切词,形成第一切词库;S202: Using the technology (stammering associative participle, which can automatically create vocabulary according to the context of the phrase, to ensure that it is consistent with the actual meaning) technology, cut all tax law documents into words to form the first all thesaurus;
S203:将关键词库和第一切词库并库,得到第二切词库;S203: Combine the keyword library and the first all thesaurus to obtain the second all thesaurus;
S204:运用(结巴自定义词库)技术,将关键词库设置为第二切词库中的优先级地位,将第二切词库拆分为关键词库和非关键词库;S204: Using the (stammering custom thesaurus) technology, the keyword database is set as the priority position in the second-cut thesaurus, and the second-cut thesaurus is divided into a keyword database and a non-keyword database;
S205:运用(布隆过滤器,快速区分关键词和非关键词)技术,形成用户检索的两轮组合模式,用户仅需一次输入检索语句,检索模块自动处理为两轮检索,处理过程为:S205: Using the (Bloom filter, quickly distinguishing keywords and non-keywords) technology to form a two-round combination mode of user retrieval, the user only needs to input a retrieval sentence once, and the retrieval module automatically processes two rounds of retrieval. The processing process is as follows:
运用(Golang语言多协程并发检索)技术,检索模块使用关键词库和非关键词库同时对检索语句切词,按照关键词库切分出若干个主检索词,按照非关键词库切分出若干个从检索词;Using the (Golang language multi-coroutine concurrent retrieval) technology, the retrieval module uses the keyword database and the non-keyword database to segment the retrieval sentence at the same time. out a number of search terms;
第一轮检索时,运用(数列组合归并)技术,把主检索词形成的不同组合,对税法文件的文件名和正文内容进行匹配,并按匹配度的高低依次呈现检索结果;In the first round of retrieval, the (sequence combination) technology is used to match the file name and text content of the tax law documents with different combinations of the main search terms, and the retrieval results are presented in order of matching degree;
若第一轮检索无法满足用户需求,用户可启动操作界面上的“继续检索”按钮,检索模块仅在本次检索中,将从检索词升级为主检索词,按照第一轮检索规则重新检索并呈现检索结果;若第二轮检索能够满足用户需求,检索模块自动记录第二轮检索中使用的从检索词。If the first round of retrieval cannot meet the user’s needs, the user can activate the “Continue Retrieval” button on the operation interface. The retrieval module will upgrade the search term from the search term to the main search term only in this search, and re-search according to the first round of search rules. And present the retrieval results; if the second round of retrieval can meet the user's needs, the retrieval module automatically records the secondary search terms used in the second round of retrieval.
S206:运用(智能检索识别,将检索结果按文件重要性进行分析,对非关键词检索结果进行权重计算,检索结果越多,得到的权重值越大,自动识别非关键词增补)技术,定期对用户检索启动第二轮检索时,满足用户需求使用到的从检索词(即切词库中的非关键词)在税法文件中的词频和词向量进行计算,智能动态升级补充为关键词库。S206: Use (intelligent search and identification, analyze the search results according to the importance of the files, calculate the weight of the non-keyword search results, the more search results, the greater the weight value, the automatic identification of non-keyword supplements) technology, regularly When the user search starts the second round of search, the word frequency and word vector in the tax law document from the search words (that is, the non-keywords in the thesaurus) used to meet the needs of the user are calculated, and the intelligent dynamic upgrade is supplemented to the keyword database .
实务案例:Practical case:
A用户位于成都,其处理增值税涉税事项时,需要同时用到中央级、四川省级、成都市级三个效力层级的税法文件,A用户登陆系统后,在“政策法规”栏下的最新文件列表即同屏呈现出“中央、四川、成都”三个效力层级的税法文件;再进一步选择分类“增值税”,即可一键获取、同屏呈现中央级、四川省级、成都市级三个效力层级有关增值税的税法文件;A用户可以通过TAB栏切换选择中央级、四川省级、成都市级三个效力层级的关联税法文件,结合分屏呈现,进行不同效力层级的税法文件关联、比对、应用。User A is located in Chengdu. When dealing with VAT-related matters, he needs to use tax law documents at the central, provincial and Chengdu levels at the same time. The latest document list shows the tax law documents of the three effective levels of "Central, Sichuan and Chengdu" on the same screen; and then further selects the category "VAT", you can get one-click access, and the central level, Sichuan provincial level, and Chengdu city are displayed on the same screen. Tax law documents related to value-added tax in three efficacy levels; A user can switch to select the related tax law documents of the three efficacy levels of central level, Sichuan provincial level, and Chengdu level through the TAB column, combined with split-screen presentation, to carry out tax laws of different efficacy levels File association, comparison, application.
通过法规层级与时间范围和关键字的搭配使用,结合响应式交互与分屏显示,可不用反复启动搜索引擎模块,避免操作界面反复跳转与低效检索,实现文件检索、应用的高效率。Through the combination of regulatory level, time range and keywords, combined with responsive interaction and split-screen display, it is not necessary to repeatedly start the search engine module, avoid repeated jumps and inefficient retrieval of the operation interface, and achieve high efficiency in file retrieval and application.
以上所述仅是本发明的优选实施方式,应当理解本发明并非局限于本文所披露的形式,不应看作是对其他实施例的排除,而可用于各种其他组合、修改和环境,并能够在本文所述构想范围内,通过上述教导或相关领域的技术或知识进行改动。而本领域人员所进行的改动和变化不脱离本发明的精神和范围,则都应在本发明所附权利要求的保护范围内。The foregoing are only preferred embodiments of the present invention, and it should be understood that the present invention is not limited to the forms disclosed herein, and should not be construed as an exclusion of other embodiments, but may be used in various other combinations, modifications, and environments, and Modifications can be made within the scope of the concepts described herein, from the above teachings or from skill or knowledge in the relevant field. However, modifications and changes made by those skilled in the art do not depart from the spirit and scope of the present invention, and should all fall within the protection scope of the appended claims of the present invention.
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210269052.3A CN114817378A (en) | 2022-03-18 | 2022-03-18 | Tax file search engine-based retrieval method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210269052.3A CN114817378A (en) | 2022-03-18 | 2022-03-18 | Tax file search engine-based retrieval method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114817378A true CN114817378A (en) | 2022-07-29 |
Family
ID=82531463
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210269052.3A Pending CN114817378A (en) | 2022-03-18 | 2022-03-18 | Tax file search engine-based retrieval method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114817378A (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050114324A1 (en) * | 2003-09-14 | 2005-05-26 | Yaron Mayer | System and method for improved searching on the internet or similar networks and especially improved MetaNews and/or improved automatically generated newspapers |
CN101853252A (en) * | 2009-04-02 | 2010-10-06 | 深圳市辰飞信息技术有限公司 | Legal searching method and legal searching system |
CN105760489A (en) * | 2016-02-17 | 2016-07-13 | 国家电网公司 | Electronic document establishing and displaying method and device for electric power business expansion project review |
-
2022
- 2022-03-18 CN CN202210269052.3A patent/CN114817378A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050114324A1 (en) * | 2003-09-14 | 2005-05-26 | Yaron Mayer | System and method for improved searching on the internet or similar networks and especially improved MetaNews and/or improved automatically generated newspapers |
CN101853252A (en) * | 2009-04-02 | 2010-10-06 | 深圳市辰飞信息技术有限公司 | Legal searching method and legal searching system |
CN105760489A (en) * | 2016-02-17 | 2016-07-13 | 国家电网公司 | Electronic document establishing and displaying method and device for electric power business expansion project review |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11907511B1 (en) | User interface for use with a search engine for searching financial related documents | |
CN110888990B (en) | Text recommendation method, device, equipment and medium | |
US10042896B2 (en) | Providing search recommendation | |
US7814102B2 (en) | Method and system for linking documents with multiple topics to related documents | |
US10552467B2 (en) | System and method for language sensitive contextual searching | |
US9836511B2 (en) | Computer-generated sentiment-based knowledge base | |
US11086883B2 (en) | Systems and methods for suggesting content to a writer based on contents of a document | |
US8046348B1 (en) | Method and apparatus for concept-based searching of natural language discourse | |
CN111767716B (en) | Method and device for determining enterprise multi-level industry information and computer equipment | |
US20130041921A1 (en) | Ontology for use with a system, method, and computer readable medium for retrieving information and response to a query | |
US20130018894A1 (en) | System and method of sentiment data generation | |
US10585927B1 (en) | Determining a set of steps responsive to a how-to query | |
US20120197864A1 (en) | Trusted query system and method | |
US20050080613A1 (en) | System and method for processing text utilizing a suite of disambiguation techniques | |
US10311113B2 (en) | System and method of sentiment data use | |
US20180121502A1 (en) | User Search Query Processing | |
US11983486B1 (en) | Machine learning techniques for updating documents generated by a natural language generation (NLG) engine | |
Takamura et al. | Text summarization model based on the budgeted median problem | |
Poirier et al. | Automating opinion analysis in film reviews: the case of statistic versus linguistic approach | |
CN114817378A (en) | Tax file search engine-based retrieval method | |
US20090171949A1 (en) | Linguistic Assistance Systems And Methods | |
Johnny et al. | Farmer query answering system | |
WO2019142094A1 (en) | System and method for semantic text search | |
CN120235582A (en) | Contract review methods and related products | |
Othman et al. | A Relevant Passage Retrieval and Re-ranking Approach for Open-Domain Question Answering. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |