[go: up one dir, main page]

CN112131459A - Intellectual property information retrieval software management system and method based on big data - Google Patents

Intellectual property information retrieval software management system and method based on big data Download PDF

Info

Publication number
CN112131459A
CN112131459A CN202010789517.9A CN202010789517A CN112131459A CN 112131459 A CN112131459 A CN 112131459A CN 202010789517 A CN202010789517 A CN 202010789517A CN 112131459 A CN112131459 A CN 112131459A
Authority
CN
China
Prior art keywords
retrieval
user
value
module
time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010789517.9A
Other languages
Chinese (zh)
Other versions
CN112131459B (en
Inventor
曾素梅
黄鹏
易露霞
唐小梦
王月甜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Kewo Technology Service Co ltd
Original Assignee
Guangzhou College of Technology and Business
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou College of Technology and Business filed Critical Guangzhou College of Technology and Business
Priority to CN202010789517.9A priority Critical patent/CN112131459B/en
Publication of CN112131459A publication Critical patent/CN112131459A/en
Application granted granted Critical
Publication of CN112131459B publication Critical patent/CN112131459B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/248Presentation of query results
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services
    • G06Q50/184Intellectual property management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Tourism & Hospitality (AREA)
  • Computational Linguistics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Technology Law (AREA)
  • Health & Medical Sciences (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Business, Economics & Management (AREA)
  • Operations Research (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

本发明公开了基于大数据的知识产权信息检索软件管理系统及方法,包括数据采集模块、若干个用户终端、浏览模块、用户行为分析模块、服务器、检索模块、数据采集模块、数据库以及评价模块;检索模块用于用户终端发布检索信息并将检索信息发送至服务器;数据采集模块用于采集每件专利的基本信息并将每件专利的基本信息传输到数据库进行存储;调查模块用于监测每隔预设时间专利的发明人与用户的交流信息并将交流信息传输到数据分析模块;数据分析模块接收点击信息和交流信息并结合检索模块进行专利的推送分析。本发明巧妙利用大数据智能分析和用户的行为来提高搜索效率,减轻用户负担。

Figure 202010789517

The invention discloses an intellectual property information retrieval software management system and method based on big data, comprising a data acquisition module, several user terminals, a browsing module, a user behavior analysis module, a server, a retrieval module, a data acquisition module, a database and an evaluation module; The retrieval module is used for the user terminal to issue the retrieval information and send the retrieval information to the server; the data acquisition module is used to collect the basic information of each patent and transmit the basic information of each patent to the database for storage; the investigation module is used to monitor every The communication information between the inventor of the patent and the user at the preset time and the communication information is transmitted to the data analysis module; the data analysis module receives the click information and communication information, and combines the retrieval module to carry out the push analysis of the patent. The invention cleverly utilizes big data intelligent analysis and user's behavior to improve search efficiency and reduce user's burden.

Figure 202010789517

Description

基于大数据的知识产权信息检索软件管理系统及方法Intellectual property information retrieval software management system and method based on big data

技术领域technical field

本发明涉及信息检索领域,尤其涉及基于大数据的知识产权信息检索软件管理系统及方法。The invention relates to the field of information retrieval, in particular to a software management system and method for intellectual property information retrieval based on big data.

背景技术Background technique

随着互联网应用的普及和大数据时代的到来,每天全球互联网网页数目以千万级的数量增加。要在浩瀚网络检索需要的信息,搜索引擎已成为访问互联网不可或缺的助手。With the popularization of Internet applications and the arrival of the era of big data, the number of Internet pages around the world increases by tens of millions every day. To retrieve the required information on the vast web, search engines have become an indispensable assistant for accessing the Internet.

公开号CN106503199A的文件公开了一种基于网络的计算机信息检索系统,包括前台信息输入系统和后台信息检索系统,所述前台信息输入系统和后台信息检索系统均通过计算机中心系统双向电性连接;所述前台信息输入系统包括图片输入子系统、语言输入子系统和文字输入子系统;所述后台信息检索系统包括信息检索子系统、检索检索子系统和检索共享子系统,该发明提出的一种基于网络的计算机信息检索系统,包括前台信息输入系统和后台信息检索系统,当需要检索时,可以输入图片、语言和文字三种检索信息,克服了传统的检索系统的检索方式单一的问题,检索共享子系统实现了检索的共享,实现了远程的传输。The document of publication number CN106503199A discloses a network-based computer information retrieval system, including a foreground information input system and a background information retrieval system, and the foreground information input system and the background information retrieval system are both bidirectionally electrically connected through a computer center system; The foreground information input system includes a picture input subsystem, a language input subsystem and a text input subsystem; the background information retrieval system includes an information retrieval subsystem, a retrieval retrieval subsystem and a retrieval sharing subsystem. The computer information retrieval system of the network, including the foreground information input system and the background information retrieval system, can input three kinds of retrieval information: picture, language and text when retrieval is required, which overcomes the problem of single retrieval method of the traditional retrieval system, and the retrieval and sharing The subsystem realizes the sharing of retrieval and remote transmission.

但是该专利是将所有可能的结果全部呈现给用户,由用户自己选择其中需要的检索项;增加了用户负担,降低了搜索效率;而且在检索项排序的时候并没有充分考虑用户的行为。However, this patent presents all possible results to the user, and the user selects the required search items; it increases the user's burden and reduces the search efficiency; and does not fully consider the user's behavior when sorting the search items.

发明内容SUMMARY OF THE INVENTION

针对现有技术存在的不足,本发明目的是提供基于大数据的知识产权信息检索软件管理系统及方法;本发明巧妙利用大数据智能分析和用户的行为来提高搜索效率,减轻用户负担;同时对检索服务系统形成一个有效评价,方便后来查看。In view of the deficiencies in the prior art, the purpose of the present invention is to provide a software management system and method for intellectual property information retrieval based on big data; the present invention cleverly utilizes big data intelligent analysis and user behavior to improve search efficiency and reduce user burden; The retrieval service system forms an effective evaluation, which is convenient for later viewing.

本发明的目的可以通过以下技术方案实现:The object of the present invention can be realized through the following technical solutions:

基于大数据的知识产权信息检索软件管理系统,包括数据采集模块、若干个用户终端、浏览模块、用户行为分析模块、服务器、检索模块、数据采集模块、数据库以及评价模块;An intellectual property information retrieval software management system based on big data, including a data acquisition module, several user terminals, a browsing module, a user behavior analysis module, a server, a retrieval module, a data acquisition module, a database and an evaluation module;

所述用户终端用于录入用户的登录信息和注册信息,用户在已有账户时通过用户终端输入登录信息后进行登录,用户在不存在账户时通过用户终端输入注册信息注册新的账户后进行首次登录;The user terminal is used to input the user's login information and registration information. The user logs in after entering the login information through the user terminal when there is an existing account. When the user does not have an account, the user enters the registration information through the user terminal and registers a new account. Log in;

所述检索模块用于用户终端发布检索信息并将检索信息发送至服务器,所述检索信息包括关键字和技术领域;The retrieval module is used for the user terminal to issue retrieval information and send the retrieval information to the server, where the retrieval information includes keywords and technical fields;

所述数据采集模块用于采集每件专利的基本信息并将每件专利的基本信息传输到数据库进行存储;所述数据库用于存储服务器接收的浏览记录、评价记录、检索信息、登录信息以及注册信息;The data acquisition module is used to collect the basic information of each patent and transmit the basic information of each patent to the database for storage; the database is used to store the browsing records, evaluation records, retrieval information, login information and registration received by the server information;

所述访问统计模块用于统计数据库中每件专利在系统当前时间前10天内的点击信息并将点击信息传输到数据分析模块;所述调查模块用于监测每隔预设时间专利的发明人与用户的交流信息并将交流信息传输到数据分析模块;The access statistics module is used to count the click information of each patent in the database within 10 days before the current time of the system and transmit the click information to the data analysis module; the investigation module is used to monitor the inventors and inventors of the patent every preset time. Communication information of users and transmission of communication information to the data analysis module;

所述数据分析模块接收点击信息和交流信息并结合检索模块进行专利的推送分析,具体推送分析过程如下:The data analysis module receives click information and communication information, and combines the retrieval module to carry out patent push analysis. The specific push analysis process is as follows:

S11:获取符合检索信息中关键字和技术领域的专利并将其标记为初选专利;S11: Obtain patents that match the keywords and technical fields in the search information and mark them as primary patents;

S12:将系统当前时间前10天内该初选专利每天被点击的次数标记为Bk,每次点击的观看时间标记为Tki,每天被评论的次数标记为Ck,每天被转发的次数标记为Dk,每天被收藏的次数标记为Ek,每天被点赞的次数标记为Fk;k=1,2,…,10;i=1,2,…,Bk;S12: Mark the daily number of clicks on the primary patent within 10 days before the current system time as Bk, the viewing time of each click as Tki, the daily number of comments as Ck, and the daily number of reposts as Dk, The number of favorites per day is marked as Ek, and the number of likes per day is marked as Fk; k=1, 2, ..., 10; i=1, 2, ..., Bk;

S13:将系统当前时间前10天内该初选专利每天被观看的时间标记为Tk;

Figure BDA0002623256400000031
S13: Mark the daily viewing time of the primary selection patent as Tk within 10 days before the current system time;
Figure BDA0002623256400000031

S14:利用公式

Figure BDA0002623256400000032
计算得出该初选专利每天的热度值Qk,其中,b1、b2、b3、r1、r2、r3和r4均为系数因子;S14: Utilize formulas
Figure BDA0002623256400000032
Calculate the daily heat value Qk of the primary patent, where b1, b2, b3, r1, r2, r3 and r4 are coefficient factors;

S15:按照平均值计算公式得出该初选专利当前时间前10天内的平均热度值L;按照标准差计算公式得出前10天内该初选专利每天热度值的标准差α,利用公式β=(L×η1-α×η2)(η3+η4)计算得出该初选专利的持续热度值β,其中η1、η2、η3和η4均为系数因子;S15: Calculate the average heat value L of the primary patent in the 10 days before the current time according to the average calculation formula; obtain the standard deviation α of the daily heat value of the primary patent within the previous 10 days according to the standard deviation calculation formula, using the formula β=( L×η1-α×η2) ( η3+η4 ) calculates the continuous heat value β of the primary patent, wherein η1, η2, η3 and η4 are all coefficient factors;

S16:将服务评价系数标记为Ko,将服务评价系数Ko求取平均值得到服务评价均值K;S16: Mark the service evaluation coefficient as Ko, and calculate the average value of the service evaluation coefficient Ko to obtain the service evaluation mean value K;

S17:将初选专利发明人答复用户问题的反应时间标记为J3o,所述J3o=J2o-J1o,o=1,...,n,将反应时间J3o求和并取平均值得到平均反应时间J;S17: Mark the response time of the primary patent inventor to answer the user's question as J3o, where J3o=J2o-J1o, o=1, . J;

S18:将初选专利发明人名下专利总数量标记为P1;将初选专利发明人名下已成交的专利数量标记为P2;S18: Mark the total number of patents under the name of the primary patent inventor as P1; mark the number of patents that have been traded under the name of the primary patent inventor as P2;

S19:利用公式

Figure BDA0002623256400000033
计算得出该初选专利发明人的信誉值R,其中c1、c2、c3和c4均为系数因子;S19: Utilize formulas
Figure BDA0002623256400000033
Calculate the reputation value R of the primary patent inventor, where c1, c2, c3 and c4 are all coefficient factors;

S20:利用公式

Figure BDA0002623256400000041
得出该初选专利的推送值TS;其中d1、d2、d3、d4和d5为预设比例系数;λ=0.00564327;P(x)为用户对该初选专利的兴趣值;S20: Utilize formulas
Figure BDA0002623256400000041
Get the push value TS of the primary patent; where d1, d2, d3, d4 and d5 are preset proportional coefficients; λ=0.00564327; P(x) is the user's interest in the primary patent;

数据分析模块将推送值TS传输到服务器,所述服务器根据推送值TS对初选专利做降序排列并将排列后的初选专利发送至用户终端。The data analysis module transmits the push value TS to the server, and the server sorts the preliminary selection patents in descending order according to the push value TS and sends the sorted preliminary selection patents to the user terminal.

进一步地,所述浏览模块用于用户终端浏览专利信息,并将浏览记录发送至服务器;所述浏览记录包括浏览时间、持续时长以及评论、转发、收藏和点赞的行为特征;所述浏览时间为用户点开专利链接的时间;所述用户行为分析模块用于接收服务器传输的浏览记录并作出分析;具体步骤包括:Further, the browsing module is used for the user terminal to browse patent information, and send the browsing record to the server; the browsing record includes browsing time, duration, and behavioral characteristics of comments, forwarding, favorites, and likes; the browsing time The time when the user clicks the patent link; the user behavior analysis module is used to receive and analyze the browsing records transmitted by the server; the specific steps include:

S41:获取浏览记录中浏览时间并将浏览时间标记为Hx,将持续时长标记为Rx,评论行为值标记为S(C),转发行为值标记为S(D),收藏行为值标记为S(E),点赞行为值标记为S(F);S41: Obtain the browsing time in the browsing record and mark the browsing time as Hx, the duration as Rx, the comment behavior value as S(C), the forwarding behavior value as S(D), and the favorite behavior value as S( E), the like behavior value is marked as S(F);

S42:获取系统当前时间,将当前时间标记为TV,利用公式

Figure BDA0002623256400000042
计算得出该条记录的时效值f(x);其中g1为系数因子,Hx与TV越接近,则f(x)值越大;σ为预设因子;S42: Obtain the current time of the system, mark the current time as TV, and use the formula
Figure BDA0002623256400000042
Calculate the aging value f(x) of the record; where g1 is the coefficient factor, the closer Hx is to TV, the larger the f(x) value; σ is the preset factor;

S43:若用户对该专利有评论,则S(C)=1,否则S(C)=0;若用户对该专利有转发,则S(D)=1,否则S(D)=0;若用户对该专利有收藏,则S(E)=1,否则S(E)=0,若用户对该专利有点赞,则S(F)=1,否则S(F)=0;S43: If the user has commented on the patent, then S(C)=1, otherwise S(C)=0; if the user has forwarded the patent, then S(D)=1, otherwise S(D)=0; If the user has a favorite of the patent, then S(E)=1, otherwise S(E)=0, if the user likes the patent, then S(F)=1, otherwise S(F)=0;

S44:利用公式

Figure BDA0002623256400000043
计算得出用户对该专利的兴趣值P(x);其中g2为预设系数因子。S44: Utilize formulas
Figure BDA0002623256400000043
The user's interest value P(x) for the patent is calculated; where g2 is a preset coefficient factor.

进一步地,所述服务器接收到检索模块传输的检索信息时会自动驱动控制计时模块开始计时,在服务器返回检索结果至用户终端时会通过检索模块向服务器传输检索信号,在浏览器关闭时会通过检索模块向服务器传输解决信号;所述服务器在接收到反应信号和解决信号时均会驱动计时模块记录检索时间和解决时间;所述服务器将检索时间标记为RT1并将其传输到评价模块,服务器将解决时间标记为RT2并将其传输到评价模块;Further, when the server receives the retrieval information transmitted by the retrieval module, it will automatically drive the control timing module to start timing, and when the server returns the retrieval result to the user terminal, it will transmit the retrieval signal to the server through the retrieval module, and will pass the retrieval signal when the browser is closed. The retrieval module transmits the solution signal to the server; the server drives the timing module to record the retrieval time and the solution time when receiving the response signal and the solution signal; the server marks the retrieval time as RT1 and transmits it to the evaluation module, the server Mark the resolution time as RT2 and transfer it to the evaluation module;

所述评价模块用于用户对专利的检索服务进行评价,评价规则为:给检索服务评分,满分为100分;所述评价模块的具体工作步骤如下:The evaluation module is used for the user to evaluate the patent retrieval service, and the evaluation rules are: score the retrieval service with a full score of 100 points; the specific working steps of the evaluation module are as follows:

S51:将服务评分标记为Qs;获取整个检索过程中用户浏览专利的数量并将其标记为Cs;S51: Mark the service score as Qs; obtain the number of patents browsed by the user during the entire retrieval process and mark it as Cs;

S52:根据大数据内用户对服务评分Qs、浏览专利的数量Cs、检索时间RT1和解决时间RT2的重视程度分配权重;S52: Allocate weights according to the user's importance to the service score Qs, the number of browsed patents Cs, the retrieval time RT1 and the resolution time RT2 in the big data;

对服务评分Qs分配权重为D1;对浏览专利的数量Cs分配权重为D2;对检索时间RT1分配权重D3,对解决时间RT2分配权重为D4;且D1+D2+D3+D4=1;D1>D2>D3>D4;D1 is assigned to the service score Qs; D2 is assigned to the number of browsed patents Cs; D3 is assigned to the retrieval time RT1, and D4 is assigned to the solution time RT2; and D1+D2+D3+D4=1; D1> D2>D3>D4;

S53:利用公式

Figure BDA0002623256400000051
计算得到用户的检索满意值QR。S53: Utilize formulas
Figure BDA0002623256400000051
The user's retrieval satisfaction value QR is calculated.

进一步地,所述评价模块用于将检索满意值QR传输到服务器,所述服务器用于将检索满意值QR打上时间戳存储到存储模块并将检索满意值QR传输到显示模块进行实时显示;所述专利的基本信息包括专利包括发明人、发明类型、技术领域以及名称;所述点击信息包括点击次数、每次点击的观看时间以及评论、转发、收藏和点赞的行为特征;所述交流信息包括用户提出问题的时间J1x、发明人答复问题的时间J2x、服务评价系数、发明人名下专利总数量以及发明人名下已成交的专利数量,所述服务评价系数规则为:给发明人服务评分,满分为100分。Further, the evaluation module is used to transmit the retrieval satisfaction value QR to the server, and the server is used to stamp the retrieval satisfaction value QR with a timestamp and store it in the storage module and transmit the retrieval satisfaction value QR to the display module for real-time display; The basic information of the patent includes the patent including the inventor, the type of invention, the technical field and the name; the click information includes the number of clicks, the viewing time of each click, and the behavioral characteristics of comments, forwarding, favorites and likes; the communication information It includes the time J1x for the user to ask the question, the time for the inventor to answer the question J2x, the service evaluation coefficient, the total number of patents under the inventor's name, and the number of patents that have been traded under the inventor's name. The full score is 100 points.

进一步地,基于大数据的知识产权信息检索方法,包括如下步骤:Further, the intellectual property information retrieval method based on big data includes the following steps:

步骤一:用户通过若干个用户终端进行注册和登录,对专利进行浏览查看,而后发布检索信息;Step 1: The user registers and logs in through several user terminals, browses and checks the patent, and then publishes the retrieval information;

步骤二:所述数据分析模块接收点击信息和交流信息并结合检索信息进行专利的推送分析;包括:Step 2: The data analysis module receives click information and communication information, and carries out patent push analysis in combination with retrieval information; including:

X11:获取符合检索信息中关键字和技术领域的专利并将其标记为初选专利;X11: Obtain patents that match the keywords and technical fields in the search information and mark them as primary patents;

X12:将系统当前时间前10天内该初选专利每天被点击的次数标记为Bk,每次点击的观看时间标记为Tki,每天被评论的次数标记为Ck,每天被转发的次数标记为Dk,每天被收藏的次数标记为Ek,每天被点赞的次数标记为Fk;X12: Mark the daily number of clicks on the primary patent within 10 days before the current system time as Bk, the viewing time of each click as Tki, the daily number of comments as Ck, and the daily number of reposts as Dk, The number of favorites per day is marked as Ek, and the number of likes per day is marked as Fk;

X13:将系统当前时间前10天内该初选专利每天被观看的时间标记为Tk;

Figure BDA0002623256400000061
X13: Mark the daily viewing time of the primary patent as Tk within 10 days before the current system time;
Figure BDA0002623256400000061

X14:利用公式

Figure BDA0002623256400000062
计算得出该初选专利每天的热度值Qk;X14: Utilize formulas
Figure BDA0002623256400000062
Calculate the daily heat value Qk of the primary patent;

X15:按照平均值计算公式得出该初选专利当前时间前10天内的平均热度值L;按照标准差计算公式得出前10天内该初选专利每天热度值的标准差α,利用公式β=(L×η1-α×η2)(η3+η4)计算得出该初选专利的持续热度值β;X15: Calculate the average heat value L of the primary patent within 10 days before the current time according to the average calculation formula; obtain the standard deviation α of the daily heat value of the primary patent within the first 10 days according to the standard deviation calculation formula, using the formula β=( L×η1-α×η2) (η3+η4) is calculated to obtain the continuous heat value β of the primary patent;

X16:将服务评价系数标记为Ko,将服务评价系数Ko求取平均值得到服务评价均值K;X16: Mark the service evaluation coefficient as Ko, and calculate the average value of the service evaluation coefficient Ko to obtain the service evaluation mean value K;

X17:将初选专利发明人答复用户问题的反应时间标记为J3o;J3o=J2o-J1o;将反应时间J3o求和并取平均值得到平均反应时间J;X17: Mark the response time of the primary patent inventor to answer user questions as J3o; J3o=J2o-J1o; sum the response times J3o and take the average to obtain the average response time J;

X18:将初选专利发明人名下专利总数量标记为P1;将初选专利发明人名下已成交的专利数量标记为P2;X18: Mark the total number of patents under the name of the primary patent inventor as P1; mark the number of patents that have been traded under the name of the primary patent inventor as P2;

X19:利用公式

Figure BDA0002623256400000071
计算得出该初选专利发明人的信誉值R;X19: Utilize formulas
Figure BDA0002623256400000071
Calculate the reputation value R of the primary patent inventor;

X20:利用公式

Figure BDA0002623256400000072
得出该初选专利的推送值TS;X20: Utilize formula
Figure BDA0002623256400000072
Get the push value TS of the primary patent;

步骤三:根据推送值TS对专利做降序排列并将排列后的专利发送至用户终端;Step 3: Arrange the patents in descending order according to the push value TS and send the arranged patents to the user terminal;

步骤四:用户终端通过浏览模块浏览专利信息,并将浏览记录发送至服务器;用户行为分析模块接收服务器传输的浏览记录并作出分析;获得用户对专利的兴趣值P(x);具体步骤如下:Step 4: the user terminal browses the patent information through the browsing module, and sends the browsing record to the server; the user behavior analysis module receives the browsing record transmitted by the server and analyzes it; obtains the user's interest value P(x) in the patent; the specific steps are as follows:

X31:获取浏览记录中的浏览时间并将浏览时间标记为Hx,将持续时长标记为Rx,评论行为值标记为S(C),转发行为值标记为S(D),收藏行为值标记为S(E),点赞行为值标记为S(F);X31: Get the browsing time in the browsing record and mark the browsing time as Hx, the duration as Rx, the comment behavior value as S(C), the forwarding behavior value as S(D), and the favorite behavior value as S (E), the like behavior value is marked as S(F);

X32:获取系统当前时间,将当前时间标记为TV,利用公式

Figure BDA0002623256400000073
计算得出该条记录的时效值f(x);其中g1为系数因子,Hx与TV越接近,则f(x)值越大;σ为预设因子;X32: Get the current time of the system, mark the current time as TV, and use the formula
Figure BDA0002623256400000073
Calculate the aging value f(x) of the record; where g1 is the coefficient factor, the closer Hx is to TV, the larger the f(x) value; σ is the preset factor;

X33:若用户对该专利有评论,则S(C)=1,否则S(C)=0;若用户对该专利有转发,则S(D)=1,否则S(D)=0;若用户对该专利有收藏,则S(E)=1,否则S(E)=0,若用户对该专利有点赞,则S(F)=1,否则S(F)=0;X33: If the user has commented on the patent, then S(C)=1, otherwise S(C)=0; if the user has forwarded the patent, then S(D)=1, otherwise S(D)=0; If the user has a favorite of the patent, then S(E)=1, otherwise S(E)=0, if the user likes the patent, then S(F)=1, otherwise S(F)=0;

X34:利用公式

Figure BDA0002623256400000074
计算得出用户对该专利的兴趣值P(x);X34: Utilize formulas
Figure BDA0002623256400000074
Calculate the user's interest value P(x) for the patent;

步骤五:检索完成后,用户通过评价模块对专利的检索服务进行评价,包括:Step 5: After the retrieval is completed, the user evaluates the patent retrieval service through the evaluation module, including:

X41:将服务评分标记为Qs;获取整个检索过程中用户浏览专利的数量并将其标记为Cs;X41: Mark the service score as Qs; get the number of patents viewed by the user throughout the search process and mark it as Cs;

X42:根据大数据内用户对服务评分Qs、浏览专利的数量Cs、检索时间RT1和解决时间RT2的重视程度分配权重;X42: Allocate weights according to the user's importance to the service rating Qs, the number of browsed patents Cs, the retrieval time RT1 and the resolution time RT2 in the big data;

对服务评分Qs分配权重为D1;对浏览专利的数量Cs分配权重为D2;对检索时间RT1分配权重D3,对解决时间RT2分配权重为D4;且D1+D2+D3+D4=1;D1>D2>D3>D4;D1 is assigned to the service score Qs; D2 is assigned to the number of browsed patents Cs; D3 is assigned to the retrieval time RT1, and D4 is assigned to the solution time RT2; and D1+D2+D3+D4=1; D1> D2>D3>D4;

X43:利用公式

Figure BDA0002623256400000081
计算得到用户的检索满意值QR;X43: Utilize formulas
Figure BDA0002623256400000081
Calculate the user's retrieval satisfaction value QR;

步骤六:服务器将检索满意值QR打上时间戳存储到存储模块并将检索满意值QR传输到显示模块进行实时显示。Step 6: The server stamps the retrieval satisfaction value QR with a timestamp and stores it in the storage module, and transmits the retrieval satisfaction value QR to the display module for real-time display.

本发明的有益效果是:The beneficial effects of the present invention are:

(1)本发明通过访问统计模块统计数据库中每件专利在系统当前时间前10天内的点击信息,通过调查模块监测每隔预设时间专利的发明人与用户的交流信息;数据分析模块接收点击信息和交流信息并结合检索模块进行专利的推送分析;首先获取符合检索信息中关键字和技术领域的专利并将其标记为初选专利;结合相关算法得到该初选专利的持续热度值β;同时根据交流信息获得该初选专利发明人的信誉值R;利用公式

Figure BDA0002623256400000082
得出该初选专利的推送值TS;服务器根据推送值TS对初选专利做降序排列并将排列后的初选专利发送至用户终端;巧妙利用大数据智能分析,提高检索效率;(1) In the present invention, the click information of each patent in the statistical database in the 10 days before the current time of the system is accessed by accessing the statistical module, and the communication information between the inventor and the user of the patent at every preset time is monitored by the investigation module; the data analysis module receives clicks Information and exchange information and combined with the retrieval module to carry out patent push analysis; firstly obtain patents that match the keywords and technical fields in the retrieval information and mark them as primary selection patents; combine relevant algorithms to obtain the continuous popularity value β of the primary selection patent; At the same time, the reputation value R of the primary patent inventor is obtained according to the exchange information; using the formula
Figure BDA0002623256400000082
The push value TS of the primary selection patent is obtained; the server sorts the primary selection patents in descending order according to the push value TS and sends the sorted primary selection patents to the user terminal; cleverly uses big data intelligent analysis to improve retrieval efficiency;

(2)本发明通过浏览模块浏览专利信息,并将浏览记录发送至服务器;用户行为分析模块用于接收服务器传输的浏览记录并作出分析;获取浏览记录中浏览时间并将浏览时间标记为Hx,将持续时长标记为Rx,评论行为值标记为S(C),转发行为值标记为S(D),收藏行为值标记为S(E),点赞行为值标记为S(F);利用公式

Figure BDA0002623256400000091
计算得出用户对该专利的兴趣值P(x);结合持续热度值β和发明人的信誉值R,利用公式
Figure BDA0002623256400000092
得出该初选专利的推送值TS;使推送结果更准确,提高检索效率;(2) The present invention browses the patent information through the browsing module, and sends the browsing record to the server; the user behavior analysis module is used to receive the browsing record transmitted by the server and make an analysis; obtain the browsing time in the browsing record and mark the browsing time as Hx, Mark the duration as Rx, the comment behavior value as S(C), the forwarding behavior value as S(D), the favorite behavior value as S(E), and the like behavior value as S(F); using the formula
Figure BDA0002623256400000091
Calculate the user's interest value P(x) for the patent; combine the continuous popularity value β and the inventor's reputation value R, use the formula
Figure BDA0002623256400000092
Get the push value TS of the primary selection patent; make the push result more accurate and improve the retrieval efficiency;

(3)本发明通过评价模块对专利的检索服务进行评价;根据大数据内用户对服务评分Qs、浏览专利的数量Cs、检索时间RT1和解决时间RT2的重视程度分配权重;利用公式

Figure BDA0002623256400000093
计算得到用户的检索满意值QR;评价模块将检索满意值QR传输到服务器,服务器将检索满意值QR打上时间戳存储到存储模块并将检索满意值QR传输到显示模块进行实时显示,本发明对检索服务系统形成一个有效评价,方便后来查看。(3) The present invention evaluates the patent retrieval service through the evaluation module; assigns weights according to the importance of the user's service score Qs, the number of browsed patents Cs, the retrieval time RT1 and the resolution time RT2 in the big data; using the formula
Figure BDA0002623256400000093
The retrieval satisfaction value QR of the user is obtained by calculating; the evaluation module transmits the retrieval satisfaction value QR to the server, and the server stamps the retrieval satisfaction value QR with a timestamp and stores it in the storage module, and transmits the retrieval satisfaction value QR to the display module for real-time display. The retrieval service system forms an effective evaluation, which is convenient for later viewing.

附图说明Description of drawings

为了便于本领域技术人员理解,下面结合附图对本发明作进一步的说明。In order to facilitate the understanding of those skilled in the art, the present invention will be further described below with reference to the accompanying drawings.

图1为本发明的系统框图。FIG. 1 is a system block diagram of the present invention.

具体实施方式Detailed ways

下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其它实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

如图1所示,基于大数据的知识产权信息检索软件管理系统及方法,包括数据采集模块、若干个用户终端、浏览模块、服务器、检索模块、数据采集模块、数据库以及评价模块;As shown in Figure 1, the intellectual property information retrieval software management system and method based on big data includes a data acquisition module, several user terminals, a browsing module, a server, a retrieval module, a data acquisition module, a database and an evaluation module;

用户终端用于录入用户的登录信息和注册信息,用户在已有账户时通过用户终端输入登录信息后进行登录,用户在不存在账户时通过用户终端输入注册信息注册新的账户后进行首次登录;The user terminal is used to input the user's login information and registration information. The user logs in after entering the login information through the user terminal when there is an existing account. When the user does not have an account, the user enters the registration information through the user terminal to register a new account and log in for the first time;

检索模块用于用户终端发布检索信息并将检索信息发送至服务器,检索信息包括关键字和技术领域;The retrieval module is used for the user terminal to issue retrieval information and send the retrieval information to the server, and the retrieval information includes keywords and technical fields;

数据采集模块用于采集每件专利的基本信息并将每件专利的基本信息传输到数据库进行存储;专利的基本信息包括专利包括发明人、发明类型、技术领域以及名称;数据库用于存储服务器接收的浏览记录、评价记录、检索信息、登录信息以及注册信息;The data collection module is used to collect the basic information of each patent and transmit the basic information of each patent to the database for storage; the basic information of the patent includes the patent including the inventor, the type of invention, the technical field and the name; the database is used for the storage server to receive browsing records, evaluation records, retrieval information, login information and registration information;

访问统计模块用于统计数据库中每件专利在系统当前时间前10天内的点击信息并将点击信息传输到数据分析模块;点击信息包括点击次数、每次点击的观看时间以及评论、转发、收藏和点赞的行为特征;调查模块用于监测每隔预设时间专利的发明人与用户的交流信息并将交流信息传输到数据分析模块;交流信息包括用户提出问题的时间J1x、发明人答复问题的时间J2x、服务评价系数、发明人名下专利总数量以及发明人名下已成交的专利数量,服务评价系数规则为:给发明人服务评分,满分为100分;The access statistics module is used to count the click information of each patent in the database within 10 days before the current time of the system and transmit the click information to the data analysis module; the click information includes the number of clicks, the viewing time of each click, comments, forwarding, favorites and The behavioral characteristics of likes; the investigation module is used to monitor the communication information between the inventor and the user of the patent every preset time and transmit the communication information to the data analysis module; the communication information includes the time when the user asks the question J1x, the time when the inventor answers the question. Time J2x, service evaluation coefficient, the total number of patents under the inventor's name, and the number of patents that have been traded under the inventor's name, the service evaluation coefficient rule is: score the inventor's service, the full score is 100 points;

数据分析模块接收点击信息和交流信息并结合检索模块进行专利的推送分析,具体推送分析过程如下:The data analysis module receives the click information and communication information, and combines the retrieval module for patent push analysis. The specific push analysis process is as follows:

S11:获取符合检索信息中关键字和技术领域的专利并将其标记为初选专利;S11: Obtain patents that match the keywords and technical fields in the search information and mark them as primary patents;

S12:将系统当前时间前10天内该初选专利每天被点击的次数标记为Bk,每次点击的观看时间标记为Tki,每天被评论的次数标记为Ck,每天被转发的次数标记为Dk,每天被收藏的次数标记为Ek,每天被点赞的次数标记为Fk;k=1,2,…,10;i=1,2,…,Bk;S12: Mark the daily number of clicks on the primary patent within 10 days before the current system time as Bk, the viewing time of each click as Tki, the daily number of comments as Ck, and the daily number of reposts as Dk, The number of favorites per day is marked as Ek, and the number of likes per day is marked as Fk; k=1, 2, ..., 10; i=1, 2, ..., Bk;

S13:将系统当前时间前10天内该初选专利每天被观看的时间标记为Tk;

Figure BDA0002623256400000111
S13: Mark the daily viewing time of the primary selection patent as Tk within 10 days before the current system time;
Figure BDA0002623256400000111

S14:利用公式

Figure BDA0002623256400000112
计算得出该初选专利每天的热度值Qk,其中,b1、b2、b3、r1、r2、r3和r4均为系数因子;S14: Utilize formulas
Figure BDA0002623256400000112
Calculate the daily heat value Qk of the primary patent, where b1, b2, b3, r1, r2, r3 and r4 are coefficient factors;

S15:按照平均值计算公式得出该初选专利当前时间前10天内的平均热度值L;按照标准差计算公式得出前10天内该初选专利每天热度值的标准差α,利用公式β=(L×η1-α×η2)(η3+η4)计算得出该初选专利的持续热度值β,其中η1、η2、η3和η4均为系数因子;S15: Calculate the average heat value L of the primary patent in the 10 days before the current time according to the average calculation formula; obtain the standard deviation α of the daily heat value of the primary patent within the previous 10 days according to the standard deviation calculation formula, using the formula β=( L×η1-α×η2) (η3+η4) is calculated to obtain the continuous heat value β of the primary patent, wherein η1, η2, η3 and η4 are all coefficient factors;

S16:将服务评价系数标记为Ko,将服务评价系数Ko求取平均值得到服务评价均值K;S16: Mark the service evaluation coefficient as Ko, and calculate the average value of the service evaluation coefficient Ko to obtain the service evaluation mean value K;

S17:将初选专利发明人答复用户问题的反应时间标记为J3o,J3o=J2o-J1o,o=1,...,n,将反应时间J3o求和并取平均值得到平均反应时间J;S17: Mark the response time of the primary patent inventor to answer the user's question as J3o, J3o=J2o-J1o, o=1,...,n, sum the reaction times J3o and take the average to obtain the average reaction time J;

S18:将初选专利发明人名下专利总数量标记为P1;将初选专利发明人名下已成交的专利数量标记为P2;S18: Mark the total number of patents under the name of the primary patent inventor as P1; mark the number of patents that have been traded under the name of the primary patent inventor as P2;

S19:利用公式

Figure BDA0002623256400000113
计算得出该初选专利发明人的信誉值R,其中c1、c2、c3和c4均为系数因子;S19: Utilize formulas
Figure BDA0002623256400000113
Calculate the reputation value R of the primary patent inventor, where c1, c2, c3 and c4 are all coefficient factors;

S20:利用公式

Figure BDA0002623256400000114
得出该初选专利的推送值TS;其中d1、d2、d3、d4和d5为预设比例系数;λ=0.00564327;P(x)为用户对该初选专利的兴趣值;S20: Utilize formulas
Figure BDA0002623256400000114
Get the push value TS of the primary patent; where d1, d2, d3, d4 and d5 are preset proportional coefficients; λ=0.00564327; P(x) is the user's interest in the primary patent;

数据分析模块将推送值TS传输到服务器,服务器根据推送值TS对初选专利做降序排列并将排列后的初选专利发送至用户终端;The data analysis module transmits the push value TS to the server, and the server sorts the preliminary selection patents in descending order according to the push value TS and sends the sorted preliminary selection patents to the user terminal;

浏览模块用于用户终端浏览专利信息,并将浏览记录发送至服务器;浏览记录包括浏览时间、持续时长以及评论、转发、收藏和点赞的行为特征;浏览时间为用户点开专利链接的时间;用户行为分析模块用于接收服务器传输的浏览记录并作出分析;具体步骤包括:The browsing module is used by the user terminal to browse patent information and send the browsing record to the server; the browsing record includes the browsing time, duration, and behavioral characteristics of comments, forwarding, favorites and likes; the browsing time is the time when the user clicks the patent link; The user behavior analysis module is used to receive and analyze the browsing records transmitted by the server; the specific steps include:

S41:获取浏览记录中浏览时间并将浏览时间标记为Hx,将持续时长标记为Rx,评论行为值标记为S(C),转发行为值标记为S(D),收藏行为值标记为S(E),点赞行为值标记为S(F);S41: Obtain the browsing time in the browsing record and mark the browsing time as Hx, the duration as Rx, the comment behavior value as S(C), the forwarding behavior value as S(D), and the favorite behavior value as S( E), the like behavior value is marked as S(F);

S42:获取系统当前时间,将当前时间标记为TV,利用公式

Figure BDA0002623256400000121
计算得出该条记录的时效值f(x);其中g1为系数因子,Hx与TV越接近,则f(x)值越大;σ为预设因子;S42: Obtain the current time of the system, mark the current time as TV, and use the formula
Figure BDA0002623256400000121
Calculate the aging value f(x) of the record; where g1 is the coefficient factor, the closer Hx is to TV, the larger the f(x) value; σ is the preset factor;

S43:若用户对该专利有评论,则S(C)=1,否则S(C)=0;若用户对该专利有转发,则S(D)=1,否则S(D)=0;若用户对该专利有收藏,则S(E)=1,否则S(E)=0,若用户对该专利有点赞,则S(F)=1,否则S(F)=0;S43: If the user has commented on the patent, then S(C)=1, otherwise S(C)=0; if the user has forwarded the patent, then S(D)=1, otherwise S(D)=0; If the user has a favorite of the patent, then S(E)=1, otherwise S(E)=0, if the user likes the patent, then S(F)=1, otherwise S(F)=0;

S44:利用公式

Figure BDA0002623256400000122
计算得出用户对该专利的兴趣值P(x);其中g2为预设系数因子;S44: Utilize formulas
Figure BDA0002623256400000122
Calculate the user's interest value P(x) for the patent; where g2 is a preset coefficient factor;

服务器接收到检索模块传输的检索信息时会自动驱动控制计时模块开始计时,在服务器返回检索结果至用户终端时会通过检索模块向服务器传输检索信号,在浏览器关闭时会通过检索模块向服务器传输解决信号;服务器在接收到反应信号和解决信号时均会驱动计时模块记录检索时间和解决时间;服务器将检索时间标记为RT1并将其传输到评价模块,服务器将解决时间标记为RT2并将其传输到评价模块;When the server receives the retrieval information transmitted by the retrieval module, it will automatically drive and control the timing module to start timing. When the server returns the retrieval result to the user terminal, it will transmit the retrieval signal to the server through the retrieval module. When the browser is closed, it will transmit the retrieval signal to the server through the retrieval module. Resolution signal; the server will drive the timing module to record the retrieval time and resolution time when it receives the response signal and resolution signal; the server marks the retrieval time as RT1 and transmits it to the evaluation module, and the server marks the resolution time as RT2 and sends it to the evaluation module transfer to the evaluation module;

评价模块用于用户对专利的检索服务进行评价,评价规则为:给检索服务评分,满分为100分;评价模块的具体工作步骤如下:The evaluation module is used for users to evaluate the patent retrieval service. The evaluation rules are: to score the retrieval service, the full score is 100 points; the specific working steps of the evaluation module are as follows:

S51:将服务评分标记为Qs;获取整个检索过程中用户浏览专利的数量并将其标记为Cs;S51: Mark the service score as Qs; obtain the number of patents browsed by the user during the entire retrieval process and mark it as Cs;

S52:根据大数据内用户对服务评分Qs、浏览专利的数量Cs、检索时间RT1和解决时间RT2的重视程度分配权重;S52: Allocate weights according to the user's importance to the service score Qs, the number of browsed patents Cs, the retrieval time RT1 and the resolution time RT2 in the big data;

对服务评分Qs分配权重为D1;对浏览专利的数量Cs分配权重为D2;对检索时间RT1分配权重D3,对解决时间RT2分配权重为D4;且D1+D2+D3+D4=1;D1>D2>D3>D4;D1 is assigned to the service score Qs; D2 is assigned to the number of browsed patents Cs; D3 is assigned to the retrieval time RT1, and D4 is assigned to the solution time RT2; and D1+D2+D3+D4=1; D1> D2>D3>D4;

S53:利用公式

Figure BDA0002623256400000131
计算得到用户的检索满意值QR;S53: Utilize formulas
Figure BDA0002623256400000131
Calculate the user's retrieval satisfaction value QR;

评价模块用于将检索满意值QR传输到服务器,服务器用于将检索满意值QR打上时间戳存储到存储模块并将检索满意值QR传输到显示模块进行实时显示。The evaluation module is used to transmit the retrieval satisfaction value QR to the server, and the server is used to stamp the retrieval satisfaction value QR and store it in the storage module and transmit the retrieval satisfaction value QR to the display module for real-time display.

基于大数据的知识产权信息检索软件管理系统及方法,在工作时,用户通过若干个用户终端进行注册和登录,并通过浏览模块对专利进行浏览查看,而后通过检索模块发布检索信息;检索信息包括关键字和技术领域;同时数据采集模块采集每件专利的基本信息并将每件专利的基本信息传输到数据库进行存储;访问统计模块统计数据库中每件专利在系统当前时间前10天内的点击信息并将点击信息传输到数据分析模块;调查模块监测每隔预设时间专利的发明人与用户的交流信息并将交流信息传输到数据分析模块;数据分析模块接收点击信息和交流信息并结合检索模块进行专利的推送分析;首先获取符合检索信息中关键字和技术领域的专利并将其标记为初选专利;将系统当前时间前10天内该初选专利每天被点击的次数标记为Bk,每次点击的观看时间标记为Tki,每天被评论的次数标记为Ck,每天被转发的次数标记为Dk,每天被收藏的次数标记为Ek,每天被点赞的次数标记为Fk;其中

Figure BDA0002623256400000141
利用公式
Figure BDA0002623256400000142
计算得出该初选专利每天的热度值Qk;利用公式β=(L×η1-α×η2)(η3+η4)计算得出该初选专利的持续热度值β;The intellectual property information retrieval software management system and method based on big data, when working, users register and log in through several user terminals, browse and view patents through the browsing module, and then publish retrieval information through the retrieval module; the retrieval information includes: Keywords and technical fields; at the same time, the data collection module collects the basic information of each patent and transmits the basic information of each patent to the database for storage; accesses the statistics module to count the click information of each patent in the database within 10 days before the current system time The click information is transmitted to the data analysis module; the investigation module monitors the communication information between the inventor and the user of the patent every preset time and transmits the communication information to the data analysis module; the data analysis module receives the click information and communication information and combines with the retrieval module Carry out patent push analysis; first obtain patents that match the keywords and technical fields in the search information and mark them as primary patents; mark the number of clicks on the primary patents per day within 10 days before the current system time as Bk, and each time The clicked viewing time is marked as Tki, the daily number of comments is marked as Ck, the daily number of reposts is marked as Dk, the daily number of favorites is marked as Ek, and the daily number of likes is marked as Fk;
Figure BDA0002623256400000141
Use the formula
Figure BDA0002623256400000142
Calculate the daily heat value Qk of the primary patent; use the formula β=(L×η1-α×η2)( η3+η4 ) to calculate the continuous heat value β of the primary patent;

将服务评价系数标记为Ko,将服务评价系数Ko求取平均值得到服务评价均值K,将初选专利发明人答复用户问题的反应时间标记为J3o,将反应时间J3o求和并取平均值得到平均反应时间J;将初选专利发明人名下专利总数量标记为P1;将初选专利发明人名下已成交的专利数量标记为P2;利用公式

Figure BDA0002623256400000143
计算得出该初选专利发明人的信誉值R;利用公式
Figure BDA0002623256400000144
得出该初选专利的推送值TS;服务器根据推送值TS对初选专利做降序排列并将排列后的初选专利发送至用户终端;Mark the service evaluation coefficient as Ko, take the average value of the service evaluation coefficient Ko to obtain the service evaluation mean value K, mark the response time of the primary patent inventor to answer the user's question as J3o, sum the response time J3o and take the average value to get Average response time J; mark the total number of patents under the name of the inventor of the primary patent as P1; mark the number of patents that have been traded under the name of the inventor of the primary patent as P2; use the formula
Figure BDA0002623256400000143
Calculate the reputation value R of the primary patent inventor; use the formula
Figure BDA0002623256400000144
Obtain the push value TS of the primary selection patent; the server sorts the primary selection patents in descending order according to the push value TS and sends the sorted primary selection patents to the user terminal;

用户终端通过浏览模块浏览专利信息,并将浏览记录发送至服务器;用户行为分析模块用于接收服务器传输的浏览记录并作出分析;获取浏览记录中浏览时间并将浏览时间标记为Hx,将持续时长标记为Rx,评论行为值标记为S(C),转发行为值标记为S(D),收藏行为值标记为S(E),点赞行为值标记为S(F);获取系统当前时间,将当前时间标记为TV,利用公式

Figure BDA0002623256400000145
计算得出该条记录的时效值f(x);The user terminal browses the patent information through the browsing module, and sends the browsing record to the server; the user behavior analysis module is used to receive the browsing record transmitted by the server and analyze it; obtain the browsing time in the browsing record and mark the browsing time as Hx, which will last for a long time. It is marked as Rx, the comment behavior value is marked as S(C), the forwarding behavior value is marked as S(D), the favorite behavior value is marked as S(E), and the like behavior value is marked as S(F); to obtain the current time of the system, Mark the current time as TV, using the formula
Figure BDA0002623256400000145
Calculate the aging value f(x) of the record;

利用公式

Figure BDA0002623256400000151
计算得出用户对该专利的兴趣值P(x);Use the formula
Figure BDA0002623256400000151
Calculate the user's interest value P(x) for the patent;

评价模块用于用户对专利的检索服务进行评价,评价规则为:给检索服务评分,满分为100分;首先将服务评分标记为Qs;获取整个检索过程中用户浏览专利的数量并将其标记为Cs,根据大数据内用户对服务评分Qs、浏览专利的数量Cs、检索时间RT1和解决时间RT2的重视程度分配权重;利用公式

Figure BDA0002623256400000152
计算得到用户的检索满意值QR;评价模块将检索满意值QR传输到服务器,服务器将检索满意值QR打上时间戳存储到存储模块并将检索满意值QR传输到显示模块进行实时显示。The evaluation module is used for users to evaluate the patent retrieval service. The evaluation rules are: score the retrieval service with a full score of 100; first mark the service score as Qs; obtain the number of patents browsed by the user during the entire retrieval process and mark it as Cs, assign weights according to the importance of users in the big data to the service score Qs, the number of browsed patents Cs, the retrieval time RT1 and the resolution time RT2; using the formula
Figure BDA0002623256400000152
The user's retrieval satisfaction value QR is calculated; the evaluation module transmits the retrieval satisfaction value QR to the server, and the server stamps the retrieval satisfaction value QR with a timestamp and stores it in the storage module, and transmits the retrieval satisfaction value QR to the display module for real-time display.

上述公式均是由采集大量数据进行软件模拟及相应专家进行参数设置处理,得到与真实结果符合的公式。The above formulas are obtained by collecting a large amount of data for software simulation and corresponding experts for parameter setting processing, and obtaining formulas that are consistent with the real results.

以上公开的本发明优选实施例只是用于帮助阐述本发明。优选实施例并没有详尽叙述所有的细节,也不限制该发明仅为的具体实施方式。显然,根据本说明书的内容,可作很多的修改和变化。本说明书选取并具体描述这些实施例,是为了更好地解释本发明的原理和实际应用,从而使所属技术领域技术人员能很好地理解和利用本发明。本发明仅受权利要求书及其全部范围和等效物的限制。The above-disclosed preferred embodiments of the present invention are provided only to help illustrate the present invention. The preferred embodiments do not describe all the details and do not limit the invention to specific embodiments only. Obviously, many modifications and variations are possible in light of the content of this specification. These embodiments are selected and described in this specification in order to better explain the principles and practical applications of the present invention, so that those skilled in the art can well understand and utilize the present invention. The present invention is to be limited only by the claims and their full scope and equivalents.

Claims (5)

1. The intellectual property information retrieval software management system based on big data is characterized by comprising a data acquisition module, a plurality of user terminals, a browsing module, a user behavior analysis module, a server, a retrieval module, a data acquisition module, a database and an evaluation module;
the user terminal is used for inputting login information and registration information of a user, the user logs in after inputting the login information through the user terminal when the user has an account, and the user logs in for the first time after inputting the registration information through the user terminal to register a new account when the user does not have the account;
the retrieval module is used for the user terminal to issue retrieval information and send the retrieval information to the server, and the retrieval information comprises keywords and the technical field;
the data acquisition module is used for acquiring the basic information of each patent and transmitting the basic information of each patent to the database for storage; the database is used for storing browsing records, evaluation records, retrieval information, login information and registration information received by the server;
the access statistical module is used for counting click information of each patent in the database within 10 days before the current time of the system and transmitting the click information to the data analysis module; the investigation module is used for monitoring the communication information between the inventor of the patent and the user at preset time intervals and transmitting the communication information to the data analysis module;
the data analysis module receives click information and exchange information and performs pushing analysis of patents by combining with the retrieval module, and the specific pushing analysis process is as follows:
s11: acquiring patents which accord with keywords and technical fields in the retrieval information and marking the patents as primary-selected patents;
s12: marking the clicked times of the primary selection patent every day within 10 days before the current time of the system as Bk, marking the watching time of each click as Tki, marking the commented times every day as Ck, marking the forwarded times every day as Dk, marking the collected times every day as Ek, and marking the praised times every day as Fk; k is 1, 2, …, 10; i ═ 1, 2, …, Bk;
s13: marking the time when the primary selected patent is watched every day within 10 days before the current time of the system as Tk;
Figure FDA0002623256390000021
s14: using formulas
Figure FDA0002623256390000022
Calculating to obtain a heat value Qk of the initially selected patent every day, wherein b1, b2, b3, r1, r2, r3 and r4 are coefficient factors;
s15: obtaining an average heat value L of the initially selected patent within 10 days before the current time according to an average value calculation formula; obtaining the standard deviation alpha of the calorific value of the initially selected patent every day in the first 10 days according to a standard deviation calculation formula, and utilizing the formula
Figure FDA0002623256390000023
Calculating to obtain a continuous heat value beta of the initially selected patent, wherein eta 1, eta 2, eta 3 and eta 4 are coefficient factors;
s16: marking the service evaluation coefficient as Ko, and calculating the average value of the service evaluation coefficient Ko to obtain a service evaluation mean value K;
s17: marking the response time of the initially selected patent inventor for answering the user question as J3o, wherein J3o is J2o-J1o, o is 1, and n, and summing and averaging the response times J3o to obtain an average response time J;
s18: marking the total number of patents in the name of the initially selected patent inventor as P1; marking the number of patents which have been committed in the name of the initially selected patent inventor as P2;
s19: using formulas
Figure FDA0002623256390000024
Calculating to obtain a reputation value R of the initially selected patent inventor, wherein c1, c2, c3 and c4 are coefficient factors;
s20: using formulas
Figure FDA0002623256390000025
Obtaining a pushing value TS of the initially selected patent; wherein d1, d2, d3, d4 and d5 are preset proportionality coefficients; λ 0.00564327; p (x) is the interest value of the user in the initially selected patent;
and the data analysis module transmits the pushing value TS to the server, and the server performs descending arrangement on the primary selected patents according to the pushing value TS and sends the arranged primary selected patents to the user terminal.
2. The intellectual property information retrieval software management system based on big data as claimed in claim 1, wherein the browsing module is used for the user terminal to browse the patent information and send the browsing record to the server; the browsing record comprises browsing time, duration and behavior characteristics of comment, forwarding, collection and praise; the browsing time is the time for the user to click on the patent link; the user behavior analysis module is used for receiving the browsing record transmitted by the server and analyzing the browsing record; the method comprises the following specific steps:
s41: acquiring browsing time in the browsing record, marking the browsing time as Hx, marking the duration as Rx, marking the comment behavior value as S (C), marking the forwarding behavior value as S (D), marking the collection behavior value as S (E), and marking the praise behavior value as S (F);
s42: obtaining the current time of the system, marking the current time as TV, and utilizing a formula
Figure FDA0002623256390000031
Calculating to obtain the aging value f (x) of the record; where g1 is a coefficient factor, the closer Hx is to TV, the greater the value of f (x); sigma is a preset factor;
s43: if the user has a comment on the patent, s (c) is 1, otherwise s (c) is 0; if the user has forwarding to the patent, s (d) is 1, otherwise s (d) is 0; if the user has a collection for the patent, s (e) ═ 1, otherwise s (e) ═ 0, if the user has a approval for the patent, s (f) ═ 1, otherwise s (f) ═ 0;
s44: using formulas
Figure FDA0002623256390000032
Calculating to obtain a user interest value P (x) of the patent; where g2 is a preset coefficient factor.
3. The intellectual property information retrieval software management system based on big data as claimed in claim 1, wherein the server automatically drives and controls the timing module to start timing when receiving the retrieval information transmitted by the retrieval module, transmits the retrieval signal to the server through the retrieval module when the server returns the retrieval result to the user terminal, and transmits the solution signal to the server through the retrieval module when the browser is closed; the server drives the timing module to record retrieval time and solution time when receiving the response signal and the solution signal; the server marks the retrieval time as RT1 and transmits it to the evaluation module, the server marks the resolution time as RT2 and transmits it to the evaluation module;
the evaluation module is used for evaluating the retrieval service of the patent by the user, and the evaluation rule is as follows: scoring the retrieval service, wherein the full score is 100; the specific working steps of the evaluation module are as follows:
s51: marking the service score as Qs; acquiring the number of the patents browsed by the user in the whole retrieval process and marking the patents as Cs;
s52: assigning weights according to the degree of importance of the users to the service scores Qs, the number Cs of browsed patents, the retrieval time RT1 and the resolution time RT2 in the big data;
assigning a weight of D1 to the service score Qs; assigning a weight D2 to the number Cs of viewed patents; the retrieval time RT1 is assigned with the weight D3, and the solution time RT2 is assigned with the weight D4; and D1+ D2+ D3+ D4 is 1; d1> D2> D3> D4;
s53: using formulas
Figure FDA0002623256390000041
And calculating to obtain a retrieval satisfaction value QR of the user.
4. The intellectual property information retrieval software management system based on big data as claimed in claim 1, wherein the evaluation module is used for transmitting the retrieval satisfaction value QR to the server, the server is used for storing the retrieval satisfaction value QR to the storage module by stamping a time stamp and transmitting the retrieval satisfaction value QR to the display module for real-time display; the basic information of the patent includes the patent including the inventor, the type of invention, the technical field and the name; the click information comprises click times, the watching time of each click and behavior characteristics of comment, forwarding, collection and praise; the communication information comprises time J1x of a user proposing a question, time J2x of an inventor answering the question, service evaluation coefficient, total number of patents under the name of the inventor and number of patents which have been submitted under the name of the inventor, and the service evaluation coefficient rules are as follows: the inventor was given a service score of 100 points.
5. The intellectual property information retrieval method based on the big data is characterized by comprising the following steps:
the method comprises the following steps: a user registers and logs in through a plurality of user terminals, browses and checks patents, and then releases retrieval information;
step two: the data analysis module receives click information and exchange information and performs pushing analysis on patents by combining retrieval information; the method comprises the following steps:
x11: acquiring patents which accord with keywords and technical fields in the retrieval information and marking the patents as primary-selected patents;
x12: marking the clicked times of the primary selection patent every day within 10 days before the current time of the system as Bk, marking the watching time of each click as Tki, marking the commented times every day as Ck, marking the forwarded times every day as Dk, marking the collected times every day as Ek, and marking the praised times every day as Fk;
x13: marking the time when the primary selected patent is watched every day within 10 days before the current time of the system as Tk;
Figure FDA0002623256390000051
x14: using formulas
Figure FDA0002623256390000052
Calculating to obtain a heat value Qk of the initially selected patent every day;
x15: obtaining an average heat value L of the initially selected patent within 10 days before the current time according to an average value calculation formula; obtaining the standard deviation alpha of the calorific value of the initially selected patent every day in the first 10 days according to a standard deviation calculation formulaBy the formula
Figure FDA0002623256390000053
Calculating to obtain a continuous heat value beta of the initially selected patent;
x16: marking the service evaluation coefficient as Ko, and calculating the average value of the service evaluation coefficient Ko to obtain a service evaluation mean value K;
x17: the reaction time of the primary selected patent inventor for answering the user question is marked as J3 o; j3o ═ J2o-J1 o; summing the reaction times J3o and averaging to obtain an average reaction time J;
x18: marking the total number of patents in the name of the initially selected patent inventor as P1; marking the number of patents which have been committed in the name of the initially selected patent inventor as P2;
x19: using formulas
Figure FDA0002623256390000061
Calculating to obtain a reputation value R of the initially selected patent inventor;
x20: using formulas
Figure FDA0002623256390000062
Obtaining a pushing value TS of the initially selected patent;
step three: sorting the patents in a descending order according to the TS and sending the sorted patents to the user terminal;
step four: the user terminal browses the patent information through the browsing module and sends a browsing record to the server; the user behavior analysis module receives the browsing record transmitted by the server and analyzes the browsing record; obtaining the interest value P (x) of the user to the patent; the method comprises the following specific steps:
x31: acquiring browsing time in the browsing record, marking the browsing time as Hx, marking the duration as Rx, marking the comment behavior value as S (C), marking the forwarding behavior value as S (D), marking the collection behavior value as S (E), and marking the praise behavior value as S (F);
x32: obtaining the current time of the system, marking the current time as TV, and utilizing a formula
Figure FDA0002623256390000063
Calculating to obtain the aging value f (x) of the record; where g1 is a coefficient factor, the closer Hx is to TV, the greater the value of f (x); sigma is a preset factor;
x33: if the user has a comment on the patent, s (c) is 1, otherwise s (c) is 0; if the user has forwarding to the patent, s (d) is 1, otherwise s (d) is 0; if the user has a collection for the patent, s (e) ═ 1, otherwise s (e) ═ 0, if the user has a approval for the patent, s (f) ═ 1, otherwise s (f) ═ 0;
x34: using formulas
Figure FDA0002623256390000064
Calculating to obtain a user interest value P (x) of the patent;
step five: after the retrieval is completed, the user evaluates the retrieval service of the patent through an evaluation module, which comprises the following steps:
x41: marking the service score as Qs; acquiring the number of the patents browsed by the user in the whole retrieval process and marking the patents as Cs;
x42: assigning weights according to the degree of importance of the users to the service scores Qs, the number Cs of browsed patents, the retrieval time RT1 and the resolution time RT2 in the big data;
assigning a weight of D1 to the service score Qs; assigning a weight D2 to the number Cs of viewed patents; the retrieval time RT1 is assigned with the weight D3, and the solution time RT2 is assigned with the weight D4; and D1+ D2+ D3+ D4 is 1; d1> D2> D3> D4;
x43: using formulas
Figure FDA0002623256390000071
Calculating to obtain a retrieval satisfaction value QR of the user;
step six: and the server stamps a time stamp on the retrieval satisfaction value QR and stores the retrieval satisfaction value QR in a storage module and transmits the retrieval satisfaction value QR to a display module for real-time display.
CN202010789517.9A 2020-08-07 2020-08-07 Intellectual property information retrieval software management system and method based on big data Active CN112131459B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010789517.9A CN112131459B (en) 2020-08-07 2020-08-07 Intellectual property information retrieval software management system and method based on big data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010789517.9A CN112131459B (en) 2020-08-07 2020-08-07 Intellectual property information retrieval software management system and method based on big data

Publications (2)

Publication Number Publication Date
CN112131459A true CN112131459A (en) 2020-12-25
CN112131459B CN112131459B (en) 2021-06-01

Family

ID=73850262

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010789517.9A Active CN112131459B (en) 2020-08-07 2020-08-07 Intellectual property information retrieval software management system and method based on big data

Country Status (1)

Country Link
CN (1) CN112131459B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112348602A (en) * 2021-01-07 2021-02-09 浙江争游网络科技有限公司 Automatic advertisement putting management system based on big data
CN113011798A (en) * 2021-05-24 2021-06-22 江苏荣泽信息科技股份有限公司 Product detection information processing system based on block chain
CN113111333A (en) * 2021-04-15 2021-07-13 广东省林业科学研究院 Remote interaction system for quick inspection platform
CN114925266A (en) * 2022-04-22 2022-08-19 北京奇艺世纪科技有限公司 Information recommendation method and device, electronic equipment and storage medium
CN118096267A (en) * 2024-04-29 2024-05-28 山东铂明网络科技有限公司 Personalized advertisement delivery system and method based on data analysis
CN118861439A (en) * 2024-09-27 2024-10-29 福建省君诺科技成果转化服务有限公司 A method for pushing information on intellectual property platform

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8010527B2 (en) * 2007-06-29 2011-08-30 Fuji Xerox Co., Ltd. System and method for recommending information resources to user based on history of user's online activity
CN102930052A (en) * 2012-11-19 2013-02-13 西北大学 Interest resource recommendation method based on multi-dimensional attribute attention
CN105630871A (en) * 2015-12-16 2016-06-01 广州神马移动信息科技有限公司 Search result display method and device as well as search system
CN109783740A (en) * 2019-01-24 2019-05-21 北京字节跳动网络技术有限公司 Pay close attention to the sort method and device of the page

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8010527B2 (en) * 2007-06-29 2011-08-30 Fuji Xerox Co., Ltd. System and method for recommending information resources to user based on history of user's online activity
CN102930052A (en) * 2012-11-19 2013-02-13 西北大学 Interest resource recommendation method based on multi-dimensional attribute attention
CN105630871A (en) * 2015-12-16 2016-06-01 广州神马移动信息科技有限公司 Search result display method and device as well as search system
CN109783740A (en) * 2019-01-24 2019-05-21 北京字节跳动网络技术有限公司 Pay close attention to the sort method and device of the page

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112348602A (en) * 2021-01-07 2021-02-09 浙江争游网络科技有限公司 Automatic advertisement putting management system based on big data
CN112348602B (en) * 2021-01-07 2021-04-06 浙江争游网络科技有限公司 Automatic advertisement putting management system based on big data
CN113111333A (en) * 2021-04-15 2021-07-13 广东省林业科学研究院 Remote interaction system for quick inspection platform
CN113111333B (en) * 2021-04-15 2022-03-04 广东省林业科学研究院 A remote interactive system for fast inspection platform
CN113011798A (en) * 2021-05-24 2021-06-22 江苏荣泽信息科技股份有限公司 Product detection information processing system based on block chain
CN113011798B (en) * 2021-05-24 2021-08-13 江苏荣泽信息科技股份有限公司 Product detection information processing system based on block chain
CN114925266A (en) * 2022-04-22 2022-08-19 北京奇艺世纪科技有限公司 Information recommendation method and device, electronic equipment and storage medium
CN114925266B (en) * 2022-04-22 2025-01-28 北京奇艺世纪科技有限公司 Information recommendation method, device, electronic device and storage medium
CN118096267A (en) * 2024-04-29 2024-05-28 山东铂明网络科技有限公司 Personalized advertisement delivery system and method based on data analysis
CN118861439A (en) * 2024-09-27 2024-10-29 福建省君诺科技成果转化服务有限公司 A method for pushing information on intellectual property platform

Also Published As

Publication number Publication date
CN112131459B (en) 2021-06-01

Similar Documents

Publication Publication Date Title
CN112131459B (en) Intellectual property information retrieval software management system and method based on big data
CN112348602B (en) Automatic advertisement putting management system based on big data
KR101993771B1 (en) Chatbot searching system and program
US10395271B2 (en) System and method for normalizing campaign data gathered from a plurality of advertising platforms
CN100440224C (en) An automatic processing method for search engine performance evaluation
TWI390421B (en) Method and apparatus for estimating the performance of an information package
CN110222267A (en) A kind of gaming platform information-pushing method, system, storage medium and equipment
CN104881738B (en) Intelligence system applied to ideology and politics teaching
CN103905486B (en) A kind of psychological health states appraisal procedure
US11042899B2 (en) System and method for tracking users across a plurality of media platforms
CN1996316A (en) Search engine searching method based on web page correlation
JP2002334104A (en) Information distribution system, information distribution server, client, information transmitting method, receiving method and program
CN112765326B (en) Method, system and application for expert recommendation in question-and-answer community
CN109543840A (en) A kind of Dynamic recommendation design method based on multidimensional classification intensified learning
CN105183925A (en) Content association recommending method and content association recommending device
JP2011154467A (en) Retrieval result ranking method and system
CN108876058A (en) A kind of media event influence force prediction method based on microblogging
CN117972257A (en) A visual website construction promotion method and promotion system
CN107977452A (en) A kind of information retrieval system and method based on big data
CN116320626A (en) Method and system for calculating live broadcast heat of electronic commerce
WO2021134944A1 (en) Mobile news client-based evaluation method and system therefor
CN114065054A (en) A method and device for pushing information
CN118395004A (en) A method and system for intelligent data collection and information push based on big data
CN113806410A (en) Service recommendation experiment system for scientific and technological service
CN118691369A (en) Internet-based office equipment marketing and promotion service system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20230531

Address after: Unit A204/A205, 2nd Floor, Building A, Zima Technology Innovation Industrial Park, No. 18 Xinhua Street, Xincun, East District, Zhongshan City, Guangdong Province, 528400

Patentee after: Zhongshan Kewo Technology Service Co.,Ltd.

Address before: 509 Kangrui Times Square, Keyuan Business Building, 39 Huarong Road, Gaofeng Community, Dalang Street, Longhua District, Shenzhen, Guangdong Province, 518000

Patentee before: Shenzhen lizhuan Technology Transfer Center Co.,Ltd.

Effective date of registration: 20230531

Address after: 509 Kangrui Times Square, Keyuan Business Building, 39 Huarong Road, Gaofeng Community, Dalang Street, Longhua District, Shenzhen, Guangdong Province, 518000

Patentee after: Shenzhen lizhuan Technology Transfer Center Co.,Ltd.

Address before: 510800 Haibu, Shiling Town, Huadu District, Guangzhou City, Guangdong Province

Patentee before: GUANGZHOU College OF TECHNOLOGY AND BUSINESS

TR01 Transfer of patent right
CP01 Change in the name or title of a patent holder

Address after: Unit A204/A205, 2nd Floor, Building A, Zima Technology Innovation Industrial Park, No. 18 Xinhua Street, Xincun, East District, Zhongshan City, Guangdong Province, 528400

Patentee after: Guangdong Kewo Technology Service Co.,Ltd.

Address before: Unit A204/A205, 2nd Floor, Building A, Zima Technology Innovation Industrial Park, No. 18 Xinhua Street, Xincun, East District, Zhongshan City, Guangdong Province, 528400

Patentee before: Zhongshan Kewo Technology Service Co.,Ltd.

CP01 Change in the name or title of a patent holder