CN120386801B

CN120386801B - Data analysis system and method based on distributed caching technology

Info

Publication number: CN120386801B
Application number: CN202510779687.1A
Authority: CN
Inventors: 王陶; 温砚
Original assignee: Jiangsu Zhimeng Intelligent Technology Co ltd
Current assignee: Jiangsu Zhimeng Intelligent Technology Co ltd
Priority date: 2025-06-12
Filing date: 2025-06-12
Publication date: 2025-10-14
Anticipated expiration: 2045-06-12
Also published as: CN120386801A

Abstract

The invention discloses a data analysis system and a data analysis method based on a distributed caching technology, which relate to the technical field of data analysis and are used for positioning service nodes based on target service data to acquire service node state data; the method comprises the steps of analyzing the service node data request demand degree in real time based on the service node state data, judging the database access state, constructing data cache nodes for each service node according to the database access state judgment result, analyzing cache memory allocation based on the corresponding service node data request demand data, analyzing the real-time cache node memory utilization degree according to the service node period data reading state of the cache nodes, carrying out periodic memory dynamic regulation on the cache nodes, generating regulation and control instructions according to the cache node period memory dynamic regulation and control data and feeding back to a management port, and realizing flexible regulation and analysis on cache demands in different demand scenes in the service node data calling process.

Description

Data analysis system and method based on distributed caching technology

Technical Field

The invention relates to the field of data analysis, in particular to a data analysis system and method based on a distributed caching technology.

Background

The distributed caching technology is a technical scheme for coping with high concurrency and big data scenes, can improve system throughput and remarkably reduce data read-write delay and high concurrency requests by storing data in a plurality of server nodes in a scattered manner, but when the distribution and regulation of caching nodes are single, the caching nodes cannot be adaptively regulated and controlled according to dynamic node data request environments, the capacity regulation and node state regulation of the caching nodes cannot be carried out according to complex data change scenes, and the situation that memory resource waste is caused by unreasonable capacity resource allocation exists in most caching nodes at present is caused.

Disclosure of Invention

The invention aims to provide a data analysis system and a data analysis method based on a distributed caching technology, which are used for solving the problems in the prior art.

In order to achieve the above purpose, the present invention provides the following technical solutions:

A data analysis method based on a distributed caching technique, the method comprising the steps of:

determining target service data, positioning service nodes based on the target service data, and acquiring service node state data;

According to the judging result of the access state of the database, constructing data cache nodes for each service node, and analyzing cache memory allocation based on the data request requirement data of the corresponding service node;

Analyzing the memory utilization degree of the real-time cache node according to the data reading state of the cache node in each service node period, carrying out periodic memory dynamic regulation and control on the cache node according to the memory utilization degree of the real-time cache node, and generating a regulation and control instruction according to the periodic memory dynamic regulation and control data of the cache node and feeding back to the management port.

Further, the service data is searched on the basis of the management port to determine target service data, wherein the service data comprises service name data and service node data, and the service node data is a service node number;

and positioning the service node through the target service node data, and acquiring the state data of the corresponding service node through acquiring the request data of the corresponding service node, wherein the state data of the service node comprises the number of the request data packets and the request data packet capacity of the service node.

Further, based on the state data of each service node of the target service, a corresponding service node state set is constructed, the real-time data request demand of the target service is analyzed according to the service node state set, the real-time data request data capacity of the target service is obtained, and the calculation formula is as follows

;

P (t) is the data request demand capacity of the target service at the current time t, P is the single data packet capacity, mj is the data packet request quantity at the time t of the service node of the corresponding number j, n is the service node quantity of the target service, analysis data are analyzed based on the actual target service data request demand, the real-time access state of the database is judged and analyzed, the maximum response request data packet capacity at the single time of the database is calculated by acquiring the request data packet quantity of the maximum response at the single time of the database, the access state of the database at the current time is determined by comparing the data request demand of the target service at the single time with the maximum response request data packet capacity at the single time of the database, the access state comprises busy and idle, the specific comparison step is that the pre-warning coefficient k is determined for the maximum response request data packet capacity at the single time of the database, the response request data packet capacity at the single time is acquired, the real-time is calculated as P _e,s=k*P_max,s, the maximum response request data packet capacity at the single time of the database is calculated, the current time of the target service data request capacity P (t) is compared, and the current access state is judged as the busy state if the P (t) is more than or equal to the current state of the current access state is judged as busy;

when the access state of the database is busy, constructing cache nodes of each service node of the target service, carrying out capacity prediction analysis on each cache node based on the real-time service data request demand data of each service node to obtain the real-time capacity data of the cache node corresponding to each service node, and calculating as

;

The method comprises the steps of taking C (j, T) as a time T, numbering a capacity prediction value of a corresponding cache node of a service node, taking r as a single data packet cache capacity pre-storage coefficient, wherein the single data packet cache capacity pre-storage coefficient refers to the fact that extra storage space is needed to be carried out when a single data packet is cached and stored, preventing data storage loss, carrying out capacity prediction analysis on each cache node in an observation period through dividing the observation period T1, taking the maximum value as the storage capacity value of the corresponding service node period T1, carrying out type division and overall preparation according to request demand data in the corresponding service node period T1, respectively determining the quantity of various types of request data in the period T1, carrying out analysis on the request data quantity proportion of various types of request data in the period T1, introducing a proportion threshold according to analysis results, comparing the request data quantity proportion of various types of request data with the proportion threshold, determining high request type data and low request type data, otherwise judging the request data type corresponding to be high request type data, judging the request data type corresponding to the cache type data is the low request type data, and the low request type data is used for the first-level cache space and the second-level cache type data storage space, and the first-level cache type data is compared with the low request type data storage space.

Further, based on the capacity prediction analysis and data storage division results of the corresponding cache nodes of each service node of the target service, a cache node update period T2 is set, the calling condition analysis of the data stored by the cache nodes of each service node in the period is analyzed, the utilization degree of the primary cache space and the secondary cache space is analyzed in real time, the calling hit rate of the data stored in the primary cache space and the secondary cache space is analyzed respectively, and the calculation formula is that

;

The hit1 and hit2 correspond to the call hit rate in the data storage period T2 in the first-level storage space and the second-level storage space in the cache nodes of each service node respectively, m (1, T2) and m (1, T2) correspond to the call number in the data storage period T2 in the first-level storage space and the second-level storage space in the cache nodes of each service node respectively, and m (T2) is the call number in the data storage period T2 in the cache nodes of each service node;

Based on the data calling hit rate of the first-level storage space and the second-level storage space in the corresponding cache nodes, the memory occupation amount of the first-level storage space and the second-level storage space in the corresponding cache nodes in the period T2 is analyzed, and the calculation formula is as follows

;

Wherein C (1, j, T2) and C (2, j, T2) are respectively the period memory occupation amount of the first-level memory space and the second-level memory space in the cache nodes corresponding to the service nodes with the numbers j in the period T2, C (1, j) and C (2, j) are respectively the memory capacity of the first-level memory space and the second-level memory space in the cache nodes corresponding to the service nodes with the numbers j, S is a cache fragment rate parameter of the cache nodes, the memory update capacity Cg1 and the memory update capacity Cg2 of the second-level memory space in the cache nodes corresponding to the service nodes are analyzed, the memory occupation amount of the first-level memory space and the second-level memory space in the cache nodes corresponding to the service nodes with the numbers j is set, the memory prompt capacity Cv is set, the memory occupation amount of the first-level memory space and the second-level memory space in the cache nodes corresponding to the continuous time points in the period T2 is updated and judged, the memory capacity regulation strategy of the corresponding to the cache nodes is generated according to the judgment result, wherein when the memory occupation amount of the first-level memory space in the cache nodes corresponding to the first-level memory space is larger than the cache prompt capacity and smaller than or equal to the memory update capacity of the cache nodes in the certain period, the memory occupation amount of the first-level memory space is corresponding to the memory prompt capacity of the cache nodes corresponding to the first-level memory space, the corresponding to the first-level memory space is C (1, j, T2) is the memory capacity of the first-level memory storage space and the second-level memory space is larger than the free data is required to be the free state, and the first-level memory storage capacity is directly or the free data is judged to be the free from the first level storage and the storage capacity is calculated to be the storage and the first level storage and the storage capacity is larger than the storage and the storage is corresponding is larger and the free storage and the storage and is corresponding storage and is larger and is corresponding and a high. The memory regulation is not performed, wherein the calculation formulas of the memory update capacity Cg1 of the primary storage space and the memory update capacity Cg2 of the secondary storage space corresponding to the cache node are as follows

。

Further, the service node state data of the target service and the state data of the corresponding cache node are fed back through a visual window, wherein the cache node state data comprises capacity data of the cache node and data calling records of the response service node;

outputting the generated memory capacity regulating strategy corresponding to each cache node to a management port, and executing the memory capacity regulating strategy corresponding to each cache node.

The system comprises a node positioning module, a node analysis module, a dynamic regulation and control module and a data feedback module;

The node positioning module determines target service data, positions service nodes based on the target service data and acquires service node state data, the node analysis module analyzes service node data request demand degrees in real time based on the service node state data and analyzes data according to the service node data request demand, the node analysis module judges database access states, builds data cache nodes for the service nodes according to the database access state judgment results and analyzes cache memory allocation based on corresponding service node data request demand data, the dynamic regulation module analyzes real-time cache node memory utilization degrees according to data reading states of the cache nodes in each service node period and carries out periodic memory dynamic regulation on the cache nodes according to the real-time cache node memory utilization degrees, a regulation command is generated according to the cache node periodic memory dynamic regulation data and is fed back to the management port, and the data feedback module outputs the service node state data and the corresponding cache node state data by utilizing the visual window and executes the regulation command.

Further, the node positioning module comprises a target service determining unit and a service node positioning unit;

The target service determining unit determines target service data by searching the service data based on the management port, wherein the service data comprises service name data and service node data, and the service node data is a service node number;

The service node positioning unit positions the service node through target service node data, and acquires state data of the corresponding service node through acquiring request data of the corresponding service node, wherein the state data of the service node comprises the number of request data packets and the request data packet capacity of the service node.

Further, the node analysis module comprises a service node demand analysis unit and a cache node construction unit;

The service node demand analysis unit is used for constructing a corresponding service node status set based on each service node status data of a target service, analyzing real-time data request demands of the target service according to the service node status set to obtain real-time data request data capacity of the target service, judging and analyzing a real-time access state of a database based on actual target service data request demand analysis data, calculating the maximum response request data packet capacity of the database at a single moment by obtaining the maximum response request data packet number of the database at the single moment, and determining the access state of the database at the current moment by comparing the target service data request demand analysis data at the single moment with the maximum response request data packet capacity of the database at the single moment, wherein the access state comprises busy and idle;

The method comprises the steps of establishing a cache node for each service node of a target service when an access state of a database is busy, carrying out capacity prediction analysis on each cache node based on real-time service data request demand data of each service node to obtain cache node real-time capacity data corresponding to each service node, carrying out capacity prediction analysis on each cache node in an observation period through dividing the observation period T1 to obtain a maximum value which is a memory value of the cache node period T1 corresponding to the service node, carrying out type division according to the request demand data in the corresponding service node period T1, respectively determining the quantity of each type of request data in the period T1, carrying out analysis on the request data quantity proportion of each type of request data in the period T1, introducing a proportion threshold according to an analysis result, comparing the request data quantity proportion of each type of request data with the proportion threshold, determining high request type data and low request type data, carrying out cache space division on the cache nodes into a first-level cache space and a second-level cache space according to the comparison analysis data of each type of request data in the corresponding service nodes, wherein the first-level cache space is used for storing the high request type data and the second-level request data.

Further, the dynamic regulation and control module comprises a cache node memory analysis unit and a cache node memory regulation and control unit;

The cache node memory analysis unit sets a cache node update period T2 based on capacity prediction analysis and data storage division results of corresponding cache nodes of each service node of target service, analyzes the calling condition of each service node in the period for data stored in the cache nodes, and analyzes the utilization degree of a first-level cache space and a second-level cache space in real time;

the memory regulating and controlling unit of the cache node analyzes data based on the calling condition of each service node for the data stored in the cache node in a cache node updating period T2, dynamically regulates and controls and analyzes the memory of each service node for the cache node, analyzes the memory occupation amount of the corresponding first-level memory space and second-level memory space of the cache node in the period T2 based on the data calling hit rate of the first-level memory space and the second-level memory space of the corresponding cache node respectively, analyzes the memory updating capacity Cg1 and the memory updating capacity Cg2 of the corresponding first-level memory space and the memory occupation amount of the second-level memory space of the corresponding cache node in the cache node of each service node, sets a cache prompting capacity Cv, updates and judges the memory occupation amount of the first-level memory space and the second-level memory space in each cache node at continuous time points in the period, and generates a memory capacity regulating and controlling strategy of the corresponding cache node according to the judging result.

Further, the data feedback module comprises a visualization unit and a strategy execution unit;

The visualization unit feeds back the service node state data of the target service and the state data of the corresponding cache node through a visualization window, wherein the cache node state data comprises capacity data of the cache node and a data calling record of the response service node;

And the strategy executing unit outputs the memory capacity regulating strategy generated by corresponding each cache node to the management port and executes the memory capacity regulating strategy corresponding to each cache node.

Compared with the prior art, the invention has the beneficial effects that:

The application realizes flexible regulation and analysis of the cache demands under different demand scenes in the service node data calling process by carrying out node determination on the target service and a series of node request quantity analysis and database access state analysis and combining cache node construction and cache node memory prediction and regulation analysis, can realize adaptive cache node regulation and control on dynamic node data request environments, and can carry out cache capacity regulation and state regulation on complex data change scenes, thereby improving the condition of memory resource waste caused by unreasonable capacity resource allocation of most cache nodes at present.

Drawings

FIG. 1 is a schematic diagram of a data analysis system based on a distributed caching technique according to the present invention;

fig. 2 is a flow chart of a data analysis method based on a distributed caching technology according to the present invention.

Detailed Description

The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

Embodiment as shown in fig. 1, the invention provides a technical scheme that:

the data analysis system based on the distributed caching technology comprises a node positioning module, a node analysis module, a dynamic regulation and control module and a data feedback module;

The node positioning module determines target service data, positions service nodes based on the target service data and acquires service node state data, the node analysis module analyzes service node data request demand degrees in real time based on the service node state data and judges database access states according to the service node data request demand analysis data, a data cache node is built for each service node according to the database access state judgment result and cache memory allocation is analyzed based on corresponding service node data request demand data, the dynamic regulation module analyzes real-time cache node memory utilization degrees according to data reading states of the cache nodes in each service node period and carries out periodic memory dynamic regulation on the cache nodes according to the real-time cache node memory utilization degrees, a regulation command is generated according to the cache node periodic memory dynamic regulation data and fed back to the management port, and the data feedback module outputs the service node state data and the state data of the corresponding cache nodes by using the visual window and executes the regulation command.

The target service determining unit determines target service data by searching the service data based on the management port, wherein the service data comprises service name data and service node data;

The service node positioning unit positions the service node through the target service node data, and acquires the state data of the corresponding service node through acquiring the corresponding service node request data, wherein the state data of the service node comprises the number of the request data packets and the request data packet capacity of the service node.

The service node demand analysis unit is used for constructing a corresponding service node status set based on each service node status data of the target service, analyzing the real-time data request demand of the target service according to the service node status set to obtain the real-time data request data capacity of the target service, judging and analyzing the real-time access status of the database based on the real target service data request demand analysis data, calculating the maximum response request data packet capacity of the database at a single moment by obtaining the maximum response request data packet quantity of the database at a single moment, and determining the access status of the database at the current moment by comparing the target service data request demand analysis data at a single moment with the maximum response request data packet capacity of the database at a single moment;

The method comprises the steps of constructing a cache node for each service node of a target service when an access state of a database is busy, carrying out capacity prediction analysis on each cache node based on real-time service data request demand data of each service node to obtain real-time capacity data of the cache node corresponding to each service node, carrying out capacity prediction analysis on each cache node in an observation period through dividing the observation period T1, taking the maximum value as a memory value of the cache node period T1 of the corresponding service node, carrying out type division and overall according to the request demand data in the corresponding service node period T1, respectively determining the quantity of each type of request data in the period T1, carrying out analysis on the request data quantity proportion of each type of request data in the period T1, introducing a proportion threshold according to an analysis result, comparing the request data quantity proportion of each type of request data with the proportion threshold, determining high request type data and low request type data, carrying out cache space division on the cache nodes into a first-level cache space and a second-level cache space according to comparison analysis data of each type of request data in the corresponding service nodes, wherein the first-level cache space is used for storing the high request type data, and the second-level cache space is used for storing the low request type data.

The cache node memory analysis unit sets a cache node update period T2 based on capacity prediction analysis and data storage division results of corresponding cache nodes of each service node of the target service, analyzes the calling condition of the data stored by the cache nodes by each service node in the period, and analyzes the utilization degree of a first-level cache space and a second-level cache space in real time;

The method comprises the steps of analyzing the memory occupation amount of a first-level storage space and a second-level storage space of corresponding cache nodes in a period T2 based on the data of the cache node updating period T2, analyzing the data of the calling condition of each service node for the data stored in the cache nodes, dynamically adjusting and analyzing the memory occupation amount of the cache node by each service node, updating and judging the memory occupation amount of the first-level storage space and the second-level storage space in each cache node at continuous time points in the period according to the judging result, and generating a memory capacity adjusting strategy of the corresponding cache nodes.

The strategy execution unit outputs the generated memory capacity regulation strategy corresponding to each cache node to the management port, and executes the memory capacity regulation strategy corresponding to each cache node;

As shown in fig. 2, the present invention provides another technical solution:

Further, the service data is searched on the basis of the management port to determine target service data, wherein the service data comprises service name data and service node data;

;

P (t) is the data request demand capacity of the target service at the current time t, P is the single data packet capacity, mj is the data packet request quantity at the time t of the service node of the corresponding number j, n is the service node quantity of the target service, based on the actual target service data request demand analysis data, judgment and analysis are carried out on the real-time access state of the database, the maximum response request data packet capacity of the database at the single time is calculated by acquiring the request data packet quantity of the maximum response of the database at the single time, the access state of the database at the current time is determined by comparing the target service data request demand analysis data at the single time with the maximum response request data packet capacity of the database at the single time, the access state comprises busy and idle, the specific comparison steps are that the request data packet capacity P _e,s of the database at the single time is acquired by carrying out early warning coefficient k determination on the maximum response request data packet capacity of the database at the single time, P _e,s=k*P_max,s is calculated, wherein P _max,s is the maximum response request data packet capacity at the single time, the current time is compared with the request data capacity P (t) at the maximum response request data packet capacity at the single time, and if P (t) is more than or equal to the current P _e,s, the current access state is judged to be busy, otherwise, the access state of the database is judged to be idle;

;

The method comprises the steps of taking a time C (j, T) as a time T, numbering a service node according to a capacity prediction value of the corresponding buffer node, taking r as a single data packet buffer capacity pre-storage coefficient, comparing a request data volume ratio of each type of request data with a ratio threshold according to analysis results, determining high request type data and low request type data, judging the request data type of each buffer node in an observation period as high request type data by dividing the observation period T1, taking the maximum value as the content value of the buffer node period T1 of the corresponding service node, carrying out type division and overall formulation according to request data in the corresponding service node period T1, respectively determining the quantity of each type of request data in the period T1, analyzing the request data volume ratio of each type of request data in the period T1, introducing the ratio threshold according to analysis results, comparing the request data volume ratio of each type of request data with the ratio threshold, and determining the high request type data and the low request type data, otherwise judging the request type data with the ratio of the ratio threshold, and judging the request data type of each type of request data in the corresponding type of request data as low request type data, and the buffer node is used for the first-level buffer space storage of the low request type data.

;

。

It will be evident to those skilled in the art that the invention is not limited to the details of the foregoing illustrative embodiments, and that the present invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The present embodiments are, therefore, to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein. Any reference sign in a claim should not be construed as limiting the claim concerned.

Claims

1. A data analysis method based on distributed caching technology, characterized in that the method comprises the following steps:

Determine target business data, locate business nodes based on the target business data, and obtain business node status data;

Analyze the data request demand level of each business node in real time based on the status data of each business node, and judge the database access status based on the data analysis of the business node data request demand level; build a data cache node for each business node based on the database access status judgment result, and analyze the cache memory allocation based on the corresponding business node data request demand level;

Analyze the real-time cache node memory utilization based on the data reading status of each business node periodically, and dynamically adjust the cache node memory periodically based on the real-time cache node memory utilization; generate adjustment instructions based on the cache node periodic memory dynamic adjustment data and feed them back to the management port;

Furthermore, based on the status data of each service node of the target service, a corresponding service node status set is constructed; the real-time data request requirements of the target service are analyzed according to the service node status set to obtain the real-time data request data capacity of the target service; the calculation formula is:

P(t) is the data request demand capacity of the target service at the current moment t; p is the capacity of a single data packet; Mj is the number of data packet requests corresponding to service node number j at moment t; and n is the number of service nodes of the target service. Based on the actual target service data request demand analysis data, the real-time access status of the database is judged and analyzed. By obtaining the maximum number of request packets that the database responds to at a single moment, the maximum capacity of the database's single-moment response request packet is calculated. The current database access status is determined by comparing the single-moment target service data request demand analysis data with the database's single-moment maximum capacity of the database's single-moment response request packet. Access status includes busy and idle. The specific comparison step is to calculate the single-moment database response request packet warning capacity P _e,s based on the database's single-moment maximum capacity of the response request packet and the warning coefficient k, specifically P _e,s = k*P _max,s , where P _max,s is the database's single-moment maximum capacity of the response request packet. This is compared with the current target service data request demand capacity P(t). If P(t) ≥ P _e,s , the current database access status is determined to be busy; otherwise, the current database access status is determined to be idle.

2. The data analysis method based on distributed caching technology according to claim 1, characterized in that:

Determine target business data by searching business data based on the management port; the business data includes business name data and business node data; the business node data is the business node number;

The service node is located by the target service node data, and the status data of the corresponding service node is obtained by obtaining the corresponding service node request data; the status data of the service node includes the number of request data packets and the capacity of the request data packets of the service node.

3. The data analysis method based on distributed caching technology according to claim 2, characterized in that:

Based on the status data of each business node of the target business, a corresponding business node status set is constructed; the real-time data request demand of the target business is analyzed according to the business node status set to obtain the real-time data request data capacity of the target business; based on the actual target business data request demand analysis data, the real-time access status of the database is judged and analyzed; based on the maximum number of request data packets responded by the database at a single moment, the single-moment database response request data packet warning capacity is calculated; by comparing the single-moment target business data request demand analysis data with the single-moment database maximum response request data packet warning capacity, the current moment database access status is determined; the access status includes busy and idle;

When the database access status is busy, cache nodes are constructed for each business node of the target business. Capacity forecast analysis is performed on each cache node based on the real-time business data request demand data of each business node to obtain the real-time capacity data of the cache node corresponding to each business node. The calculation is:

C(j,t)＝ _Mj *p*(1+r);

Where C(j,t) is the predicted capacity of the cache node corresponding to service node number j at time t; r is the single data packet cache capacity pre-storage coefficient; the single data packet cache capacity pre-storage coefficient refers to the additional storage space required for caching a single data packet to prevent data loss;

By dividing the observation period T1, the capacity prediction analysis of each cache node in the observation period is coordinated, and the maximum value is taken as the cache node period T1 memory capacity value of the corresponding business node; according to the request demand data in the corresponding business node period T1, the type is divided and coordinated, and the number of each type of request data in the period T1 is determined respectively, and the proportion of the request data volume of each type of request data in the period T1 is analyzed; according to the analysis result, a proportion threshold is introduced, and the proportion of the request data volume of each type of request data is compared with the proportion threshold to determine high request type data and low request type data; according to the comparative analysis data of each type of request data in the corresponding business node, the cache space of the cache node is divided into a first-level cache space and a second-level cache space; wherein the first-level cache space is used to store high request type data; the second-level cache space is used to store low request type data.

4. The data analysis method based on distributed caching technology according to claim 3, characterized in that:

Based on the capacity forecast analysis and data storage partitioning results of the cache nodes corresponding to each business node of the target business, a cache node update cycle T2 is set. The call status of each business node to the cache node stored data within the cycle is analyzed, and the utilization of the first-level cache space and the second-level cache space is analyzed in real time. This is done by analyzing the call hit rate of the data stored in the first-level cache space and the second-level cache space respectively.

The calculation formula is

Among them, hit1 and hit2 correspond to the call hit rates of the primary storage space and secondary storage space in the cache node of each business node within the storage data period T2; m(1, T2) and m(2, T2) correspond to the number of calls to the primary storage space and secondary storage space in the cache node of each business node within the storage data period T2; m(T2) is the number of calls to the cache node of each business node within the storage data period T2;

Based on the analysis data of the call status of each business node to the cache node storage data within the cache node update cycle T2, the dynamic regulation and analysis of each business node's use of the cache node storage memory is performed; based on the data call hit rate of the first-level storage space and second-level storage space in the corresponding cache node, the memory usage of the first-level storage space and second-level storage space of the corresponding cache node within the cycle T2 is analyzed; the calculation formula is:

Among them, C(1,j,T2) and C(2,j,T2) are the periodic memory occupancy of the primary storage space and the secondary storage space in the cache node corresponding to the business node numbered j in period T2; C(1,j) and C(2,j) are the memory capacities of the primary storage space and the secondary storage space in the cache node corresponding to the business node numbered j respectively; S is the cache fragmentation rate parameter of the cache node; by analyzing the memory update capacity Cg1 of the primary storage space and the memory update capacity Cg2 of the secondary storage space in the cache node of each business node, and setting the cache prompt capacity Cv, the memory occupancy of the primary storage space and the secondary storage space in each cache node at continuous time points in the period is updated and judged, and the memory capacity control strategy of the corresponding cache node is generated according to the judgment result; among them, when the memory capacity of the corresponding primary storage space in the cache node at all continuous time points in a certain period is The memory occupancy is greater than the cache prompt capacity and less than or equal to the cache node memory update capacity, which corresponds to Cv＜C(1,j,T2)≤Cg1, then the corresponding primary storage space memory capacity is adjusted to the cache node memory update capacity Cg; if C(1,j,T2)≤Cv, it means that the current primary storage space is idle; according to the above judgment method, the secondary storage space occupancy is judged and the memory capacity is adjusted; if the primary storage space and the secondary storage space of a cache node are both idle, a cache node sleep instruction is generated, and the business node requests data directly with the database for data call; and when the primary storage space or the secondary storage space is greater than the cache node update capacity, no memory adjustment is performed; the calculation formula for the primary storage space memory update capacity Cg1 and the secondary storage space memory update capacity Cg2 corresponding to the cache node is

5. The data analysis method based on distributed caching technology according to claim 4, characterized in that:

Feedback of the target service's service node status data and the corresponding cache node status data through a visualization window; the cache node status data includes the cache node's capacity data and the data retrieval record of the responding service node;

The memory capacity control policy generated for each cache node is output to the management port, and the memory capacity control policy for each cache node is executed.

6. A data analysis system based on distributed caching technology, used to implement the data analysis method based on distributed caching technology according to any one of claims 1 to 5, characterized in that: the system includes a node positioning module, a node analysis module, a dynamic control module, and a data feedback module;

The node positioning module determines the target business data, locates the business node based on the target business data, and obtains the business node status data; the node analysis module analyzes the business node data request demand level in real time based on the business node status data, and judges the database access status based on the business node data request demand level analysis data; according to the database access status judgment result, a data cache node is constructed for each business node, and the cache memory allocation is analyzed based on the corresponding business node data request demand level; the dynamic control module analyzes the real-time cache node memory utilization level based on the data reading status of the cache node of each business node cycle, and performs periodic memory dynamic control on the cache node based on the real-time cache node memory utilization level; generates control instructions based on the cache node periodic memory dynamic control data and feeds them back to the management port; the data feedback module uses a visualization window to output the business node status data and the status data of the corresponding cache node and executes the control instructions.

7. A data analysis system based on distributed cache technology according to claim 6, characterized in that: the node positioning module includes a target service determination unit and a service node positioning unit;

The target service determination unit determines target service data by retrieving service data based on the management port; the service data includes service name data and service node data; the service node data is a service node number;

The service node positioning unit locates the service node through the target service node data, and obtains the status data of the corresponding service node by obtaining the corresponding service node request data; the status data of the service node includes the number of request data packets and the capacity of the request data packets of the service node.

8. A data analysis system based on distributed cache technology according to claim 7, characterized in that: the node analysis module includes a service node demand analysis unit and a cache node construction unit;

The service node demand analysis unit constructs a corresponding service node state set based on the status data of each service node of the target service; analyzes the real-time data request demand of the target service according to the service node state set to obtain the real-time data request data capacity of the target service; judges and analyzes the real-time access state of the database based on the actual target service data request demand analysis data; calculates the maximum single-time response request data packet warning capacity of the database based on the maximum number of request data packets responded by the database at a single moment; determines the access state of the database at the current moment by comparing the single-time target service data request demand analysis data with the maximum single-time response request data packet warning capacity of the database; the access state includes busy and idle;

When the access status of the database is busy, the cache node construction unit constructs a cache node for each business node of the target business; performs capacity prediction analysis on each cache node based on the real-time business data request demand data of each business node, and obtains the real-time capacity data of the cache node corresponding to each business node; divides the observation period T1, coordinates the capacity prediction analysis of each cache node within the observation period, and takes the maximum value as the cache node period T1 memory capacity value of the corresponding business node; divides and coordinates the types according to the request demand data within the corresponding business node period T1, determines the number of each type of request data within the period T1, and analyzes the proportion of the request data volume of each type of request data within the period T1; introduces a proportion threshold based on the analysis result, compares the request data volume proportion of each type of request data with the proportion threshold, and determines high request type data and low request type data; divides the cache space of the cache node into a first-level cache space and a second-level cache space based on the comparative analysis data of each type of request data in the corresponding business node; wherein the first-level cache space is used to store high request type data; the second-level cache space is used to store low request type data.

9. A data analysis system based on distributed cache technology according to claim 8, characterized in that: the dynamic control module includes a cache node memory analysis unit and a cache node memory control unit;

The cache node memory analysis unit sets a cache node update period T2 based on the capacity prediction analysis and data storage partitioning results of the cache nodes corresponding to each business node of the target business, analyzes the call status of each business node to the cache node stored data within the period, and analyzes the utilization level of the first-level cache space and the second-level cache space in real time; it analyzes the call hit rate of the data stored in the first-level cache space and the second-level cache space respectively;

The cache node memory control unit dynamically controls and analyzes the cache node storage memory of each business node based on the call analysis data of the cache node storage data of each business node within the cache node update cycle T2; it analyzes the memory occupancy of the first-level storage space and the second-level storage space of the corresponding cache node within the cycle T2 based on the data call hit rate of the first-level storage space and the second-level storage space in the corresponding cache node respectively; by analyzing the corresponding first-level storage space memory update capacity Cg1 and the second-level storage space memory update capacity Cg2 in the cache node of each business node, and setting the cache prompt capacity Cv, it updates and judges the memory occupancy of the first-level storage space and the second-level storage space in each cache node at continuous time points within the cycle, and generates a memory capacity control strategy for the corresponding cache node based on the judgment result.

10. A data analysis system based on distributed cache technology according to claim 9, characterized in that: the data feedback module includes a visualization unit and a strategy execution unit;

The visualization unit feeds back the service node status data of the target service and the status data of the corresponding cache node through the visualization window; the cache node status data includes the capacity data of the cache node and the data retrieval record of the response service node;

The policy execution unit outputs the memory capacity control policy generated for each cache node to the management port, and executes the memory capacity control policy for each cache node.