CN107656810B

CN107656810B - A method for ensuring quality of service for latency-sensitive programs in a data center environment

Info

Publication number: CN107656810B
Application number: CN201710756689.4A
Authority: CN
Inventors: 王琳; 耿世超; 张化祥
Original assignee: Shandong Normal University
Current assignee: Shandong Normal University
Priority date: 2017-08-29
Filing date: 2017-08-29
Publication date: 2021-01-08
Anticipated expiration: 2037-08-29
Also published as: CN107656810A

Abstract

The invention discloses a method for ensuring the quality of service of a delay-sensitive program in a data center environment. The method includes the following steps: respectively setting a number of resource limit points for a processor and a memory, and calculating the resource sensitivity of each delay-sensitive program to be executed at the resource limit point. degree, draw the resource sensitivity plane of the delay-sensitive program; assume that each server node executes a delay-sensitive program and several batch programs, calculate the resources occupied by the delay-sensitive program under the condition of satisfying the service quality requirements; set the delay-sensitive program The resources occupied by the set delay-sensitive programs are removed, and the remaining resources are allocated to batch programs as needed for use; according to the set results, delay-sensitive programs and batch programs are executed. The beneficial effects of the invention are as follows: the performance when the delay-sensitive program and the batch program are jointly executed can be guaranteed, and the resource utilization rate of the system can be improved as much as possible at the same time.

Description

Method for ensuring service quality of delay sensitive program under data center environment

Technical Field

The invention relates to parallel and distributed computing, in particular to a method for ensuring the service quality of a delay sensitive program in a data center environment.

Background

Data centers have gone from concept to maturity. In a data center, a large number of programs are executed on as few servers as possible in order to improve resource utilization. In a server node, there are multiple programs executing on a single node. The advantage of multiple programs executing together is that the utilization of the server can be increased, and the problem is that the performance of the program is reduced. The degree of performance degradation of a program depends on program characteristics, and for some programs, performance degradation is not significant when the program is executed with other programs, and for some programs, performance degradation is significant when the program is executed with other programs.

At the same time, a large number of delay sensitive programs are running in the data center. Customers execute delay sensitive programs in the data center and have certain quality of service requirements on the programs, such as program performance that cannot be lower than 90% of that when executed alone. When a delay sensitive program is executed together with other programs, it is easy to cause severe performance degradation due to performance interference, and thus, the quality of service requirements of customers cannot be satisfied. This is a problem that must be solved. Therefore, a method is needed to ensure the service quality of the delay-sensitive program on the basis of improving the utilization rate of system resources as much as possible.

Disclosure of Invention

The invention aims to provide a method for ensuring the service quality of a delay sensitive program in a data center environment, which can ensure the performance of the delay sensitive program and batch programs when the delay sensitive program and the batch programs are executed together, and simultaneously improve the resource utilization rate of a system as much as possible. According to the method, a resource isolation tool Cgroups is used for dividing resources for delay sensitive programs, and the performance of the delay sensitive programs is guaranteed. Meanwhile, other resources are distributed to batch programs for use. And the utilization rate and the throughput of system resources are improved by using a batch program.

In order to achieve the purpose, the invention adopts the following technical scheme:

a method for ensuring the service quality of a delay sensitive program in a data center environment comprises the following steps:

step (1): respectively setting a plurality of resource limit points for a processor and a memory, calculating the resource sensitivity of each delay sensitive program to be executed at the resource limit points, and drawing a resource sensitivity plane of the delay sensitive program;

step (2): supposing that one delay sensitive program and a plurality of batch programs are executed on each server node, and calculating resources occupied by the delay sensitive program under the condition of meeting the service quality requirement;

and (3): setting the occupied resource of the delay sensitive program according to the calculation result of the step (2);

and (4): removing the resources occupied by the delay sensitive program set in the step (3), and distributing the rest resources to batch programs for use as required; setting the occupied resources of the batch programs according to the distribution result;

and (5): and (4) executing the delay sensitive program according to the result set in the step (3), and executing the batch program according to the result set in the step (4).

The step of respectively setting a plurality of resource limit points for the processor and the memory comprises the following steps:

setting M resource limit points in an occupancy rate range M of processor resources when a delay sensitive program runs, setting N resource limit points in an occupancy rate range N of memory resources when the delay sensitive program runs, and recording the execution time of the delay sensitive program when the delay sensitive program is executed on each resource limit point (i, j), wherein the value range of i is 1 to M, and the value range of j is 1 to N.

The step of calculating the resource sensitivity of each delay-sensitive program to be executed at the resource limit point comprises the following steps:

and calculating the ratio of the execution time of the delay-sensitive program under the condition that the resources are not limited to the execution time of the delay-sensitive program at each resource limit point, wherein the ratio is the program resource sensitivity of the delay-sensitive program at each resource limit point.

The step of drawing the sensitivity plane of the delay sensitive program comprises the following steps:

the resource occupancy rate of a processor when a delay sensitive program runs is taken as an X axis, the resource occupancy rate of a memory when the delay sensitive program runs is taken as a Y axis, the resource sensitivity of the program is taken as a Z axis, a space rectangular coordinate system is established, all program resource sensitivity point values are fitted into a curved surface, and the curved surface is a sensitivity plane of the delay sensitive program.

The steps of setting a plurality of resource limit points for the processor and the memory, calculating the resource sensitivity of each delay sensitive program to be executed at the resource limit points, and drawing the sensitivity plane of the delay sensitive program further comprise:

acquiring resource occupancy rates M of the delay sensitive program to a processor and N of the delay sensitive program to a memory respectively under the condition that resources are not limited by a performance analysis tool; acquiring the resource occupation amount of each batch program to the processor and the resource occupation amount of each batch program to the memory under the condition that the resources are not limited by a performance analysis tool;

the step (2) calculates the resources occupied by the delay sensitive program under the condition of meeting the service quality requirement:

assuming that the service quality requirement of the delay-sensitive program is not lower than 90% of the performance of the delay-sensitive program when the delay-sensitive program is executed alone, for the sensitivity plane of the delay-sensitive program, a new plane parallel to both the X axis and the Y axis is drawn with reference to 90% of the performance of the delay-sensitive program when the delay-sensitive program is executed alone; the resource combination of the processor and the memory occupied by the delay-sensitive program is the node above the new plane and closest to the new plane in the positive Z-axis direction, and it is assumed that the processor selected by the delay-sensitive program is C_latThe selected memory node is M_lat。

For example: taking z as an interface, wherein the value of z is less than 0.9 and is below a new plane, and the value of z is more than 0.9 and is above the new plane;

and (3) setting occupied resources of the delay sensitive program by using a Linux resource isolation tool Cgroups.

In the step (3), C is added_latAnd M_latWrite control group set Cgroups; the delay sensitive program is placed into the control group for execution.

The step (4) comprises the following steps:

step (41): the processor that the batch program can occupy is C_all-C_latThe memory resource that can be occupied by the batch program is M_all-M_lat；C_allAs a total amount of processor resources, C_latSetting processor resources occupied by delay sensitive programs; m_allFor total amount of memory resources, M_latSetting memory resources occupied by delay sensitive programs;

step (42): the requirements for the number of executable batch programs are as follows: all batch programs occupy no more processor resources than (C)_all-C_lat) 160%, and all the batch programs occupy no more memory resources than (M)_all-M_lat)*160％；

Step (43): and (4) executing the batch program according to the calculation result of the step (42).

The invention has the beneficial effects that:

the invention can improve the resource utilization rate and the throughput of the system as much as possible while ensuring the service quality of the program.

The invention calculates the resource occupation when the service quality of the program needs to be ensured through the resource sensitivity, and the resource occupation value calculated by the resource sensitivity has the advantages that: the value is as small as possible and the quality of service of the program can be guaranteed.

Drawings

Fig. 1 is a flowchart of a resource partitioning method.

Detailed Description

The invention is further described with reference to the following figures and examples.

Fig. 1 depicts a resource partitioning method.

Firstly, the method comprises the following steps: working off-line

For each program needing to be executed, the resource occupation information of the program on a processor and a memory is obtained by utilizing a performance analysis tool. And simultaneously acquiring resource sensitivity information of the program. The performance analysis tools, such as collectll;

II, secondly: working on-line

For delay sensitive programs:

and calculating the resources which should be occupied by the network. The calculation method is as follows: assuming that the service quality requirement of the program is not less than 90% of the performance when performed alone, for the sensitivity plane of the program, a plane parallel to the X-axis and the Y-axis is drawn with reference to 90% of the performance of the program.

The processor and memory resource combination occupied by the delay sensitive program may be a node above the plane. And dividing resources for the delay sensitive program by using a Linux resource isolation tool Cgroups, and placing the delay sensitive program into a control group for execution.

For batch procedures:

and calculating the resource occupation of the batch programs. The resource occupancy value is (total resource value of server-resource occupancy value of delay sensitive program) × 160%.

The batch program is placed outside the control group for execution.

Although the embodiments of the present invention have been described with reference to the accompanying drawings, it is not intended to limit the scope of the present invention, and it should be understood by those skilled in the art that various modifications and variations can be made without inventive efforts by those skilled in the art based on the technical solution of the present invention.

Claims

1. A method for ensuring the quality of service of a delay-sensitive program in a data center environment, characterized in that it comprises the following steps:

Step (1): In the case of offline work in the data center,

For each delay-sensitive program that needs to be executed, use performance analysis tools to obtain information on the resource consumption of the processor and memory;

For the case of data center working online,

Set a number of resource limit points for the processor and memory respectively, calculate the resource sensitivity of each delay-sensitive program that needs to be executed at the resource limit point, and draw the resource-sensitivity plane of the delay-sensitive program;

The steps of drawing the resource-sensitivity plane of the delay-sensitive program are:

Taking the resource occupancy rate of the processor on the X-axis when the delay-sensitive program is running, the resource occupancy rate of the memory when the delay-sensitive program is running on the Y-axis, and taking the resource sensitivity of the program as the Z-axis, a space rectangular coordinate system is established, and the All program resource sensitivity point values are fitted into a curved surface, and the curved surface is the sensitivity plane of delay-sensitive programs;

Step (2): Assuming that each server node executes a delay-sensitive program and several batch programs, calculate the resources occupied by the delay-sensitive program under the condition that the service quality requirements are met, as follows:

Assuming that the service quality requirement of the delay-sensitive program is not lower than 90% of the performance when the delay-sensitive program is executed alone, for the resource sensitivity plane of the sensitive program obtained in step (1), the performance of the delay-sensitive program when executed alone is 90%. Draw a new plane parallel to the X axis and the Y axis as the benchmark, then the combination of processor and memory resources occupied by the delay-sensitive program is the node on the positive direction of the Z axis and the closest node to the new plane above the new plane;

Step (3): according to the calculation result of step (2), set the occupied resources of the delay-sensitive program;

Step (4): remove the resources occupied by the delay-sensitive program set in step (3), and the remaining resources are allocated to the batch programs as needed for use; the occupied resources of the batch programs are set according to the assignment result;

Step (5): according to the result set in step (3), execute the delay-sensitive program, and at the same time, according to the result set in step (4), execute the batch program.

2. the method for guaranteeing delay-sensitive program quality of service under the data center environment as claimed in claim 1, is characterized in that, the described step that processor and memory are respectively set some resource limit points is:

Set m resource limit points within the processor resource occupancy range M when the delay-sensitive program is running, set n resource limit points within the memory resource occupancy range N when the delay-sensitive program is running, record the delay-sensitive program in each When executing on the resource limit point (i, j), the execution time of the delay-sensitive program, where the value range of i is 1 to m, and the value range of j is 1 to n.

3. the method for guaranteeing delay-sensitive program quality of service under the data center environment as claimed in claim 1, is characterized in that, the step of described calculating the resource sensitivity of each delay-sensitive program that needs to be executed at the resource limit point is:

Calculate the ratio of the execution time of the delay-sensitive program without resource constraints to the execution time of the delay-sensitive program at each resource constraint point, the ratio being the program resource sensitivity of the delay sensitive program at each resource constraint point.

4. the method for guaranteeing delay-sensitive program quality of service under the data center environment as claimed in claim 1, it is characterized in that, described to processor and memory are respectively set some resource limit points, calculate the delay-sensitive program that each needs to execute in Resource Sensitivity at Resource Constraint Points. Before drawing the sensitivity plane for delay-sensitive programs, the steps also include:

Use the performance analysis tool to obtain the resource occupancy rate M of the processor and the memory resource occupancy rate N of the delay-sensitive program when the resources are not limited; use the performance analysis tool to obtain the resource occupancy rate of each batch program when the resources are not limited. The resource occupancy of the processor and the resource occupancy of the memory are respectively listed below.

5. The method for ensuring quality of service of delay-sensitive programs under the data center environment as claimed in claim 1, wherein the step (3) utilizes Linux resource isolation tool Cgroups to set the occupied resources of delay-sensitive programs.

6. the method for guaranteeing delay-sensitive program quality of service under the data center environment as claimed in claim 1, is characterized in that, in described step (3), C _lat and M _lat are written into the control group that Cgroups is set; Delay-sensitive The program is placed in the control group for execution.

7. the method for guaranteeing delay-sensitive program quality of service under the data center environment as claimed in claim 1, is characterized in that, the step of described step (4) is:

Step (41): The processors that can be occupied by batch programs are _Call -C _lat , and the memory resources that can be occupied by batch programs are M _all -M _lat ; _Call is the total amount of processor resources, and C _lat is the set delay-sensitive program Occupied processor resources; M _all is the total amount of memory resources, M _lat is the memory resources occupied by delay-sensitive programs;

Step (42): The requirements for the executable number of batch programs are: the processor resources occupied by all batch programs do not exceed (C _all -C _lat )*160%, and at the same time, the memory resources occupied by all batch programs do not exceed (M _all -M _lat )*160%;

Step (43): According to the calculation result of step (42), execute the batch program.