CN107832165A - A kind of method for lifting distributed system processing request stability - Google Patents
A kind of method for lifting distributed system processing request stability Download PDFInfo
- Publication number
- CN107832165A CN107832165A CN201711179984.4A CN201711179984A CN107832165A CN 107832165 A CN107832165 A CN 107832165A CN 201711179984 A CN201711179984 A CN 201711179984A CN 107832165 A CN107832165 A CN 107832165A
- Authority
- CN
- China
- Prior art keywords
- request
- node
- result
- distributed system
- service
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/0703—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
- G06F11/0706—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
- G06F11/0709—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in a distributed system consisting of a plurality of standalone computer nodes, e.g. clusters, client-server systems
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/0703—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
- G06F11/0793—Remedial or corrective actions
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/10—Protocols in which an application is distributed across nodes in the network
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Quality & Reliability (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Computer Hardware Design (AREA)
- Multi Processors (AREA)
- Computer And Data Communications (AREA)
Abstract
The present invention relates to Development of Distributed System technical field, particularly relates to a kind of method for lifting distributed system processing request stability.Method of the present invention is that service request into processing request first occur to some node of distributed system, then confirms to send to other nodes again after handling successfully and ask;Send and ask to another node if failure;Refusal performs request in the node of system if failing again.The problem of present invention is handled with request of the method soundd out to distributed system, and the service for avoiding servicing in large scale distributed system caused by frequent iteration residual defects is collapsed on a large scale.
Description
Technical field
The present invention relates to Development of Distributed System technical field, and particularly relating to a kind of lifting distributed system processing please seek to stable
Qualitatively method.
Background technology
With the development of Distributed Calculation and the rise of big data analysis mining technology, the amount of calculation of single request processing
Increasing, the node scale of cooperated computing required for handling single request is increasing, for example has part searches engine at present
In order to improve speed, search result is often all placed in internal memory, and the limited memory of single node, it is therefore desirable in multiple nodes
Upper parallel execution search process.At the same time, with the change of program development pattern, the iteration cycle of program is shorter and shorter, journey
Some defects in sequence unavoidably be present, if it is some serious the defects of performed on multinode, may cause whole to calculate collection
The collapse of group, and then influence the stability of whole distributed system processing request.In this case, how rejected program is wrong
By mistake, distributed treatment request stability is improved just to be particularly important.It is general at present that mistake is excluded using the method for resetting daily record
By mistake, general step is as follows:
1st, recent daily record is reset;
2nd, the daily record that system is hung up in daily record is searched for;
3rd, by the details of request before checking that system is hung up come Wrong localization reason.
The above method is meaningful for programmer, but problems be present:
1st, the sprawling of program error can not be prevented, also cannot really improve the stability of whole distributed system;
2nd, by checking that the time needed for a large amount of daily record debugs is longer, specific aim is not strong;
Therefore, it is necessary to design a kind of effective method for improving stability when distributed system processing is asked.
The content of the invention
Present invention solves the technical problem that it is to propose a kind of method for lifting distributed system processing request stability.
The present invention solve above-mentioned technical problem technical scheme be:
Described method is that service request into processing request first occur to some node of distributed system, then confirms to handle
Send and ask to other nodes again after success;Send and ask to another node if failure;Refusal is in system if failing again
Node perform request.
Described method specifically includes following steps:
Step 1:The background service of distributed system is operated on the cluster of multiple node compositions, and service end first takes to backstage
One of node where business sends request;
Step 2:If request is successfully executed, then request transmits the request on all the nodes all without error
On the whole cluster of background service;
Step 3:If request failure, then select the node where a background service again, be sent to ask, test again
The result that card request performs;
Step 4:If request performs failure again, refusal performs request on any node, while the letter of request
Breath write-in daily record, it is used to further debug in the future.
The background service of the distributed system refers to the repeatable cluster service for performing request, and each background service has
Several nodes of identical.
The request for asking to be successfully executed finger does not cause node procedure on background service node in implementation procedure
Throw exception, and implementing result can be returned, including the result of correct result and mistake;After the wrong result refers to
Platform service routine can be exited normally, but the Program path performed is non-normal route;
The request unsuccessfully refers to causing node procedure abnormal including rear after request performs on background service node
Platform service crashes returning result and do not return to abnormal result.
The refusal performs request on any node and refers to that identical request is just intercepted in service end, is not sent to
On the node of any one background service.
Beneficial effects of the present invention:
It is overall that the method for the present invention can substantially shorten large-scale distributed system in the case where increased performance cost is few
Response time.The present invention can lift the stability of distributed system processing request.
Brief description of the drawings
The present invention is further described below in conjunction with the accompanying drawings:
Fig. 1 is the flow chart of the present invention.
Embodiment
Below by taking the akka frameworks under Scala as an example, a kind of method that the present invention is realized thereon of description.
A normal Web request responding process is defined first
When the first request performs failure, the code of embedded processing:
Modification is carried out as follows in the code of akka frameworks:
The request of the present invention can perform on multiple nodes parallel, and respective result merges to obtain most after the completion of execution
Result afterwards.But the inventive method is first without using parallel characteristic, but first wherein on the node where a background service
Perform request;Handled with request of the method for exploration to distributed system, avoid taking in large scale distributed system
It is engaged in the service caused by frequent iteration residual defects the problem of collapsing on a large scale.
Claims (6)
- A kind of 1. method for lifting distributed system processing request stability, it is characterised in that described method is first will service Ask that processing request occurs to some node of distributed system, then confirm to send to other nodes again after handling successfully and ask; Send and ask to another node if failure;Refusal performs request in the node of system if failing again.
- 2. according to the method for claim 1, it is characterised in that:Described method specifically includes following steps:Step 1:The background service of distributed system is operated on the cluster of multiple node compositions, and service end is first to background service institute One of node send request;Step 2:If request is successfully executed, then request all without error, transmits the request to backstage on all the nodes On the whole cluster of service;Step 3:If request failure, then select the node where a background service again, be sent to ask, checking again please Seek the result of execution;Step 4:If request performs failure again, refusal performs request on any node, while the information of request is write Enter daily record, be used to further debug in the future.
- 3. according to the method for claim 2, it is characterised in that the background service of the distributed system refers to repeating The cluster service of request is performed, each background service there are several nodes of identical.
- 4. according to the method for claim 2, it is characterised in that the request, which is successfully executed, refers to that request takes on backstage Do not cause node procedure throw exception in implementation procedure on business node, and implementing result can be returned, including correct result With the result of mistake;The wrong result refers to that background service program can be exited normally, but the Program path performed is Improper path;The request unsuccessfully refers to causing node procedure abnormal after request performs on background service node, including backstage takes Business collapses not returning result and returns to abnormal result.
- 5. according to the method for claim 3, it is characterised in that the request, which is successfully executed, refers to that request takes on backstage Do not cause node procedure throw exception in implementation procedure on business node, and implementing result can be returned, including correct result With the result of mistake;The wrong result refers to that background service program can be exited normally, but the Program path performed is Improper path;The request unsuccessfully refers to causing node procedure abnormal after request performs on background service node, including backstage takes Business collapses not returning result and returns to abnormal result.
- 6. according to the method described in any one of claim 2 to 5, it is characterised in that the refusal performs on any node please Ask and refer to that identical request is just intercepted in service end, be not sent on the node of any one background service.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711179984.4A CN107832165A (en) | 2017-11-23 | 2017-11-23 | A kind of method for lifting distributed system processing request stability |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711179984.4A CN107832165A (en) | 2017-11-23 | 2017-11-23 | A kind of method for lifting distributed system processing request stability |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107832165A true CN107832165A (en) | 2018-03-23 |
Family
ID=61652341
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711179984.4A Withdrawn CN107832165A (en) | 2017-11-23 | 2017-11-23 | A kind of method for lifting distributed system processing request stability |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107832165A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112948108A (en) * | 2021-01-26 | 2021-06-11 | 北京字节跳动网络技术有限公司 | Request processing method and device and electronic equipment |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104468174A (en) * | 2013-09-25 | 2015-03-25 | 北京新媒传信科技有限公司 | Method and device for cluster servers to execute task |
CN106506490A (en) * | 2016-11-03 | 2017-03-15 | 深圳智高点知识产权运营有限公司 | A kind of Distributed Calculation control method and distributed computing system |
CN106874109A (en) * | 2016-12-29 | 2017-06-20 | 朗新科技股份有限公司 | A kind of distributed job distribution processing method and system |
US9785521B2 (en) * | 2013-12-02 | 2017-10-10 | Qbase, LLC | Fault tolerant architecture for distributed computing systems |
-
2017
- 2017-11-23 CN CN201711179984.4A patent/CN107832165A/en not_active Withdrawn
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104468174A (en) * | 2013-09-25 | 2015-03-25 | 北京新媒传信科技有限公司 | Method and device for cluster servers to execute task |
US9785521B2 (en) * | 2013-12-02 | 2017-10-10 | Qbase, LLC | Fault tolerant architecture for distributed computing systems |
CN106506490A (en) * | 2016-11-03 | 2017-03-15 | 深圳智高点知识产权运营有限公司 | A kind of Distributed Calculation control method and distributed computing system |
CN106874109A (en) * | 2016-12-29 | 2017-06-20 | 朗新科技股份有限公司 | A kind of distributed job distribution processing method and system |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112948108A (en) * | 2021-01-26 | 2021-06-11 | 北京字节跳动网络技术有限公司 | Request processing method and device and electronic equipment |
CN112948108B (en) * | 2021-01-26 | 2024-03-15 | 北京字节跳动网络技术有限公司 | Request processing method and device and electronic equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111897724B (en) | Automatic testing method and device suitable for cloud platform | |
CN106294222A (en) | A kind of method and device determining PCIE device and slot corresponding relation | |
KR101860674B1 (en) | Method, Server and Computer Program for Crash Report Grouping | |
CN111767213B (en) | Database check point testing method and device, electronic equipment and storage medium | |
CN109710476B (en) | System interface robustness testing method and device | |
CN107832165A (en) | A kind of method for lifting distributed system processing request stability | |
JP2019020864A (en) | Arithmetic unit | |
CN113868137A (en) | Method, device and system for processing buried point data and server | |
CN110706108B (en) | Method and apparatus for concurrently executing transactions in a blockchain | |
CN112380127A (en) | Test case regression method, device, equipment and storage medium | |
CN117828823A (en) | Equipment life assessment method, device, electronic equipment and storage medium | |
CN114840400B (en) | Abnormal component identification method, device, equipment, storage medium and program product | |
KR102256894B1 (en) | Method, Server and Computer Program for Crash Report Grouping | |
CN116723085A (en) | Service conflict processing method and device, storage medium and electronic device | |
CN115658470A (en) | Distributed system-oriented failure recovery mechanism automatic testing method and device | |
CN115525660A (en) | Data table verification method, device, equipment and medium | |
CN116048959A (en) | Website testing method, device, equipment and storage medium | |
CN114610644A (en) | Method and device for testing database | |
CN109857675B (en) | A Method of Locating Program Errors Using Statement Types | |
CN108920378B (en) | Data layering method and system based on interface test | |
CN112380136A (en) | Data cleaning method and device, test equipment and storage medium | |
CN115640236B (en) | A script quality detection method and computing device | |
CN115048296B (en) | Method for verifying validity of memory barrier instruction and related product | |
CN119167368B (en) | Conditional competition vulnerability detection method and device | |
JP7447723B2 (en) | Information processing program, information processing method, and information processing device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20180323 |
|
WW01 | Invention patent application withdrawn after publication |