[go: up one dir, main page]

CN107832165A - A kind of method for lifting distributed system processing request stability - Google Patents

A kind of method for lifting distributed system processing request stability Download PDF

Info

Publication number
CN107832165A
CN107832165A CN201711179984.4A CN201711179984A CN107832165A CN 107832165 A CN107832165 A CN 107832165A CN 201711179984 A CN201711179984 A CN 201711179984A CN 107832165 A CN107832165 A CN 107832165A
Authority
CN
China
Prior art keywords
request
node
result
distributed system
service
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN201711179984.4A
Other languages
Chinese (zh)
Inventor
莫展鹏
季统凯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
G Cloud Technology Co Ltd
Original Assignee
G Cloud Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by G Cloud Technology Co Ltd filed Critical G Cloud Technology Co Ltd
Priority to CN201711179984.4A priority Critical patent/CN107832165A/en
Publication of CN107832165A publication Critical patent/CN107832165A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • G06F11/0709Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in a distributed system consisting of a plurality of standalone computer nodes, e.g. clusters, client-server systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0793Remedial or corrective actions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computer Hardware Design (AREA)
  • Multi Processors (AREA)
  • Computer And Data Communications (AREA)

Abstract

The present invention relates to Development of Distributed System technical field, particularly relates to a kind of method for lifting distributed system processing request stability.Method of the present invention is that service request into processing request first occur to some node of distributed system, then confirms to send to other nodes again after handling successfully and ask;Send and ask to another node if failure;Refusal performs request in the node of system if failing again.The problem of present invention is handled with request of the method soundd out to distributed system, and the service for avoiding servicing in large scale distributed system caused by frequent iteration residual defects is collapsed on a large scale.

Description

A kind of method for lifting distributed system processing request stability
Technical field
The present invention relates to Development of Distributed System technical field, and particularly relating to a kind of lifting distributed system processing please seek to stable Qualitatively method.
Background technology
With the development of Distributed Calculation and the rise of big data analysis mining technology, the amount of calculation of single request processing Increasing, the node scale of cooperated computing required for handling single request is increasing, for example has part searches engine at present In order to improve speed, search result is often all placed in internal memory, and the limited memory of single node, it is therefore desirable in multiple nodes Upper parallel execution search process.At the same time, with the change of program development pattern, the iteration cycle of program is shorter and shorter, journey Some defects in sequence unavoidably be present, if it is some serious the defects of performed on multinode, may cause whole to calculate collection The collapse of group, and then influence the stability of whole distributed system processing request.In this case, how rejected program is wrong By mistake, distributed treatment request stability is improved just to be particularly important.It is general at present that mistake is excluded using the method for resetting daily record By mistake, general step is as follows:
1st, recent daily record is reset;
2nd, the daily record that system is hung up in daily record is searched for;
3rd, by the details of request before checking that system is hung up come Wrong localization reason.
The above method is meaningful for programmer, but problems be present:
1st, the sprawling of program error can not be prevented, also cannot really improve the stability of whole distributed system;
2nd, by checking that the time needed for a large amount of daily record debugs is longer, specific aim is not strong;
Therefore, it is necessary to design a kind of effective method for improving stability when distributed system processing is asked.
The content of the invention
Present invention solves the technical problem that it is to propose a kind of method for lifting distributed system processing request stability.
The present invention solve above-mentioned technical problem technical scheme be:
Described method is that service request into processing request first occur to some node of distributed system, then confirms to handle Send and ask to other nodes again after success;Send and ask to another node if failure;Refusal is in system if failing again Node perform request.
Described method specifically includes following steps:
Step 1:The background service of distributed system is operated on the cluster of multiple node compositions, and service end first takes to backstage One of node where business sends request;
Step 2:If request is successfully executed, then request transmits the request on all the nodes all without error On the whole cluster of background service;
Step 3:If request failure, then select the node where a background service again, be sent to ask, test again The result that card request performs;
Step 4:If request performs failure again, refusal performs request on any node, while the letter of request Breath write-in daily record, it is used to further debug in the future.
The background service of the distributed system refers to the repeatable cluster service for performing request, and each background service has Several nodes of identical.
The request for asking to be successfully executed finger does not cause node procedure on background service node in implementation procedure Throw exception, and implementing result can be returned, including the result of correct result and mistake;After the wrong result refers to Platform service routine can be exited normally, but the Program path performed is non-normal route;
The request unsuccessfully refers to causing node procedure abnormal including rear after request performs on background service node Platform service crashes returning result and do not return to abnormal result.
The refusal performs request on any node and refers to that identical request is just intercepted in service end, is not sent to On the node of any one background service.
Beneficial effects of the present invention:
It is overall that the method for the present invention can substantially shorten large-scale distributed system in the case where increased performance cost is few Response time.The present invention can lift the stability of distributed system processing request.
Brief description of the drawings
The present invention is further described below in conjunction with the accompanying drawings:
Fig. 1 is the flow chart of the present invention.
Embodiment
Below by taking the akka frameworks under Scala as an example, a kind of method that the present invention is realized thereon of description.
A normal Web request responding process is defined first
When the first request performs failure, the code of embedded processing:
Modification is carried out as follows in the code of akka frameworks:
The request of the present invention can perform on multiple nodes parallel, and respective result merges to obtain most after the completion of execution Result afterwards.But the inventive method is first without using parallel characteristic, but first wherein on the node where a background service Perform request;Handled with request of the method for exploration to distributed system, avoid taking in large scale distributed system It is engaged in the service caused by frequent iteration residual defects the problem of collapsing on a large scale.

Claims (6)

  1. A kind of 1. method for lifting distributed system processing request stability, it is characterised in that described method is first will service Ask that processing request occurs to some node of distributed system, then confirm to send to other nodes again after handling successfully and ask; Send and ask to another node if failure;Refusal performs request in the node of system if failing again.
  2. 2. according to the method for claim 1, it is characterised in that:Described method specifically includes following steps:
    Step 1:The background service of distributed system is operated on the cluster of multiple node compositions, and service end is first to background service institute One of node send request;
    Step 2:If request is successfully executed, then request all without error, transmits the request to backstage on all the nodes On the whole cluster of service;
    Step 3:If request failure, then select the node where a background service again, be sent to ask, checking again please Seek the result of execution;
    Step 4:If request performs failure again, refusal performs request on any node, while the information of request is write Enter daily record, be used to further debug in the future.
  3. 3. according to the method for claim 2, it is characterised in that the background service of the distributed system refers to repeating The cluster service of request is performed, each background service there are several nodes of identical.
  4. 4. according to the method for claim 2, it is characterised in that the request, which is successfully executed, refers to that request takes on backstage Do not cause node procedure throw exception in implementation procedure on business node, and implementing result can be returned, including correct result With the result of mistake;The wrong result refers to that background service program can be exited normally, but the Program path performed is Improper path;
    The request unsuccessfully refers to causing node procedure abnormal after request performs on background service node, including backstage takes Business collapses not returning result and returns to abnormal result.
  5. 5. according to the method for claim 3, it is characterised in that the request, which is successfully executed, refers to that request takes on backstage Do not cause node procedure throw exception in implementation procedure on business node, and implementing result can be returned, including correct result With the result of mistake;The wrong result refers to that background service program can be exited normally, but the Program path performed is Improper path;
    The request unsuccessfully refers to causing node procedure abnormal after request performs on background service node, including backstage takes Business collapses not returning result and returns to abnormal result.
  6. 6. according to the method described in any one of claim 2 to 5, it is characterised in that the refusal performs on any node please Ask and refer to that identical request is just intercepted in service end, be not sent on the node of any one background service.
CN201711179984.4A 2017-11-23 2017-11-23 A kind of method for lifting distributed system processing request stability Withdrawn CN107832165A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711179984.4A CN107832165A (en) 2017-11-23 2017-11-23 A kind of method for lifting distributed system processing request stability

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711179984.4A CN107832165A (en) 2017-11-23 2017-11-23 A kind of method for lifting distributed system processing request stability

Publications (1)

Publication Number Publication Date
CN107832165A true CN107832165A (en) 2018-03-23

Family

ID=61652341

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711179984.4A Withdrawn CN107832165A (en) 2017-11-23 2017-11-23 A kind of method for lifting distributed system processing request stability

Country Status (1)

Country Link
CN (1) CN107832165A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112948108A (en) * 2021-01-26 2021-06-11 北京字节跳动网络技术有限公司 Request processing method and device and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104468174A (en) * 2013-09-25 2015-03-25 北京新媒传信科技有限公司 Method and device for cluster servers to execute task
CN106506490A (en) * 2016-11-03 2017-03-15 深圳智高点知识产权运营有限公司 A kind of Distributed Calculation control method and distributed computing system
CN106874109A (en) * 2016-12-29 2017-06-20 朗新科技股份有限公司 A kind of distributed job distribution processing method and system
US9785521B2 (en) * 2013-12-02 2017-10-10 Qbase, LLC Fault tolerant architecture for distributed computing systems

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104468174A (en) * 2013-09-25 2015-03-25 北京新媒传信科技有限公司 Method and device for cluster servers to execute task
US9785521B2 (en) * 2013-12-02 2017-10-10 Qbase, LLC Fault tolerant architecture for distributed computing systems
CN106506490A (en) * 2016-11-03 2017-03-15 深圳智高点知识产权运营有限公司 A kind of Distributed Calculation control method and distributed computing system
CN106874109A (en) * 2016-12-29 2017-06-20 朗新科技股份有限公司 A kind of distributed job distribution processing method and system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112948108A (en) * 2021-01-26 2021-06-11 北京字节跳动网络技术有限公司 Request processing method and device and electronic equipment
CN112948108B (en) * 2021-01-26 2024-03-15 北京字节跳动网络技术有限公司 Request processing method and device and electronic equipment

Similar Documents

Publication Publication Date Title
CN111897724B (en) Automatic testing method and device suitable for cloud platform
CN106294222A (en) A kind of method and device determining PCIE device and slot corresponding relation
KR101860674B1 (en) Method, Server and Computer Program for Crash Report Grouping
CN111767213B (en) Database check point testing method and device, electronic equipment and storage medium
CN109710476B (en) System interface robustness testing method and device
CN107832165A (en) A kind of method for lifting distributed system processing request stability
JP2019020864A (en) Arithmetic unit
CN113868137A (en) Method, device and system for processing buried point data and server
CN110706108B (en) Method and apparatus for concurrently executing transactions in a blockchain
CN112380127A (en) Test case regression method, device, equipment and storage medium
CN117828823A (en) Equipment life assessment method, device, electronic equipment and storage medium
CN114840400B (en) Abnormal component identification method, device, equipment, storage medium and program product
KR102256894B1 (en) Method, Server and Computer Program for Crash Report Grouping
CN116723085A (en) Service conflict processing method and device, storage medium and electronic device
CN115658470A (en) Distributed system-oriented failure recovery mechanism automatic testing method and device
CN115525660A (en) Data table verification method, device, equipment and medium
CN116048959A (en) Website testing method, device, equipment and storage medium
CN114610644A (en) Method and device for testing database
CN109857675B (en) A Method of Locating Program Errors Using Statement Types
CN108920378B (en) Data layering method and system based on interface test
CN112380136A (en) Data cleaning method and device, test equipment and storage medium
CN115640236B (en) A script quality detection method and computing device
CN115048296B (en) Method for verifying validity of memory barrier instruction and related product
CN119167368B (en) Conditional competition vulnerability detection method and device
JP7447723B2 (en) Information processing program, information processing method, and information processing device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication

Application publication date: 20180323

WW01 Invention patent application withdrawn after publication