CN1567889A - A method of automatic primary standby equipment switching - Google Patents
A method of automatic primary standby equipment switching Download PDFInfo
- Publication number
- CN1567889A CN1567889A CN 03137486 CN03137486A CN1567889A CN 1567889 A CN1567889 A CN 1567889A CN 03137486 CN03137486 CN 03137486 CN 03137486 A CN03137486 A CN 03137486A CN 1567889 A CN1567889 A CN 1567889A
- Authority
- CN
- China
- Prior art keywords
- node
- equipment
- standby
- parameter
- active
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Landscapes
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
本发明提供了一种主备设备自动切换的方法,通过设置数据传输环网中传输设备不同的状态,以保证当主设备失效后,备节点自动切换自身参数的状态,并使该备设备替换失效的主设备继续工作,从而实现了对主设备数据的保护,保障了数据传输环网的稳定性与安全性。数据传输环网中的所有主备设备的切换动作均自动完成,且由于本发明的方案不需要预留特定的带宽,因而降低了成本;同时,本发明对数据传输环网中的其它设备没有其它额外的需求,因而满足通用的情况,且本发明的保护动作对用户透明。应用本发明,简单可靠。主备服务器切换过程中不需要人工的干预,倒换时间可在50ms内完成。
The present invention provides a method for automatic switching of active and standby equipment. By setting different states of the transmission equipment in the data transmission ring network, it is ensured that when the main equipment fails, the standby node automatically switches the state of its own parameters, and the standby equipment is replaced and invalidated. The master device continues to work, thereby realizing the protection of the data of the master device and ensuring the stability and security of the data transmission ring network. The switching actions of all active and standby devices in the data transmission ring network are automatically completed, and because the solution of the present invention does not need to reserve a specific bandwidth, the cost is reduced; at the same time, the present invention has no impact on other devices in the data transmission ring network. Other additional requirements, thus satisfying common situations, and the protection action of the present invention is transparent to the user. The application of the present invention is simple and reliable. No manual intervention is required during the switching process of the active and standby servers, and the switching time can be completed within 50ms.
Description
技术领域technical field
本发明涉及主备切换技术领域,特别是指一种环网上的主备设备自动切换的方法。The invention relates to the technical field of master-standby switchover, in particular to a method for automatic switchover of master-standby devices on a ring network.
背景技术Background technique
随着通信事业的发展,人们的学习、生活和工作越来越离不开网络。图1所示为数据接入因特网的示意图。用户访问因特网的数据首先接入接入服务器,如数字用户线接入服务器(DSLAM)等设备,用户的数据通过接入服务器的处理后,进入数据传输环网,由数据传输环网将用户的数据传送到服务器,如宽带接入服务器(BAS)、核心交换机等设备,服务器对用户数据完成相关的认证等工作后,提供接入因特网的服务,网管中心完成对整个网络的配置、性能和告警的监控,以及管理等功能。图中共有N台DSLAM设备、M台主用BAS设备和一个备用BAS设备,其中DSLAM1、DSLAM2和DSALM3通过BAS1接入因特网,DSLAM4、DSLAM5......DSLAMN分别使用BAS2~M接入因特网。在使用过程中如果出现某一主用BAS损坏,则在网管中心的调配下,由备用BAS替换损坏的主用BAS进行工作。With the development of the communication industry, people's study, life and work are increasingly inseparable from the network. Figure 1 shows a schematic diagram of data access to the Internet. The data of the user accessing the Internet is first connected to the access server, such as a digital subscriber line access server (DSLAM) and other equipment. After the user's data is processed by the access server, it enters the data transmission ring network, and the user's data The data is transmitted to the server, such as broadband access server (BAS), core switch and other equipment. After the server completes relevant authentication and other work for user data, it provides access to the Internet. The network management center completes the configuration, performance and alarm of the entire network. monitoring and management functions. In the figure, there are N DSLAM devices, M primary BAS devices and a backup BAS device, among which DSLAM1, DSLAM2 and DSALM3 access the Internet through BAS1, and DSLAM4, DSLAM5...DSLAMN use BAS2~M to access the Internet respectively . If a main BAS is damaged during use, under the deployment of the network management center, the damaged main BAS will be replaced by the backup BAS to work.
由于数据传输网络对人们生活和工作的影响越来越大,因而其安全性和稳定性也越来越重要。在一些特殊行业如:金融、电力和政府等重要部门,数据传输网的中断往往会带来不可估量的损失。提高数据网运行的稳定性,确保在数据网出现故障的情况下仍能保证数据的正常通信是十分重要的,现有应用方案主要有以下两种:As the data transmission network has more and more influence on people's life and work, its security and stability are becoming more and more important. In some special industries such as: financial, electric power and government and other important departments, the interruption of data transmission network often brings immeasurable losses. It is very important to improve the stability of the data network operation and ensure the normal communication of data in the event of a data network failure. The existing application solutions mainly include the following two types:
1)光纤直连方式。如图2所示,每台DSLAM设备同时提供两套光缆,分别直接接入主备BAS,图中101、102代表接入主BAS的数据流,201、202代表接入备BAS的数据流。如果主BAS或A点的光纤出现故障,则在主BAS和DSLAM之间都可以检测出来,此时,DSLAM的备用通道将业务流传递给备BAS,由备BAS提供接入因特网的服务。1) Optical fiber direct connection. As shown in Figure 2, each DSLAM device provides two sets of optical cables at the same time, which are directly connected to the main and standby BAS respectively. In the figure, 101 and 102 represent the data flow connected to the main BAS, and 201 and 202 represent the data flow connected to the standby BAS. If the main BAS or the optical fiber at point A fails, it can be detected between the main BAS and the DSLAM. At this time, the backup channel of the DSLAM transmits the business flow to the backup BAS, and the backup BAS provides the service of accessing the Internet.
上述方案的缺陷在于:虽然主备BAS的切换过程可自动完成,但由于每一个DSLAM都需要提供两套光缆分别与主备BAS相连,且每一个BAS与所有的DSLAM都直接连接,因此,不但光缆资源浪费较大,而且严重浪费了BAS的端口资源,如果BAS和DSLAM之间的距离较远,而它们之间又没有进行汇聚处理,则光缆资源浪费问题尤显突出,致使在一些光缆资源紧张的城市无法采用该方案。The defect of the above scheme is that although the switching process of the active and standby BAS can be completed automatically, since each DSLAM needs to provide two sets of optical cables to be connected to the active and standby BAS respectively, and each BAS is directly connected to all DSLAMs, therefore, not only The waste of optical cable resources is large, and the port resources of the BAS are seriously wasted. If the distance between the BAS and the DSLAM is long, and there is no aggregation process between them, the waste of optical cable resources is particularly prominent, resulting in some optical cable resources. Stressed cities cannot adopt this option.
2)数据传输环网方式。在BAS和DSLAM之间利用数据传输网络进行数据流的传输。如图3所示,数据传输环网能够提供业务的传输、汇聚和保护功能,图中的传输设备是用于完成数据流传输的同步数字体系(SDH)或异步转移模式(ATM)等设备,实线103为组网线。经DSLAM1接入的数据流101,同时向主备两个BAS进行传输,经DSLAM2接入的数据流102也同时向主备两个BAS进行传输。无论是主BAS,或A点光纤还是与主BAS相连的传输设备出现故障,网管中心都会发现问题,此时,由BAS网管人员通知数据传输环网的网管人员主BAS不能提供接入因特网的服务,数据传输环网的管理人员重新进行业务配置,手动完成主备BAS的倒换,由备BAS提供接入因特网的服务,以上倒换时间一般要30分钟左右。2) Data transmission ring network mode. Between BAS and DSLAM, the data transmission network is used to transmit the data flow. As shown in Figure 3, the data transmission ring network can provide service transmission, convergence and protection functions. The transmission equipment in the figure is a synchronous digital hierarchy (SDH) or asynchronous transfer mode (ATM) equipment used to complete data stream transmission. The
上述方案的缺陷在于:由于倒换过程需人工介入,不但存在使业务恢复时间很长,而且可能存在由于操作不当引起其它网上事故的可能,同时,由于每一个DSLAM都需要占用两条路径来传递被保护的业务,浪费了大量的传输带宽。The defect of the above scheme is that: because the switching process requires manual intervention, not only the service recovery time is very long, but also there may be other online accidents caused by improper operation. At the same time, because each DSLAM needs to occupy two paths to transmit the received Protected services waste a lot of transmission bandwidth.
发明内容Contents of the invention
有鉴于此,本发明的目的在于基于数据传输环网提供一种主备设备自动切换的方法,同时达到应用一台备用设备保护一台或多台主用设备中的数据,或应用多台备用设备保护多台主用设备中数据的目的。In view of this, the purpose of the present invention is to provide a method for automatic switching of master and backup devices based on the data transmission ring network, and at the same time achieve the protection of data in one or more master devices by using one backup device, or use multiple backup devices The purpose of the device protecting data in multiple active devices.
为达到上述目的本发明的技术方案是这样实现的:For achieving the above object, technical scheme of the present invention is achieved in that way:
一种主备设备自动切换的方法,该方法包括以下步骤:A method for automatic switching of active and standby equipment, the method comprising the following steps:
a、为所有主备设备设置标识该主备设备状态的参数;a. Set parameters identifying the state of the master and backup devices for all master and backup devices;
b、根据所获取的主设备的工作状态,判断主设备是否工作正常,如果是,则主设备继续正常处理所收到的数据流,备设备透传用户的数据流,否则,主备设备自动改变自身参数的状态,备设备切换为主设备处理所收到的数据流,主设备切换为备设备透传用户的数据流。b. According to the obtained working status of the master device, judge whether the master device is working normally. If yes, the master device continues to process the received data flow normally, and the backup device transparently transmits the user's data stream. Otherwise, the master device and the backup device automatically Change the state of its own parameters, the backup device switches to the master device to process the received data stream, and the master device switches to the backup device to transparently transmit the user's data stream.
较佳地,所述主设备为与环网挂接的主挂接设备和作为环网节点的传输设备的组合,所述备设备为与环网挂接的备挂接设备和作为环网节点的传输设备的组合,主备设备中所包含的环网节点分别为主备节点,Preferably, the master device is a combination of a master mount device connected to the ring network and a transmission device serving as a ring network node, and the backup device is a standby mount device connected to the ring network and a ring network node The combination of the transmission equipment, the ring network nodes contained in the main and standby equipment are respectively the main and standby nodes,
所述步骤a由网管中心为所有主备节点设置参数状态并进行初始化配置;In step a, the network management center sets parameter states for all active and standby nodes and performs initial configuration;
所述步骤b为:备节点定时向所有主节点发送管理消息,并根据在规定的时间内是否收主节点的响应消息来判断主用设备是否工作正常。The step b is: the standby node regularly sends management messages to all the master nodes, and judges whether the master device works normally according to whether it receives a response message from the master node within a specified time.
较佳地,所述主备设备为构成环的节点传输设备,主备设备分别为主备节点,且主备节点位于包含控制模块的同一设备中,由控制模块指定主节点和备节点,Preferably, the primary and backup devices are node transmission devices forming a ring, the primary and secondary devices are respectively primary and secondary nodes, and the primary and secondary nodes are located in the same device containing the control module, and the control module specifies the primary node and the secondary node,
所述步骤a该设备中的控制模块为所有主备节点设置参数状态并进行初始化配置;In step a, the control module in the device sets parameter states for all active and standby nodes and performs initial configuration;
所述步骤b为:由控制模块直接确定主用设备是否工作正常。The step b is: the control module directly determines whether the master device is working normally.
较佳地,所述参数至少包括节点业务处理模式参数、节点业务主备模式参数、节点工作地址参数和节点保护地址参数,所述参数设置进一步包括:Preferably, the parameters at least include node business processing mode parameters, node business active/standby mode parameters, node working address parameters and node protection address parameters, and the parameter settings further include:
设置主节点的节点业务处理模式参数为正常模式,节点业务主备模式参数为工作模式;Set the node business processing mode parameter of the master node to normal mode, and the node business master/standby mode parameter to work mode;
设置备节点的节点业务处理模式参数为穿通模式,节点业务主备模式参数为备份模式。Set the node business processing mode parameter of the standby node to pass-through mode, and the node business master/standby mode parameter to backup mode.
较佳地,步骤b所述主备设备自动改变自身参数状态进一步包括:Preferably, the active and standby devices in step b automatically changing their own parameter states further include:
主节点设置其自身的节点业务处理模式参数为穿通模式,节点业务主备模式参数为备份模式;The master node sets its own node business processing mode parameter as the pass-through mode, and the node business master/standby mode parameter as the backup mode;
备节点设置其自身的节点业务处理模式参数为正常模式,节点业务主备模式参数为工作模式,且设置自身的节点工作地址为待替换节点的地址。The standby node sets its own node business processing mode parameter to normal mode, the node business active standby mode parameter to work mode, and sets its own node working address to the address of the node to be replaced.
较佳地,所述步骤b进一步包括:备节点根据未收到的响应消息,确定出现故障的主节点的地址,将备设备替换该出现故障的主设备处理所收到的数据流。Preferably, the step b further includes: the backup node determines the address of the failed master node according to the unreceived response message, and replaces the failed master node with the backup device to process the received data flow.
较佳地,该方法进一步包括:判断是否有一个以上主设备同时出现故障,如果是,则根据预先设定的主设备的优先级别进行主备切换,否则,直接进行主备切换。Preferably, the method further includes: judging whether more than one master device fails at the same time, and if so, performing master-standby switchover according to the preset priority level of the master device, otherwise, directly performing master-standby switchover.
较佳地,系统中存在一个以上备设备,该方法进一步包括:备挂接设备之间通过服务消息的交互,确定对方当前的使用状态。Preferably, there is more than one standby device in the system, and the method further includes: determining the current usage status of the other party through the interaction of service messages between the standby hook-up devices.
较佳地,该方法进一步包括:判断是否有一个以上主设备,或与主设备直接相连的挂接设备,或主设备和与主设备直接相连的挂接设备之间的光缆出现故障,如果是,则根据预先设定的主设备的优先级别进行主备切换,否则,直接进行主备切换。Preferably, the method further includes: judging whether there is more than one master device, or the mount device directly connected to the master device, or the optical cable between the master device and the mount device directly connected to the master device fails, if , the master-standby switchover will be performed according to the preset priority level of the master device, otherwise, the master-standby switchover will be performed directly.
较佳地,所述环网为弹性分组环网,或数据分组环网,或具有数据分组功能的普通环网。Preferably, the ring network is an elastic packet ring network, or a data packet ring network, or an ordinary ring network with a data packet function.
应用本发明,通过设置数据传输环网中传输设备不同的状态,以保证当主设备失效后,备设备切换自身的状态,并使该备设备替换失效的主设备继续工作,从而实现了对主设备数据的保护,保障了数据传输环网的稳定性与安全性。数据传输环网中的所有主备设备的切换动作均自动完成,且由于本发明的方案不需要预留特定的带宽,因而降低了成本;同时,本发明对数据传输环网中的其它设备没有其它额外的需求,因而满足通用的情况,且本发明的保护动作对用户透明。应用本发明,简单可靠。主备设备切换过程中不需要人工的干预,倒换时间可在50ms内完成。By applying the present invention, by setting different states of the transmission equipment in the data transmission ring network, it is ensured that when the main equipment fails, the standby equipment switches its own state, and the standby equipment replaces the failed main equipment to continue working, thereby realizing the control of the main equipment. Data protection ensures the stability and security of the data transmission ring network. The switching actions of all active and standby devices in the data transmission ring network are automatically completed, and because the solution of the present invention does not need to reserve a specific bandwidth, the cost is reduced; at the same time, the present invention has no impact on other devices in the data transmission ring network. Other additional requirements, thus satisfying common situations, and the protection action of the present invention is transparent to the user. The application of the present invention is simple and reliable. No manual intervention is required during the switchover process of the active and standby equipment, and the switchover time can be completed within 50ms.
附图说明Description of drawings
图1所示为数据接入因特网的示意图;Figure 1 shows a schematic diagram of data access to the Internet;
图2所示为现有技术的方案一的示意图;FIG. 2 is a schematic diagram of a scheme 1 of the prior art;
图3所示为现有技术的方案二的示意图;FIG. 3 is a schematic diagram of a second scheme of the prior art;
图4所示为应用本发明的业务数据流流向的示意图。FIG. 4 is a schematic diagram of the flow direction of service data applying the present invention.
具体实施方式Detailed ways
下面结合附图及具体实施例对本发明再作进一步详细的说明。The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.
本发明基于数据环网,利用数据环网的传输、汇聚等特性,将接入的数据通过环网的传输设备传送到挂接在环网上的主用设备上,由主用设备对所收到的数据进行相应的处理。The present invention is based on the data ring network, and utilizes the characteristics of transmission and aggregation of the data ring network to transmit the accessed data to the main device connected to the ring network through the transmission equipment of the ring network, and the main device is responsible for receiving the received data. The data are processed accordingly.
图4所示为应用本发明的一实施例。本实施例以具有弹性分组特性的数据环网以及挂接在该环网上的数据服务器为例进行具体说明,并将主备数据服务器和分别与主备数据服务器直接相连的环网上的传输设备称为主备设备,且主备设备中的传输设备分别称为主备节点。Figure 4 shows an embodiment of the application of the present invention. In this embodiment, the data ring network with elastic grouping characteristics and the data servers attached to the ring network are taken as an example for specific description, and the active and standby data servers and the transmission devices on the ring network directly connected to the active and standby data servers are called The active and standby devices, and the transmission devices in the active and standby devices are respectively called the active and standby nodes.
在本实施例中,整个系统由网管中心完成对整个网络的配置、性能和告警的监控,以及管理等功能。In this embodiment, the network management center completes the configuration, performance, alarm monitoring and management of the entire network in the entire system.
网管中心对所有与主备服务器直接相连的节点,如图4中的节点1和节点2,进行参数设置,每一节点有以下几个参数:The network management center sets parameters for all nodes directly connected to the active and standby servers, such as node 1 and node 2 in Figure 4, and each node has the following parameters:
节点业务处理模式(ProcessMode)参数,该参数有正常模式(normal)和穿通模式(passthrough)两种状态,其中,正常模式(normal)是指该节点对经过的数据流进行上行和下行的传递;穿通模式(passthrough)是指数据流只能从该节点穿通,而该节点不能实现对数据流进行上下行传递的控制,但该节点可识别所传消息的标志位,并截留下控制本节点的消息,而对其它消息则进行透传。Node business processing mode (ProcessMode) parameter, which has two states: normal mode (normal) and passthrough mode (passthrough), wherein, normal mode (normal) means that the node performs uplink and downlink transmission of the passing data stream; The passthrough mode means that the data flow can only pass through the node, and the node cannot realize the control of the uplink and downlink transmission of the data flow, but the node can recognize the flag bit of the transmitted message and intercept the control node. message, while other messages are transparently transmitted.
节点业务主备模式(ProtectMode)参数,该参数有工作模式(work)和备份模式(back)两种状态,其中,工作模式(work)是指该节点能够正常处理数据流,控制数据流在本节点进行上行和下行传递;备份模式(back)是指该节点工作在备份方式,只对数据流完成穿通功能,不能控制数据流进行上下行传递。Node business master mode (ProtectMode) parameter, this parameter has two states of work mode (work) and backup mode (back), among them, work mode (work) means that the node can process data flow normally, control data flow in this The node performs uplink and downlink transmission; the backup mode (back) means that the node works in the backup mode, and only completes the pass-through function for the data flow, and cannot control the data flow for uplink and downlink transmission.
节点工作地址(NodeNormalAddress)参数,是指在正常工作时该节点所在数据传输环网上的被标记的环节点地址。The node working address (NodeNormalAddress) parameter refers to the marked ring node address on the data transmission ring network where the node is located during normal operation.
节点保护地址(NodeProtectAddress[N])参数,是指在实施保护时处于备份工作模式的节点切换为待保护的数据传输环网上被标记的环节点地址。其中,N为待保护的数目。The node protection address (NodeProtectAddress[N]) parameter refers to the marked ring node address on the data transmission ring network to be protected when the node in the backup working mode switches to the data transmission ring network to be protected when the protection is implemented. Wherein, N is the number to be protected.
备节点定时向主节点发送管理消息,其可通过固定通道或其它方式,以检测主节点的工作状态是否正常,该管理消息类似相互间心跳的检查;主节点收到备节点发送来的管理消息后给予响应。上述管理消息中,包括检测信息还可包括主服务器的数据备份信息,从而使主备服务器中的数据保持一致。备节点可对管理消息的标志位进行识别,并对其进行处理,而不会透传到下一个节点上。The backup node regularly sends management messages to the master node, which can check whether the working status of the master node is normal through a fixed channel or other methods. The management messages are similar to mutual heartbeat checks; the master node receives the management messages sent by the backup node Then give a response. In the above management message, the detection information may also include the data backup information of the master server, so that the data in the master server and the slave server are consistent. The standby node can identify the flag bit of the management message and process it without transparently transmitting it to the next node.
如果备节点和主节点间能够周期性的收到相互的管理消息,则认为主服务器一切正常,如果备节点在规定的时间内没有收到主节点的消息响应,则认为主服务器出现故障,则备节点自动改变其自身的参数状态,以替换主节点正常工作,同时,主节点将自身设置为穿通状态,对流至的数据流不再做处理,而备节点将自身设置为正常状态,对流经的数据流不再穿通,而进行相应的处理。在本实施例中,接入服务器以DSLAM设备为例,服务器以BAS设备为例,并为叙述简便将数据传输环网中的传输设备定义为节点。If the backup node and the master node can periodically receive mutual management messages, it is considered that the master server is normal. If the slave node does not receive the message response from the master node within the specified time, it is considered that the master server has failed, The standby node automatically changes its own parameter status to replace the master node to work normally. The data flow is no longer passed through, and the corresponding processing is carried out. In this embodiment, a DSLAM device is used as an example for the access server, and a BAS device is used as an example for the server, and the transmission device in the data transmission ring network is defined as a node for simplicity of description.
本发明的备份方案可分为1:1备份方案、1:N备份方案和M:N备份方案三种情况,其中,M为备用设备的数目,N为主用设备的数目,且M必须小于等于N(M<=N),M和N的值根据具体的设计要求确定,下面分别进行说明。The backup scheme of the present invention can be divided into three cases: 1:1 backup scheme, 1:N backup scheme and M:N backup scheme, wherein, M is the number of backup equipment, N is the number of active equipment, and M must be less than It is equal to N (M<=N), and the values of M and N are determined according to specific design requirements, which will be described respectively below.
1、1:1备份方案1. 1:1 backup scheme
图4所示为应用本发明的业务数据流流向的示意图。图中节点1~4是具备弹性分组环功能或类似功能的传输设备,上述传输设备组成弹性分组环网,实线103为组网线。假定节点1为主节点,节点2为备节点,则节点1正常处理数据流,控制数据流在本节点进行上行和下行传递,节点2只完成穿通功能,不对数据流做任何处理。那么,对节点1和节点2的所有参数的具体配置为:FIG. 4 is a schematic diagram of the flow direction of service data applying the present invention. Nodes 1 to 4 in the figure are transmission devices with an RPR function or similar functions. The above-mentioned transmission devices form a RPR network, and the
节点1的参数配置:Parameter configuration of node 1:
ProcessMode=normal ProtectMode=workProcessMode=normal ProtectMode=work
NodeNormalAddress=Node1 NodeProtectAddress=Node2NodeNormalAddress=Node1 NodeProtectAddress=Node2
节点2的参数配置:Parameter configuration of node 2:
ProcessMode=passthrough ProtectMode=backProcessMode=passthrough ProtectMode=back
NodeNormalAddress=Node2 NodeProtectAddress=Node1NodeNormalAddress=Node2 NodeProtectAddress=Node1
在正常情况下,经DSLAM1接入的数据流101经过节点4、节点1传送到主BAS,由主BAS提供接入因特网的服务;经DSLAM2接入的数据流102经过节点3、节点4、节点1传送到主BAS,由主BAS提供接入因特网的服务。同时,备节点2定时向主节点1发送管理消息,以检测主节点1的工作状态是否正常,该管理消息类似相互间心跳的检查,管理消息发送的频率建议为10ms;主节点1收到备节点2发送来的管理消息后给予响应。上述管理消息中,既包括检测信息还可包括主BAS的数据备份信息,从而使主备BAS中的数据保持一致。备节点2可对管理消息的标志位进行识别,并对其进行处理,而不会透传到下一个节点上。Under normal circumstances, the
如果备节点2和主节点1间能够周期性的收到相互的管理消息,则认为主服务器一切正常,如果备节点2在规定的时间内,如100ms,没有收到主节点1的消息响应,则认为主服务器出现故障,该故障可以是主BAS发生故障,或节点1发生故障,或节点1与主BAS之间的光纤发生故障,此时,节点2自动改变其自身的所有参数设置:将工作地址置为Node1,即NodeNormalAddress=Node1,节点主备模式置为工作模式,即ProtectMode=work,节点业务处理模式置为正常模式,即ProcessMode=normal,同时,节点1的业务处理模式参数设置为穿通状态,即ProcessMode=passthrough,这样,由节点2自动处理本应由主节点1完成的工作,并由备BAS将数据流接入因特网,从而自动完成主备服务器的切换。如果上述故障出现在节点1,即由于节点1损坏而不能完成数据流穿通功能,则由于弹性分组环网可检测错误并使每个受影响的节点启动故障覆盖(fail-over)功能的,该功能能够把一个数据流从故障路径改向到保护路径并进行重发,因此,备节点2可处理到所有应由主节点1完成的工作。If the standby node 2 and the primary node 1 can periodically receive mutual management messages, it is considered that the primary server is normal. If the secondary node 2 does not receive the message response from the primary node 1 within the specified time, such as 100ms, Then it is considered that the main server has a failure. The failure can be the failure of the main BAS, or the failure of node 1, or the failure of the optical fiber between node 1 and the main BAS. At this time, node 2 automatically changes all its own parameter settings: set The working address is set to Node1, that is, NodeNormalAddress=Node1, the node active/standby mode is set to the working mode, that is, ProtectMode=work, the node business processing mode is set to the normal mode, that is, ProcessMode=normal, and meanwhile, the business processing mode parameter of node 1 is set to Passthrough state, that is, ProcessMode=passthrough, so that node 2 automatically handles the work that should be done by the master node 1, and the backup BAS connects the data flow to the Internet, thereby automatically completing the switchover between the master and backup servers. If the above-mentioned fault occurs at node 1, that is, the data flow pass-through function cannot be completed due to node 1 being damaged, since the resilient packet ring network can detect errors and enable each affected node to start the fail-over function, the The function can redirect a data flow from the failure path to the protection path and retransmit it, so the backup node 2 can handle all the work that should be done by the master node 1.
图4所示的组网也可为双归属形式,定义主BAS处于工作状态,备BAS处于备份状态,节点1处于工作模式,节点2处于穿通模式下。节点1和节点2为两个地理上分离的节点,其节点号可相同,由其所处的工作状态进行区分。正常工作下,节点3、节点4的数据向主BAS汇聚,节点2处于穿通的工作模式下,不接收和发送数据,但处于热备份工作中。当主BAS发生故障,或节点1发生故障,或主BAS与节点1之间的连接线路发生故障时,节点1可以将自身的工作状态设置为穿通状态,节点2将自身的工作状态设置为工作状态,并改变其相应的参数,环网上的数据流转而从节点2汇聚到备BAS,从而实现了在RPR环网上的1:1双归保护。The networking shown in Figure 4 can also be in the form of dual-homing. It is defined that the active BAS is in the working state, the standby BAS is in the backup state, node 1 is in the working mode, and node 2 is in the pass-through mode. Node 1 and Node 2 are two geographically separated nodes, and their node numbers can be the same, and they are distinguished by their working status. Under normal operation, the data of nodes 3 and 4 are aggregated to the main BAS, and node 2 is in the pass-through working mode, does not receive or send data, but is in hot backup work. When the main BAS fails, or node 1 fails, or the connection line between the main BAS and node 1 fails, node 1 can set its own working state to pass-through state, and node 2 can set its own working state to working state , and change its corresponding parameters, the data flow on the ring network is converged from node 2 to the standby BAS, thus realizing the 1:1 dual-homing protection on the RPR ring network.
2、1:N主备保护方案2. 1:N active/standby protection scheme
该方案能够实现一套备用服务器同时保护N套主用服务器的情况。其具体的处理方式如下:This scheme can realize the situation that one set of standby servers protects N sets of active servers at the same time. The specific processing method is as follows:
与备用服务器直接相连的节点同时向与N套主用服务器直接相连的节点分别发送管理消息,与N套主用服务器直接相连的节点分别向与备用服务器直接相连的节点发送管理消息的响应消息。The nodes directly connected to the standby server send management messages to the nodes directly connected to the N sets of active servers at the same time, and the nodes directly connected to the N sets of active servers respectively send management message response messages to the nodes directly connected to the standby servers.
如果在规定的时间内与备用服务器直接相连的节点能够周期性的收到所有管理消息的响应消息,则认为主用服务器一切正常;否则,认为主用服务器出现故障,该故障可以是主用服务器发生故障,或与该主用服务器直接相连的节点发生故障,或主用服务器与节点之间的光纤发生故障,与备用服务器直接相连的节点根据未收到的响应消息判断出现故障的主用节点地址,并将备节点的工作地址置为待保护节点地址,即NodeNormalAddress[X],其中,X为出现故障的主用节点号,同时设置该备用节点的业务处理模式参数为正常模式,节点主备模式参数为工作模式,这样,由备节点自动处理本应由主节点X完成的工作,并由备服务器将数据流接入因特网,从而自动完成主备服务器的切换。如果上述故障出现在节点X,即由于节点X损坏而不能完成数据流穿通功能,则由于弹性分组环网具有故障覆盖(fail-over)功能,备节点仍可处理到所有应由主节点X完成的工作。If the node directly connected to the standby server can periodically receive the response messages of all management messages within the specified time, it is considered that the active server is normal; otherwise, the active server is considered to be faulty, and the fault can be the If a failure occurs, or the node directly connected to the active server fails, or the optical fiber between the active server and the node fails, the node directly connected to the standby server judges the failed active node according to the unreceived response message address, and set the working address of the standby node as the address of the node to be protected, that is, NodeNormalAddress[X], where X is the number of the active node that fails, and set the business processing mode parameter of the standby node to normal mode, and the active node The standby mode parameter is the working mode. In this way, the standby node automatically handles the work that should be done by the primary node X, and the standby server connects the data flow to the Internet, thereby automatically completing the switchover between the active and standby servers. If the above-mentioned failure occurs on node X, that is, the data flow pass-through function cannot be completed due to node X being damaged, the backup node can still handle all the data that should be completed by the master node X due to the fail-over function of the resilient packet ring network. work.
如果在工作中出现多个主用节点同时出现故障的情况,则可以预先设置主用节点的优先级别,根据优先级别进行主备切换。If multiple active nodes fail at the same time during work, you can pre-set the priority level of the active node, and perform active-standby switchover according to the priority level.
3、M:N主备保护方案3. M:N active/standby protection scheme
该方案能够实现M套备用服务器同时保护N套主用服务器的情况。其具体的处理方式如下:This scheme can realize the situation that M sets of standby servers protect N sets of active servers at the same time. The specific processing method is as follows:
与M套备用服务器直接相连的节点同时分别向与N套主用服务器直接相连的节点发送管理消息,与N套主用服务器直接相连的节点根据收到的消息分别向与M套备用服务器直接相连的节点发送响应消息。同时,M套备用服务器之间可以通过服务消息进行相互通信,以了解备用服务器的使用情况,如某个或某几个备用服务器已替换主用服务器进行工作,或某个或某几个备用服务器仍处于备用状态,该消息与管理消息类似,备节点可对该服务消息的标志位进行识别,并对其进行处理,而不会透传到下一个节点上。Nodes directly connected to M sets of standby servers send management messages to nodes directly connected to N sets of active servers at the same time, and nodes directly connected to N sets of active servers send management messages to M sets of standby servers according to the received messages. The node sends a response message. At the same time, M sets of backup servers can communicate with each other through service messages to understand the usage of the backup servers, such as one or several backup servers have replaced the main server for work, or one or several backup servers Still in the standby state, this message is similar to the management message, and the standby node can identify the flag bit of the service message and process it without transparently transmitting it to the next node.
如果在规定的时间内与M套备用服务器直接相连的节点能够周期性的收到所有管理消息的响应消息,则认为主用服务器一切正常;否则,认为主用服务器出现故障,该故障可以是主用服务器发生故障,或与该主用服务器直接相连的节点发生故障,或主用服务器与节点之间的光纤发生故障,与M套备用服务器直接相连的节点根据未收到的响应消息判断出现故障的主用节点地址,并按照预先策略选用一套备用服务器,该预先策略可以是轮循方式、或查表方式或设置优先级等方式,将与该套备用服务器直接相连的节点的工作地址置为待保护地址,即NodeNormalAddress[X],其中,X为出现故障的主用节点号,同时设置该备用节点的业务处理模式参数为正常模式,节点主备模式参数为工作模式,这样,由备节点自动处理本应由主节点X完成的工作,并由备服务器将数据流接入因特网,从而自动完成主备服务器的切换。如果上述故障出现在节点X,即由于节点X损坏而不能完成数据流穿通功能,则由于弹性分组环网具有故障覆盖(fail-over)功能,备节点仍可处理到所有应由主节点X完成的工作。If the nodes directly connected to M sets of standby servers can periodically receive the response messages of all management messages within the specified time, it is considered that the active server is normal; otherwise, it is considered that the active server fails, and the failure can be the main If the active server fails, or the node directly connected to the active server fails, or the optical fiber between the active server and the node fails, the node directly connected to M sets of standby servers judges that there is a failure based on the unreceived response message The address of the active node, and select a set of standby servers according to the pre-strategy. The pre-strategy can be round robin, table look-up, or priority setting. The working address of the node directly connected to the set of standby servers is set is the address to be protected, that is, NodeNormalAddress[X], where X is the number of the active node that has failed, and at the same time set the business processing mode parameter of the backup node to normal mode, and the node master/standby mode parameter to work mode. In this way, the standby node The node automatically handles the work that should be done by the primary node X, and the standby server connects the data flow to the Internet, thereby automatically completing the switchover between the primary and secondary servers. If the above-mentioned failure occurs on node X, that is, the data flow pass-through function cannot be completed due to node X being damaged, the backup node can still handle all the data that should be completed by the master node X due to the fail-over function of the resilient packet ring network. work.
本发明的另一实施例是:将具有弹性分组环网特性的传输设备作为一个模块,多个具有弹性分组环网特性的模块位于同一装置中且构成环,由该装置中的控制模块对每个具有弹性分组环网特性的模块进行控制,为每一个具有弹性分组环网特性的模块设置标识该模块状态的参数,并确定其工作状态。将构成环的处于工作状态的传输设备称为主设备即主节点,将构成环的处于非工作状态的传输设备称为备设备即备节点。当主设备出现故障时,切换备节点的参数状态以及主节点的参数状态,以使备设备替换出现故障的主设备继续进行正常工作。Another embodiment of the present invention is: the transmission equipment with the characteristics of the elastic packet ring network is used as a module, a plurality of modules with the characteristics of the elastic packet ring network are located in the same device and form a ring, and the control module in the device controls each Each module with the characteristics of the elastic packet ring network is controlled, and a parameter identifying the status of the module is set for each module with the characteristics of the elastic packet ring network, and its working status is determined. The transmission equipment in the working state forming the ring is called the main equipment, that is, the master node, and the transmission equipment in the non-working state forming the ring is called the standby equipment, that is, the standby node. When the master device fails, switch the parameter status of the backup node and the parameter status of the master node, so that the backup device replaces the failed master device and continues to work normally.
主备设备可同时连接于一个外挂设备上,也可分别连接于不同的外挂设备上。The main and standby devices can be connected to one external device at the same time, or they can be connected to different external devices respectively.
以上所述仅为利用弹性分组环网实现的较佳实施例,所述的数据环网也可以是数据分组环网,或具有数据分组功能的普通数据环网。The above is only a preferred embodiment implemented by using the elastic packet ring network, and the data ring network may also be a data packet ring network, or an ordinary data ring network with a data packet function.
以上所述仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The above descriptions are only preferred embodiments of the present invention, and are not intended to limit the present invention. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the present invention shall be included in the scope of the present invention. within the scope of protection.
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNB031374867A CN100493017C (en) | 2003-06-25 | 2003-06-25 | A method of automatic switching for primary standby equipment |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNB031374867A CN100493017C (en) | 2003-06-25 | 2003-06-25 | A method of automatic switching for primary standby equipment |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN1567889A true CN1567889A (en) | 2005-01-19 |
| CN100493017C CN100493017C (en) | 2009-05-27 |
Family
ID=34470434
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNB031374867A Expired - Fee Related CN100493017C (en) | 2003-06-25 | 2003-06-25 | A method of automatic switching for primary standby equipment |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN100493017C (en) |
Cited By (21)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2006105722A1 (en) * | 2005-04-04 | 2006-10-12 | Huawei Technologies Co., Ltd. | A method for implementing network protection combining the network element double-adscription and the ringnet protection |
| CN100432946C (en) * | 2005-12-31 | 2008-11-12 | 华为技术有限公司 | A device and method for realizing protection switching control |
| CN100446499C (en) * | 2006-04-14 | 2008-12-24 | 华为技术有限公司 | A protection switching method and device |
| CN1984120B (en) * | 2006-05-15 | 2010-05-12 | 华为技术有限公司 | A method to ensure the consistency between the bill file and the original data in the process of dual-machine switching |
| CN101127653B (en) * | 2006-08-14 | 2010-06-23 | 中兴通讯股份有限公司 | The Ethernet ring network device with backup and the realization method of its master device backup |
| CN1964248B (en) * | 2006-11-17 | 2011-04-20 | 中兴通讯股份有限公司 | A device and method to realize backup protection of data service |
| CN101252426B (en) * | 2007-09-11 | 2011-05-11 | 北京东土科技股份有限公司 | Method for realizing high-reliability distributed redundant looped network |
| CN102143011A (en) * | 2010-08-23 | 2011-08-03 | 华为技术有限公司 | Device and method for realizing network protection |
| CN102292938A (en) * | 2009-08-13 | 2011-12-21 | 三星Sds株式会社 | Electronic repair device, network system, and operation method in network system |
| CN104253731A (en) * | 2013-06-25 | 2014-12-31 | 罗伯特·博世有限公司 | Method for running communication device |
| CN104317208A (en) * | 2014-10-17 | 2015-01-28 | 安徽立卓智能电网科技有限公司 | Method for user machine for automatically identifying switch between active director and standby director |
| CN104866395A (en) * | 2014-02-24 | 2015-08-26 | 中国移动通信集团福建有限公司 | Data backup method, data backup system, main-backup switching method and main-backup switching system |
| CN107145400A (en) * | 2010-09-22 | 2017-09-08 | 罗塞塔石碑有限公司 | Computerization task is performed using different device |
| CN107291787A (en) * | 2016-04-13 | 2017-10-24 | 中兴通讯股份有限公司 | Master/slave data storehouse switching method and apparatus |
| CN109471770A (en) * | 2018-09-11 | 2019-03-15 | 华为技术有限公司 | A system management method and device |
| CN109831325A (en) * | 2019-01-22 | 2019-05-31 | 武汉精立电子技术有限公司 | A kind of server switching system and method for sentencing again |
| CN110956758A (en) * | 2019-12-19 | 2020-04-03 | 中邮智递科技有限公司 | Express delivery cabinet lock accuse board communication system and intelligent express delivery cabinet |
| CN111464348A (en) * | 2020-03-30 | 2020-07-28 | 中国电子科技集团公司第五十四研究所 | A hierarchical multi-dimensional system monitoring method |
| WO2023280041A1 (en) * | 2021-07-09 | 2023-01-12 | 三维通信股份有限公司 | Active-standby switching processing method and system, and electronic apparatus and storage medium |
| CN116476898A (en) * | 2023-05-10 | 2023-07-25 | 中铁第四勘察设计院集团有限公司 | A method and system for automatically switching redundant ground electronic units of a railway train control system |
| CN117376060A (en) * | 2023-10-07 | 2024-01-09 | 武汉船舶通信研究所(中国船舶集团有限公司第七二二研究所) | A site business replacement method |
-
2003
- 2003-06-25 CN CNB031374867A patent/CN100493017C/en not_active Expired - Fee Related
Cited By (31)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2006105722A1 (en) * | 2005-04-04 | 2006-10-12 | Huawei Technologies Co., Ltd. | A method for implementing network protection combining the network element double-adscription and the ringnet protection |
| CN1848714B (en) * | 2005-04-04 | 2010-06-23 | 华为技术有限公司 | A Method for Realizing Network Protection Through the Combination of Network Element Dual Homing and Ring Network Protection |
| US7764597B2 (en) | 2005-04-04 | 2010-07-27 | Huawei Technologies Co., Ltd. | Method for implementing network protection combining network element dual homing and ring network protection |
| CN100432946C (en) * | 2005-12-31 | 2008-11-12 | 华为技术有限公司 | A device and method for realizing protection switching control |
| CN100446499C (en) * | 2006-04-14 | 2008-12-24 | 华为技术有限公司 | A protection switching method and device |
| CN1984120B (en) * | 2006-05-15 | 2010-05-12 | 华为技术有限公司 | A method to ensure the consistency between the bill file and the original data in the process of dual-machine switching |
| CN101127653B (en) * | 2006-08-14 | 2010-06-23 | 中兴通讯股份有限公司 | The Ethernet ring network device with backup and the realization method of its master device backup |
| CN1964248B (en) * | 2006-11-17 | 2011-04-20 | 中兴通讯股份有限公司 | A device and method to realize backup protection of data service |
| CN101252426B (en) * | 2007-09-11 | 2011-05-11 | 北京东土科技股份有限公司 | Method for realizing high-reliability distributed redundant looped network |
| CN102292938A (en) * | 2009-08-13 | 2011-12-21 | 三星Sds株式会社 | Electronic repair device, network system, and operation method in network system |
| CN102292938B (en) * | 2009-08-13 | 2014-10-29 | 三星Sds株式会社 | An electronic repair device |
| US9369337B2 (en) | 2009-08-13 | 2016-06-14 | Samsung Sds Co., Ltd. | Electronic patch device, network system, and operation method in network system |
| CN102143011A (en) * | 2010-08-23 | 2011-08-03 | 华为技术有限公司 | Device and method for realizing network protection |
| CN102143011B (en) * | 2010-08-23 | 2013-11-06 | 华为技术有限公司 | Device and method for realizing network protection |
| CN107145400B (en) * | 2010-09-22 | 2020-09-15 | 罗塞塔石碑有限公司 | Performing computerized tasks using different devices |
| CN107145400A (en) * | 2010-09-22 | 2017-09-08 | 罗塞塔石碑有限公司 | Computerization task is performed using different device |
| CN104253731A (en) * | 2013-06-25 | 2014-12-31 | 罗伯特·博世有限公司 | Method for running communication device |
| CN104866395B (en) * | 2014-02-24 | 2017-11-21 | 中国移动通信集团福建有限公司 | Data back up method and system, main/standby switching method and system |
| CN104866395A (en) * | 2014-02-24 | 2015-08-26 | 中国移动通信集团福建有限公司 | Data backup method, data backup system, main-backup switching method and main-backup switching system |
| CN104317208A (en) * | 2014-10-17 | 2015-01-28 | 安徽立卓智能电网科技有限公司 | Method for user machine for automatically identifying switch between active director and standby director |
| CN104317208B (en) * | 2014-10-17 | 2017-08-25 | 安徽立卓智能电网科技有限公司 | A kind of method that subscriber computer automatic identification commands machine active-standby switch |
| CN107291787A (en) * | 2016-04-13 | 2017-10-24 | 中兴通讯股份有限公司 | Master/slave data storehouse switching method and apparatus |
| CN107291787B (en) * | 2016-04-13 | 2020-08-28 | 中兴通讯股份有限公司 | Active-standby database switching method and device |
| CN109471770A (en) * | 2018-09-11 | 2019-03-15 | 华为技术有限公司 | A system management method and device |
| CN109831325A (en) * | 2019-01-22 | 2019-05-31 | 武汉精立电子技术有限公司 | A kind of server switching system and method for sentencing again |
| CN109831325B (en) * | 2019-01-22 | 2021-09-24 | 武汉精立电子技术有限公司 | Server switching system and method for re-judgment |
| CN110956758A (en) * | 2019-12-19 | 2020-04-03 | 中邮智递科技有限公司 | Express delivery cabinet lock accuse board communication system and intelligent express delivery cabinet |
| CN111464348A (en) * | 2020-03-30 | 2020-07-28 | 中国电子科技集团公司第五十四研究所 | A hierarchical multi-dimensional system monitoring method |
| WO2023280041A1 (en) * | 2021-07-09 | 2023-01-12 | 三维通信股份有限公司 | Active-standby switching processing method and system, and electronic apparatus and storage medium |
| CN116476898A (en) * | 2023-05-10 | 2023-07-25 | 中铁第四勘察设计院集团有限公司 | A method and system for automatically switching redundant ground electronic units of a railway train control system |
| CN117376060A (en) * | 2023-10-07 | 2024-01-09 | 武汉船舶通信研究所(中国船舶集团有限公司第七二二研究所) | A site business replacement method |
Also Published As
| Publication number | Publication date |
|---|---|
| CN100493017C (en) | 2009-05-27 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN1567889A (en) | A method of automatic primary standby equipment switching | |
| CN1072869C (en) | method and system for data protection in a digital telecommunications system | |
| US8886831B2 (en) | System and methodology for fast link failover based on remote upstream failures | |
| CN100391191C (en) | Switch and method for rerouting traffic due to detection of failed link | |
| US8117337B2 (en) | Method and device for implementing link pass through in point-to-multipoint network | |
| US8830825B2 (en) | Method and system for priority based (1:1)n ethernet protection | |
| US20030031126A1 (en) | Bandwidth reservation reuse in dynamically allocated ring protection and restoration technique | |
| CN102014019B (en) | Aggregated link switching method and device | |
| WO2002065661A1 (en) | System and method for fast-rerouting of data in a data communication network | |
| CN1941730A (en) | Method for protecting RPR bridge redundancy | |
| WO2009082894A1 (en) | Automatic protection switching implementation method, system and board equipment for main and standby boards | |
| CN1934561A (en) | Restoration mechanism for network topologies | |
| CN100352223C (en) | Method for protecting data service in metropolitan area transmission network | |
| CN1870480B (en) | Ring network protection method of optical transmission network | |
| CN102231126B (en) | Method and system for implementing inter-core backup in multi-core processor | |
| JP4320602B2 (en) | Subscriber unit redundancy system and subscriber unit redundancy method | |
| CN1859219A (en) | Service take-over method based on device disaster tolerance, service switching device and backup machine | |
| CN1949700A (en) | Method and apparatus for mixed network protection | |
| CN101420351B (en) | Apparatus and method for implementing service protection on elastic packet ring | |
| CN102045600A (en) | Link protection method and device of Ethernet accessed to SDH (Synchronous Digital Hierarchy) optical network | |
| US7746949B2 (en) | Communications apparatus, system and method of creating a sub-channel | |
| JP2013192022A (en) | Network device, link aggregation system and redundancy method therefor | |
| CN1866866A (en) | Data transmission system and method between telecommunication equipments based on point-to-point connection | |
| CN101136899B (en) | Ethernet physical layer transmission protecting method, and device and equipment for implementing the same | |
| CN101299703B (en) | Elastic packet ring as well as link condition transference method for the same |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| CF01 | Termination of patent right due to non-payment of annual fee | ||
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20090527 Termination date: 20200625 |