CN106817267A - A kind of fault detection method and equipment - Google Patents
A kind of fault detection method and equipment Download PDFInfo
- Publication number
- CN106817267A CN106817267A CN201510851553.2A CN201510851553A CN106817267A CN 106817267 A CN106817267 A CN 106817267A CN 201510851553 A CN201510851553 A CN 201510851553A CN 106817267 A CN106817267 A CN 106817267A
- Authority
- CN
- China
- Prior art keywords
- fault detection
- frequency
- detection message
- device sends
- threshold
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/10—Active monitoring, e.g. heartbeat, ping or trace-route
- H04L43/103—Active monitoring, e.g. heartbeat, ping or trace-route with adaptive polling, i.e. dynamically adapting the polling rate
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
- H04L43/0805—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
- H04L43/0811—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking connectivity
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Environmental & Geological Engineering (AREA)
- Health & Medical Sciences (AREA)
- Cardiology (AREA)
- General Health & Medical Sciences (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
本发明实施例涉及通信技术领域,尤其涉及一种故障检测方法和设备,用以减轻网络负荷,降低网络开销。本发明实施例中,确定第一设备与第二设备对应的所有用户设备之间未建立业务连接,确定第一设备发送故障检测报文的频率,在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值;和/或所述确定所述第二设备发送故障检测报文的频率;在确定所述第二设备发送故障检测报文的频率不小于第二阈值时,调整所述第二设备发送故障检测报文的频率,以使调整后的所述第二设备发送故障检测报文的频率小于第二阈值。
The embodiments of the present invention relate to the field of communication technologies, and in particular to a fault detection method and equipment for reducing network load and network overhead. In the embodiment of the present invention, it is determined that no service connection has been established between all user equipment corresponding to the first device and the second device, and the frequency at which the first device sends a fault detection message is determined. After determining the frequency at which the first device sends a fault detection message When not less than the first threshold, adjust the frequency of the first device to send the fault detection message, so that the adjusted frequency of the first device to send the fault detection message is less than the first threshold; and/or said determining that the second device The frequency of sending fault detection messages; when it is determined that the frequency of sending fault detection messages from the second device is not less than the second threshold, adjust the frequency of sending fault detection messages from the second device so that the adjusted The frequency at which the second device sends the fault detection packet is less than the second threshold.
Description
技术领域technical field
本发明实施例涉及通信领域,尤其涉及一种故障检测方法和设备。The embodiments of the present invention relate to the communication field, and in particular, to a fault detection method and device.
背景技术Background technique
在通信领域中,为保障网络设备之间的数据传输质量,需对网络设备之间的通信故障进行快速检测,以在出现故障时及时建立新的通信链路,或切换到其它通信链路上去。目前,双向故障检测(Bidirectional Forwarding Detection,简称BFD)方法,是一种应用最广泛的网络故障检测技术。BFD的特点是:轻负荷、持续时间短,且能对任何类型的通信链路进行快速,如直接的物理链路、虚拟电路、多跳路由通道等。In the field of communication, in order to ensure the quality of data transmission between network devices, it is necessary to quickly detect communication failures between network devices, so as to establish a new communication link in time when a failure occurs, or switch to other communication links. . At present, a bidirectional fault detection (Bidirectional Forwarding Detection, BFD for short) method is a most widely used network fault detection technology. The characteristics of BFD are: light load, short duration, and it can quickly implement any type of communication link, such as direct physical link, virtual circuit, multi-hop routing channel, etc.
在BFD会话建立后,BFD会话双方在会话通道上,周期性地向对端发送BFD报文;同时,在该会话通道上,也周期性地检测对端发送的BFD报文。如果在检测时间内,没有收到对端发送的BFD报文,则确定BFD会话双方之间的通信链路出现了故障。After a BFD session is established, both parties in the BFD session periodically send BFD packets to the peer end on the session channel. At the same time, they also periodically check the BFD packets sent by the peer end on the session channel. If no BFD packet is received from the peer within the detection time, it is determined that the communication link between the two parties in the BFD session is faulty.
图1示出了现有技术中一种典型的数据中心组网的架构示意图。如图1所示,数据中心组网中包括多个用于转发数据的虚拟机(Virtual Machine,简称VM),多个虚拟机连接二层设备,多个二层设备连接多个三层设备,二层设备可为路由器,三层设备可为交换机,也就是说多个虚拟机通过交换机连接一个路由器,多个路由器也可通过交换机连接多个虚拟机。现有技术中,为了保证数据传输的可靠性,需要在各个设备之间发送用于检测链路是否发生故障的探测报文,比如在虚拟机和路由器之间发送BFD报文,通过探测报文检测到链路或设备发生故障之后,可以触发链路或者设备更换,从而保证数据传输的可靠性,减少数据传输中断时间。FIG. 1 shows a schematic diagram of a typical data center network architecture in the prior art. As shown in Figure 1, the data center network includes multiple virtual machines (Virtual Machines, referred to as VMs) for forwarding data. Multiple virtual machines are connected to Layer 2 devices, and multiple Layer 2 devices are connected to multiple Layer 3 devices. A layer-2 device can be a router, and a layer-3 device can be a switch, that is, multiple virtual machines are connected to one router through a switch, and multiple routers can also be connected to multiple virtual machines through a switch. In the existing technology, in order to ensure the reliability of data transmission, it is necessary to send detection packets between each device to detect whether a link is faulty, such as sending a BFD packet between a virtual machine and a router, and through the detection packet After a link or device failure is detected, the replacement of the link or device can be triggered to ensure the reliability of data transmission and reduce the interruption time of data transmission.
现有技术中,各个设备之间通常周期性发送BFD报文,周期为毫秒级,比如10ms,此时,一个虚拟机在1秒内需发送100个BFD报文。进一步由于多个VM通过交换机连接一个路由器,此时,路由器在1秒内需要处理的BFD则会达到10kpps级别,远远超出普通的路由器的能力,普通的路由器通过CPU处理数据时,通常仅能达到1kpps级别。可见,设备周期性发送BFD报文会加大网络负荷,增加网络开销。In the prior art, BFD packets are usually sent periodically between devices, and the period is millisecond level, such as 10ms. At this time, a virtual machine needs to send 100 BFD packets within 1 second. Furthermore, because multiple VMs are connected to a router through a switch, at this time, the BFD that the router needs to process within 1 second will reach the level of 10kpps, which is far beyond the capabilities of ordinary routers. When ordinary routers process data through the CPU, they can usually only Reach the 1kpps level. It can be seen that the periodic sending of BFD packets by the device will increase the load on the network and increase the network overhead.
发明内容Contents of the invention
本发明实施例提供一种故障检测方法和设备,用以减轻网络负荷,降低网络开销。Embodiments of the present invention provide a fault detection method and equipment, which are used to reduce network load and reduce network overhead.
本发明实施例提供一种故障检测方法,包括:An embodiment of the present invention provides a fault detection method, including:
确定第一设备与第二设备对应的所有用户设备之间未建立业务连接;其中所述第二设备对应的用户设备的业务数据通过第二设备和第一设备进行传输;Determining that no service connection has been established between the first device and all user equipment corresponding to the second device; wherein the service data of the user equipment corresponding to the second device is transmitted through the second device and the first device;
在确定所述第一设备发送故障检测报文的频率不小于第一阈值时,调整所述第一设备发送故障检测报文的频率,以使调整后的所述第一设备发送故障检测报文的频率小于第一阈值;和/或When it is determined that the frequency at which the first device sends the fault detection message is not less than the first threshold, adjusting the frequency at which the first device sends the fault detection message, so that the adjusted first device sends the fault detection message The frequency of is less than the first threshold; and/or
在确定所述第二设备发送故障检测报文的频率不小于第二阈值时,调整所述第二设备发送故障检测报文的频率,以使调整后的所述第二设备发送故障检测报文的频率小于第二阈值。When it is determined that the frequency at which the second device sends the fault detection message is not less than the second threshold, adjust the frequency at which the second device sends the fault detection message, so that the adjusted second device sends the fault detection message The frequency of is less than the second threshold.
由于在确定第一设备与第二设备对应的所有用户设备之间未建立业务连接时,第二设备对应的用户设备不会向第一设备发送业务数据,因此,此时无需以高频率发送故障检测报文,此时在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值,和/或在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值,即降低了故障检测报文的发送频率,从而减轻网络负荷,降低网络开销,节省了网络资源。Since the user equipment corresponding to the second device will not send service data to the first device when it is determined that no service connection has been established between the first device and all user equipment corresponding to the second device, there is no need to send faults at a high frequency at this time Detecting messages, at this time, when it is determined that the frequency of sending fault detection messages of the first device is not less than the first threshold, adjust the frequency of sending fault detection messages of the first device, so that the adjusted first device sends fault detection messages is less than the first threshold, and/or when it is determined that the frequency at which the second device sends fault detection messages is not less than the second threshold, adjust the frequency at which the second device sends fault detection messages so that the adjusted second device sends The frequency of the fault detection message is less than the second threshold, that is, the frequency of sending the fault detection message is reduced, thereby reducing network load, reducing network overhead, and saving network resources.
第二设备对应的用户设备与第一设备和第二设备的连接方法有多种,比如用户设备通过其它设备与第二设备连接,第二设备通过其它设备与第一设备连接,或者用户设备直接与第二设备连接,第二设备直接与第一设备连接。无论第二设备对应的用户设备如何与第一设备和第二设备的连接,用户设备的业务数据均需要经过第一设备和第二设备进行传输。There are many ways to connect the user equipment corresponding to the second device to the first device and the second device. For example, the user equipment connects to the second device through other devices, the second device connects to the first device through other devices, or the user equipment directly Connect with the second device, the second device is directly connected with the first device. No matter how the user equipment corresponding to the second device is connected to the first device and the second device, the service data of the user equipment needs to be transmitted through the first device and the second device.
用户设备与第一设备之间建立业务连接具体是指用户设备通过第二设备以及第一设备传输请求与另一个用户设备建立业务连接的请求,在业务连接成功建立之后,用户设备与第二设备、用户设备与第一设备、以及用户设备与另一个用户设备之间均成功建立业务连接,可通过该建立成功的业务连接链路进行通信,即传输业务数据。Establishing a service connection between the user equipment and the first device specifically means that the user equipment transmits a request to establish a service connection with another user equipment through the second device and the first device. After the service connection is successfully established, the user equipment and the second device A service connection is successfully established between the user equipment and the first device, and between the user equipment and another user equipment, and communication can be performed through the successfully established service connection links, that is, service data can be transmitted.
本发明实施例中,确定第一设备发送故障检测报文的频率,具体有多种方式,比如通过获取用于确定第一设备发送故障检测报文的频率的定时器的参数,将该定时器的时间参数进行更改,以便达到调整第一设备发送故障检测报文的频率的目的。In the embodiment of the present invention, there are many ways to determine the frequency of the first device sending the fault detection message. For example, by obtaining the parameters of the timer used to determine the frequency of the first device sending the fault detection message, the timer The time parameter of the device is changed, so as to achieve the purpose of adjusting the frequency of sending the fault detection message by the first device.
可选地,还包括:Optionally, also include:
在确定所述第二设备对应的至少一个用户设备与所述第一设备之间建立业务连接时,确定出所述第一设备在预设时长内接收到的所述第二设备发送的业务数据的数量;When determining that at least one user equipment corresponding to the second device establishes a service connection with the first device, determine the service data sent by the second device received by the first device within a preset time period quantity;
在确定出所述业务数据的数量为零时,确定所述第一设备发送故障检测报文的频率,在确定所述第一设备发送故障检测报文的频率小于第一阈值时,调整所述第一设备发送故障检测报文的频率,以使调整后的所述第一设备发送故障检测报文的频率不小于第一阈值;和/或When it is determined that the amount of service data is zero, determine the frequency at which the first device sends a fault detection message, and when it is determined that the frequency at which the first device sends a fault detection message is less than a first threshold, adjust the The frequency at which the first device sends the fault detection message, so that the adjusted frequency at which the first device sends the fault detection message is not less than the first threshold; and/or
在确定出所述业务数据的数量为零时,确定所述第二设备发送故障检测报文的频率,在确定所述第二设备发送故障检测报文的频率小于第二阈值时,调整所述第二设备发送故障检测报文的频率,以使调整后的所述第二设备发送故障检测报文的频率不小于第二阈值。When it is determined that the amount of service data is zero, determine the frequency at which the second device sends a fault detection message, and when it is determined that the frequency at which the second device sends a fault detection message is less than a second threshold, adjust the The frequency at which the second device sends the fault detection message is such that the adjusted frequency at which the second device sends the fault detection message is not less than the second threshold.
由于在确定第二设备对应的至少一个用户设备与第一设备之间建立业务连接,因此,用户设备一定会通过第二设备和第一设备进行业务数据传输,如果在预设时长内接收到的第二设备发送的业务数据的数量为零,则说明在预设时长内第二设备未向第一设备传输用户设备所对应的业务数据,此时判定第二设备可能出现故障,因此,可将故障检测报文的频率提高。如此,则在设备或链路出现故障时,提高故障检测速率,进而减少了在设备或链路出现故障时数据传输的中断时间。Since at least one user equipment corresponding to the second device is determined to establish a service connection with the first device, the user equipment must transmit service data through the second device and the first device, if the received The number of service data sent by the second device is zero, which means that the second device has not transmitted the service data corresponding to the user equipment to the first device within the preset time period. At this time, it is determined that the second device may be faulty. Therefore, the The frequency of fault detection messages is increased. In this way, when a device or link fails, the fault detection rate is increased, thereby reducing the interruption time of data transmission when the device or link fails.
进一步,当第二设备故障时,第二设备上的业务数据传输中断,此时在某个接口上发现第二设备上传输的业务数据中断后,立即提高故障检测报文的发送频率,第一设备通过该接口发送故障检测报文。由于多数情况下都是单个设备发生故障,因此个别设备提高故障检测报文的发送频率,也不会为网络带了较大的负荷,且也提高了故障检测速度。Furthermore, when the second device fails, the transmission of service data on the second device is interrupted. At this time, when the service data transmitted on the second device is found to be interrupted on a certain interface, the frequency of sending fault detection messages is immediately increased. The first The device sends fault detection packets through this interface. Since most of the time a single device fails, increasing the frequency of sending fault detection packets by individual devices will not bring a large load to the network, and the fault detection speed will also be improved.
可选地,所述在确定所述第二设备对应的至少一个用户设备与所述第一设备之间建立业务连接时,确定出所述第一设备在预设时长内接收到的所述第二设备发送的业务数据的数量之后,还包括:Optionally, when determining that at least one user equipment corresponding to the second equipment establishes a service connection with the first equipment, it is determined that the first equipment received by the first equipment within a preset duration After the number of business data sent by the second device, it also includes:
在确定出所述业务数据的数量不为零时,且所述第一设备在向所述第二设备发送故障检测报文时,调整所述第一设备发送故障检测报文的频率,以使调整后的所述第一设备停止向所述第二设备发送故障检测报文;和/或When it is determined that the quantity of service data is not zero, and when the first device sends a fault detection message to the second device, adjust the frequency of sending the fault detection message by the first device, so that After the adjustment, the first device stops sending fault detection messages to the second device; and/or
在确定出所述业务数据的数量不为零时,且所述第二设备在向所述第一设备发送故障检测报文时,调整所述第二设备发送故障检测报文的频率,以使调整后的所述第二设备停止向所述第一设备发送故障检测报文。When it is determined that the quantity of service data is not zero, and when the second device sends a fault detection message to the first device, adjust the frequency of sending the fault detection message by the second device, so that The adjusted second device stops sending fault detection packets to the first device.
由于在确定第二设备对应的至少一个用户设备与第一设备之间建立业务连接,因此,用户设备一定会通过第二设备和第一设备进行业务数据传输,如果在预设时长内接收到的第二设备发送的业务数据的数量为不为零,则说明第一设备和第二设备之间的链路及设备均正常,此时可停止发送故障检测报文,以便减轻网络负荷,降低网络开销,节省了网络资源。Since at least one user equipment corresponding to the second device is determined to establish a service connection with the first device, the user equipment must transmit service data through the second device and the first device, if the received If the number of business data sent by the second device is not zero, it means that the links and devices between the first device and the second device are normal. At this time, you can stop sending fault detection messages to reduce the network load and reduce the network load. overhead, saving network resources.
可选地,所述在确定所述第二设备对应的至少一个用户设备与所述第一设备之间建立业务连接时,确定出所述第一设备在预设时长内接收到的所述第二设备发送的业务数据的数量之后,还包括:Optionally, when determining that at least one user equipment corresponding to the second equipment establishes a service connection with the first equipment, it is determined that the first equipment received by the first equipment within a preset duration After the number of business data sent by the second device, it also includes:
在确定出所述业务数据的数量不为零时,确定所述第一设备发送故障检测报文的频率;在确定所述第一设备发送故障检测报文的频率不小于第一阈值时,调整所述第一设备发送故障检测报文的频率,以使调整后的所述第一设备发送故障检测报文的频率小于第一阈值;和/或When it is determined that the number of business data is not zero, determine the frequency at which the first device sends a fault detection message; when it is determined that the frequency at which the first device sends a fault detection message is not less than a first threshold, adjust The frequency at which the first device sends a fault detection message, so that the adjusted frequency at which the first device sends a fault detection message is less than a first threshold; and/or
在确定出所述业务数据的数量不为零时,确定所述第二设备发送故障检测报文的频率;在确定所述第二设备发送故障检测报文的频率不小于第二阈值时,调整所述第二设备发送故障检测报文的频率,以使调整后的所述第二设备发送故障检测报文的频率小于第二阈值。When it is determined that the amount of business data is not zero, determine the frequency at which the second device sends a fault detection message; when it is determined that the frequency at which the second device sends a fault detection message is not less than a second threshold, adjust The frequency at which the second device sends fault detection packets is such that the adjusted frequency at which the second device sends fault detection packets is less than a second threshold.
由于在确定第二设备对应的至少一个用户设备与第一设备之间建立业务连接,因此,用户设备一定会通过第二设备和第一设备进行业务数据传输,如果在预设时长内接收到的第二设备发送的业务数据的数量为不为零,则说明第一设备和第二设备之间的链路及设备均正常,此时可降低发送故障检测报文的频率,一方面持续对链路以及设备进行检测,另一方面减轻网络负荷,降低网络开销,节省了网络资源。Since at least one user equipment corresponding to the second device is determined to establish a service connection with the first device, the user equipment must transmit service data through the second device and the first device, if the received If the number of business data sent by the second device is not zero, it means that the link and equipment between the first device and the second device are normal. At this time, the frequency of sending fault detection messages can be reduced. On the other hand, it reduces network load, reduces network overhead, and saves network resources.
可选地,所述在确定所述第一设备发送故障检测报文的频率不小于第一阈值时,调整所述第一设备发送故障检测报文的频率,具体包括:Optionally, when it is determined that the frequency of sending fault detection messages by the first device is not less than a first threshold, adjusting the frequency of sending fault detection messages by the first device specifically includes:
在确定所述第一设备发送故障检测报文的频率不小于第一阈值时,根据所述所述业务数据的数量调整所述第一设备发送故障检测报文的频率;其中,所述业务数据的数量越小,调整后所述第一设备发送故障检测报文的频率越大;When it is determined that the frequency at which the first device sends a fault detection message is not less than a first threshold, adjust the frequency at which the first device sends a fault detection message according to the amount of the service data; wherein the service data The smaller the number, the greater the frequency at which the first device sends fault detection messages after adjustment;
所述在确定所述第一设备发送故障检测报文的频率不小于第一阈值时,调整所述第二设备发送故障检测报文的频率,具体包括:When determining that the frequency of sending fault detection messages by the first device is not less than the first threshold, adjusting the frequency of sending fault detection messages by the second device specifically includes:
在确定所述第二设备发送故障检测报文的频率不小于第二阈值时,根据所述所述业务数据的数量调整所述第二设备发送故障检测报文的频率;其中,所述业务数据的数量越小,调整后所述第二设备发送故障检测报文的频率越大。When it is determined that the frequency at which the second device sends a fault detection message is not less than a second threshold, adjust the frequency at which the second device sends a fault detection message according to the amount of the service data; wherein the service data The smaller the number is, the higher the frequency at which the second device sends the fault detection message after adjustment is.
本发明实施例提供一种故障检测设备,包括:An embodiment of the present invention provides a fault detection device, including:
确定单元,用于确定第一设备与第二设备对应的所有用户设备之间未建立业务连接;其中所述第二设备对应的用户设备的业务数据通过第二设备和第一设备进行传输;A determining unit, configured to determine that no service connection has been established between the first device and all user equipment corresponding to the second device; wherein the service data of the user equipment corresponding to the second device is transmitted through the second device and the first device;
处理单元,用于在通过确定单元确定所述第一设备发送故障检测报文的频率不小于第一阈值时,调整所述第一设备发送故障检测报文的频率,以使调整后的所述第一设备发送故障检测报文的频率小于第一阈值;和/或A processing unit, configured to adjust the frequency of sending fault detection messages by the first device when it is determined by the determining unit that the frequency of sending fault detection messages by the first device is not less than a first threshold, so that the adjusted The frequency at which the first device sends a fault detection message is less than a first threshold; and/or
在通过确定单元确定所述第二设备发送故障检测报文的频率不小于第二阈值时,调整所述第二设备发送故障检测报文的频率,以使调整后的所述第二设备发送故障检测报文的频率小于第二阈值。When it is determined by the determination unit that the frequency of sending fault detection messages by the second device is not less than the second threshold, adjust the frequency of sending fault detection messages by the second device, so that the adjusted second device sends fault detection messages The frequency of detecting packets is less than the second threshold.
由于在确定第一设备与第二设备对应的所有用户设备之间未建立业务连接时,第二设备对应的用户设备不会向第一设备发送业务数据,因此,此时无需以高频率发送故障检测报文,此时在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值,和/或在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值,即降低了故障检测报文的发送频率,从而减轻网络负荷,降低网络开销,节省了网络资源。Since the user equipment corresponding to the second device will not send service data to the first device when it is determined that no service connection has been established between the first device and all user equipment corresponding to the second device, there is no need to send faults at a high frequency at this time Detecting messages, at this time, when it is determined that the frequency of sending fault detection messages of the first device is not less than the first threshold, adjust the frequency of sending fault detection messages of the first device, so that the adjusted first device sends fault detection messages is less than the first threshold, and/or when it is determined that the frequency at which the second device sends fault detection messages is not less than the second threshold, adjust the frequency at which the second device sends fault detection messages so that the adjusted second device sends The frequency of the fault detection message is less than the second threshold, that is, the frequency of sending the fault detection message is reduced, thereby reducing network load, reducing network overhead, and saving network resources.
第二设备对应的用户设备与第一设备和第二设备的连接方法有多种,比如用户设备通过其它设备与第二设备连接,第二设备通过其它设备与第一设备连接,或者用户设备直接与第二设备连接,第二设备直接与第一设备连接。无论第二设备对应的用户设备如何与第一设备和第二设备的连接,用户设备的业务数据均需要经过第一设备和第二设备进行传输。There are many ways to connect the user equipment corresponding to the second device to the first device and the second device. For example, the user equipment connects to the second device through other devices, the second device connects to the first device through other devices, or the user equipment directly Connect with the second device, the second device is directly connected with the first device. No matter how the user equipment corresponding to the second device is connected to the first device and the second device, the service data of the user equipment needs to be transmitted through the first device and the second device.
用户设备与第一设备之间建立业务连接具体是指用户设备通过第二设备以及第一设备传输请求与另一个用户设备建立业务连接的请求,在业务连接成功建立之后,用户设备与第二设备、用户设备与第一设备、以及用户设备与另一个用户设备之间均成功建立业务连接,可通过该建立成功的业务连接链路进行通信,即传输业务数据。Establishing a service connection between the user equipment and the first device specifically means that the user equipment transmits a request to establish a service connection with another user equipment through the second device and the first device. After the service connection is successfully established, the user equipment and the second device A service connection is successfully established between the user equipment and the first device, and between the user equipment and another user equipment, and communication can be performed through the successfully established service connection links, that is, service data can be transmitted.
本发明实施例中,确定第一设备发送故障检测报文的频率,具体有多种方式,比如通过获取用于确定第一设备发送故障检测报文的频率的定时器的参数,将该定时器的时间参数进行更改,以便达到调整第一设备发送故障检测报文的频率的目的。In the embodiment of the present invention, there are many ways to determine the frequency of the first device sending the fault detection message. For example, by obtaining the parameters of the timer used to determine the frequency of the first device sending the fault detection message, the timer The time parameter of the device is changed, so as to achieve the purpose of adjusting the frequency of sending the fault detection message by the first device.
可选地,所述确定单元,还用于:Optionally, the determining unit is further configured to:
在确定所述第二设备对应的至少一个用户设备与所述第一设备之间建立业务连接时,确定出所述第一设备在预设时长内接收到的所述第二设备发送的业务数据的数量;When determining that at least one user equipment corresponding to the second device establishes a service connection with the first device, determine the service data sent by the second device received by the first device within a preset time period quantity;
所述处理单元,还用于:The processing unit is also used for:
在通过所述确定单元确定出所述业务数据的数量为零时,确定所述第一设备发送故障检测报文的频率,在确定所述第一设备发送故障检测报文的频率小于第一阈值时,调整所述第一设备发送故障检测报文的频率,以使调整后的所述第一设备发送故障检测报文的频率不小于第一阈值;和/或When it is determined by the determination unit that the quantity of the service data is zero, determine the frequency at which the first device sends a fault detection message, and determine that the frequency at which the first device sends a fault detection message is less than a first threshold , adjusting the frequency at which the first device sends the fault detection message, so that the adjusted frequency at which the first device sends the fault detection message is not less than the first threshold; and/or
在通过所述确定单元确定出所述业务数据的数量为零时,确定所述第二设备发送故障检测报文的频率,在确定所述第二设备发送故障检测报文的频率小于第二阈值时,调整所述第二设备发送故障检测报文的频率,以使调整后的所述第二设备发送故障检测报文的频率不小于第二阈值。When it is determined by the determination unit that the quantity of the service data is zero, determine the frequency at which the second device sends a fault detection message, and determine that the frequency at which the second device sends a fault detection message is less than a second threshold , adjusting the frequency at which the second device sends the fault detection message, so that the adjusted frequency at which the second device sends the fault detection message is no less than a second threshold.
由于在确定第二设备对应的至少一个用户设备与第一设备之间建立业务连接,因此,用户设备一定会通过第二设备和第一设备进行业务数据传输,如果在预设时长内接收到的第二设备发送的业务数据的数量为零,则说明在预设时长内第二设备未向第一设备传输用户设备所对应的业务数据,此时判定第二设备可能出现故障,因此,可将故障检测报文的频率提高。如此,则在设备或链路出现故障时,提高故障检测速率,进而减少了在设备或链路出现故障时数据传输的中断时间。Since at least one user equipment corresponding to the second device is determined to establish a service connection with the first device, the user equipment must transmit service data through the second device and the first device, if the received The number of service data sent by the second device is zero, which means that the second device has not transmitted the service data corresponding to the user equipment to the first device within the preset time period. At this time, it is determined that the second device may be faulty. Therefore, the The frequency of fault detection messages is increased. In this way, when a device or link fails, the fault detection rate is increased, thereby reducing the interruption time of data transmission when the device or link fails.
进一步,当第二设备故障时,第二设备上的业务数据传输中断,此时在某个接口上发现第二设备上传输的业务数据中断后,立即提高故障检测报文的发送频率,第一设备通过该接口发送故障检测报文。由于多数情况下都是单个设备发生故障,因此个别设备提高故障检测报文的发送频率,也不会为网络带了较大的负荷,且也提高了故障检测速度。Furthermore, when the second device fails, the transmission of service data on the second device is interrupted. At this time, when the service data transmitted on the second device is found to be interrupted on a certain interface, the frequency of sending fault detection messages is immediately increased. The first The device sends fault detection packets through this interface. Since most of the time a single device fails, increasing the frequency of sending fault detection packets by individual devices will not bring a large load to the network, and the fault detection speed will also be improved.
可选地,所述处理单元,还用于:Optionally, the processing unit is also used for:
在通过所述确定单元确定出所述业务数据的数量不为零时,且所述第一设备在向所述第二设备发送故障检测报文时,调整所述第一设备发送故障检测报文的频率,以使调整后的所述第一设备停止向所述第二设备发送故障检测报文;和/或When it is determined by the determination unit that the quantity of the service data is not zero, and the first device sends a fault detection message to the second device, adjusting the first device to send the fault detection message frequency, so that the adjusted first device stops sending fault detection messages to the second device; and/or
在通过所述确定单元确定出所述业务数据的数量不为零时,且所述第二设备在向所述第一设备发送故障检测报文时,调整所述第二设备发送故障检测报文的频率,以使调整后的所述第二设备停止向所述第一设备发送故障检测报文。When it is determined by the determining unit that the quantity of the service data is not zero, and the second device sends a fault detection message to the first device, adjusting the second device to send the fault detection message frequency, so that the adjusted second device stops sending fault detection packets to the first device.
由于在确定第二设备对应的至少一个用户设备与第一设备之间建立业务连接,因此,用户设备一定会通过第二设备和第一设备进行业务数据传输,如果在预设时长内接收到的第二设备发送的业务数据的数量为不为零,则说明第一设备和第二设备之间的链路及设备均正常,此时可停止发送故障检测报文,以便减轻网络负荷,降低网络开销,节省了网络资源。Since at least one user equipment corresponding to the second device is determined to establish a service connection with the first device, the user equipment must transmit service data through the second device and the first device, if the received If the number of business data sent by the second device is not zero, it means that the links and devices between the first device and the second device are normal. At this time, you can stop sending fault detection messages to reduce the network load and reduce the network load. overhead, saving network resources.
可选地,所述处理单元,还用于:Optionally, the processing unit is also used for:
在通过所述确定单元确定出所述业务数据的数量不为零时,确定所述第一设备发送故障检测报文的频率;在确定所述第一设备发送故障检测报文的频率不小于第一阈值时,调整所述第一设备发送故障检测报文的频率,以使调整后的所述第一设备发送故障检测报文的频率小于第一阈值;和/或When it is determined by the determination unit that the quantity of the service data is not zero, determine the frequency at which the first device sends a fault detection message; when it is determined that the frequency at which the first device sends a fault detection message is not less than the first When a threshold is reached, adjusting the frequency at which the first device sends a fault detection message, so that the adjusted frequency at which the first device sends a fault detection message is less than the first threshold; and/or
在通过所述确定单元确定出所述业务数据的数量不为零时,确定所述第二设备发送故障检测报文的频率;在确定所述第二设备发送故障检测报文的频率不小于第二阈值时,调整所述第二设备发送故障检测报文的频率,以使调整后的所述第二设备发送故障检测报文的频率小于第二阈值。When it is determined by the determination unit that the quantity of the service data is not zero, determine the frequency at which the second device sends the fault detection message; when it is determined that the frequency at which the second device sends the fault detection message is not less than the first When the second threshold is set, adjusting the frequency of sending fault detection packets by the second device, so that the adjusted frequency of sending fault detection packets by the second device is less than the second threshold.
由于在确定第二设备对应的至少一个用户设备与第一设备之间建立业务连接,因此,用户设备一定会通过第二设备和第一设备进行业务数据传输,如果在预设时长内接收到的第二设备发送的业务数据的数量为不为零,则说明第一设备和第二设备之间的链路及设备均正常,此时可降低发送故障检测报文的频率,一方面持续对链路以及设备进行检测,另一方面减轻网络负荷,降低网络开销,节省了网络资源。Since at least one user equipment corresponding to the second device is determined to establish a service connection with the first device, the user equipment must transmit service data through the second device and the first device, if the received If the number of business data sent by the second device is not zero, it means that the link and equipment between the first device and the second device are normal. At this time, the frequency of sending fault detection messages can be reduced. On the other hand, it reduces network load, reduces network overhead, and saves network resources.
可选地,所述处理单元,具体用于:Optionally, the processing unit is specifically configured to:
在确定所述第一设备发送故障检测报文的频率不小于第一阈值时,根据所述所述业务数据的数量调整所述第一设备发送故障检测报文的频率;其中,所述业务数据的数量越小,调整后所述第一设备发送故障检测报文的频率越大;When it is determined that the frequency at which the first device sends a fault detection message is not less than a first threshold, adjust the frequency at which the first device sends a fault detection message according to the amount of the service data; wherein the service data The smaller the number, the greater the frequency at which the first device sends fault detection messages after adjustment;
在确定所述第二设备发送故障检测报文的频率不小于第二阈值时,根据所述所述业务数据的数量调整所述第二设备发送故障检测报文的频率;其中,所述业务数据的数量越小,调整后所述第二设备发送故障检测报文的频率越大。When it is determined that the frequency at which the second device sends a fault detection message is not less than a second threshold, adjust the frequency at which the second device sends a fault detection message according to the amount of the service data; wherein the service data The smaller the number is, the higher the frequency at which the second device sends the fault detection message after adjustment is.
本发明实施例提供一种故障检测设备,包括:An embodiment of the present invention provides a fault detection device, including:
处理器,用于确定第一设备与第二设备对应的所有用户设备之间未建立业务连接;其中所述第二设备对应的用户设备的业务数据通过第二设备和第一设备进行传输;A processor, configured to determine that no service connection has been established between the first device and all user equipment corresponding to the second device; wherein the service data of the user equipment corresponding to the second device is transmitted through the second device and the first device;
在确定所述第一设备发送故障检测报文的频率不小于第一阈值时,调整所述第一设备发送故障检测报文的频率,以使调整后的所述第一设备发送故障检测报文的频率小于第一阈值;和/或When it is determined that the frequency at which the first device sends the fault detection message is not less than the first threshold, adjusting the frequency at which the first device sends the fault detection message, so that the adjusted first device sends the fault detection message The frequency of is less than the first threshold; and/or
在确定所述第二设备发送故障检测报文的频率不小于第二阈值时,调整所述第二设备发送故障检测报文的频率,以使调整后的所述第二设备发送故障检测报文的频率小于第二阈值;When it is determined that the frequency at which the second device sends the fault detection message is not less than the second threshold, adjust the frequency at which the second device sends the fault detection message, so that the adjusted second device sends the fault detection message The frequency of is less than the second threshold;
存储器,用于存储所述第一阈值和所述第二阈值。A memory, configured to store the first threshold and the second threshold.
由于在确定第一设备与第二设备对应的所有用户设备之间未建立业务连接时,第二设备对应的用户设备不会向第一设备发送业务数据,因此,此时无需以高频率发送故障检测报文,此时在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值,和/或在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值,即降低了故障检测报文的发送频率,从而减轻网络负荷,降低网络开销,节省了网络资源。Since the user equipment corresponding to the second device will not send service data to the first device when it is determined that no service connection has been established between the first device and all user equipment corresponding to the second device, there is no need to send faults at a high frequency at this time Detecting messages, at this time, when it is determined that the frequency of sending fault detection messages of the first device is not less than the first threshold, adjust the frequency of sending fault detection messages of the first device, so that the adjusted first device sends fault detection messages is less than the first threshold, and/or when it is determined that the frequency at which the second device sends fault detection messages is not less than the second threshold, adjust the frequency at which the second device sends fault detection messages so that the adjusted second device sends The frequency of the fault detection message is less than the second threshold, that is, the frequency of sending the fault detection message is reduced, thereby reducing network load, reducing network overhead, and saving network resources.
可选地,所述处理器,还用于:Optionally, the processor is also used for:
在确定所述第二设备对应的至少一个用户设备与所述第一设备之间建立业务连接时,确定出所述第一设备在预设时长内接收到的所述第二设备发送的业务数据的数量;When determining that at least one user equipment corresponding to the second device establishes a service connection with the first device, determine the service data sent by the second device received by the first device within a preset time period quantity;
在确定出所述业务数据的数量为零时,确定所述第一设备发送故障检测报文的频率,在确定所述第一设备发送故障检测报文的频率小于第一阈值时,调整所述第一设备发送故障检测报文的频率,以使调整后的所述第一设备发送故障检测报文的频率不小于第一阈值;和/或When it is determined that the amount of service data is zero, determine the frequency at which the first device sends a fault detection message, and when it is determined that the frequency at which the first device sends a fault detection message is less than a first threshold, adjust the The frequency at which the first device sends the fault detection message, so that the adjusted frequency at which the first device sends the fault detection message is not less than the first threshold; and/or
在确定出所述业务数据的数量为零时,确定所述第二设备发送故障检测报文的频率,在确定所述第二设备发送故障检测报文的频率小于第二阈值时,调整所述第二设备发送故障检测报文的频率,以使调整后的所述第二设备发送故障检测报文的频率不小于第二阈值。When it is determined that the amount of service data is zero, determine the frequency at which the second device sends a fault detection message, and when it is determined that the frequency at which the second device sends a fault detection message is less than a second threshold, adjust the The frequency at which the second device sends the fault detection message is such that the adjusted frequency at which the second device sends the fault detection message is not less than the second threshold.
由于在确定第二设备对应的至少一个用户设备与第一设备之间建立业务连接,因此,用户设备一定会通过第二设备和第一设备进行业务数据传输,如果在预设时长内接收到的第二设备发送的业务数据的数量为零,则说明在预设时长内第二设备未向第一设备传输用户设备所对应的业务数据,此时判定第二设备可能出现故障,因此,可将故障检测报文的频率提高。如此,则在设备或链路出现故障时,提高故障检测速率,进而减少了在设备或链路出现故障时数据传输的中断时间。Since at least one user equipment corresponding to the second device is determined to establish a service connection with the first device, the user equipment must transmit service data through the second device and the first device, if the received The number of service data sent by the second device is zero, which means that the second device has not transmitted the service data corresponding to the user equipment to the first device within the preset time period. At this time, it is determined that the second device may be faulty. Therefore, the The frequency of fault detection messages is increased. In this way, when a device or link fails, the fault detection rate is increased, thereby reducing the interruption time of data transmission when the device or link fails.
进一步,当第二设备故障时,第二设备上的业务数据传输中断,此时在某个接口上发现第二设备上传输的业务数据中断后,立即提高故障检测报文的发送频率,第一设备通过该接口发送故障检测报文。由于多数情况下都是单个设备发生故障,因此个别设备提高故障检测报文的发送频率,也不会为网络带了较大的负荷,且也提高了故障检测速度。Furthermore, when the second device fails, the transmission of service data on the second device is interrupted. At this time, when the service data transmitted on the second device is found to be interrupted on a certain interface, the frequency of sending fault detection messages is immediately increased. The first The device sends fault detection packets through this interface. Since most of the time a single device fails, increasing the frequency of sending fault detection packets by individual devices will not bring a large load to the network, and the fault detection speed will also be improved.
可选地,所述处理器,还用于:Optionally, the processor is also used for:
在确定出所述业务数据的数量不为零时,且所述第一设备在向所述第二设备发送故障检测报文时,调整所述第一设备发送故障检测报文的频率,以使调整后的所述第一设备停止向所述第二设备发送故障检测报文;和/或When it is determined that the quantity of service data is not zero, and when the first device sends a fault detection message to the second device, adjust the frequency of sending the fault detection message by the first device, so that After the adjustment, the first device stops sending fault detection messages to the second device; and/or
在确定出所述业务数据的数量不为零时,且所述第二设备在向所述第一设备发送故障检测报文时,调整所述第二设备发送故障检测报文的频率,以使调整后的所述第二设备停止向所述第一设备发送故障检测报文。When it is determined that the quantity of service data is not zero, and when the second device sends a fault detection message to the first device, adjust the frequency of sending the fault detection message by the second device, so that The adjusted second device stops sending fault detection packets to the first device.
由于在确定第二设备对应的至少一个用户设备与第一设备之间建立业务连接,因此,用户设备一定会通过第二设备和第一设备进行业务数据传输,如果在预设时长内接收到的第二设备发送的业务数据的数量为不为零,则说明第一设备和第二设备之间的链路及设备均正常,此时可停止发送故障检测报文,以便减轻网络负荷,降低网络开销,节省了网络资源。Since at least one user equipment corresponding to the second device is determined to establish a service connection with the first device, the user equipment must transmit service data through the second device and the first device, if the received If the number of business data sent by the second device is not zero, it means that the links and devices between the first device and the second device are normal. At this time, you can stop sending fault detection messages to reduce the network load and reduce the network load. overhead, saving network resources.
可选地,所述处理器,还用于:Optionally, the processor is also used for:
在确定出所述业务数据的数量不为零时,确定所述第一设备发送故障检测报文的频率;在确定所述第一设备发送故障检测报文的频率不小于第一阈值时,调整所述第一设备发送故障检测报文的频率,以使调整后的所述第一设备发送故障检测报文的频率小于第一阈值;和/或When it is determined that the number of business data is not zero, determine the frequency at which the first device sends a fault detection message; when it is determined that the frequency at which the first device sends a fault detection message is not less than a first threshold, adjust The frequency at which the first device sends a fault detection message, so that the adjusted frequency at which the first device sends a fault detection message is less than a first threshold; and/or
在确定出所述业务数据的数量不为零时,确定所述第二设备发送故障检测报文的频率;在确定所述第二设备发送故障检测报文的频率不小于第二阈值时,调整所述第二设备发送故障检测报文的频率,以使调整后的所述第二设备发送故障检测报文的频率小于第二阈值。When it is determined that the amount of business data is not zero, determine the frequency at which the second device sends a fault detection message; when it is determined that the frequency at which the second device sends a fault detection message is not less than a second threshold, adjust The frequency at which the second device sends fault detection packets is such that the adjusted frequency at which the second device sends fault detection packets is less than a second threshold.
由于在确定第二设备对应的至少一个用户设备与第一设备之间建立业务连接,因此,用户设备一定会通过第二设备和第一设备进行业务数据传输,如果在预设时长内接收到的第二设备发送的业务数据的数量为不为零,则说明第一设备和第二设备之间的链路及设备均正常,此时可降低发送故障检测报文的频率,一方面持续对链路以及设备进行检测,另一方面减轻网络负荷,降低网络开销,节省了网络资源。Since at least one user equipment corresponding to the second device is determined to establish a service connection with the first device, the user equipment must transmit service data through the second device and the first device, if the received If the number of business data sent by the second device is not zero, it means that the link and equipment between the first device and the second device are normal. At this time, the frequency of sending fault detection messages can be reduced. On the other hand, it reduces network load, reduces network overhead, and saves network resources.
可选地,所述处理器,具体用于:Optionally, the processor is specifically used for:
在确定所述第一设备发送故障检测报文的频率不小于第一阈值时,根据所述所述业务数据的数量调整所述第一设备发送故障检测报文的频率;其中,所述业务数据的数量越小,调整后所述第一设备发送故障检测报文的频率越大;When it is determined that the frequency at which the first device sends a fault detection message is not less than a first threshold, adjust the frequency at which the first device sends a fault detection message according to the amount of the service data; wherein the service data The smaller the number, the greater the frequency at which the first device sends fault detection messages after adjustment;
在确定所述第二设备发送故障检测报文的频率不小于第二阈值时,根据所述所述业务数据的数量调整所述第二设备发送故障检测报文的频率;其中,所述业务数据的数量越小,调整后所述第二设备发送故障检测报文的频率越大。When it is determined that the frequency at which the second device sends a fault detection message is not less than a second threshold, adjust the frequency at which the second device sends a fault detection message according to the amount of the service data; wherein the service data The smaller the number is, the higher the frequency at which the second device sends the fault detection message after adjustment is.
本发明实施例中,确定第一设备与第二设备对应的所有用户设备之间未建立业务连接;其中第二设备对应的用户设备的业务数据通过第二设备和第一设备进行传输;确定第一设备发送故障检测报文的频率,在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值;和/或确定第二设备发送故障检测报文的频率;在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值。由于在确定第一设备与第二设备对应的所有用户设备之间未建立业务连接时,第二设备对应的用户设备不会向第一设备发送业务数据,因此,此时无需以高频率发送故障检测报文,此时在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值,和/或在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值,即降低了故障检测报文的发送频率,从而减轻网络负荷,降低网络开销,节省了网络资源。In the embodiment of the present invention, it is determined that no service connection has been established between all user equipment corresponding to the first device and the second device; wherein the service data of the user equipment corresponding to the second device is transmitted through the second device and the first device; The frequency of sending fault detection messages by a device, when it is determined that the frequency of sending fault detection messages of the first device is not less than the first threshold, adjust the frequency of sending fault detection messages of the first device so that the adjusted first device sends The frequency of the fault detection message is less than the first threshold; and/or determine the frequency of the second device to send the fault detection message; when it is determined that the frequency of the second device to send the fault detection message is not less than the second threshold, adjust the second device to send The frequency of the fault detection packet, so that the adjusted frequency of the second device to send the fault detection packet is less than the second threshold. Since the user equipment corresponding to the second device will not send service data to the first device when it is determined that no service connection has been established between the first device and all user equipment corresponding to the second device, there is no need to send faults at a high frequency at this time Detecting messages, at this time, when it is determined that the frequency of sending fault detection messages of the first device is not less than the first threshold, adjust the frequency of sending fault detection messages of the first device, so that the adjusted first device sends fault detection messages is less than the first threshold, and/or when it is determined that the frequency at which the second device sends fault detection messages is not less than the second threshold, adjust the frequency at which the second device sends fault detection messages so that the adjusted second device sends The frequency of the fault detection message is less than the second threshold, that is, the frequency of sending the fault detection message is reduced, thereby reducing network load, reducing network overhead, and saving network resources.
附图说明Description of drawings
为了更清楚地说明本发明实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简要介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域的普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings that need to be used in the description of the embodiments will be briefly introduced below. Obviously, the drawings in the following description are only some embodiments of the present invention. For Those of ordinary skill in the art can also obtain other drawings based on these drawings without making creative efforts.
图1为现有技术中一种典型的数据中心组网的架构示意图;FIG. 1 is a schematic diagram of a typical data center network architecture in the prior art;
图2为本发明实施例提供的本发明实施例适用的一种系统架构示意图;FIG. 2 is a schematic diagram of a system architecture applicable to the embodiment of the present invention provided by the embodiment of the present invention;
图3为本发明实施例提供的一种故障检测方法流程示意图;FIG. 3 is a schematic flowchart of a fault detection method provided by an embodiment of the present invention;
图3a为本发明实施例提供的另一种故障检测方法的流程示意图;FIG. 3a is a schematic flowchart of another fault detection method provided by an embodiment of the present invention;
图3b为本发明实施例提供的一种故障检测报文的发送频率与业务数据的数量之间的关系示意图;FIG. 3b is a schematic diagram of the relationship between the sending frequency of a fault detection message and the quantity of service data provided by an embodiment of the present invention;
图4为本发明实施例提供的一种故障检测设备的结构示意图;FIG. 4 is a schematic structural diagram of a fault detection device provided by an embodiment of the present invention;
图5本发明实施例提供的另一种故障检测设备的结构示意图。FIG. 5 is a schematic structural diagram of another fault detection device provided by an embodiment of the present invention.
具体实施方式detailed description
为了使本发明的目的、技术方案及有益效果更加清楚明白,以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。In order to make the object, technical solution and beneficial effects of the present invention more clear, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.
下面,为了便于理解,介绍本发明实施例涉及的基本概念。In the following, the basic concepts involved in the embodiments of the present invention are introduced for ease of understanding.
术语“用户设备”包括不限于具有无线通信功能的手持设备、车载设备、可穿戴设备、计算设备或连接到无线调制解调器的其它处理设备,以及各种形式的用户设备(User Equipment,简称UE),移动台(Mobile station,简称MS),用户设备(terminal),用户设备设备(Terminal Equipment)等等。为方便描述,本申请中,简称为用户设备。The term "user equipment" includes, but is not limited to, handheld devices with wireless communication functions, vehicle-mounted devices, wearable devices, computing devices or other processing devices connected to wireless modems, as well as various forms of user equipment (User Equipment, referred to as UE), Mobile station (Mobile station, MS for short), user equipment (terminal), user equipment equipment (Terminal Equipment) and so on. For convenience of description, in this application, it is referred to as user equipment for short.
术语“第一设备”或“第二设备”均为具有无线通信功能的处理设备,用于传输用户设备发送的数据。第一设备和/或第二设备包括不限于交换机、路由器、软交换、网络协议多媒体子系统(Internet Protocol Multimedia Subsystem,简称IMS)等设备。The term "first device" or "second device" is a processing device with a wireless communication function, and is used to transmit data sent by the user equipment. The first device and/or the second device include, but are not limited to, devices such as a switch, a router, a soft switch, and an Internet Protocol Multimedia Subsystem (Internet Protocol Multimedia Subsystem, IMS for short).
术语“业务数据”也可称“用户数据”,可包括多种,比如在业务连接成功建立之后用户设备之间通过传输链路传输的媒体流。具体来说,媒体流数据可为视频数据、语音数据、短信息数据等等。由于媒体流数据若出现中断,则会对用户造成很大的影响,用户感受较差,因此本发明实施例所提供的方法在业务连接状态,即用户设备之间可以正常传输媒体流的阶段,而经检测发现媒体流突然中断,则提高发送故障检测报文的频率,以便快速检测出发生故障的接口。The term "service data" may also be referred to as "user data", and may include various types, such as media streams transmitted between user equipments through a transmission link after a service connection is successfully established. Specifically, the media stream data may be video data, voice data, short message data, and so on. If the media stream data is interrupted, it will have a great impact on the user, and the user experience is poor. Therefore, the method provided by the embodiment of the present invention is in the service connection state, that is, the stage where the media stream can be normally transmitted between user equipment. If it is detected that the media stream is suddenly interrupted, the frequency of sending fault detection packets is increased, so as to quickly detect the faulty interface.
具体实施中,在未成功建立业务连接之前用户设备和网络之间不会传输媒体流数据,可能会传输一些为了建立业务连接的控制面信令,进而通过这些控制面的信令交互,成功建立媒体面的会话(session),即成功建立业务连接,之后,用户设备之间就可以传输媒体流数据了。由于在业务连接未成功建立时,用户设备之间不会传输媒体流数据,且仅仅传输少量的用于建立业务连接的控制面信令,且由于用于建立业务连接的控制面信令的交互过程非常快,因此本发明实施例中在业务连接未成功建立时,比如业务连接的建立过程中,使用较小的故障检测报文的发送频率即可,节省信令开销,降低了网络资源的消耗。In the specific implementation, before the business connection is successfully established, the media stream data will not be transmitted between the user equipment and the network, and some control plane signaling for establishing the business connection may be transmitted, and then through the signaling interaction of these control planes, the network is successfully established. A session (session) on the media plane means that a service connection is established successfully, after which media stream data can be transmitted between user equipments. Because when the business connection is not successfully established, the media stream data will not be transmitted between the user equipments, and only a small amount of control plane signaling for establishing the business connection is transmitted, and due to the interaction of the control plane signaling for establishing the business connection The process is very fast, so in the embodiment of the present invention, when the service connection is not successfully established, for example, in the process of establishing the service connection, it is enough to use a smaller sending frequency of the fault detection message, which saves signaling overhead and reduces network resource usage. consume.
术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。The term "and/or" is just an association relationship describing associated objects, which means that there can be three relationships, for example, A and/or B can mean: A exists alone, A and B exist at the same time, and B exists alone. three conditions. In addition, the character "/" in this article generally indicates that the contextual objects are an "or" relationship.
本发明实施例中的故障检测报文可为能够检测出设备,以及设备之间链路是否发生故障的报文,包括不限于BFD、地址分辨协议(Address ResolutionProtocol,简称APR)报文。The fault detection message in the embodiment of the present invention may be a message capable of detecting whether a device and a link between devices are faulty, including but not limited to BFD and Address Resolution Protocol (APR) messages.
下面以第一设备和第二设备为例介绍BFD报文的检测机制。The following uses the first device and the second device as examples to introduce the detection mechanism of BFD packets.
第一设备与第二设备通过控制信令进行协商,协商进行BFD会话的参数,比如会话标识符、期望的收到BFD报文,以及发送BFD报文的最小时间间隔、本端BFD会话状态等。控制信令封装在UDP斑纹中进行传输,协商成功后,第一设备和第二设备之间建立BFD会话。具体可包括两种方式。The first device and the second device negotiate through control signaling to negotiate the parameters of the BFD session, such as the session identifier, expected BFD packets received, the minimum time interval for sending BFD packets, and the status of the local BFD session. . The control signaling is encapsulated in UDP stripes for transmission. After the negotiation succeeds, a BFD session is established between the first device and the second device. Specifically, there are two ways.
方式一,第一设备周期性向第二设备发送BFD报文,第二设备周期性向第一设备发送BFD报文,若第一设备在一定时长内没有收到第二设备发送的BFD报文,则认为第二设备和/或第一设备和第二设备之间的链路发生了故障。第一设备和第二设备之间的链路发生故障具体可包括第一设备、第二设备、第一设备和第二设备之间的链路上的传输设备,以及第一设备和第二设备之间的传输线路中的任一项或任多项发生故障。为满足快速检测的需求,BFD协议规定BFD报文的发送间隔和接收间隔单位是微秒。但限于目前现有技术处理能力,大部分厂商的设备所配置BFD报文的发送和接收的时间间隔为毫秒级。Method 1: The first device periodically sends BFD packets to the second device, and the second device periodically sends BFD packets to the first device. If the first device does not receive the BFD packet sent by the second device within a certain period of time, then It is assumed that the second device and/or the link between the first device and the second device has failed. The failure of the link between the first device and the second device may specifically include the first device, the second device, the transmission device on the link between the first device and the second device, and the first device and the second device failure of any one or more of the transmission lines in between. To meet the requirement of fast detection, the BFD protocol stipulates that the unit of sending interval and receiving interval of BFD packets is microseconds. However, limited by the current processing capabilities of existing technologies, the time interval for sending and receiving BFD packets configured by most manufacturers' devices is at the millisecond level.
方式二,第一设备周期性向第二设备发送BFD报文,第二设备将该BFD报文的响应发送给第二设备,若第一设备在预设时长内未收到第二设备发送的响应,则认为第二设备和/或第一设备和第二设备之间的链路发生了故障。第一设备和第二设备之间的链路可包括第一设备和第二设备之间的设备,以及第一设备和第二设备之间的传输线路。Method 2: The first device periodically sends a BFD packet to the second device, and the second device sends the response of the BFD packet to the second device. If the first device does not receive the response sent by the second device within the preset time period , it is considered that the second device and/or the link between the first device and the second device has failed. The link between the first device and the second device may include a device between the first device and the second device, and a transmission line between the first device and the second device.
ARP用于实现以太网中三层网络协议(Internet Protocol,简称IP)地址与二层媒体访问控制(Medium Access Control,简称MAC)地址之间的映射。ARP相关协议为RFC826、RFC903等。The ARP is used to realize the mapping between the Layer 3 Internet Protocol (Internet Protocol, IP for short) address and the Layer 2 Media Access Control (Medium Access Control, MAC for short) address in the Ethernet. ARP-related protocols are RFC826, RFC903, etc.
ARP基于的原理为:在以太网的同一网段内以广播的方式查询某个IP地址对应的MAC地址,以实现三层IP地址与二层MAC地址之间的动态映射。这是任何以太网主机设备都支持的一个协议,也可称为ARP为2.5层协议。ARP is based on the principle that the MAC address corresponding to an IP address is queried by broadcasting in the same network segment of the Ethernet, so as to realize the dynamic mapping between the Layer 3 IP address and the Layer 2 MAC address. This is a protocol supported by any Ethernet host device, and it can also be called ARP as a layer 2.5 protocol.
下面以第一设备和第二设备为例介绍ARP报文的检测机制。The following uses the first device and the second device as examples to introduce the detection mechanism of the ARP packet.
第一设备通过广播方式发送一个APR请求,该APR请求的目标IP为第二设备的IP。包括第二设备在内的以太网设备都会接收到该APR请求,但是以太网设备中仅有第二设备会响应该APR请求。第二设备向第一设备发出一个包括第二设备的MAC地址的ARP响应报文,第一设备接收到该ARP响应报文之后,第一设备可以使用第二设备的MAC地址与第二设备进行通信。该过程中,第一设备成功收到第二设备发送的APR响应,即判定第二设备无故障。The first device sends an APR request in a broadcast manner, and the target IP of the APR request is the IP of the second device. All Ethernet devices including the second device will receive the APR request, but only the second device among the Ethernet devices will respond to the APR request. The second device sends an ARP response message including the MAC address of the second device to the first device. After the first device receives the ARP response message, the first device can use the MAC address of the second device to communicate with the second device. communication. In this process, the first device successfully receives the APR response sent by the second device, that is, it determines that the second device is not faulty.
通过上述方式判定出第二设备或第一设备与第二设备之间的链路上出现故障,则系统会及时切换至备用网络,以便提高数据传输的可靠性。If it is determined in the above manner that a fault occurs on the second device or the link between the first device and the second device, the system will switch to the standby network in time to improve the reliability of data transmission.
基于上述内容,下面介绍本发明实施例适用的系统架构。图2示例性示出了本发明实施例适用的一种系统架构示意图。如图2所示,包括用户A使用的用户设备201,与用户设备连接的路由器202,路由器202连接交换机203和交换机204,交换机203和交换机204连接,交换机203和交换机204连接核心网设备205和核心网设备206,核心网设备205和核心网设备206之间连接,核心网设备205和核心网设备206连接交换机207和交换机208,交换机207和交换机208之间连接,交换机207和交换机208连接路由器209,路由器209连接用户B的用户设备210。可选地,本发明实施例中的用户设备可连接多个路由器,路由器也可连接多个用户设备。在数据中心,核心网设备通常为虚拟机。Based on the above content, the system architecture applicable to the embodiment of the present invention is introduced below. Fig. 2 exemplarily shows a schematic diagram of a system architecture applicable to an embodiment of the present invention. As shown in FIG. 2 , it includes user equipment 201 used by user A, router 202 connected to the user equipment, router 202 connected to switch 203 and switch 204, switch 203 connected to switch 204, switch 203 and switch 204 connected to core network equipment 205 and The core network device 206 is connected between the core network device 205 and the core network device 206, the core network device 205 and the core network device 206 are connected to the switch 207 and the switch 208, the switch 207 is connected to the switch 208, and the switch 207 and the switch 208 are connected to the router 209. The router 209 connects the user equipment 210 of user B. Optionally, the user equipment in this embodiment of the present invention may be connected to multiple routers, and the router may also be connected to multiple user equipments. In a data center, core network devices are usually virtual machines.
本发明实施例中第一设备或第二设备中的任一个设备可为上述路由器、交换机、核心网设备中的任一个,第一设备和第二设备为两个不同的设备。本发明实施例中的故障检测设备可安装于第一设备,或安装于第二设备,或者第一设备和第二设备中分别安装一个故障检测设备。用户设备201传输的数据可通过多条链路传输给用户设备210,比如一条链路为用户设备201依次连接路由器202、交换机203、核心网设备205、交换机207、路由器209,以及用户设备210。假设第一设备为核心网设备205,第二设备为路由器209,第一设备和第二设备之间可以做故障检测,在检测到该条链路或该条链路上的设备出现故障时,向上层应用上报,并且切换至备用链路,比如经检测发现第二设备与第一设备的链路上的交换机207出现故障,此时切换至新链路,比如用户设备201依次连接路由器202、交换机203、核心网设备205、交换机208、路由器209,以及用户设备210。如此,可提高该系统架构的可靠性。Any one of the first device or the second device in the embodiment of the present invention may be any one of the aforementioned routers, switches, and core network devices, and the first device and the second device are two different devices. The fault detection device in the embodiment of the present invention may be installed in the first device, or in the second device, or one fault detection device is respectively installed in the first device and the second device. The data transmitted by the user equipment 201 can be transmitted to the user equipment 210 through multiple links. For example, a link connects the user equipment 201 to the router 202, the switch 203, the core network device 205, the switch 207, the router 209, and the user equipment 210. Assuming that the first device is a core network device 205, and the second device is a router 209, fault detection can be performed between the first device and the second device, and when it is detected that the link or a device on the link fails, Report to the upper-layer application and switch to the backup link. For example, it is detected that the switch 207 on the link between the second device and the first device is faulty. At this time, switch to a new link. For example, the user equipment 201 is connected to the router 202, Switch 203 , core network device 205 , switch 208 , router 209 , and user equipment 210 . In this way, the reliability of the system architecture can be improved.
基于上述内容,以及图2所示的系统架构,图3示出了本发明实施例提供的一种故障检测方法流程示意图,如图3所示,包括:Based on the above content and the system architecture shown in FIG. 2, FIG. 3 shows a schematic flowchart of a fault detection method provided by an embodiment of the present invention, as shown in FIG. 3, including:
步骤301,确定第一设备与第二设备对应的所有用户设备之间未建立业务连接;其中第二设备对应的用户设备的业务数据通过第二设备和第一设备进行传输;Step 301, determining that no service connection has been established between the first device and all user equipment corresponding to the second device; wherein the service data of the user equipment corresponding to the second device is transmitted through the second device and the first device;
步骤302,确定第一设备发送故障检测报文的频率,在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值;和/或Step 302: Determine the frequency at which the first device sends the fault detection message, and when it is determined that the frequency at which the first device sends the fault detection message is not less than the first threshold, adjust the frequency at which the first device sends the fault detection message so that after adjustment The frequency at which the first device sends the fault detection message is less than the first threshold; and/or
确定第二设备发送故障检测报文的频率;在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值。可选地,第一阈值可等于第二阈值,也可根据具体的实施环境动态确定第一阈值和第二阈值,第一阈值可大于,也可小于第二阈值。Determine the frequency at which the second device sends the fault detection message; when determining that the frequency at which the second device sends the fault detection message is not less than the second threshold, adjust the frequency at which the second device sends the fault detection message so that the adjusted second The frequency at which the device sends fault detection packets is less than the second threshold. Optionally, the first threshold may be equal to the second threshold, or the first threshold and the second threshold may be dynamically determined according to a specific implementation environment, and the first threshold may be greater than or less than the second threshold.
上述步骤301中,第二设备对应的用户设备与第一设备和第二设备的连接方法有多种,比如用户设备通过其它设备与第二设备连接,第二设备通过其它设备与第一设备连接,或者用户设备直接与第二设备连接,第二设备直接与第一设备连接。无论第二设备对应的用户设备如何与第一设备和第二设备的连接,用户设备的业务数据均需要经过第一设备和第二设备进行传输。In the above step 301, there are many ways to connect the user equipment corresponding to the second device to the first device and the second device, for example, the user equipment is connected to the second device through other devices, and the second device is connected to the first device through other devices , or the user equipment is directly connected to the second device, and the second device is directly connected to the first device. No matter how the user equipment corresponding to the second device is connected to the first device and the second device, the service data of the user equipment needs to be transmitted through the first device and the second device.
上述步骤301中,用户设备与第一设备之间建立业务连接具体是指用户设备通过第二设备以及第一设备传输请求与另一个用户设备建立业务连接的请求,在业务连接成功建立之后,用户设备与第二设备、用户设备与第一设备、以及用户设备与另一个用户设备之间均成功建立业务连接,可通过该建立成功的业务连接链路进行通信,即传输业务数据。In the above step 301, establishing a service connection between the user equipment and the first device specifically means that the user equipment transmits a request to establish a service connection with another user equipment through the second device and the first device. After the service connection is successfully established, the user A service connection is successfully established between the device and the second device, the user equipment and the first device, and the user equipment and another user equipment, and communication can be performed through the successfully established service connection link, that is, service data can be transmitted.
本发明实施例中,确定第一设备发送故障检测报文的频率,具体有多种方式,比如通过获取用于确定第一设备发送故障检测报文的频率的定时器的参数,将该定时器的时间参数进行更改,以便达到调整第一设备发送故障检测报文的频率的目的。确定第二设备发送故障检测报文的频率,可通过传输信令获取用于确定第二设备发送故障检测报文的频率的定时器的参数,将该定时器的时间参数进行更改,以便达到调整第二设备发送故障检测报文的频率的目的。举个例子,用于确定第一设备发送故障检测报文的频率的定时器的预定时间为10毫秒,则第一设备每隔10毫秒发送一次故障检测报文,将该定时器的时间设置为20毫秒,则第一设备每隔20毫秒发送一次故障检测报文,如此,第一设备发送故障检测报文的频率则降低。In the embodiment of the present invention, there are many ways to determine the frequency of the first device sending the fault detection message. For example, by obtaining the parameters of the timer used to determine the frequency of the first device sending the fault detection message, the timer The time parameter of the device is changed, so as to achieve the purpose of adjusting the frequency of sending the fault detection message by the first device. To determine the frequency at which the second device sends the fault detection message, the parameters of the timer used to determine the frequency at which the second device sends the fault detection message can be obtained through transmission signaling, and the time parameter of the timer is changed to achieve adjustment The purpose of the frequency of sending the fault detection message by the second device. For example, if the preset time of the timer used to determine the frequency of sending the fault detection message by the first device is 10 milliseconds, then the first device sends a fault detection message every 10 milliseconds, and the time of the timer is set to 20 milliseconds, the first device sends a fault detection message every 20 milliseconds, so that the frequency of the first device to send a fault detection message is reduced.
上述步骤302中,可更改第一设备向第二设备发送的故障检测报文的频率,第二设备接收到该故障检测报文之后可以回响应,如此,第一设备发送的故障检测报文的频率降低,则第二设备发送响应的频率也降低,因此,减少了数据传输,减轻了网络负荷。In the above step 302, the frequency of the fault detection message sent by the first device to the second device can be changed, and the second device can respond after receiving the fault detection message. In this way, the frequency of the fault detection message sent by the first device If the frequency is reduced, the frequency of sending the response by the second device is also reduced, thus reducing data transmission and reducing network load.
另一种实现方式为可更改第二设备向第一设备发送的故障检测报文的频率,第一设备接收到该故障检测报文之后可以回响应,如此,第二设备发送的故障检测报文的频率降低,则第一设备发送响应的频率也降低,因此,减少了数据传输,减轻了网络负荷。Another implementation method is to change the frequency of the fault detection message sent by the second device to the first device, and the first device can respond after receiving the fault detection message. In this way, the fault detection message sent by the second device If the frequency of the first device is reduced, the frequency of sending the response by the first device is also reduced. Therefore, the data transmission is reduced and the network load is reduced.
第三种实现方式可为同时更改第一设备向第二设备发送的故障检测报文的频率,以及第二设备向第一设备发送的故障检测报文的频率,如此,第一设备发送的故障检测报文的频率降低,第二设备发送故障检测报文的频率也降低,因此,减少了数据传输,减轻了网络负荷。The third way of implementation can be to change the frequency of the fault detection message sent by the first device to the second device at the same time, and the frequency of the fault detection message sent by the second device to the first device. In this way, the fault detection message sent by the first device The frequency of the detection message is reduced, and the frequency of sending the fault detection message by the second device is also reduced. Therefore, the data transmission is reduced and the network load is reduced.
由于在确定第一设备与第二设备对应的所有用户设备之间未建立业务连接时,第二设备对应的用户设备不会向第一设备发送业务数据,因此,此时无需以高频率发送故障检测报文,此时在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值,和/或在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值,即降低了故障检测报文的发送频率,从而减轻网络负荷,降低网络开销,节省了网络资源。Since the user equipment corresponding to the second device will not send service data to the first device when it is determined that no service connection has been established between the first device and all user equipment corresponding to the second device, there is no need to send faults at a high frequency at this time Detecting messages, at this time, when it is determined that the frequency of sending fault detection messages of the first device is not less than the first threshold, adjust the frequency of sending fault detection messages of the first device, so that the adjusted first device sends fault detection messages is less than the first threshold, and/or when it is determined that the frequency at which the second device sends fault detection messages is not less than the second threshold, adjust the frequency at which the second device sends fault detection messages so that the adjusted second device sends The frequency of the fault detection message is less than the second threshold, that is, the frequency of sending the fault detection message is reduced, thereby reducing network load, reducing network overhead, and saving network resources.
可选地,在确定第二设备对应的至少一个用户设备与第一设备之间建立业务连接时,确定出在预设时长内接收到的第二设备发送的业务数据的数量;其中,预设时长可根据具体情况来确定,比如根据经验设置为1秒或10秒等;Optionally, when it is determined that at least one user equipment corresponding to the second device establishes a service connection with the first device, the number of service data sent by the second device received within a preset time period is determined; wherein, the preset The duration can be determined according to the specific situation, such as setting it to 1 second or 10 seconds based on experience;
在确定出业务数据的数量为零时,确定第一设备发送故障检测报文的频率,在确定第一设备发送故障检测报文的频率小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率不小于第一阈值;和/或When it is determined that the number of business data is zero, determine the frequency of the first device to send the fault detection message, and when it is determined that the frequency of the first device to send the fault detection message is less than the first threshold, adjust the first device to send the fault detection message frequency, so that the adjusted frequency of the first device sending the fault detection message is not less than the first threshold; and/or
在确定出业务数据的数量为零时,确定第二设备发送故障检测报文的频率,在确定第二设备发送故障检测报文的频率小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率不小于第二阈值。When it is determined that the amount of business data is zero, determine the frequency of the second device sending the fault detection message, and when it is determined that the frequency of the second device sending the fault detection message is less than the second threshold, adjust the second device to send the fault detection message frequency, so that the adjusted frequency of the second device sending the fault detection message is no less than the second threshold.
由于在确定第二设备对应的至少一个用户设备与第一设备之间建立业务连接,因此,用户设备一定会通过第二设备和第一设备进行业务数据传输,如果在预设时长内接收到的第二设备发送的业务数据的数量为零,则说明在预设时长内第二设备未向第一设备传输用户设备所对应的业务数据,此时判定第二设备可能出现故障,因此,可将故障检测报文的频率提高。如此,则在设备或链路出现故障时,提高故障检测速率,进而减少了在设备或链路出现故障时数据传输的中断时间。Since at least one user equipment corresponding to the second device is determined to establish a service connection with the first device, the user equipment must transmit service data through the second device and the first device, if the received The number of service data sent by the second device is zero, which means that the second device has not transmitted the service data corresponding to the user equipment to the first device within the preset time period. At this time, it is determined that the second device may be faulty. Therefore, the The frequency of fault detection messages is increased. In this way, when a device or link fails, the fault detection rate is increased, thereby reducing the interruption time of data transmission when the device or link fails.
进一步,当第二设备故障时,第二设备上的业务数据传输中断,此时在某个接口上发现第二设备上传输的业务数据中断后,立即提高故障检测报文的发送频率,第一设备通过该接口发送故障检测报文。由于多数情况下都是单个设备发生故障,因此个别设备提高故障检测报文的发送频率,也不会为网络带了较大的负荷,且也提高了故障检测速度。Furthermore, when the second device fails, the transmission of service data on the second device is interrupted. At this time, when the service data transmitted on the second device is found to be interrupted on a certain interface, the frequency of sending fault detection messages is immediately increased. The first The device sends fault detection packets through this interface. Since most of the time a single device fails, increasing the frequency of sending fault detection packets by individual devices will not bring a large load to the network, and the fault detection speed will also be improved.
可选地,在确定第二设备连接的至少一个用户设备与第一设备之间建立业务连接时,确定出在预设时长内接收到的第二设备发送的业务数据的数量时;Optionally, when it is determined that at least one user equipment connected to the second device establishes a service connection with the first device, the amount of service data sent by the second device received within a preset time period is determined;
在确定出业务数据的数量不为零时,且第一设备在向第二设备发送故障检测报文时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备停止向第二设备发送故障检测报文;和/或When it is determined that the quantity of business data is not zero, and when the first device sends a fault detection message to the second device, adjust the frequency of the first device to send the fault detection message, so that the adjusted first device stops sending the fault detection message to the second device. The second device sends a fault detection message; and/or
在确定出业务数据的数量不为零时,且第二设备在向第一设备发送故障检测报文时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备停止向第一设备发送故障检测报文。When it is determined that the quantity of business data is not zero, and when the second device sends a fault detection message to the first device, adjust the frequency of the second device to send the fault detection message, so that the adjusted second device stops sending the fault detection message to the first device. The first device sends a fault detection packet.
由于在确定第二设备对应的至少一个用户设备与第一设备之间建立业务连接,因此,用户设备一定会通过第二设备和第一设备进行业务数据传输,如果在预设时长内接收到的第二设备发送的业务数据的数量为不为零,则说明第一设备和第二设备之间的链路及设备均正常,此时可停止发送故障检测报文,以便减轻网络负荷,降低网络开销,节省了网络资源。Since at least one user equipment corresponding to the second device is determined to establish a service connection with the first device, the user equipment must transmit service data through the second device and the first device, if the received If the number of business data sent by the second device is not zero, it means that the links and devices between the first device and the second device are normal. At this time, you can stop sending fault detection messages to reduce the network load and reduce the network load. overhead, saving network resources.
另一种可选的实施方式为,在确定第二设备连接的至少一个用户设备与第一设备之间建立业务连接时,确定出在预设时长内接收到的第二设备发送的业务数据的数量时;Another optional implementation manner is that when it is determined that at least one user equipment connected to the second device establishes a service connection with the first device, it is determined that the service data sent by the second device is received within a preset time period. quantity;
在确定出业务数据的数量不为零时,确定第一设备发送故障检测报文的频率;在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值;和/或When it is determined that the amount of business data is not zero, determine the frequency at which the first device sends a fault detection message; when it is determined that the frequency at which the first device sends a fault detection message is not less than the first threshold, adjust the first device to send a fault detection The frequency of the message, so that the adjusted frequency of the first device sending the fault detection message is less than the first threshold; and/or
在确定出业务数据的数量不为零时,确定第二设备发送故障检测报文的频率;在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值。When it is determined that the amount of business data is not zero, determine the frequency at which the second device sends the fault detection message; when it is determined that the frequency at which the second device sends the fault detection message is not less than the second threshold, adjust the second device to send the fault detection message The frequency of the message, so that the adjusted frequency of the second device sending the fault detection message is less than the second threshold.
由于在确定第二设备对应的至少一个用户设备与第一设备之间建立业务连接,因此,用户设备一定会通过第二设备和第一设备进行业务数据传输,如果在预设时长内接收到的第二设备发送的业务数据的数量为不为零,则说明第一设备和第二设备之间的链路及设备均正常,此时可降低发送故障检测报文的频率,一方面持续对链路以及设备进行检测,另一方面减轻网络负荷,降低网络开销,节省了网络资源。Since at least one user equipment corresponding to the second device is determined to establish a service connection with the first device, the user equipment must transmit service data through the second device and the first device, if the received If the number of business data sent by the second device is not zero, it means that the link and equipment between the first device and the second device are normal. At this time, the frequency of sending fault detection messages can be reduced. On the other hand, it reduces network load, reduces network overhead, and saves network resources.
在上述实施方式中,可选地,在确定第一设备发送故障检测报文的频率不小于第一阈值时,根据业务数据的数量调整第一设备发送故障检测报文的频率;其中,业务数据的数量越小,调整后第一设备发送故障检测报文的频率越大;在确定第二设备发送故障检测报文的频率不小于第二阈值时,根据业务数据的数量调整第二设备发送故障检测报文的频率;其中,业务数据的数量越小,调整后第二设备发送故障检测报文的频率越大。In the above embodiment, optionally, when it is determined that the frequency at which the first device sends the fault detection message is not less than the first threshold, the frequency at which the first device sends the fault detection message is adjusted according to the amount of service data; wherein, the service data The smaller the number, the greater the frequency of the first device to send fault detection packets after adjustment; when it is determined that the frequency of the second device to send fault detection packets is not less than the second threshold, adjust the second device according to the amount of business data. The frequency of the detection message; wherein, the smaller the amount of service data is, the higher the frequency of the adjusted second device is to send the fault detection message.
如此,可灵活的根据用户设备对应的业务数据的发送量确定故障检测报文的发送频率,更加有效的实现了对链路以及设备的检测,且更有效的减轻网络负荷,降低网络开销,节省了网络资源。In this way, the sending frequency of fault detection messages can be flexibly determined according to the sending volume of service data corresponding to the user equipment, which can more effectively realize the detection of links and devices, and more effectively reduce network load, reduce network overhead, and save network resources.
为了更清楚的介绍上述方法流程,图3a示例性示出了本发明实施例提供的另一种故障检测方法的流程示意图,如图3a所示,包括:In order to more clearly introduce the flow of the above method, FIG. 3a exemplarily shows a schematic flow diagram of another fault detection method provided by an embodiment of the present invention, as shown in FIG. 3a, including:
步骤3101,确定第二设备对应的所有用户设备与第一设备之间是否建立业务连接,若否,则执行步骤3102,若是,则执行步骤3103;Step 3101, determine whether service connections are established between all user equipment corresponding to the second device and the first device, if not, perform step 3102, and if yes, perform step 3103;
步骤3102,确定第一设备发送故障检测报文的频率,在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值;和/或Step 3102: Determine the frequency at which the first device sends the fault detection message, and when it is determined that the frequency at which the first device sends the fault detection message is not less than the first threshold, adjust the frequency at which the first device sends the fault detection message so that after adjustment The frequency at which the first device sends the fault detection message is less than the first threshold; and/or
确定第二设备发送故障检测报文的频率;在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值。Determine the frequency at which the second device sends the fault detection message; when determining that the frequency at which the second device sends the fault detection message is not less than the second threshold, adjust the frequency at which the second device sends the fault detection message so that the adjusted second The frequency at which the device sends fault detection packets is less than the second threshold.
步骤3103,确定出在预设时长内接收到的第二设备发送的业务数据的数量,并判断业务数据的数量是否为零;若是,则执行步骤3104;若否,则执行步骤3105或步骤3106;Step 3103, determine the number of business data sent by the second device received within the preset time length, and judge whether the number of business data is zero; if yes, execute step 3104; if not, execute step 3105 or step 3106 ;
步骤3104,确定第一设备发送故障检测报文的频率,在确定第一设备发送故障检测报文的频率小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率不小于第一阈值;和/或Step 3104: Determine the frequency at which the first device sends the fault detection message, and when it is determined that the frequency at which the first device sends the fault detection message is less than the first threshold, adjust the frequency at which the first device sends the fault detection message so that the adjusted The frequency at which the first device sends the fault detection message is not less than the first threshold; and/or
确定第二设备发送故障检测报文的频率,在确定第二设备发送故障检测报文的频率小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率不小于第二阈值。Determine the frequency at which the second device sends the fault detection message, and adjust the frequency at which the second device sends the fault detection message when it is determined that the frequency at which the second device sends the fault detection message is less than the second threshold, so that the adjusted second device The frequency of sending the fault detection message is not less than the second threshold.
步骤3105,确定在向第二设备发送故障检测报文时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备停止向第二设备发送故障检测报文;和/或Step 3105, determining that when sending the fault detection message to the second device, adjust the frequency of sending the fault detection message by the first device, so that the adjusted first device stops sending the fault detection message to the second device; and/or
确定第二设备在向第一设备发送故障检测报文时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备停止向第一设备发送故障检测报文。It is determined that when the second device sends the fault detection message to the first device, the frequency of sending the fault detection message by the second device is adjusted, so that the adjusted second device stops sending the fault detection message to the first device.
步骤3106,确定第一设备发送故障检测报文的频率;在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值;和/或Step 3106, determine the frequency of the first device to send the fault detection message; when it is determined that the frequency of the first device to send the fault detection message is not less than the first threshold, adjust the frequency of the first device to send the fault detection message, so that after adjustment The frequency at which the first device sends the fault detection message is less than the first threshold; and/or
确定第二设备发送故障检测报文的频率;在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值。Determine the frequency at which the second device sends the fault detection message; when determining that the frequency at which the second device sends the fault detection message is not less than the second threshold, adjust the frequency at which the second device sends the fault detection message so that the adjusted second The frequency at which the device sends fault detection packets is less than the second threshold.
为更加清晰描述上述方法,图3b示例性实例了本发明实施例提供的一种故障检测报文的发送频率与业务数据的数量之间的关系示意图。如图3b所示,粗实线表示业务数据的数量,点画线表示故障检测报文的发送频率。当业务连接未建立时,第一设备和第二设备之间的业务数据数量为零,此时,以较小的频率发送故障检测报文,比如以小于第一阈值的频率发送故障检测报文。当业务连接已建立时,业务数据的数量与故障检测报文的发送频率成反比,也就是说,业务数据的数量越大,故障检测报文的发送频率越低,优选地,在业务数据的数量较大时,可停止发送故障检测报文。在业务连接已建立,但业务数据数量为零时,加大故障检测报文的发送频率,以便迅速的定位出发送故障的设备或传输线路。In order to describe the above method more clearly, Fig. 3b exemplarily illustrates a schematic diagram of the relationship between the sending frequency of the fault detection message and the quantity of service data provided by the embodiment of the present invention. As shown in Figure 3b, the thick solid line represents the quantity of service data, and the dotted line represents the frequency of sending fault detection messages. When the service connection is not established, the amount of service data between the first device and the second device is zero, at this time, send the fault detection message with a relatively small frequency, for example, send the fault detection message with a frequency less than the first threshold . When the business connection has been established, the amount of business data is inversely proportional to the frequency of sending fault detection messages, that is to say, the larger the amount of business data, the lower the frequency of sending fault detection messages. When the number is large, the sending of fault detection packets can be stopped. When the service connection has been established but the amount of service data is zero, increase the frequency of sending fault detection messages so as to quickly locate the faulty equipment or transmission line.
从上述内容可看出:本发明实施例中,确定第一设备与第二设备对应的所有用户设备之间未建立业务连接;其中第二设备对应的用户设备的业务数据通过第二设备和第一设备进行传输;确定第一设备发送故障检测报文的频率,在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值;和/或确定第二设备发送故障检测报文的频率;在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值。由于在确定第一设备与第二设备对应的所有用户设备之间未建立业务连接时,第二设备对应的用户设备不会向第一设备发送业务数据,因此,此时无需以高频率发送故障检测报文,此时在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值,和/或在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值,即降低了故障检测报文的发送频率,从而减轻网络负荷,降低网络开销,节省了网络资源。It can be seen from the above that in the embodiment of the present invention, it is determined that no service connection has been established between all user equipment corresponding to the first device and the second device; wherein the service data of the user equipment corresponding to the second device passes through the second device and the second device A device transmits; determine the frequency at which the first device sends the fault detection message, and adjust the frequency at which the first device sends the fault detection message when it is determined that the frequency at which the first device sends the fault detection message is not less than the first threshold, so that The adjusted frequency of the first device sending the fault detection message is less than the first threshold; and/or determining the frequency of the second device sending the fault detection message; after determining that the frequency of the second device sending the fault detection message is not less than the second threshold , adjust the frequency at which the second device sends the fault detection packet, so that the adjusted frequency at which the second device sends the fault detection packet is less than the second threshold. Since the user equipment corresponding to the second device will not send service data to the first device when it is determined that no service connection has been established between the first device and all user equipment corresponding to the second device, there is no need to send faults at a high frequency at this time Detecting messages, at this time, when it is determined that the frequency of sending fault detection messages of the first device is not less than the first threshold, adjust the frequency of sending fault detection messages of the first device, so that the adjusted first device sends fault detection messages is less than the first threshold, and/or when it is determined that the frequency at which the second device sends fault detection messages is not less than the second threshold, adjust the frequency at which the second device sends fault detection messages so that the adjusted second device sends The frequency of the fault detection message is less than the second threshold, that is, the frequency of sending the fault detection message is reduced, thereby reducing network load, reducing network overhead, and saving network resources.
图4示例性示出了本发明实施例提供的一种故障检测设备的结构示意图。Fig. 4 exemplarily shows a schematic structural diagram of a fault detection device provided by an embodiment of the present invention.
基于相同构思,本发明实施例提供一种故障检测设备400,如图4所示,包括确定单元401和处理单元402:Based on the same idea, an embodiment of the present invention provides a fault detection device 400, as shown in FIG. 4 , including a determination unit 401 and a processing unit 402:
确定单元,用于确定第一设备与第二设备对应的所有用户设备之间未建立业务连接;其中第二设备对应的用户设备的业务数据通过第二设备和第一设备进行传输;A determining unit, configured to determine that no service connection has been established between the first device and all user equipment corresponding to the second device; wherein the service data of the user equipment corresponding to the second device is transmitted through the second device and the first device;
处理单元,用于在通过确定单元确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值;和/或A processing unit, configured to adjust the frequency of the first device to send the fault detection message when the determination unit determines that the frequency of the first device to send the fault detection message is not less than the first threshold, so that the adjusted first device sends the fault detection message The frequency of the message is less than the first threshold; and/or
在通过确定单元确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值。When it is determined by the determination unit that the frequency of sending fault detection messages by the second device is not less than the second threshold, adjust the frequency of sending fault detection messages by the second device so that the adjusted frequency of sending fault detection messages by the second device is less than second threshold.
可选地,确定单元,还用于:Optionally, determine the unit, also for:
在确定第二设备对应的至少一个用户设备与第一设备之间建立业务连接时,确定出第一设备在预设时长内接收到的第二设备发送的业务数据的数量;When determining that at least one user equipment corresponding to the second device establishes a service connection with the first device, determine the amount of service data sent by the second device received by the first device within a preset time period;
处理单元,还用于:processing unit, also used for:
在通过确定单元确定出业务数据的数量为零时,确定第一设备发送故障检测报文的频率,在确定第一设备发送故障检测报文的频率小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率不小于第一阈值;和/或When it is determined by the determination unit that the number of service data is zero, determine the frequency at which the first device sends a fault detection message, and when it is determined that the frequency at which the first device sends a fault detection message is less than the first threshold, adjust the first device to send a fault detection message. The frequency of the detection message, so that the adjusted frequency of the first device sending the fault detection message is not less than the first threshold; and/or
在通过确定单元确定出业务数据的数量为零时,确定第二设备发送故障检测报文的频率,在确定第二设备发送故障检测报文的频率小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率不小于第二阈值。When it is determined by the determination unit that the number of service data is zero, determine the frequency of the second device sending fault detection messages, and when it is determined that the frequency of the second device sending fault detection messages is less than the second threshold, adjust the second device to send fault detection messages The frequency of detection packets, so that the adjusted frequency of sending fault detection packets by the second device is no less than the second threshold.
可选地,处理单元,还用于:Optionally, the processing unit is also used for:
在通过确定单元确定出业务数据的数量不为零时,且第一设备在向第二设备发送故障检测报文时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备停止向第二设备发送故障检测报文;和/或When it is determined by the determination unit that the quantity of service data is not zero, and when the first device sends a fault detection message to the second device, adjust the frequency of the first device to send the fault detection message, so that the adjusted first The device stops sending fault detection packets to the second device; and/or
在通过确定单元确定出业务数据的数量不为零时,且第二设备在向第一设备发送故障检测报文时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备停止向第一设备发送故障检测报文。When it is determined by the determination unit that the number of service data is not zero, and the second device sends a fault detection message to the first device, adjust the frequency of the second device to send the fault detection message, so that the adjusted second The device stops sending fault detection packets to the first device.
可选地,处理单元,还用于:Optionally, the processing unit is also used for:
在通过确定单元确定出业务数据的数量不为零时,确定第一设备发送故障检测报文的频率;在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值;和/或When it is determined by the determining unit that the quantity of service data is not zero, determine the frequency at which the first device sends the fault detection message; when it is determined that the frequency at which the first device sends the fault detection message is not less than the first threshold, adjust the first device The frequency of sending the fault detection message, so that the adjusted frequency of the first device to send the fault detection message is less than the first threshold; and/or
在通过确定单元确定出业务数据的数量不为零时,确定第二设备发送故障检测报文的频率;在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值。When it is determined by the determining unit that the quantity of business data is not zero, determine the frequency of the second device sending the fault detection message; when it is determined that the frequency of the second device sending the fault detection message is not less than the second threshold, adjust the second device The frequency of sending the fault detection packet, so that the adjusted frequency of the second device for sending the fault detection packet is less than the second threshold.
可选地,处理单元,具体用于:Optionally, a processing unit, specifically for:
在确定第一设备发送故障检测报文的频率不小于第一阈值时,根据业务数据的数量调整第一设备发送故障检测报文的频率;其中,业务数据的数量越小,调整后第一设备发送故障检测报文的频率越大;When it is determined that the frequency at which the first device sends fault detection packets is not less than the first threshold, adjust the frequency at which the first device sends fault detection packets according to the amount of business data; wherein, the smaller the amount of business data, the adjusted first device The higher the frequency of sending fault detection messages;
在确定第二设备发送故障检测报文的频率不小于第二阈值时,根据业务数据的数量调整第二设备发送故障检测报文的频率;其中,业务数据的数量越小,调整后第二设备发送故障检测报文的频率越大。When it is determined that the frequency at which the second device sends fault detection packets is not less than the second threshold, adjust the frequency at which the second device sends fault detection packets according to the amount of business data; wherein, the smaller the amount of business data, the adjusted second device The higher the frequency of sending fault detection packets.
从上述内容可看出:本发明实施例中,确定第一设备与第二设备对应的所有用户设备之间未建立业务连接;其中第二设备对应的用户设备的业务数据通过第二设备和第一设备进行传输;确定第一设备发送故障检测报文的频率,在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值;和/或确定第二设备发送故障检测报文的频率;在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值。由于在确定第一设备与第二设备对应的所有用户设备之间未建立业务连接时,第二设备对应的用户设备不会向第一设备发送业务数据,因此,此时无需以高频率发送故障检测报文,此时在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值,和/或在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值,即降低了故障检测报文的发送频率,从而减轻网络负荷,降低网络开销,节省了网络资源。It can be seen from the above that in the embodiment of the present invention, it is determined that no service connection has been established between all user equipment corresponding to the first device and the second device; wherein the service data of the user equipment corresponding to the second device passes through the second device and the second device A device transmits; determine the frequency at which the first device sends the fault detection message, and adjust the frequency at which the first device sends the fault detection message when it is determined that the frequency at which the first device sends the fault detection message is not less than the first threshold, so that The adjusted frequency of the first device sending the fault detection message is less than the first threshold; and/or determining the frequency of the second device sending the fault detection message; after determining that the frequency of the second device sending the fault detection message is not less than the second threshold , adjust the frequency at which the second device sends the fault detection packet, so that the adjusted frequency at which the second device sends the fault detection packet is less than the second threshold. Since the user equipment corresponding to the second device will not send service data to the first device when it is determined that no service connection has been established between the first device and all user equipment corresponding to the second device, there is no need to send faults at a high frequency at this time Detecting messages, at this time, when it is determined that the frequency of sending fault detection messages of the first device is not less than the first threshold, adjust the frequency of sending fault detection messages of the first device, so that the adjusted first device sends fault detection messages is less than the first threshold, and/or when it is determined that the frequency at which the second device sends fault detection messages is not less than the second threshold, adjust the frequency at which the second device sends fault detection messages so that the adjusted second device sends The frequency of the fault detection message is less than the second threshold, that is, the frequency of sending the fault detection message is reduced, thereby reducing network load, reducing network overhead, and saving network resources.
图5示例性示出了本发明实施例提供的另一种故障检测设备的结构示意图。Fig. 5 exemplarily shows a schematic structural diagram of another fault detection device provided by an embodiment of the present invention.
基于相同构思,本发明实施例提供另一种故障检测设备500,如图5所示,包括收发器510、存储器520、处理器530:Based on the same idea, an embodiment of the present invention provides another fault detection device 500, as shown in FIG. 5 , including a transceiver 510, a memory 520, and a processor 530:
处理器,用于读取存储器中的程序,执行下列过程:The processor, which reads the program in the memory, performs the following processes:
确定第一设备与第二设备对应的所有用户设备之间未建立业务连接;其中第二设备对应的用户设备的业务数据通过第二设备和第一设备进行传输;Determining that no service connection has been established between the first device and all user equipment corresponding to the second device; where the service data of the user equipment corresponding to the second device is transmitted through the second device and the first device;
在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值;和/或When it is determined that the frequency at which the first device sends the fault detection message is not less than the first threshold, adjust the frequency at which the first device sends the fault detection message so that the adjusted frequency at which the first device sends the fault detection message is less than the first threshold ;and / or
在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值;When it is determined that the frequency at which the second device sends the fault detection message is not less than the second threshold, adjust the frequency at which the second device sends the fault detection message, so that the adjusted frequency at which the second device sends the fault detection message is less than the second threshold ;
存储器,用于存储第一阈值和第二阈值,以及上述方法流程执行过程中其它相关数据;A memory for storing the first threshold and the second threshold, and other relevant data during the execution of the above method;
收发器,用于发送和接收信令。Transceivers for sending and receiving signaling.
可选地,处理器,还用于:Optionally, the processor is also used to:
在确定第二设备对应的至少一个用户设备与第一设备之间建立业务连接时,确定出第一设备在预设时长内接收到的第二设备发送的业务数据的数量;When determining that at least one user equipment corresponding to the second device establishes a service connection with the first device, determine the amount of service data sent by the second device received by the first device within a preset time period;
在确定出业务数据的数量为零时,确定第一设备发送故障检测报文的频率,在确定第一设备发送故障检测报文的频率小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率不小于第一阈值;和/或When it is determined that the number of business data is zero, determine the frequency of the first device to send the fault detection message, and when it is determined that the frequency of the first device to send the fault detection message is less than the first threshold, adjust the first device to send the fault detection message frequency, so that the adjusted frequency of the first device sending the fault detection message is not less than the first threshold; and/or
在确定出业务数据的数量为零时,确定第二设备发送故障检测报文的频率,在确定第二设备发送故障检测报文的频率小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率不小于第二阈值。When it is determined that the amount of business data is zero, determine the frequency of the second device sending the fault detection message, and when it is determined that the frequency of the second device sending the fault detection message is less than the second threshold, adjust the second device to send the fault detection message frequency, so that the adjusted frequency of the second device sending the fault detection message is no less than the second threshold.
可选地,处理器,还用于:Optionally, the processor is also used to:
在确定出业务数据的数量不为零时,且第一设备在向第二设备发送故障检测报文时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备停止向第二设备发送故障检测报文;和/或When it is determined that the quantity of business data is not zero, and when the first device sends a fault detection message to the second device, adjust the frequency of the first device to send the fault detection message, so that the adjusted first device stops sending the fault detection message to the second device. The second device sends a fault detection message; and/or
在确定出业务数据的数量不为零时,且第二设备在向第一设备发送故障检测报文时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备停止向第一设备发送故障检测报文。When it is determined that the quantity of business data is not zero, and when the second device sends a fault detection message to the first device, adjust the frequency of the second device to send the fault detection message, so that the adjusted second device stops sending the fault detection message to the first device. The first device sends a fault detection packet.
可选地,处理器,还用于:Optionally, the processor is also used to:
在确定出业务数据的数量不为零时,确定第一设备发送故障检测报文的频率;在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值;和/或When it is determined that the amount of business data is not zero, determine the frequency at which the first device sends a fault detection message; when it is determined that the frequency at which the first device sends a fault detection message is not less than the first threshold, adjust the first device to send a fault detection The frequency of the message, so that the adjusted frequency of the first device sending the fault detection message is less than the first threshold; and/or
在确定出业务数据的数量不为零时,确定第二设备发送故障检测报文的频率;在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值。When it is determined that the amount of business data is not zero, determine the frequency at which the second device sends the fault detection message; when it is determined that the frequency at which the second device sends the fault detection message is not less than the second threshold, adjust the second device to send the fault detection message The frequency of the message, so that the adjusted frequency of the second device sending the fault detection message is less than the second threshold.
可选地,处理器,具体用于:Optionally, a processor, specifically for:
在确定第一设备发送故障检测报文的频率不小于第一阈值时,根据业务数据的数量调整第一设备发送故障检测报文的频率;其中,业务数据的数量越小,调整后第一设备发送故障检测报文的频率越大;When it is determined that the frequency at which the first device sends fault detection packets is not less than the first threshold, adjust the frequency at which the first device sends fault detection packets according to the amount of business data; wherein, the smaller the amount of business data, the adjusted first device The higher the frequency of sending fault detection messages;
在确定第二设备发送故障检测报文的频率不小于第二阈值时,根据业务数据的数量调整第二设备发送故障检测报文的频率;其中,业务数据的数量越小,调整后第二设备发送故障检测报文的频率越大。When it is determined that the frequency at which the second device sends fault detection packets is not less than the second threshold, adjust the frequency at which the second device sends fault detection packets according to the amount of business data; wherein, the smaller the amount of business data, the adjusted second device The higher the frequency of sending fault detection packets.
其中,在图5中,总线架构可以包括任意数量的互联的总线和桥,具体由处理器代表的一个或多个处理器和存储器代表的存储器的各种电路链接在一起。总线架构还可以将诸如外围设备、稳压器和功率管理电路等之类的各种其他电路链接在一起,这些都是本领域所公知的,因此,本文不再对其进行进一步描述。总线接口提供接口。收发器可以是多个元件,即包括发送机和收发机,提供用于在传输介质上与各种其他装置通信的单元。处理器负责管理总线架构和通常的处理,存储器可以存储处理器在执行操作时所使用的数据。Wherein, in FIG. 5 , the bus architecture may include any number of interconnected buses and bridges, specifically one or more processors represented by a processor and various circuits of a memory represented by a memory are linked together. The bus architecture can also link together various other circuits such as peripherals, voltage regulators, and power management circuits, etc., which are well known in the art and therefore will not be further described herein. The bus interface provides the interface. A transceiver may be a plurality of elements, ie, including a transmitter and a transceiver, providing means for communicating with various other devices over a transmission medium. The processor is responsible for managing the bus architecture and general processing, and the memory stores data that the processor uses when performing operations.
从上述内容可看出:本发明实施例中,确定第一设备与第二设备对应的所有用户设备之间未建立业务连接;其中第二设备对应的用户设备的业务数据通过第二设备和第一设备进行传输;确定第一设备发送故障检测报文的频率,在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值;和/或确定第二设备发送故障检测报文的频率;在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值。由于在确定第一设备与第二设备对应的所有用户设备之间未建立业务连接时,第二设备对应的用户设备不会向第一设备发送业务数据,因此,此时无需以高频率发送故障检测报文,此时在确定第一设备发送故障检测报文的频率不小于第一阈值时,调整第一设备发送故障检测报文的频率,以使调整后的第一设备发送故障检测报文的频率小于第一阈值,和/或在确定第二设备发送故障检测报文的频率不小于第二阈值时,调整第二设备发送故障检测报文的频率,以使调整后的第二设备发送故障检测报文的频率小于第二阈值,即降低了故障检测报文的发送频率,从而减轻网络负荷,降低网络开销,节省了网络资源。It can be seen from the above that in the embodiment of the present invention, it is determined that no service connection has been established between all user equipment corresponding to the first device and the second device; wherein the service data of the user equipment corresponding to the second device passes through the second device and the second device A device transmits; determine the frequency at which the first device sends the fault detection message, and adjust the frequency at which the first device sends the fault detection message when it is determined that the frequency at which the first device sends the fault detection message is not less than the first threshold, so that The adjusted frequency of the first device sending the fault detection message is less than the first threshold; and/or determining the frequency of the second device sending the fault detection message; after determining that the frequency of the second device sending the fault detection message is not less than the second threshold , adjust the frequency at which the second device sends the fault detection packet, so that the adjusted frequency at which the second device sends the fault detection packet is less than the second threshold. Since the user equipment corresponding to the second device will not send service data to the first device when it is determined that no service connection has been established between the first device and all user equipment corresponding to the second device, there is no need to send faults at a high frequency at this time Detecting messages, at this time, when it is determined that the frequency of sending fault detection messages of the first device is not less than the first threshold, adjust the frequency of sending fault detection messages of the first device, so that the adjusted first device sends fault detection messages is less than the first threshold, and/or when it is determined that the frequency at which the second device sends fault detection messages is not less than the second threshold, adjust the frequency at which the second device sends fault detection messages so that the adjusted second device sends The frequency of the fault detection message is less than the second threshold, that is, the frequency of sending the fault detection message is reduced, thereby reducing network load, reducing network overhead, and saving network resources.
本领域内的技术人员应明白,本发明的实施例可提供为方法、或计算机程序产品。因此,本发明可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本发明可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art should understand that the embodiments of the present invention may be provided as methods or computer program products. Accordingly, the present invention can take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.
本发明是参照根据本发明实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的设备。The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It should be understood that each procedure and/or block in the flowchart and/or block diagram, and a combination of procedures and/or blocks in the flowchart and/or block diagram can be realized by computer program instructions. These computer program instructions may be provided to a general purpose computer, special purpose computer, embedded processor, or processor of other programmable data processing equipment to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing equipment produce a A device for realizing the functions specified in one or more steps of the flowchart and/or one or more blocks of the block diagram.
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令设备的制造品,该指令设备实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer-readable memory capable of directing a computer or other programmable data processing device to operate in a specific manner, such that the instructions stored in the computer-readable memory produce an article of manufacture comprising an instruction device, the instructions The device realizes the function specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded onto a computer or other programmable data processing device, causing a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented process, thereby The instructions provide steps for implementing the functions specified in the flow chart or blocks of the flowchart and/or the block or blocks of the block diagrams.
尽管已描述了本发明的优选实施例,但本领域内的技术人员一旦得知了基本创造性概念,则可对这些实施例作出另外的变更和修改。所以,所附权利要求意欲解释为包括优选实施例以及落入本发明范围的所有变更和修改。While preferred embodiments of the invention have been described, additional changes and modifications to these embodiments can be made by those skilled in the art once the basic inventive concept is appreciated. Therefore, it is intended that the appended claims be construed to cover the preferred embodiment as well as all changes and modifications which fall within the scope of the invention.
显然,本领域的技术人员可以对本发明进行各种改动和变型而不脱离本发明的精神和范围。这样,倘若本发明的这些修改和变型属于本发明权利要求及其等同技术的范围之内,则本发明也意图包含这些改动和变型在内。Obviously, those skilled in the art can make various changes and modifications to the present invention without departing from the spirit and scope of the present invention. Thus, if these modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalent technologies, the present invention also intends to include these modifications and variations.
Claims (15)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201510851553.2A CN106817267B (en) | 2015-11-27 | 2015-11-27 | Fault detection method and equipment |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201510851553.2A CN106817267B (en) | 2015-11-27 | 2015-11-27 | Fault detection method and equipment |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN106817267A true CN106817267A (en) | 2017-06-09 |
| CN106817267B CN106817267B (en) | 2020-05-19 |
Family
ID=59156227
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201510851553.2A Active CN106817267B (en) | 2015-11-27 | 2015-11-27 | Fault detection method and equipment |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN106817267B (en) |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109150644A (en) * | 2017-06-28 | 2019-01-04 | 阿里巴巴集团控股有限公司 | A kind of pair of server carries out the method and device of health detection |
| CN112564931A (en) * | 2019-09-25 | 2021-03-26 | 华为技术有限公司 | Fault processing method, device and storage medium |
| CN113055921A (en) * | 2021-03-29 | 2021-06-29 | 中国联合网络通信集团有限公司 | Troubleshooting method and terminal |
| CN115529255A (en) * | 2021-06-25 | 2022-12-27 | 华为技术有限公司 | A connection detection method, client and server |
| CN116723134A (en) * | 2023-07-10 | 2023-09-08 | 迈普通信技术股份有限公司 | PFC deadlock detection self-adaption method and device, storage medium and electronic equipment |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1968527A (en) * | 2006-09-27 | 2007-05-23 | 蒋晓海 | Method for intelligent connection between mobile phone client and server |
| US20070233855A1 (en) * | 2006-04-03 | 2007-10-04 | International Business Machines Corporation | Adaptible keepalive for enterprise extenders |
| CN101110848A (en) * | 2006-07-20 | 2008-01-23 | 华为技术有限公司 | A method of detecting channel failure |
| CN104159306A (en) * | 2014-07-22 | 2014-11-19 | 华为技术有限公司 | A method, device and system for controlling air interface resources |
| CN104812094A (en) * | 2015-04-21 | 2015-07-29 | 广东欧珀移动通信有限公司 | Method and apparatus for establishing connection between devices |
-
2015
- 2015-11-27 CN CN201510851553.2A patent/CN106817267B/en active Active
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070233855A1 (en) * | 2006-04-03 | 2007-10-04 | International Business Machines Corporation | Adaptible keepalive for enterprise extenders |
| CN101110848A (en) * | 2006-07-20 | 2008-01-23 | 华为技术有限公司 | A method of detecting channel failure |
| CN1968527A (en) * | 2006-09-27 | 2007-05-23 | 蒋晓海 | Method for intelligent connection between mobile phone client and server |
| CN104159306A (en) * | 2014-07-22 | 2014-11-19 | 华为技术有限公司 | A method, device and system for controlling air interface resources |
| CN104812094A (en) * | 2015-04-21 | 2015-07-29 | 广东欧珀移动通信有限公司 | Method and apparatus for establishing connection between devices |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109150644A (en) * | 2017-06-28 | 2019-01-04 | 阿里巴巴集团控股有限公司 | A kind of pair of server carries out the method and device of health detection |
| CN112564931A (en) * | 2019-09-25 | 2021-03-26 | 华为技术有限公司 | Fault processing method, device and storage medium |
| CN112564931B (en) * | 2019-09-25 | 2022-08-19 | 华为技术有限公司 | Fault processing method, device and storage medium |
| CN113055921A (en) * | 2021-03-29 | 2021-06-29 | 中国联合网络通信集团有限公司 | Troubleshooting method and terminal |
| CN113055921B (en) * | 2021-03-29 | 2022-09-23 | 中国联合网络通信集团有限公司 | Fault processing method and terminal |
| CN115529255A (en) * | 2021-06-25 | 2022-12-27 | 华为技术有限公司 | A connection detection method, client and server |
| CN116723134A (en) * | 2023-07-10 | 2023-09-08 | 迈普通信技术股份有限公司 | PFC deadlock detection self-adaption method and device, storage medium and electronic equipment |
Also Published As
| Publication number | Publication date |
|---|---|
| CN106817267B (en) | 2020-05-19 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN108574614B (en) | Message processing method, device and network system | |
| EP2878105B1 (en) | System and method using rsvp hello suppression for graceful restart capable neighbors | |
| US10587494B2 (en) | Network control method and apparatus | |
| US20200259738A1 (en) | Data Transmission Method, Apparatus, and Device | |
| US9692697B2 (en) | Control channel establishing method, forwarding point, and controller | |
| EP3188450B1 (en) | Reducing false alarms when using network keep-alive messages | |
| CN106817264B (en) | Method, device and system for detecting link fault | |
| CN100531071C (en) | Method and system for link failure detection and device for link failure detection | |
| US10516599B1 (en) | Link priority for loop-protect | |
| US9871690B2 (en) | Indicating a master network controller in a virtual router redundancy protocol | |
| CN106817267B (en) | Fault detection method and equipment | |
| CN108206759A (en) | A kind of method to E-Packet, equipment and system | |
| CN102142978B (en) | Data backup transmission processing method, device and system | |
| WO2012088910A1 (en) | Method and system for detecting connectivity fault | |
| CN107465613A (en) | link aggregation interface communication state switching method and device | |
| CN102006268A (en) | Method, equipment and system for switching main interface and standby interface | |
| US20230344747A1 (en) | Method and apparatus for detecting state of bgp session, and network device | |
| WO2022007861A1 (en) | Failure notification method and electronic device | |
| CN108462612A (en) | Adjust method, apparatus, electronic equipment and the storage medium of RTP media flow transmissions | |
| CN105681187A (en) | VRRP (Virtual Router Redundancy Protocol) backup set management method and related device | |
| CN106921553A (en) | The method and system of High Availabitity are realized in virtual network | |
| CN104618021B (en) | Optical fiber based data transmission method, device and system | |
| CN103414637B (en) | A kind of method of traffic forwarding and relevant apparatus | |
| WO2018040916A1 (en) | Message forwarding method and device | |
| CN101686153A (en) | Method for transmitting VRRP notice news by communication equipment and device thereof |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |