CN1744567A - A Method of Smooth Restart of Router Realizing Fast Routing Convergence - Google Patents
A Method of Smooth Restart of Router Realizing Fast Routing Convergence Download PDFInfo
- Publication number
- CN1744567A CN1744567A CNA2005100909481A CN200510090948A CN1744567A CN 1744567 A CN1744567 A CN 1744567A CN A2005100909481 A CNA2005100909481 A CN A2005100909481A CN 200510090948 A CN200510090948 A CN 200510090948A CN 1744567 A CN1744567 A CN 1744567A
- Authority
- CN
- China
- Prior art keywords
- router
- steadily
- restarting
- lsa
- link
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Landscapes
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
Description
技术领域technical field
本发明涉及一种实现路由快速收敛的路由器平稳重启的方法,属于数据通信技术领域。The invention relates to a method for stably restarting a router for realizing rapid route convergence, and belongs to the technical field of data communication.
背景技术Background technique
在发生链路状态改变时,IP网的传统路由协议的收敛时间一般在十几秒以上,在收敛期间会产生路由环路和黑洞,导致数据包无法正确传递。随着技术的发展,IP网正在越来越多地承载话音、视频等实时业务,这些业务明显不同于以往的电子邮件、网页浏览等业务的服务质量(QoS)要求,可以容忍的业务中断时间通常都希望在1至2秒以内,因此要求IP网络的收敛时间必须加快。实现快速路由收敛的途径主要包括:缩短故障检测时间、调整路由协议默认定时器参数、优化路由计算的算法等,在目前技术水平下,典型网络中的路由协议快速收敛时间已经可以达到1秒左右。When the link state changes, the convergence time of the traditional routing protocol on the IP network is generally more than ten seconds. During the convergence period, routing loops and black holes will occur, resulting in incorrect delivery of data packets. With the development of technology, IP networks are carrying more and more real-time services such as voice and video. These services are obviously different from the previous services such as e-mail and web browsing. The quality of service (QoS) requirements and the tolerable service interruption time It is usually expected to be within 1 to 2 seconds, so the convergence time of the IP network must be accelerated. The ways to achieve fast routing convergence mainly include: shortening the fault detection time, adjusting the default timer parameters of routing protocols, optimizing routing calculation algorithms, etc. Under the current technical level, the fast convergence time of routing protocols in a typical network can reach about 1 second .
路由协议平稳重启(Graceful Restart)的目的是增强IP网的稳定性。当前主流路由器都已经实现控制模块和数据转发模块的分离,当发生软件升级等情况需要重启控制软件时,可以不影响数据的正常转发,但需要相应路由协议的支持,IETF制定了一些相关标准。The purpose of routing protocol graceful restart (Graceful Restart) is to enhance the stability of IP network. The current mainstream routers have realized the separation of the control module and the data forwarding module. When the software upgrade needs to be restarted, the normal forwarding of the data can not be affected, but the support of the corresponding routing protocol is required. IETF has formulated some related standards.
目前,常用的路由协议包括:最短路径优先内部网关协议OSPF(OpenShortest Path First IGP)、中间系统路由协议IS-IS(IntermediateSystem-to-intermediate System)、边界网关协议BGP(Border Gateway Protocol)这些协议分别有各自对应的平稳重启机制,工作原理大体相似。这里以OSPF为例(参见图1)进行说明。Currently, commonly used routing protocols include: OSPF (OpenShortest Path First IGP), IS-IS (Intermediate System-to-intermediate System), and BGP (Border Gateway Protocol). There are corresponding smooth restart mechanisms, and the working principles are generally similar. Here OSPF is taken as an example (see Figure 1) for illustration.
步骤(1)握手阶段:需要平稳重启的路由器X与邻接的路由器Y相互发送握手信息,X告诉Y:它将进入平稳重启进程,以及重启所需的时间长度;Step (1) Handshake phase: Router X that needs to restart smoothly and the adjacent router Y send handshake information to each other, X tells Y: it will enter the process of smooth restart, and the length of time required for restart;
步骤(2)重启阶段:平稳重启路由器X进入控制软件重启阶段,在该阶段,虽然邻接路由器Y收不到重启路由器X的任何OSPF协议包,但仍然认为X是正常的,并正常地向网络中其它路由器刷新重启路由器X的相关状态;Step (2) Restart stage: restart router X smoothly and enter the control software restart stage. In this stage, although adjacent router Y does not receive any OSPF protocol packets to restart router X, it still considers X to be normal and sends information to the network normally. Refresh and restart the relevant status of router X in other routers;
步骤(3)同步阶段:重启路由器X的控制软件恢复正常,向邻接路由器Y请求链路状态信息,Y将自己状态信息库中的所有链路状态公告LSA发送给重启路由器X,以实现该两个路由器X与Y的链路状态信息库相互同步;Step (3) synchronization stage: the control software of restarting router X returns to normal, and requests link state information from adjacent router Y, and Y sends all link state advertisement LSAs in its own state information database to restarting router X, so as to realize the two The link state information databases of routers X and Y are synchronized with each other;
步骤(4)结束阶段:重启路由器X收到邻接路由器Y发来的LSA后,通过对比,发现其与邻接路由器Y的状态信息库实现同步后,告诉邻接路由器Y:平稳重启进程正常结束;Step (4) end stage: restart router X after receiving the LSA sent by adjacent router Y, through comparison, after finding that it is synchronized with the state information database of adjacent router Y, tell adjacent router Y: the smooth restart process ends normally;
步骤(5)正常阶段:重启路由器X与邻接路由器Y已建立邻接关系,分别进入标准OSPF交互状态。Step (5) Normal stage: restart router X and adjacent router Y have established adjacency relationship, respectively enter the standard OSPF interactive state.
以上是正常的平稳重启流程,如果在此期间,全网中的某处链路状态发生了改变,则邻接路由器Y将立刻退出平稳重启的辅助状态,并立即宣告自己与重启路由器X之间的链路不可用,这样做的目的是防止网络产生路由环路和黑洞。The above is the normal smooth restart process. If the link status somewhere in the entire network changes during this period, the adjacent router Y will immediately exit the auxiliary state of smooth restart and immediately announce the link between itself and the restarting router X. The link is unavailable. The purpose of this is to prevent routing loops and black holes in the network.
参见图2,介绍在目前的平稳重启机制中,如果发生链路状态改变时的路由收敛流程:Refer to Figure 2, which introduces the routing convergence process when the link state changes in the current smooth restart mechanism:
首先,路由器B检测到B-C段链路故障后发布链路状态更新,以“洪泛”(Flooding)方式向相应网络区域扩散LSA(B-C fail),通告B-C段链路不可用。所谓“洪泛”是OSPF协议中规定的一种标准化处理进程,用于路由器之间发布和同步链路状态信息库。经过“洪泛”后,除了当时恰好处于重启过程中的重启路由器X以外,网络区域内的所有其它路由器都生成了完全一致的新的链路状态信息库,并利用该新的信息库重新计算路由转发表。由于重启路由器X当时正处于重新启动过程中,其路由进程已停止工作,因此它无法了解B-C段链路发生了故障,即:X仍然保留着以前的链路状态信息库及相应路由转发表。这样一来,网络区域内各路由器就有了不同步的链路状态信息库,由链路状态路由协议的工作原理可知,这种状况将可能产生路由环路和黑洞。First, after router B detects the failure of the link on the B-C segment, it issues a link status update, floods the LSA (B-C fail) to the corresponding network area in a "flooding" manner, and notifies that the link on the B-C segment is unavailable. The so-called "flooding" is a standardized processing process stipulated in the OSPF protocol, which is used for publishing and synchronizing link state information databases between routers. After "flooding", except for the restarting router X that happened to be in the process of restarting at that time, all other routers in the network area have generated a completely consistent new link state information base, and use the new information base to recalculate Routing and forwarding table. Because the restart router X was in the process of restarting at that time, its routing process had stopped working, so it could not understand that the link in the B-C segment had failed, that is: X still retained the previous link state information base and corresponding routing forwarding table. In this way, each router in the network area has an asynchronous link state information database. According to the working principle of the link state routing protocol, this situation may cause routing loops and black holes.
为了避免路由环路和黑洞的产生,标准规定,与重启路由器X连接的邻接路由器Y在接收到LSA(B-C fail)后,除了正常将其“洪泛”外,还需要立即产生LSA(X-Y fail),并将其“洪泛”出去,宣告X-Y段链路不可用。这样其它路由器就可以将重启路由器X的相关链路从链路状态信息库中剔除,从而实现网络内所有活动路由器的同步,避免了环路和黑洞的发生。In order to avoid the generation of routing loops and black holes, the standard stipulates that after receiving the LSA (B-C fail), the adjacent router Y connected to the restarting router X needs to immediately generate an LSA (X-Y fail) in addition to "flooding" it normally. ), and "flood" it out, declaring that the X-Y segment link is unavailable. In this way, other routers can remove the relevant link of restarting router X from the link state information database, thereby realizing the synchronization of all active routers in the network, and avoiding the occurrence of loops and black holes.
但是,这样处理的结果是:网络中其它路由器将在短期内连续收到两处链路状态更新,以重启路由器A为例:由于邻接路由器Y只是在收到LSA(B-Cfail)后才产生LSA(X-Y fail),因此绝大多数情况下,重启路由器A会先收到LSA(B-C fail),在快速路由收敛机制中,该重启路由器A将立即更新链路状态信息库并重新计算路由表;然而,正在进行计算过程中或者刚刚计算结束后,它又将收到LSA((X-Y fail),这样该重启路由器A将需要再一次更新链路状态信息库及重新计算路由表。However, the result of this processing is that other routers in the network will receive two consecutive link state updates in a short period of time. Take the restart of router A as an example: because the adjacent router Y only generates LSA (B-Cfail) after receiving the LSA (B-Cfail). LSA (X-Y fail), so in most cases, restarting router A will receive LSA (B-C fail) first. In the fast route convergence mechanism, restarting router A will immediately update the link state information database and recalculate the routing table However, during the calculation process or just after the calculation, it will receive LSA ((X-Y fail), so that the restart router A will need to update the link state information base and recalculate the routing table again.
如果网络中同时有多个路由器正处于平稳重启过程中,则某处发生单个链路状态改变后,将会导致每个路由器在短期内分别连续收到多个不同的链路状态消息,在快速路由收敛的情况下可能导致这些路由器多次更新其链路状态信息库及重新计算路由表,影响了路由收敛的速度。这个问题引起了业内人士的关注和重视。If multiple routers in the network are in the process of graceful restart at the same time, a single link status change somewhere will cause each router to receive multiple different link status messages consecutively in a short period of time. In the case of route convergence, these routers may update their link state information databases and recalculate routing tables many times, which affects the speed of route convergence. This issue has aroused the attention and attention of people in the industry.
发明内容Contents of the invention
有鉴于此,本发明的目的是提供一种实现路由快速收敛的路由器平稳重启的方法,该方法是对现有路由协议平稳重启过程的扩展和改进,以便能够与路由快速收敛协调工作,解决现有技术中存在的路由器多次更新其链路状态信息库以及重新计算路由表,影响了路由收敛速度的问题。In view of this, the purpose of the present invention is to provide a method for smooth restart of a router that realizes rapid route convergence. In the prior art, routers update their link state information databases and recalculate routing tables multiple times, which affects the routing convergence speed.
为了达到上述目的,本发明提供了一种实现路由快速收敛的路由器平稳重启的方法,包括下列操作阶段:握手、重启、同步、结束和恢复正常;其特征在于:在所述流程的重启之前,增加一个通告阶段:路由器在平稳重启前,先产生一个向本区域内“洪泛”的不透明的链路状态公告Opaque LSA,向网络中其它路由器通告:自己准备进行平稳重启;远端路由器收到该Opaque LSA后,将该平稳重启路由器的相关链路标记为“潜在不可用”,但是不执行其它相关操作;直到远端路由器收到该平稳重启路由器已经结束平稳重启状态的OpaqueLSA后,才删除有关该平稳重启路由器链路的“潜在不可用”的状态标记;但是,如果在所述平稳重启过程中的重启、同步、或结束阶段中,网络发生链路状态变化时,远端路由器将立即把该平稳重启路由器的相关链路从“潜在不可用”状态标记为“不可用”状态,并更新链路状态信息库和重新计算路由,而不必等待接收到由该平稳重启路由器的邻接路由器发出的相关链路不可用的LSA后才作相应处理。In order to achieve the above-mentioned purpose, the present invention provides a kind of method that realizes the router steady restart of routing fast convergence, comprises following operation stages: handshake, restart, synchronization, end and return to normal; It is characterized in that: before the restart of described process, Add an announcement stage: before the router restarts smoothly, it first generates an opaque link state announcement Opaque LSA that is "flooded" in the area, and notifies other routers in the network that it is ready to restart smoothly; the remote router receives After the Opaque LSA, the relevant link of the graceful restart router is marked as "potentially unavailable", but no other related operations are performed; the remote router will not delete until the remote router receives the OpaqueLSA that the smooth restart router has ended the graceful restart state A "potentially unavailable" state flag for the graceful restart router link; however, if a link state change occurs on the network during the restart, synchronization, or end phases of the graceful restart process, the remote router Mark the relevant links of the graceful restart router from the "potentially unavailable" state to the "unavailable" state, and update the link state information base and recalculate the route without waiting for the received Corresponding processing will be performed only after the relevant link is unavailable LSA.
所述平稳重启路由器向本区域内“洪泛”的是不透明的链路状态公告Opaque LSA,所述Opaque LSA是IETF标准化组织在标准RFC2370中定义的一种可选LSA类型,用于携带最短路径优先内部网关协议OSPF可选功能所需要的相关信息;由于可选功能所需要的相关信息只在Opaque LSA中携带,因此避免了对标准LSA的改动,以最大程度地保障标准OSPF路由协议功能的稳定性。The "flooding" of the smooth restart router into the local area is an opaque link state advertisement Opaque LSA. The Opaque LSA is an optional LSA type defined by the IETF standardization organization in the standard RFC2370, and is used to carry the shortest path Prioritize the relevant information required by the optional functions of the interior gateway protocol OSPF; since the relevant information required by the optional functions is only carried in the Opaque LSA, changes to the standard LSA are avoided to maximize the protection of the standard OSPF routing protocol functions stability.
所述不透明的链路状态公告Opaque LSA的链路状态类型值为10,其有效“洪泛”范围为本网络区域(Area-Local),即该Opaque LSA信息只在本网络区域的路由器之间传递和扩散,而不传递到其它的网络区域中去。The link state type value of the opaque link state announcement Opaque LSA is 10, and its effective "flooding" range is the network area (Area-Local), that is, the Opaque LSA information is only between routers in the network area Delivery and diffusion, but not delivery to other network areas.
包括下列具体操作步骤:Including the following specific steps:
(1)握手阶段:各个路由器相互之间发送握手信息,其中平稳重启路由器告诉邻接路由器:它将要进入平稳重启进程;(1) Handshake phase: each router sends handshake information to each other, and the smooth restart router tells the adjacent router that it will enter the smooth restart process;
(2)通告阶段:该平稳重启路由器产生一个向本区域内“洪泛”的不透明的链路状态公告Opaque LSA,向网络中的其它路由器通告:自己准备进行平稳重启;远端路由器收到该不透明的链路状态公告Opaque LSA后,将该平稳重启路由器的相关链路标记为“潜在不可用”,但是不执行更新链路状态信息库和重新计算路由的后续操作;(2) Announcement stage: the smooth restart router generates an opaque link state announcement Opaque LSA that "floods" in the local area, and notifies other routers in the network that it is ready to perform a smooth restart; the remote router receives the opaque LSA After the opaque link state advertises Opaque LSA, the relevant link of the smooth restart router is marked as "potentially unavailable", but the follow-up operations of updating the link state information base and recalculating the route are not performed;
(3)重启阶段:该平稳重启路由器进入控制软件重启阶段,在此阶段,邻接路由器没有收到该平稳重启路由器的最短路径优先内部网关协议OSPF的任何数据包,但仍然认为该平稳重启路由器工作正常,并向网络中其它路由器正常刷新该平稳重启路由器的相关的标准链路状态公告Standard LSA信息,以便与标准平稳重启协议实现兼容;(3) Restart stage: The graceful restart router enters the control software restart stage. At this stage, the adjacent routers have not received any data packets of the smooth restart router’s Shortest Path First Interior Gateway Protocol OSPF, but still think that the smooth restart router is working Normal, and refresh the relevant standard link state announcement Standard LSA information of the graceful restart router to other routers in the network normally, so as to be compatible with the standard graceful restart protocol;
(4)同步阶段:该平稳重启路由器的控制软件恢复正常,向邻接路由器请求链路状态信息,邻接路由器将自己状态信息库中的所有标准链路状态公告Standard LSA信息都发送给该平稳重启路由器,以实现该平稳重启路由器与邻接路由器的链路状态信息库同步;(4) Synchronization phase: the control software of this smooth restart router returns to normal, and requests link state information from adjacent routers, and adjacent routers send all standard link state announcement Standard LSA information in their own state information databases to this smooth restart router , to realize the synchronization of the link state information base of the smooth restart router and the adjacent router;
(5)结束阶段:该平稳重启路由器收到邻接路由器发来的LSA后,通过对比,发现其与邻接路由器的状态信息库已经实现同步后,告诉邻接路由器:平稳重启状态已经正常结束;同时,该平稳重启路由器还将产生一个向本区域内“洪泛”的不透明的链路状态公告Opaque LSA,向网络中其它路由器通告:自己已经结束平稳重启状态;远端路由器收到该Opaque LSA后删除以前建立的有关该平稳重启路由器链路的“潜在不可用”的状态标记;(5) End stage: after the smooth restart router receives the LSA sent by the adjacent router, after comparing and finding that it has synchronized with the state information database of the adjacent router, it tells the adjacent router that the smooth restart state has ended normally; at the same time, The smooth restart router will also generate an opaque link state announcement Opaque LSA that "floods" in the area, and announces to other routers in the network that it has ended the smooth restart state; the remote router deletes the Opaque LSA after receiving the Opaque LSA A previously established status flag of "potentially unavailable" for the graceful restart router link;
(6)恢复正常阶段:该平稳重启路由器与邻接路由器建立邻接关系,远端路由器也获悉该平稳重启路由器已经恢复正常,全网进入标准的最短路径优先内部网关协议OSPF交互状态。(6) Return to normal stage: the smooth restart router establishes an adjacency relationship with the adjacent router, and the remote router also learns that the smooth restart router has returned to normal, and the whole network enters the standard shortest path first interior gateway protocol OSPF interactive state.
所述步骤(3)重启阶段、(4)同步阶段、或(5)结束阶段都进一步包括下列操作:如果网络中发生了链路状态改变,远端路由器将立即把该平稳重启路由器的相关链路状态从“潜在不可用”转变为“不可用”,并更新链路状态信息库以及重新计算路由。The step (3) restart stage, (4) synchronization stage, or (5) end stage further includes the following operations: if a link state change occurs in the network, the remote router will immediately reset the relevant link of the smooth restart router. The link state changes from "potentially unavailable" to "unavailable", and the link state information base is updated and the route is recalculated.
所述方法允许使用只能识别和处理标准OSPF路由协议的传统路由器与支持所述Opaque LSA的路由器混合组网。The method allows traditional routers that can only recognize and process standard OSPF routing protocols and routers that support the Opaque LSA to form a hybrid network.
由于传统路由器不支持本发明,可能存在多次路由计算;以及其在收到Opaque LSA后做丢弃处理,可能影响平稳重启路由器产生的Opaque LSA的“洪泛”,使该Opaque LSA无法“洪泛”到本区域内所有的路由器,导致某些支持所述Opaque LSA路由器也因收不到相应的Opaque LSA而无法启动相应扩展机制,其行为等同于传统路由器;因此,所述方法建议所有路由器都能够支持Opaque LSA。Because the traditional router does not support the present invention, there may be multiple routing calculations; and it discards after receiving the Opaque LSA, which may affect the "flooding" of the Opaque LSA that restarts the router smoothly, so that the Opaque LSA cannot be "flooded". ” to all routers in this area, causing some routers that support the Opaque LSA to fail to start the corresponding extension mechanism because they cannot receive the corresponding Opaque LSA, and their behavior is equivalent to that of traditional routers; therefore, the method suggested that all routers Can support Opaque LSA.
所述方法适用于最短路径优先内部网关协议OSPF、中间系统路由协议IS-IS、或边界网关协议BGP。The method is applicable to the Shortest Path First Interior Gateway Protocol OSPF, Intermediate System Routing Protocol IS-IS, or Border Gateway Protocol BGP.
本发明是一种实现路由快速收敛的路由器平稳重启的方法,该方法是在现有的平稳重启机制基础上进行扩展而实现的,相对于现有的平稳重启机制,本发明的特点是使得远端路由器也知道网络中某台路由器正在进行平稳重启,并将该平稳重启路由器的相关链路都标记为“潜在不可用”,但是不执行其它相关操作;如果在这个期间网络某处发生故障,各个路由器立即将正在重启的路由器的所有相关链路都标记为“不可用”,并更新链路状态信息库和重新计算路由,而不必等待接收到由该平稳重启路由器的邻接路由器发出的相关链路不可用的LSA后才作相应处理;从而只需一次路由状态更新和重新计算,加快了网络的收敛速度。The present invention is a method for smooth restart of routers that realizes rapid route convergence. The method is realized by extending the existing smooth restart mechanism. The end router also knows that a certain router in the network is in the process of restarting smoothly, and marks the relevant links of the restarting router as "potentially unavailable", but does not perform other related operations; if a failure occurs somewhere in the network during this period, Each router immediately marks all relevant links of the restarting router as unavailable, updates the link state information base, and recomputes routes without waiting to receive relevant link information from the gracefully restarting router's neighbor routers. Corresponding processing is performed only after the unavailable LSA is routed; thus, only one routing status update and recalculation is required, which speeds up the convergence speed of the network.
本发明优点是:避免了平稳重启对路由协议快速收敛的不利影响,提高了路由收敛速度;通过以Opaque LSA公告平稳重启状态的起始和结束,避免了对已经稳定的标准路由协议的修改,且对现有协议和重启方法只作很小的扩展,简单易行,不会对路由器设备的处理能力造成不利影响;后向兼容性好,本发明既不会影响原有机制的正常工作,也不要求同一网络中的所有路由器都必须支持本发明的扩展方法,允许支持本发明的路由器和传统路由器可以混合组网,便于推广应用;再者,本发明方法不仅适用于OSPF路由协议,同样适用于IS-IS和BGP路由协议。因此,本发明是对现有平稳重启方法的扩展和改进,能够与快速路由收敛机制很好地协调工作,具有很好的应用前景。The present invention has the advantages of: avoiding the adverse effect of smooth restart on the rapid convergence of routing protocols, and improving the routing convergence speed; by using the Opaque LSA to announce the start and end of the smooth restart state, avoiding the modification of the stable standard routing protocol, And the existing protocol and restart method are only slightly expanded, simple and easy to implement, and will not adversely affect the processing capability of the router device; the backward compatibility is good, and the present invention will neither affect the normal operation of the original mechanism, Also do not require that all routers in the same network must support the extension method of the present invention, allowing the routers and traditional routers supporting the present invention to be mixed and networked, which is convenient for popularization and application; moreover, the method of the present invention is not only applicable to the OSPF routing protocol, but also Applies to IS-IS and BGP routing protocols. Therefore, the present invention is an extension and improvement of the existing smooth restart method, can coordinate well with the fast route convergence mechanism, and has a good application prospect.
附图说明Description of drawings
图1是现有的平稳重启方法的示意图。FIG. 1 is a schematic diagram of an existing graceful restart method.
图2是在平稳重启期间的各个路由器之间的链路状态变化示意图。Fig. 2 is a schematic diagram of link state changes between routers during a graceful restart.
图3是本发明实现路由快速收敛的路由器平稳重启的方法流程方框图。Fig. 3 is a flow block diagram of a method for realizing smooth restart of routers for fast route convergence in the present invention.
图4是本发明实现路由快速收敛的路由器平稳重启流程的实施例示意图。FIG. 4 is a schematic diagram of an embodiment of a router smooth restart process for realizing rapid route convergence in the present invention.
具体实施方式Detailed ways
为使本发明的目的、技术方案和优点更加清楚,下面结合附图对本发明作进一步的详细描述。In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings.
目前的平稳重启机制中,只有与重启路由器相邻的路由器才知道该路由器正在进行平稳重启,而远端路由器并不知道这一信息,认为那台路由器仍然在正常工作。这样,当平稳重启期间网络某处发生了链路状态改变时,为了避免链路状态信息库不同步而可能造成的路由环路和黑洞,正在平稳重启的路由器的邻居必需将其宣告为不可用,对于远端路由器而言,这又发生了一次链路状态改变,即两个不同的链路状态改变消息(LSA)先后到达远端路由器。在网络启用快速路由收敛的情况下,由于路由器收到LSA后将立即开始重新计算路由(大约只有1毫秒延时),造成在平稳重启期间网络某处发生了链路状态改变时,将导致多次路由状态的更新和重新计算,影响了网络的收敛速度。In the current graceful restart mechanism, only the routers adjacent to the restarting router know that the router is performing a smooth restart, while the remote router does not know this information and thinks that router is still working normally. Thus, when a link state change occurs somewhere in the network during graceful restart, neighbors of the router that is gracefully restarting must declare it unavailable, in order to avoid possible routing loops and black holes caused by out-of-sync link state information bases , for the remote router, another link state change occurs, that is, two different link state change messages (LSAs) arrive at the remote router successively. When the fast route convergence is enabled on the network, since the router will start recalculating the route immediately after receiving the LSA (only about 1 millisecond delay), when a link state change somewhere in the network during the smooth restart period, it will cause multiple failures. The update and recalculation of the secondary routing status affect the convergence speed of the network.
为了减少上述情况下的计算次数,通常采用两种方法:In order to reduce the number of calculations in the above cases, two methods are generally adopted:
(1)设定路由重新计算的等待时间:当路由器收到某个LSA后,并不马上开始路由计算,而是等待一段时间后才开始计算,如果这个等待时间设置合理,路由器就会在等待时间结束前收到所有相关LSA,从而实现只进行一次路由计算。但是,该方法缺点明显:影响网络的收敛速度,与快速收敛的目的相违背。(1) Set the waiting time for route recalculation: When the router receives an LSA, it does not start routing calculation immediately, but waits for a period of time before starting calculation. If the waiting time is set reasonably, the router will wait All relevant LSAs are received before the time expires, so that only one route calculation is performed. However, this method has obvious disadvantages: it affects the convergence speed of the network, which is contrary to the purpose of fast convergence.
(2)通过扩展平稳重启机制,使远端路由器也知道网络某处正在发生平稳重启,并将那台重启路由器的所有链路标记为“潜在不可用”,暂时认为这些链路依然正常工作,但当网络某处发生故障时(收到相关LSA),立即将这些链路状态转为“不可用”,并立即进行下一步的链路状态信息库更新和路由重计算操作,实现快速路由收敛。该方法的优点是不会降低快速收敛的速度。(2) By extending the smooth restart mechanism, the remote router also knows that a smooth restart is taking place somewhere in the network, and marks all the links of the restarted router as "potentially unavailable", temporarily considering that these links are still working normally, However, when a fault occurs somewhere in the network (related LSA is received), the state of these links will be changed to "unavailable" immediately, and the next link state information database update and route recalculation operation will be performed immediately to achieve fast route convergence . The advantage of this approach is that it does not slow down fast convergence.
本发明采用上述第2种方法,它是在现有传统的平稳重启流程中增加一个“通告阶段”,并对其它阶段的操作进行相应的扩展处理。The present invention adopts the above-mentioned second method, which adds a "notification stage" to the existing traditional smooth restart process, and performs corresponding expansion processing on the operations of other stages.
参见图3和图4,具体介绍本发明的各个操作步骤:Referring to Fig. 3 and Fig. 4, each operation step of the present invention is specifically introduced:
(1)握手阶段:各个路由器相互之间发送握手信息,其中平稳重启路由器X告诉邻接路由器Y:它将要进入平稳重启进程;(1) Handshake phase: each router sends handshake information to each other, wherein the smooth restart router X tells the adjacent router Y: it will enter the smooth restart process;
(2)通告阶段:该平稳重启路由器X产生一个向本区域内“洪泛”的不透明的链路状态公告Opaque LSA,向网络中的其它路由器通告:自己准备进行平稳重启;远端路由器Z收到该不透明的链路状态公告Opaque LSA后,将该平稳重启路由器X的相关链路标记为“潜在不可用”,但是不执行更新链路状态信息库和重新计算路由的后续操作;(2) Announcement phase: The smooth restart router X generates an opaque link state announcement Opaque LSA that "floods" to the local area, and notifies other routers in the network that it is ready to perform a smooth restart; the remote router Z receives After receiving the opaque link state advertisement Opaque LSA, mark the relevant link of the smooth restart router X as "potentially unavailable", but do not perform the follow-up operations of updating the link state information base and recalculating the route;
(3)重启阶段:该平稳重启路由器X进入控制软件重启阶段,在此阶段,邻接路由器Y没有收到该平稳重启路由器X的最短路径优先内部网关协议OSPF的任何数据包,但仍然认为该平稳重启路由器X工作正常,并向网络中其它路由器正常刷新该平稳重启路由器X的相关的标准链路状态公告StandardLSA信息,以便实现后向兼容;(3) Restart stage: the gracefully restarting router X enters the control software restart stage, at this stage, the adjacent router Y does not receive any data packets of the shortest path first interior gateway protocol OSPF of the gracefully restarting router X, but still considers that the smooth restarting router X Restarting router X works normally, and refreshes the relevant standard link state advertisement StandardLSA information of this smoothly restarting router X to other routers in the network, so as to achieve backward compatibility;
(4)同步阶段:该平稳重启路由器X的控制软件恢复正常,向邻接路由器Y请求链路状态信息,邻接路由器Y将自己状态信息库中的所有标准链路状态公告Standard LSA信息都发送给该平稳重启路由器X,以实现该平稳重启路由器X与邻接路由器Y的链路状态信息库同步;(4) Synchronization stage: the control software of this smooth restart router X returns to normal, and requests link state information from adjacent router Y, and adjacent router Y sends all standard link state advertisement Standard LSA information in its own state information database to this Restart router X smoothly, so as to realize the link state information base synchronization between router X and adjacent router Y;
(5)结束阶段:该平稳重启路由器X收到邻接路由器Y发来的LSA后,通过对比,发现其与邻接路由器Y的状态信息库已经实现同步后,告诉邻接路由器Y:平稳重启状态已经正常结束;同时,该平稳重启路由器X还将产生一个向本区域内“洪泛”的Opaque LSA,向网络中其它路由器通告:自己已经结束平稳重启状态;远端路由器Z收到该Opaque LSA后,删除以前建立的有关该平稳重启路由器X相关链路的“潜在不可用”的状态标记;(5) End stage: After receiving the LSA sent by the adjacent router Y, the smooth restart router X finds that the state information database of the adjacent router Y has been synchronized through comparison, and then tells the adjacent router Y that the smooth restart state is normal At the same time, the smooth restart router X will also generate an Opaque LSA that "floods" to the local area, and announces to other routers in the network that it has ended the smooth restart state; after the remote router Z receives the Opaque LSA, Delete the previously established "potentially unavailable" status flag for the link associated with this graceful restart router X;
(6)恢复正常阶段:该平稳重启路由器X与邻接路由器Y建立邻接关系,远端路由器Z也获悉该平稳重启路由器X已经恢复正常,全网进入标准的最短路径优先内部网关协议OSPF交互状态。(6) Restoration to normal stage: the smooth restart router X establishes an adjacency relationship with the adjacent router Y, and the remote router Z also learns that the smooth restart router X has returned to normal, and the whole network enters the standard shortest path first interior gateway protocol OSPF interactive state.
上述步骤(3)重启阶段、(4)同步阶段、或(5)结束阶段中都进一步包括下列操作:如果网络中发生了链路状态改变,远端路由器Z将立即把该平稳重启路由器X的相关链路状态从“潜在不可用”转变为“不可用”,并更新链路状态信息库以及重新计算路由。The above steps (3) restart stage, (4) synchronization stage, or (5) end stage all further include the following operations: if a link state change occurs in the network, the remote router Z will immediately reset the smooth restart router X The state of the associated link changes from "potentially unavailable" to "unavailable", and the link state information base is updated and the route is recalculated.
本发明已经进行了实施试验,所采取的方法主要是在路由器的协议处理软件中增加本发明的平稳重启扩展处理的子模块,试验的结果是成功的,实现了发明目的。The present invention has been implemented and tested, and the method adopted is mainly to add the sub-module of the present invention's smooth restart expansion processing in the protocol processing software of the router. The result of the test is successful, and the purpose of the invention is realized.
Claims (8)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNB2005100909481A CN100442767C (en) | 2005-08-22 | 2005-08-22 | A Method of Smooth Restart of Router Realizing Fast Routing Convergence |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNB2005100909481A CN100442767C (en) | 2005-08-22 | 2005-08-22 | A Method of Smooth Restart of Router Realizing Fast Routing Convergence |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN1744567A true CN1744567A (en) | 2006-03-08 |
| CN100442767C CN100442767C (en) | 2008-12-10 |
Family
ID=36139768
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNB2005100909481A Expired - Lifetime CN100442767C (en) | 2005-08-22 | 2005-08-22 | A Method of Smooth Restart of Router Realizing Fast Routing Convergence |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN100442767C (en) |
Cited By (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2009082943A1 (en) * | 2007-12-21 | 2009-07-09 | Huawei Technologies Co., Ltd. | Intelligent database exchange for ospf |
| CN100558057C (en) * | 2007-07-20 | 2009-11-04 | 华为技术有限公司 | A processing method and device for a bidirectional forwarding detection session |
| WO2010099749A1 (en) * | 2009-03-04 | 2010-09-10 | 华为技术有限公司 | Method and apparatus for transferring virtual routing and forwarding (vrf), and vrf distributed communication system |
| CN101056270B (en) * | 2007-05-18 | 2010-10-06 | 华为技术有限公司 | Method and routing device for routing convergence |
| CN101984596A (en) * | 2010-11-04 | 2011-03-09 | 北京星网锐捷网络技术有限公司 | Method, device and routing equipment for avoiding black holes |
| CN101345698B (en) * | 2007-07-10 | 2011-05-25 | 中兴通讯股份有限公司 | Synchronization method for grace reboot |
| CN102571425A (en) * | 2011-12-28 | 2012-07-11 | 杭州华三通信技术有限公司 | Method and device for smoothly restarting border gateway protocol |
| WO2016177136A1 (en) * | 2015-07-31 | 2016-11-10 | 中兴通讯股份有限公司 | Method and device for implementing link state advertisement processing |
| CN107819679A (en) * | 2016-09-13 | 2018-03-20 | 南京中兴软件有限责任公司 | The method, apparatus and system of traffic messages forwarding |
| CN107925615A (en) * | 2015-05-13 | 2018-04-17 | 布尔有限公司 | The device network interconnected by the interchanger comprising routing table |
| CN115296981A (en) * | 2022-07-12 | 2022-11-04 | 珠海格力电器股份有限公司 | Router restarting method and device, electronic equipment and storage medium |
Family Cites Families (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2003209568A (en) * | 2002-01-15 | 2003-07-25 | Nippon Telegr & Teleph Corp <Ntt> | Node, packet communication network, packet communication method, program, and recording medium |
| US7209449B2 (en) * | 2002-03-27 | 2007-04-24 | Intel Corporation | Systems and methods for updating routing and forwarding information |
| KR100449488B1 (en) * | 2002-05-21 | 2004-09-22 | 한국전자통신연구원 | Network for transferring active packet and method for employing the same |
| CN1328889C (en) * | 2002-06-06 | 2007-07-25 | 中兴通讯股份有限公司 | Routing method based on link status |
| CN1214572C (en) * | 2002-07-16 | 2005-08-10 | 华为技术有限公司 | Routing calculation method based on opened shortest route priority routing protocol |
| ATE456224T1 (en) * | 2003-12-17 | 2010-02-15 | Nec Corp | NETWORK, ROUTER SETUP, ROUTE UPDATE SUPPRESSION METHOD THEREOF AND PROGRAM THEREOF |
-
2005
- 2005-08-22 CN CNB2005100909481A patent/CN100442767C/en not_active Expired - Lifetime
Cited By (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101056270B (en) * | 2007-05-18 | 2010-10-06 | 华为技术有限公司 | Method and routing device for routing convergence |
| US9461908B2 (en) | 2007-05-18 | 2016-10-04 | Huawei Technologies Co., Ltd. | Method of route convergence, routing device, and main control board in routing device |
| CN101345698B (en) * | 2007-07-10 | 2011-05-25 | 中兴通讯股份有限公司 | Synchronization method for grace reboot |
| CN100558057C (en) * | 2007-07-20 | 2009-11-04 | 华为技术有限公司 | A processing method and device for a bidirectional forwarding detection session |
| WO2009082943A1 (en) * | 2007-12-21 | 2009-07-09 | Huawei Technologies Co., Ltd. | Intelligent database exchange for ospf |
| US8045545B2 (en) | 2007-12-21 | 2011-10-25 | Futurewei Technologies, Inc. | Intelligent database exchange for OSPF |
| CN101663865B (en) * | 2007-12-21 | 2012-10-03 | 华为技术有限公司 | Intelligent database exchange for ospf |
| WO2010099749A1 (en) * | 2009-03-04 | 2010-09-10 | 华为技术有限公司 | Method and apparatus for transferring virtual routing and forwarding (vrf), and vrf distributed communication system |
| CN101984596A (en) * | 2010-11-04 | 2011-03-09 | 北京星网锐捷网络技术有限公司 | Method, device and routing equipment for avoiding black holes |
| US9225590B2 (en) | 2011-12-28 | 2015-12-29 | Hangzhou H3C Technologies Co., Ltd. | Graceful restart (GR) methods and devices |
| CN102571425B (en) * | 2011-12-28 | 2014-09-17 | 杭州华三通信技术有限公司 | Method and device for smoothly restarting border gateway protocol |
| CN102571425A (en) * | 2011-12-28 | 2012-07-11 | 杭州华三通信技术有限公司 | Method and device for smoothly restarting border gateway protocol |
| CN107925615A (en) * | 2015-05-13 | 2018-04-17 | 布尔有限公司 | The device network interconnected by the interchanger comprising routing table |
| WO2016177136A1 (en) * | 2015-07-31 | 2016-11-10 | 中兴通讯股份有限公司 | Method and device for implementing link state advertisement processing |
| CN106411729A (en) * | 2015-07-31 | 2017-02-15 | 中兴通讯股份有限公司 | Method and apparatus for processing link-state advertisement |
| CN106411729B (en) * | 2015-07-31 | 2020-04-28 | 中兴通讯股份有限公司 | Method and device for realizing link state notification processing |
| CN107819679A (en) * | 2016-09-13 | 2018-03-20 | 南京中兴软件有限责任公司 | The method, apparatus and system of traffic messages forwarding |
| CN107819679B (en) * | 2016-09-13 | 2021-04-20 | 中兴通讯股份有限公司 | Method, device and system for forwarding traffic message |
| CN115296981A (en) * | 2022-07-12 | 2022-11-04 | 珠海格力电器股份有限公司 | Router restarting method and device, electronic equipment and storage medium |
| CN115296981B (en) * | 2022-07-12 | 2023-09-26 | 珠海格力电器股份有限公司 | Router restarting method and device, electronic equipment and storage medium |
Also Published As
| Publication number | Publication date |
|---|---|
| CN100442767C (en) | 2008-12-10 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US7362700B2 (en) | Methods and systems for hitless restart of layer 3 packet forwarding | |
| US7532631B2 (en) | Method and apparatus for accelerating border gateway protocol convergence | |
| US7269133B2 (en) | IS-IS high availability design | |
| US9705782B2 (en) | Method and system for efficient graceful restart in an open shortest path first (OSPF) network | |
| US8254396B2 (en) | Fast border gateway protocol synchronization | |
| CN104917676B (en) | router and method for protocol process migration | |
| US8902766B2 (en) | Method and apparatus to improve LDP convergence using hierarchical label stacking | |
| CN102142970B (en) | Method for constructing multicast tree in two-layer Ethernet and network node equipment | |
| CN105978804B (en) | Method for routing and route device | |
| US20150043327A1 (en) | Method and apparatus for hitless failover in networking systems using single database | |
| US20140149819A1 (en) | Method and apparatus for protocol data unit recovery in an is-is system | |
| CN104205748A (en) | Inter-chassis redundancy with coordinated traffic direction | |
| EP2571208A1 (en) | Open shortest path first (OSPF) nonstop routing (NSR) with link derivation | |
| CN101471940A (en) | Method, system and device for synchronizing border gateway protocol route | |
| CN1744567A (en) | A Method of Smooth Restart of Router Realizing Fast Routing Convergence | |
| EP2575306B1 (en) | Ospf nonstop routing synchronization nack | |
| US7940649B2 (en) | Techniques for graceful restart in a multi-process operating system | |
| WO2014090083A1 (en) | Service bearing method and apparatus during distributed resilient network interconnect | |
| EP2638668A1 (en) | Prioritization of routing information updates | |
| CN102325080B (en) | Open shortest path first (OSPF) protocol graceful restarting method and device | |
| EP3157211B1 (en) | Isis-based flooding method and device | |
| US8958430B2 (en) | OSPF non-stop routing frozen standby | |
| EP2575304B1 (en) | OSPF nonstop routing (NSR) synchronization reduction | |
| WO2007036103A1 (en) | A method for recovering the service-forwarding route and the system thereof | |
| WO2019001487A1 (en) | Path data deletion method, and message forwarding method and apparatus |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| ASS | Succession or assignment of patent right |
Owner name: CHINA TELECOMMUNICATION STOCK CO., LTD. Free format text: FORMER OWNER: CHINA TELECOMMUNICATION STOCK CO., LTD. GUANGDONG ACADEME Effective date: 20091030 |
|
| C41 | Transfer of patent application or patent right or utility model | ||
| C56 | Change in the name or address of the patentee |
Owner name: CHINA TELECOMMUNICATION STOCK CO., LTD. GUANGDONG Free format text: FORMER NAME: GUANGDONG PROVINCE TELECOMMUNICATION CO., LTD. RESEARCH INSTITUTE |
|
| CP03 | Change of name, title or address |
Address after: Building 109, No. 20, Zhongshan Avenue, Tianhe District, Guangdong, Guangzhou Patentee after: GUANGDONG RESEARCH INSTITUTE, CHINA TELECOM Co.,Ltd. Address before: No. 109, Zhongshan Avenue, Tianhe District, Guangdong, Guangzhou Patentee before: Guangdong Telecommunication Co.,Ltd. Institude |
|
| TR01 | Transfer of patent right |
Effective date of registration: 20091030 Address after: No. 31, Finance Street, Beijing, Xicheng District Patentee after: CHINA TELECOM Corp.,Ltd. Address before: Building 109, No. 20, Zhongshan Avenue, Tianhe District, Guangdong, Guangzhou Patentee before: GUANGDONG RESEARCH INSTITUTE, CHINA TELECOM Co.,Ltd. |
|
| EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20060308 Assignee: Tianyiyun Technology Co.,Ltd. Assignor: CHINA TELECOM Corp.,Ltd. Contract record no.: X2024110000040 Denomination of invention: A method for achieving fast routing convergence and smooth router restart Granted publication date: 20081210 License type: Common License Record date: 20240914 |
|
| EE01 | Entry into force of recordation of patent licensing contract | ||
| CX01 | Expiry of patent term |
Granted publication date: 20081210 |
|
| CX01 | Expiry of patent term |