CN107623601A - A privatized cloud platform alarm scheme - Google Patents
A privatized cloud platform alarm scheme Download PDFInfo
- Publication number
- CN107623601A CN107623601A CN201710915960.4A CN201710915960A CN107623601A CN 107623601 A CN107623601 A CN 107623601A CN 201710915960 A CN201710915960 A CN 201710915960A CN 107623601 A CN107623601 A CN 107623601A
- Authority
- CN
- China
- Prior art keywords
- data
- cloud platform
- alarm
- privatized
- different
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Alarm Systems (AREA)
Abstract
Description
技术领域technical field
本发明公开一种告警方案,涉及云平台预警技术领域,具体的说是一种私有化云平台告警方案。The invention discloses an alarm scheme, relates to the technical field of cloud platform early warning, and specifically relates to a privatized cloud platform alarm scheme.
背景技术Background technique
在云计算时代,云平台的监控技术越来越成熟,性能指标数据的采集,日志的采集技术也越来越完善,但现有的系统、方案基本都是针对单一的性能数据,或者日志信息进行告警,且通知也比较单一,还没有完善的针对不同程度的告警采取不同的通知方式的方案。In the era of cloud computing, the monitoring technology of the cloud platform is becoming more and more mature, and the collection of performance index data and log collection technology is also becoming more and more perfect. However, the existing systems and solutions are basically aimed at a single performance data or log information. Alarms are issued, and the notification is relatively simple, and there is no perfect solution to adopt different notification methods for different levels of alarms.
基于上述问题,本发明提出了一种私有化云平台告警方案,包括监控和告警方面,可以首先监控部分主要针对私有化云平台所管理的服务、组件、云主机等资源,采集这些资源的关键性能数据,以及关键日志,并将这些数据存储起来;其次告警部分,基于监控的性能数据,或者关键日志数据,建立告警机制,并且当满足告警条件时,可以定制不同的告警通知方式:如系统事件、系统通知、短信通知、邮件通知等。Based on the above problems, the present invention proposes a privatized cloud platform alarm scheme, including monitoring and alarming aspects. The monitoring part can firstly focus on resources such as services, components, and cloud hosts managed by the privatized cloud platform, and collect the keys of these resources. Performance data, as well as key logs, and store these data; secondly, in the alarm part, based on the monitored performance data or key log data, an alarm mechanism is established, and when the alarm conditions are met, different alarm notification methods can be customized: such as system Events, system notifications, SMS notifications, email notifications, etc.
发明内容Contents of the invention
本发明针对目前技术发展的需求和不足之处,提供一种私有化云平台告警方案。The present invention provides a privatized cloud platform alarm scheme aiming at the needs and deficiencies of the current technological development.
一种私有化云平台告警方案,收集私有化云平台内资源的信息数据,并进行数据的格式化,将获取的格式化数据进行验证,并根据数据的类型,分别进行解析,存储,创建数据索引,A privatized cloud platform alarm scheme, which collects information data of resources in the privatized cloud platform, formats the data, verifies the acquired formatted data, and analyzes, stores, and creates data according to the type of data index,
通过收集的信息数据根据一定规则建立对应等级的告警机制,并对不同等级的告警机制进行循环告警评估,同时对告警机制按照数据索引进行评估检测;Based on the collected information and data, the alarm mechanism of the corresponding level is established according to certain rules, and the alarm mechanism of different levels is cyclically evaluated, and the alarm mechanism is evaluated and detected according to the data index;
当检测私有化云平台内资源的信息数据一旦满足告警机制的触发条件,则根据不同告警机制选择不同的通知方式对相关人员通知告警。Once the information data of resources in the privatized cloud platform is detected to meet the triggering conditions of the alarm mechanism, different notification methods are selected according to different alarm mechanisms to notify relevant personnel of the alarm.
所述的方案收集私有化云平台内资源的性能监控信息数据以及关键日志信息数据,进行数据的格式化后,采用消息中间件作为数据传输通道进行数据传输。The scheme collects the performance monitoring information data and key log information data of the resources in the privatized cloud platform, and after the data is formatted, the message middleware is used as the data transmission channel for data transmission.
所述的方案利用agent端收集私有化云平台内资源的性能监控信息数据以及关键日志信息数据,进行数据的格式化后,采用消息中间件作为传输通道进行数据传输,通过server端从传输通道中获取数据,进行验证,并根据数据的类型,分别进行解析,存储,创建数据索引。The solution uses the agent to collect the performance monitoring information data and key log information data of the resources in the privatized cloud platform. After the data is formatted, the message middleware is used as the transmission channel for data transmission, and the data is transmitted from the transmission channel through the server. Acquire data, verify it, and analyze, store, and create data indexes according to the type of data.
所述的方案,根据私有化云平台的信息数据对私有化云平台的影响严重程度不同,建立不同紧急等级的告警机制,针对不同级别的告警机制设定不同的通知方式。According to the scheme, according to the severity of the impact of the information data of the privatized cloud platform on the privatized cloud platform, alarm mechanisms with different emergency levels are established, and different notification methods are set for different levels of alarm mechanisms.
所述的方案,对不同级别的告警机制,进行循环告警评估的周期不同。According to the scheme, the cycles of cyclic alarm evaluation are different for different levels of alarm mechanisms.
所述的方案,告警机制的触发条件有设定的阈值条件、关键字条件或综合两种因素以上的条件。In the scheme, the triggering condition of the alarm mechanism includes a set threshold condition, a keyword condition or a combination of two or more factors.
所述的方案,通知方式包括特殊系统事件通知、系统通知、邮件通知、短信通知。In the solution, the notification methods include special system event notification, system notification, email notification, and short message notification.
本发明与现有技术相比具有的有益效果是:The beneficial effect that the present invention has compared with prior art is:
利用本发明的私有化云平台告警方案可以采集云平台内资源的主要性能指标,以及关键日志信息,并进行格式化等一系列解析操作,创建统一的数据索引,建立基于性能指标和日志信息的综合告警机制,并根据数据对云平台的影响严重程度对告警机制划分级别,同时对告警机制进行循环评估,还根据数据索引对警机制进行评估检测,保证告警的及时,并且针对不同的告警级别,设定不同的通知方式,便于通知相关人员及时了解、快速解决问题。The privatized cloud platform alarm scheme of the present invention can collect the main performance indicators and key log information of the resources in the cloud platform, and perform a series of analysis operations such as formatting, create a unified data index, and establish a system based on performance indicators and log information. Comprehensive alarm mechanism, and classify the alarm mechanism according to the severity of the impact of data on the cloud platform. At the same time, the alarm mechanism is evaluated cyclically, and the alarm mechanism is also evaluated and detected according to the data index to ensure the timely alarm, and for different alarm levels , set different notification methods, so that relevant personnel can be notified in a timely manner and solve problems quickly.
附图说明Description of drawings
图1本发明方法流程示意图。Fig. 1 schematic flow chart of the method of the present invention.
具体实施方式detailed description
本发明提供一种私有化云平台告警方案,收集私有化云平台内资源的信息数据,并进行数据的格式化,将获取的格式化数据进行验证,并根据数据的类型,分别进行解析,存储,创建数据索引,The present invention provides a privatized cloud platform alarm scheme, which collects information data of resources in the privatized cloud platform, formats the data, verifies the acquired formatted data, and analyzes and stores the data according to the type of data. , create a data index,
通过收集的信息数据根据一定规则建立对应等级的告警机制,并对不同等级的告警机制进行循环告警评估,同时对告警机制按照数据索引进行评估检测;Based on the collected information and data, the alarm mechanism of the corresponding level is established according to certain rules, and the alarm mechanism of different levels is cyclically evaluated, and the alarm mechanism is evaluated and detected according to the data index;
当检测私有化云平台内资源的信息数据一旦满足告警机制的触发条件,则根据不同告警机制选择不同的通知方式对相关人员通知告警。Once the information data of resources in the privatized cloud platform is detected to meet the triggering conditions of the alarm mechanism, different notification methods are selected according to different alarm mechanisms to notify relevant personnel of the alarm.
为使本发明的目的、技术方案和优点更加清楚明白,以下结合具体实施例,对本发明进一步详细说明。In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with specific examples.
利用本发明方法,进行数据采集,采用agent端主要采集私有化云平台的性能监控信息数据以及关键日志信息数据,并进行数据的格式化,然后通过rabbitmq,kafka等消息中间件作为传输通道进行数据传输;Utilize the method of the present invention, carry out data collection, adopt the agent end to mainly collect the performance monitoring information data of the privatized cloud platform and the key log information data, and carry out the format of the data, then use the message middleware such as rabbitmq, kafka as the transmission channel to carry out the data transmission;
server端从传输通道中获取数据,进行验证,并根据数据的类型,分别进行解析,存储,并创建数据索引;The server side obtains data from the transmission channel, performs verification, and according to the type of data, respectively parses, stores, and creates data indexes;
根据私有化云平台的性能监控信息数据以及关键日志信息数据对私有化云平台的影响严重程度不同,建立不同紧急等级的告警机制,不同级别的告警机制触发条件不同,其中触发条件可以是设定的阈值条件、关键字条件,或者综合两种因素以上的条件,比如设定阈值条件、关键字条件相结合的条件,满足一种即触发或者同时满足两种才触发;According to the severity of the impact of the performance monitoring information data of the privatized cloud platform and key log information data on the privatized cloud platform, alarm mechanisms with different emergency levels are established. The trigger conditions of different levels of alarm mechanisms are different. The trigger conditions can be set The threshold condition, keyword condition, or a combination of two or more factors, such as setting a combination of threshold conditions and keyword conditions, triggers when one is met or both are met at the same time;
针对告警机制,为保证告警的及时,进行循环告警评估,对紧急程度高的告警机制评估的周期短,对紧急程度一般的告警机制评估的周期适当延长,告警评估时读取数据索引内的数据进行告警评估检测。For the alarm mechanism, in order to ensure the timely alarm, the cycle alarm evaluation is carried out. The evaluation cycle of the alarm mechanism with high urgency is short, and the evaluation cycle of the alarm mechanism with general urgency is extended appropriately. The data in the data index is read during the alarm evaluation. Carry out alarm evaluation and detection.
当检测私有化云平台内资源的性能监控信息数据以及关键日志信息一旦满足告警机制的触发条件时,则根据不同级别的告警机制选择不同的通知方式对相关人员通知告警,并且同一告警机制可以选择一到多种告警通知方式,比如采用特殊系统事件通知、系统通知、邮件通知、短信通知等方式。Once the performance monitoring information data and key log information of resources in the privatized cloud platform are detected to meet the trigger conditions of the alarm mechanism, different notification methods are selected according to different levels of alarm mechanisms to notify relevant personnel of alarms, and the same alarm mechanism can be selected One or more alarm notification methods, such as special system event notification, system notification, email notification, SMS notification, etc.
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710915960.4A CN107623601A (en) | 2017-09-30 | 2017-09-30 | A privatized cloud platform alarm scheme |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710915960.4A CN107623601A (en) | 2017-09-30 | 2017-09-30 | A privatized cloud platform alarm scheme |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107623601A true CN107623601A (en) | 2018-01-23 |
Family
ID=61091210
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710915960.4A Pending CN107623601A (en) | 2017-09-30 | 2017-09-30 | A privatized cloud platform alarm scheme |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107623601A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112187807A (en) * | 2020-09-30 | 2021-01-05 | 新华三大数据技术有限公司 | Method, device and storage medium for monitoring branch network gateway |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101986274A (en) * | 2010-11-11 | 2011-03-16 | 东软集团股份有限公司 | Resource allocation system and resource allocation method in private cloud environment |
CN103152414A (en) * | 2013-03-01 | 2013-06-12 | 四川省电力公司信息通信公司 | High available system based on cloud calculation and implementation method thereof |
CN103873498A (en) * | 2012-12-11 | 2014-06-18 | 中国电信股份有限公司 | Cloud platform resource self-adaptive early warning method and system |
CN104092575A (en) * | 2014-07-29 | 2014-10-08 | 中国联合网络通信集团有限公司 | A resource monitoring method and system |
EP2924574A2 (en) * | 2014-03-26 | 2015-09-30 | Rockwell Automation Technologies, Inc. | Unified data ingestion adapter for migration of industrial data to a cloud platform |
CN106301919A (en) * | 2016-08-17 | 2017-01-04 | 浪潮电子信息产业股份有限公司 | Alarm system of privatized cloud platform and implementation method thereof |
CN106850295A (en) * | 2017-02-04 | 2017-06-13 | 郑州云海信息技术有限公司 | A kind of log collection monitoring method of privatization cloud platform |
CN106992876A (en) * | 2017-03-04 | 2017-07-28 | 郑州云海信息技术有限公司 | Cloud platform log management method and system |
-
2017
- 2017-09-30 CN CN201710915960.4A patent/CN107623601A/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101986274A (en) * | 2010-11-11 | 2011-03-16 | 东软集团股份有限公司 | Resource allocation system and resource allocation method in private cloud environment |
CN103873498A (en) * | 2012-12-11 | 2014-06-18 | 中国电信股份有限公司 | Cloud platform resource self-adaptive early warning method and system |
CN103152414A (en) * | 2013-03-01 | 2013-06-12 | 四川省电力公司信息通信公司 | High available system based on cloud calculation and implementation method thereof |
EP2924574A2 (en) * | 2014-03-26 | 2015-09-30 | Rockwell Automation Technologies, Inc. | Unified data ingestion adapter for migration of industrial data to a cloud platform |
CN104092575A (en) * | 2014-07-29 | 2014-10-08 | 中国联合网络通信集团有限公司 | A resource monitoring method and system |
CN106301919A (en) * | 2016-08-17 | 2017-01-04 | 浪潮电子信息产业股份有限公司 | Alarm system of privatized cloud platform and implementation method thereof |
CN106850295A (en) * | 2017-02-04 | 2017-06-13 | 郑州云海信息技术有限公司 | A kind of log collection monitoring method of privatization cloud platform |
CN106992876A (en) * | 2017-03-04 | 2017-07-28 | 郑州云海信息技术有限公司 | Cloud platform log management method and system |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112187807A (en) * | 2020-09-30 | 2021-01-05 | 新华三大数据技术有限公司 | Method, device and storage medium for monitoring branch network gateway |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108763957B (en) | Database security audit system, method and server | |
CN111459782B (en) | Method and device for monitoring service system, cloud platform system and server | |
CN106815125A (en) | A kind of log audit method and platform | |
CN113157521B (en) | Monitoring method and monitoring system for block chain full life cycle | |
CN107239388A (en) | A kind of monitoring alarm method and system | |
CN106100938A (en) | The monitoring of a kind of distributed cluster system and alarm method and system | |
GB2434670A (en) | Monitoring and management of distributed information systems | |
CN102546274A (en) | Alarm monitoring method and alarm monitoring equipment in communication service | |
JP2015028700A (en) | Failure detection device, failure detection method, failure detection program and recording medium | |
CN104574219A (en) | System and method for monitoring and early warning of operation conditions of power grid service information system | |
CN106161085A (en) | The monitoring system and method for messaging bus | |
CN110555004A (en) | Service monitoring method and device, computer equipment and storage medium | |
CN110929896A (en) | A safety analysis method and device for system equipment | |
CN110677271B (en) | Big data alarm method, device, equipment and storage medium based on ELK | |
CN108809720A (en) | The management method and device of alarming assignment in cloud data system | |
CN110784352B (en) | Data synchronous monitoring and alarming method and device based on Oracle golden gate | |
CN103856344A (en) | Alarm event information processing method and device | |
CN114356722A (en) | Monitoring alarm method, system, equipment and storage medium for server cluster | |
CN105589800A (en) | Application system for predicting faults of complex system | |
CN110971488A (en) | Data processing method, device, server and storage medium | |
US20210011793A1 (en) | Determining root-cause of failures based on machine-generated textual data | |
CN107623601A (en) | A privatized cloud platform alarm scheme | |
CN118861187A (en) | Interactive map system with multi-mode rendering | |
CN116781757B (en) | Data monitoring method, device, platform, electronic equipment and storage medium | |
CN104483943A (en) | Environment monitoring system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180123 |
|
RJ01 | Rejection of invention patent application after publication |