[go: up one dir, main page]

Wang et al., 2020 - Google Patents

Root-cause metric location for microservice systems via log anomaly detection

Wang et al., 2020

View PDF
Document ID
7903337456843097456
Author
Wang L
Zhao N
Chen J
Li P
Zhang W
Sui K
Publication year
Publication venue
2020 IEEE international conference on web services (ICWS)

External Links

Snippet

Microservice systems are typically fragile and failures are inevitable in them due to their complexity and large scale. However, it is challenging to localize the root-cause metric due to its complicated dependencies and the huge number of various metrics. Existing methods …
Continue reading at tjusail.github.io (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3409Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • G06F11/0709Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in a distributed system consisting of a plurality of standalone computer nodes, e.g. clusters, client-server systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3466Performance evaluation by tracing or monitoring
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/36Preventing errors by testing or debugging software
    • G06F11/3668Software testing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3065Monitoring arrangements determined by the means or processing involved in reporting the monitored data
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30705Clustering or classification
    • G06F17/3071Clustering or classification including class or cluster creation or modification
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management, e.g. organising, planning, scheduling or allocating time, human or machine resources; Enterprise planning; Organisational models
    • G06Q10/063Operations research or analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/875Monitoring of systems including the internet
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/86Event-based monitoring
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass

Similar Documents

Publication Publication Date Title
Wang et al. Root-cause metric location for microservice systems via log anomaly detection
Lee et al. Eadro: An end-to-end troubleshooting framework for microservices on multi-source data
Wu et al. Microdiag: Fine-grained performance diagnosis for microservice systems
CN116450399B (en) Microservice system fault diagnosis and root cause location method
US20160253232A1 (en) Contextual graph matching based anomaly detection
Yu et al. TraceRank: Abnormal service localization with dis‐aggregated end‐to‐end tracing data in cloud native systems
Zeng et al. Traceark: Towards actionable performance anomaly alerting for online service systems
US20190138542A1 (en) Classification of log data
Zhang et al. SentiLog: Anomaly detecting on parallel file systems via log-based sentiment analysis
Pham et al. Baro: Robust root cause analysis for microservices via multivariate bayesian online change point detection
CN115237717A (en) Micro-service abnormity detection method and system
Cai et al. A real-time trace-level root-cause diagnosis system in alibaba datacenters
Fawzy et al. Framework for automatic detection of anomalies in DevOps
US11055631B2 (en) Automated meta parameter search for invariant based anomaly detectors in log analytics
Jiang et al. Efficient fault detection and diagnosis in complex software systems with information-theoretic monitoring
Xie et al. Confidence guided anomaly detection model for anti-concept drift in dynamic logs
Zhang et al. Semi-supervised and unsupervised anomaly detection by mining numerical workflow relations from system logs
Zhu et al. HeMiRCA: Fine-grained root cause analysis for microservices with heterogeneous data sources
Jiang et al. Look deep into the microservice system anomaly through very sparse logs
Yu et al. Cmdiagnostor: An ambiguity-aware root cause localization approach based on call metric data
Wang et al. MADMM: microservice system anomaly detection via multi-modal data and multi-feature extraction
Yao et al. SparseRCA: Unsupervised Root Cause Analysis in Sparse Microservice Testing Traces
US20230306343A1 (en) Business process management system and method thereof
Floroiu et al. Anomaly detection and root cause analysis of microservices energy consumption
ZHANG et al. Approach to anomaly detection in microservice system with multi-source data streams