Beneventi et al., 2017 - Google Patents
Continuous learning of HPC infrastructure models using big data analytics and in-memory processing toolsBeneventi et al., 2017
View PDF- Document ID
- 7086271924173840434
- Author
- Beneventi F
- Bartolini A
- Cavazzoni C
- Benini L
- Publication year
- Publication venue
- Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017
External Links
Snippet
Exascale computing represents the next leap in the HPC race. Reaching this level of performance is subject to several engineering challenges such as energy consumption, equipment-cooling, reliability and massive parallelism. Model-based optimization is an …
- 235000010977 hydroxypropyl cellulose 0 title abstract description 20
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/48—Programme initiating; Programme switching, e.g. by interrupt
- G06F9/4806—Task transfer initiation or dispatching
- G06F9/4843—Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
- G06F9/4881—Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3409—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3466—Performance evaluation by tracing or monitoring
- G06F11/3495—Performance evaluation by tracing or monitoring for systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/54—Interprogramme communication; Intertask communication
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3065—Monitoring arrangements determined by the means or processing involved in reporting the monitored data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
- G06F15/163—Interprocessor communication
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2201/00—Indexing scheme relating to error detection, to error correction, and to monitoring
- G06F2201/86—Event-based monitoring
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F1/00—Details of data-processing equipment not covered by groups G06F3/00 - G06F13/00, e.g. cooling, packaging or power supply specially adapted for computer application
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Beneventi et al. | Continuous learning of HPC infrastructure models using big data analytics and in-memory processing tools | |
| Kalavri et al. | Three steps is all you need: fast, accurate, automatic scaling decisions for distributed streaming dataflows | |
| Lin et al. | A taxonomy and survey of power models and power modeling for cloud servers | |
| Simmhan et al. | Karma2: Provenance management for data-driven workflows | |
| Fu et al. | DRS: Dynamic resource scheduling for real-time analytics over fast streams | |
| Lin et al. | A cloud server energy consumption measurement system for heterogeneous cloud environments | |
| Netti et al. | DCDB wintermute: Enabling online and holistic operational data analytics on HPC systems | |
| Calzarossa et al. | Workloads in the Clouds | |
| Rajachandrasekar et al. | Monitoring and predicting hardware failures in HPC clusters with FTB-IPMI | |
| Song et al. | Energy profiling and analysis of the hpc challenge benchmarks | |
| Colmant et al. | WattsKit: Software-defined power monitoring of distributed systems | |
| Stefanov et al. | A review of supercomputer performance monitoring systems | |
| Sang et al. | Precise, scalable, and online request tracing for multitier services of black boxes | |
| Patki et al. | Monitoring large scale supercomputers: A case study with the lassen supercomputer | |
| Thaler et al. | Hybrid approach to hpc cluster telemetry and hardware log analytics | |
| Brondolin et al. | PRESTO: a latency-aware power-capping orchestrator for cloud-native microservices | |
| Khan | Hadoop performance modeling and job optimization for big data analytics | |
| Anghel et al. | Quantifying communication in graph analytics | |
| Shatalin et al. | Root causing MPI workloads imbalance issues via scalable MPI Critical Path analysis | |
| Guo et al. | On the performance and power consumption analysis of elastic clouds | |
| Kunz | Hpc job-monitoring with slurm, prometheus and grafana | |
| Nikolaou et al. | Total Cost of Ownership Perspective of Cloud vs Edge Deployments of IoT Applications | |
| Scionti et al. | The green computing continuum: The opera perspective | |
| Stubbs et al. | Toward Smart Scheduling in Tapis | |
| Wang et al. | Model Construction and Data Management of Running Log in Supporting SaaS Software Performance Analysis. |