[go: up one dir, main page]

Beneventi et al., 2017 - Google Patents

Continuous learning of HPC infrastructure models using big data analytics and in-memory processing tools

Beneventi et al., 2017

View PDF
Document ID
7086271924173840434
Author
Beneventi F
Bartolini A
Cavazzoni C
Benini L
Publication year
Publication venue
Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017

External Links

Snippet

Exascale computing represents the next leap in the HPC race. Reaching this level of performance is subject to several engineering challenges such as energy consumption, equipment-cooling, reliability and massive parallelism. Model-based optimization is an …
Continue reading at www.research-collection.ethz.ch (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/48Programme initiating; Programme switching, e.g. by interrupt
    • G06F9/4806Task transfer initiation or dispatching
    • G06F9/4843Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
    • G06F9/4881Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3409Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3466Performance evaluation by tracing or monitoring
    • G06F11/3495Performance evaluation by tracing or monitoring for systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/54Interprogramme communication; Intertask communication
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3065Monitoring arrangements determined by the means or processing involved in reporting the monitored data
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/16Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
    • G06F15/163Interprocessor communication
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/86Event-based monitoring
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F1/00Details of data-processing equipment not covered by groups G06F3/00 - G06F13/00, e.g. cooling, packaging or power supply specially adapted for computer application

Similar Documents

Publication Publication Date Title
Beneventi et al. Continuous learning of HPC infrastructure models using big data analytics and in-memory processing tools
Kalavri et al. Three steps is all you need: fast, accurate, automatic scaling decisions for distributed streaming dataflows
Lin et al. A taxonomy and survey of power models and power modeling for cloud servers
Simmhan et al. Karma2: Provenance management for data-driven workflows
Fu et al. DRS: Dynamic resource scheduling for real-time analytics over fast streams
Lin et al. A cloud server energy consumption measurement system for heterogeneous cloud environments
Netti et al. DCDB wintermute: Enabling online and holistic operational data analytics on HPC systems
Calzarossa et al. Workloads in the Clouds
Rajachandrasekar et al. Monitoring and predicting hardware failures in HPC clusters with FTB-IPMI
Song et al. Energy profiling and analysis of the hpc challenge benchmarks
Colmant et al. WattsKit: Software-defined power monitoring of distributed systems
Stefanov et al. A review of supercomputer performance monitoring systems
Sang et al. Precise, scalable, and online request tracing for multitier services of black boxes
Patki et al. Monitoring large scale supercomputers: A case study with the lassen supercomputer
Thaler et al. Hybrid approach to hpc cluster telemetry and hardware log analytics
Brondolin et al. PRESTO: a latency-aware power-capping orchestrator for cloud-native microservices
Khan Hadoop performance modeling and job optimization for big data analytics
Anghel et al. Quantifying communication in graph analytics
Shatalin et al. Root causing MPI workloads imbalance issues via scalable MPI Critical Path analysis
Guo et al. On the performance and power consumption analysis of elastic clouds
Kunz Hpc job-monitoring with slurm, prometheus and grafana
Nikolaou et al. Total Cost of Ownership Perspective of Cloud vs Edge Deployments of IoT Applications
Scionti et al. The green computing continuum: The opera perspective
Stubbs et al. Toward Smart Scheduling in Tapis
Wang et al. Model Construction and Data Management of Running Log in Supporting SaaS Software Performance Analysis.