Usui et al., 2016 - Google Patents

DASH: Deadline-aware high-performance memory scheduler for heterogeneous systems with hardware accelerators

Usui et al., 2016

Document ID: 442029164379248414
Author: Usui H; Subramanian L; Chang K; Mutlu O
Publication year: 2016
Publication venue: ACM Transactions on Architecture and Code Optimization (TACO)

External Links

Cited by

Snippet

Modern SoCs integrate multiple CPU cores and hardware accelerators (HWAs) that share the same main memory system, causing interference among memory requests from different agents. The result of this interference, if it is not controlled well, is missed deadlines for …

Continue reading at dl.acm.org (PDF) (other versions)

230000015654 memory 0 title abstract description 194

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/48—Programme initiating; Programme switching, e.g. by interrupt
- G06F9/4806—Task transfer initiation or dispatching
- G06F9/4843—Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
- G06F9/4881—Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5011—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/30—Arrangements for executing machine-instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline, look ahead
- G06F9/3836—Instruction issuing, e.g. dynamic instruction scheduling, out of order instruction execution
- G06F9/3851—Instruction issuing, e.g. dynamic instruction scheduling, out of order instruction execution from multiple instruction streams, e.g. multistreaming
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/52—Programme synchronisation; Mutual exclusion, e.g. by means of semaphores; Contention for resources among tasks
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/08—Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
- G06F12/0802—Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F13/00—Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
- G06F13/14—Handling requests for interconnection or transfer
- G06F13/16—Handling requests for interconnection or transfer for access to memory bus
- G06F13/1605—Handling requests for interconnection or transfer for access to memory bus based on arbitration
- G06F13/1642—Handling requests for interconnection or transfer for access to memory bus based on arbitration with request queuing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F13/00—Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
- G06F13/14—Handling requests for interconnection or transfer
- G06F13/16—Handling requests for interconnection or transfer for access to memory bus
- G06F13/1605—Handling requests for interconnection or transfer for access to memory bus based on arbitration
- G06F13/161—Handling requests for interconnection or transfer for access to memory bus based on arbitration with latency improvement
- G06F13/1626—Handling requests for interconnection or transfer for access to memory bus based on arbitration with latency improvement by reordering requests
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F1/00—Details of data-processing equipment not covered by groups G06F3/00 - G06F13/00, e.g. cooling, packaging or power supply specially adapted for computer application
- G06F1/26—Power supply means, e.g. regulation thereof
- G06F1/32—Means for saving power
- G06F1/3203—Power Management, i.e. event-based initiation of power-saving mode
- G06F1/3234—Action, measure or step performed to reduce power consumption
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements

Similar Documents

Publication	Publication Date	Title
Usui et al.	2016	DASH: Deadline-aware high-performance memory scheduler for heterogeneous systems with hardware accelerators
Ausavarungnirun et al.	2012	Staged memory scheduling: Achieving high performance and scalability in heterogeneous systems
Muralidhara et al.	2011	Reducing memory interference in multicore systems via application-aware memory channel partitioning
Jeong et al.	2012	A QoS-aware memory controller for dynamically balancing GPU and CPU bandwidth use in an MPSoC
Mutlu et al.	2007	Stall-time fair memory access scheduling for chip multiprocessors
US8898435B2 (en)	2014-11-25	Optimizing system throughput by automatically altering thread co-execution based on operating system directives
Kayıran et al.	2013	Neither more nor less: Optimizing thread-level parallelism for GPGPUs
Kim et al.	2014	Bounding memory interference delay in COTS-based multi-core systems
CN102609312B (en)	2015-08-19	Based on the SJF memory request dispatching method that fairness is considered
Jog et al.	2016	Exploiting core criticality for enhanced GPU performance
CN102081551A (en)	2011-06-01	Micro-architecture sensitive thread scheduling (MSTS) method
Gifford et al.	2021	Dna: Dynamic resource allocation for soft real-time multicore systems
Usui et al.	2015	Squash: Simple qos-aware high-performance memory scheduler for heterogeneous systems with hardware accelerators
Gupta et al.	2013	Timecube: A manycore embedded processor with interference-agnostic progress tracking
US12204774B2 (en)	2025-01-21	Allocation of resources when processing at memory level through memory request scheduling
Zhu et al.	2002	Fine-grain priority scheduling on multi-channel memory systems
Mutlu et al.	2009	Parallelism-aware batch scheduling: Enabling high-performance and fair shared memory controllers
Ausavarungnirun	2017	Techniques for shared resource management in systems with throughput processors
Elhelw et al.	2014	Time-based least memory intensive scheduling
Song et al.	2016	Single-tier virtual queuing: An efficacious memory controller architecture for MPSoCs with multiple realtime cores
Alhammad	2016	Memory Efficient Scheduling for Multicore Real-time Systems
Zhang et al.	2012	Fair memory access scheduling for quality of service guarantees via service curves
Jia et al.	2014	Combine thread with memory scheduling for maximizing performance in multi-core systems
Kim et al.	2017	Efficient GPU multitasking with latency minimization and cache boosting
Jatala et al.	2015	The more we share, the more we have: Improving GPU performance through register sharing