Li et al., 2013 - Google Patents

NUMA-aware algorithms: the case of data shuffling.

Li et al., 2013

Document ID: 14644577709428683445
Author: Li Y; Pandis I; Mueller R; Raman V; Lohman G
Publication year: 2013
Publication venue: CIDR

External Links

Cited by

Snippet

In recent years, a new breed of non-uniform memory access (NUMA) systems has emerged: multi-socket servers of multicores. This paper makes the case that data management systems need to employ designs that take into consideration the characteristics of modern …

Continue reading at www.pandis.net (PDF) (other versions)

230000015654 memory 0 abstract description 71

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/08—Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
- G06F12/0802—Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
- G06F12/0806—Multiuser, multiprocessor or multiprocessing cache systems
- G06F12/0815—Cache consistency protocols
- G06F12/0817—Cache consistency protocols using directory methods
- G06F12/0826—Limited pointers directories; State-only directories without pointers
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
- G06F15/163—Interprocessor communication
- G06F15/173—Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star, snowflake
- G06F15/17356—Indirect interconnection networks
- G06F15/17368—Indirect interconnection networks non hierarchical topologies
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/52—Programme synchronisation; Mutual exclusion, e.g. by means of semaphores; Contention for resources among tasks
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30575—Replication, distribution or synchronisation of data between databases or within a distributed database; Distributed database system architectures therefor
- G06F17/30584—Details of data partitioning, e.g. horizontal or vertical partitioning
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored programme computers
- G06F15/78—Architectures of general purpose stored programme computers comprising a single central processing unit
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored programme computers
- G06F15/80—Architectures of general purpose stored programme computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
- G06F15/8007—Architectures of general purpose stored programme computers comprising an array of processing units with common control, e.g. single instruction multiple data processors single instruction multiple data [SIMD] multiprocessors
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2212/00—Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
- G06F2212/25—Using a specific main memory architecture
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2201/00—Indexing scheme relating to error detection, to error correction, and to monitoring

Similar Documents

Publication	Publication Date	Title
Li et al.	2013	NUMA-aware algorithms: the case of data shuffling.
Shukur et al.	2020	Cache coherence protocols in distributed systems
Li et al.	2013	NUMA-aware shared-memory collective communication for MPI
Maass et al.	2017	Mosaic: Processing a trillion-edge graph on a single machine
Kim et al.	2014	Multi-GPU system design with memory networks
Ziegler et al.	2022	ScaleStore: A fast and cost-efficient storage engine using DRAM, NVMe, and RDMA
Li et al.	2016	MapReduce parallel programming model: a state-of-the-art survey
US6871219B2 (en)	2005-03-22	Dynamic memory placement policies for NUMA architecture
Ramos et al.	2017	Capability models for manycore memory systems: A case-study with Xeon Phi KNL
Kissinger et al.	2014	ERIS: A NUMA-aware in-memory storage engine for analytical workloads
Zhang et al.	2015	Exploiting commutativity to reduce the cost of updates to shared data in cache-coherent systems
Ma et al.	2013	Kernel-assisted and topology-aware MPI collective communications on multicore/many-core platforms
Li et al.	2014	Improved MPI collectives for MPI processes in shared address spaces
Luo et al.	2018	Adapt: An event-based adaptive collective communication framework
Morari et al.	2014	Scaling irregular applications through data aggregation and software multithreading
Balaji et al.	2019	Combining data duplication and graph reordering to accelerate parallel graph processing
Jatala et al.	2020	A study of graph analytics for massive datasets on distributed multi-gpus
Burkhardt et al.	2022	A cloud-based approach to big graphs
Mhembere et al.	2017	knor: a NUMA-optimized in-memory, distributed and semi-external-memory k-means library
Hashmi et al.	2020	Machine-agnostic and Communication-aware Designs for MPI on Emerging Architectures
Morari et al.	2012	Efficient sorting on the Tilera manycore architecture
Asaduzzaman et al.	2016	A novel directory based hybrid cache coherence protocol for shared memory multiprocessors
Krick et al.	1999	Data management in networks: Experimental evaluation of a provably good strategy
Liu et al.	2018	Topology‐Aware Strategy for MPI‐IO Operations in Clusters
Papagiannis et al.	2014	Hybrid address spaces: A methodology for implementing scalable high-level programming models on non-coherent many-core architectures