Talbi et al., 2013 - Google Patents
Metaheuristics on gpusTalbi et al., 2013
View PDF- Document ID
- 75741121932513312
- Author
- Talbi E
- Hasle G
- Publication year
- Publication venue
- J. Parallel Distributed Comput.
External Links
Snippet
Metaheuristics on GPU Page 1 1 Metaheuristics on GPU Thé Van Luong, Nouredine Melab and
El-Ghazali Talbi DOLPHIN Project Team April 2010 Page 2 2 Local search on GPU: From design
to implementation Page 3 3 Outline ❑ Parallel Local Search Metaheuristics (PLSM) ❑ …
- 238000011156 evaluation 0 description 29
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G06F9/5066—Algorithms for mapping a plurality of inter-dependent sub-tasks onto a plurality of physical CPUs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
- G06F9/505—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the load
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
- G06F15/163—Interprocessor communication
- G06F15/173—Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star, snowflake
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/08—Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
- G06F12/0802—Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
- G06F12/0806—Multiuser, multiprocessor or multiprocessing cache systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored programme computers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Talbi et al. | Metaheuristics on gpus | |
Lu et al. | Optimizing depthwise separable convolution operations on gpus | |
CN104461466B (en) | The method for improving calculating speed based on MPI and OpenMP Hybrid paradigm parallel computations | |
US8943011B2 (en) | Methods and systems for using map-reduce for large-scale analysis of graph-based data | |
Li et al. | MapReduce parallel programming model: a state-of-the-art survey | |
Satish et al. | Navigating the maze of graph analytics frameworks using massive graph datasets | |
Soman et al. | Fast community detection algorithm with gpus and multicore architectures | |
US8959138B2 (en) | Distributed data scalable adaptive map-reduce framework | |
Khoram et al. | Accelerating graph analytics by co-optimizing storage and access on an FPGA-HMC platform | |
CN104461467B (en) | The method for improving calculating speed using MPI and OpenMP hybrid parallels for SMP group systems | |
Zheng et al. | Architecture-based design and optimization of genetic algorithms on multi-and many-core systems | |
Meng et al. | A survey of distributed graph algorithms on massive graphs | |
Li et al. | A hybrid particle swarm optimization algorithm for load balancing of MDS on heterogeneous computing systems | |
Xiao et al. | Highly scalable parallel genetic algorithm on sunway many-core processors | |
Chavarria-Miranda et al. | Scaling graph community detection on the tilera many-core architecture | |
Wei et al. | Multi-core-, multi-thread-based optimization algorithm for large-scale traveling salesman problem | |
Zhang et al. | Low-latency mini-batch gnn inference on cpu-fpga heterogeneous platform | |
Jiao et al. | Communication optimizations for state-vector quantum simulator on CPU+ GPU clusters | |
CN109840306B (en) | Recursive-based parallel fast Fourier transform communication optimization method and system | |
Ueno et al. | 2d partitioning based graph search for the graph500 benchmark | |
Zhang et al. | A novel cloud model based data placement strategy for data-intensive application in clouds | |
Mirsadeghi et al. | PTRAM: A parallel topology-and routing-aware mapping framework for large-scale HPC systems | |
Lv et al. | Understanding parallelism in graph traversal on multi-core clusters | |
Wang et al. | NDPGNN: A Near-Data Processing Architecture for GNN Training and Inference Acceleration | |
Su et al. | Exploring pim architecture for high-performance graph pattern mining |