[go: up one dir, main page]

Talbi et al., 2013 - Google Patents

Metaheuristics on gpus

Talbi et al., 2013

View PDF
Document ID
75741121932513312
Author
Talbi E
Hasle G
Publication year
Publication venue
J. Parallel Distributed Comput.

External Links

Snippet

Metaheuristics on GPU Page 1 1 Metaheuristics on GPU Thé Van Luong, Nouredine Melab and El-Ghazali Talbi DOLPHIN Project Team April 2010 Page 2 2 Local search on GPU: From design to implementation Page 3 3 Outline ❑ Parallel Local Search Metaheuristics (PLSM) ❑ …
Continue reading at www.sintef.no (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5066Algorithms for mapping a plurality of inter-dependent sub-tasks onto a plurality of physical CPUs
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • G06F9/505Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the load
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/16Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
    • G06F15/163Interprocessor communication
    • G06F15/173Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star, snowflake
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/08Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
    • G06F12/0802Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
    • G06F12/0806Multiuser, multiprocessor or multiprocessing cache systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/50Computer-aided design
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored programme computers
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/12Computer systems based on biological models using genetic models
    • G06N3/126Genetic algorithms, i.e. information processing using digital simulations of the genetic system
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformations of program code

Similar Documents

Publication Publication Date Title
Talbi et al. Metaheuristics on gpus
Lu et al. Optimizing depthwise separable convolution operations on gpus
CN104461466B (en) The method for improving calculating speed based on MPI and OpenMP Hybrid paradigm parallel computations
US8943011B2 (en) Methods and systems for using map-reduce for large-scale analysis of graph-based data
Li et al. MapReduce parallel programming model: a state-of-the-art survey
Satish et al. Navigating the maze of graph analytics frameworks using massive graph datasets
Soman et al. Fast community detection algorithm with gpus and multicore architectures
US8959138B2 (en) Distributed data scalable adaptive map-reduce framework
Khoram et al. Accelerating graph analytics by co-optimizing storage and access on an FPGA-HMC platform
CN104461467B (en) The method for improving calculating speed using MPI and OpenMP hybrid parallels for SMP group systems
Zheng et al. Architecture-based design and optimization of genetic algorithms on multi-and many-core systems
Meng et al. A survey of distributed graph algorithms on massive graphs
Li et al. A hybrid particle swarm optimization algorithm for load balancing of MDS on heterogeneous computing systems
Xiao et al. Highly scalable parallel genetic algorithm on sunway many-core processors
Chavarria-Miranda et al. Scaling graph community detection on the tilera many-core architecture
Wei et al. Multi-core-, multi-thread-based optimization algorithm for large-scale traveling salesman problem
Zhang et al. Low-latency mini-batch gnn inference on cpu-fpga heterogeneous platform
Jiao et al. Communication optimizations for state-vector quantum simulator on CPU+ GPU clusters
CN109840306B (en) Recursive-based parallel fast Fourier transform communication optimization method and system
Ueno et al. 2d partitioning based graph search for the graph500 benchmark
Zhang et al. A novel cloud model based data placement strategy for data-intensive application in clouds
Mirsadeghi et al. PTRAM: A parallel topology-and routing-aware mapping framework for large-scale HPC systems
Lv et al. Understanding parallelism in graph traversal on multi-core clusters
Wang et al. NDPGNN: A Near-Data Processing Architecture for GNN Training and Inference Acceleration
Su et al. Exploring pim architecture for high-performance graph pattern mining