Zhan et al., 2021 - Google Patents

Field programmable gate array‐based all‐layer accelerator with quantization neural networks for sustainable cyber‐physical systems

Zhan et al., 2021

View PDF

Document ID: 17532692209079483723
Author: Zhan J; Zhou X; Jiang W
Publication year: 2021
Publication venue: Software: Practice and Experience

External Links

Cited by

Snippet

Summary Low‐Bit Neural Network (LBNN) is a promising technique to enrich intelligent applications running on sustainable Cyber‐Physical Systems (CPS). Although LBNN has the advantages of low memory usage, fast inference and low power consumption, Low‐bit …

Continue reading at onlinelibrary.wiley.com (PDF) (other versions)

230000001537 neural 0 title abstract description 74

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30312—Storage and indexing structures; Management thereof
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/38—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
- G06F7/48—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored programme computers
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches

Similar Documents

Publication	Publication Date	Title
Kulkarni et al.	2021	Quantization friendly mobilenet (qf-mobilenet) architecture for vision based applications on embedded platforms
CN114503125B (en)	2025-08-05	Structured pruning method, system and computer-readable medium
CN113273082B (en)	2025-04-25	Neural network activation compression with outlier block floating point
CN111095302B (en)	2024-05-28	Compression of Sparse Deep Convolutional Network Weights
Mohaidat et al.	2024	A survey on neural network hardware accelerators
EP4036724A1 (en)	2022-08-03	Method for splitting neural network model by using multi-core processor, and related product
Li et al.	2024	Large language model inference acceleration: A comprehensive hardware perspective
Wang et al.	2022	TRC‐YOLO: A real‐time detection method for lightweight targets based on mobile devices
CN113986816B (en)	2023-05-02	Reconfigurable computing chip
Russo et al.	2021	DNN model compression for IoT domain-specific hardware accelerators
US20230146689A1 (en)	2023-05-11	Deep neural network
Shahshahani et al.	2018	Memory optimization techniques for fpga based cnn implementations
Li et al.	2024	Locmoe: A low-overhead moe for large language model training
Yan et al.	2020	FPGAN: an FPGA accelerator for graph attention networks with software and hardware co-optimization
Zhan et al.	2021	Field programmable gate array‐based all‐layer accelerator with quantization neural networks for sustainable cyber‐physical systems
He et al.	2020	Bis-km: Enabling any-precision k-means on fpgas
Raha et al.	2023	Efficient hardware acceleration of emerging neural networks for embedded machine learning: An industry perspective
Zhang et al.	2020	Cosine: compressive network embedding on large-scale information networks
Fuketa et al.	2018	Image-classifier deep convolutional neural network training by 9-bit dedicated hardware to realize validation accuracy and energy efficiency superior to the half precision floating point format
Moon et al.	2023	Multipurpose Deep-Learning Accelerator for Arbitrary Quantization With Reduction of Storage, Logic, and Latency Waste
Huang et al.	2022	An efficient loop tiling framework for convolutional neural network inference accelerators
Chen et al.	2023	Bitgnn: Unleashing the performance potential of binary graph neural networks on gpus
Zhang et al.	2020	XNORCONV: CNNs accelerator implemented on FPGA using a hybrid CNNs structure and an inter‐layer pipeline method
Yan et al.	2020	S-GAT: Accelerating graph attention networks inference on FPGA platform with shift operation
Lei et al.	2024	DT-SCNN: dual-threshold spiking convolutional neural network with fewer operations and memory access for edge applications