Elzohairy et al., 2022 - Google Patents

Fedlesscan: Mitigating stragglers in serverless federated learning

Elzohairy et al., 2022

Document ID: 648020169799306862
Author: Elzohairy M; Chadha M; Jindal A; Grafberger A; Gu J; Gerndt M; Abboud O
Publication year: 2022
Publication venue: 2022 IEEE International Conference on Big Data (Big Data)

External Links

Cited by

Snippet

Federated Learning (FL) is a machine learning paradigm that enables the training of a shared global model across distributed clients while keeping the training data local. While most prior work on designing systems for FL has focused on using stateful always running …

Continue reading at arxiv.org (PDF) (other versions)

230000000116 mitigating 0 title abstract description 5

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
- G06F17/30424—Query processing
- G06F17/30533—Other types of queries
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3409—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network-specific arrangements or communication protocols supporting networked applications
- H04L67/10—Network-specific arrangements or communication protocols supporting networked applications in which an application is distributed across nodes in the network
- H04L67/1002—Network-specific arrangements or communication protocols supporting networked applications in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers, e.g. load balancing
- H04L67/1004—Server selection in load balancing
- H04L67/1014—Server selection in load balancing based on the content of a request
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network-specific arrangements or communication protocols supporting networked applications
- H04L67/32—Network-specific arrangements or communication protocols supporting networked applications for scheduling or organising the servicing of application requests, e.g. requests for application data transmissions involving the analysis and optimisation of the required network resources
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance or administration or management of packet switching networks
- H04L41/50—Network service management, i.e. ensuring proper service fulfillment according to an agreement or contract between two parties, e.g. between an IT-provider and a customer
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices

Similar Documents

Publication	Publication Date	Title
Elzohairy et al.	2022	Fedlesscan: Mitigating stragglers in serverless federated learning
US10841241B2 (en)	2020-11-17	Intelligent placement within a data center
CN115408151B (en)	2025-06-17	A method to accelerate federated learning training
US20200219028A1 (en)	2020-07-09	Systems, methods, and media for distributing database queries across a metered virtual network
Swarup et al.	2021	Task scheduling in cloud using deep reinforcement learning
US7793290B2 (en)	2010-09-07	Grip application acceleration by executing grid application based on application usage history prior to user request for application execution
Jain et al.	2023	QoS-aware task offloading in fog environment using multi-agent deep reinforcement learning
US10733190B2 (en)	2020-08-04	Method and device for deciding where to execute subqueries of an analytics continuous query
US20090300161A1 (en)	2009-12-03	Method and system for using feedback in accessing network services
Marcus et al.	2017	Releasing Cloud Databases for the Chains of Performance Prediction Models.
US20220300323A1 (en)	2022-09-22	Job Scheduling Method and Job Scheduling Apparatus
US20210012225A1 (en)	2021-01-14	Machine learning based ranking of private distributed data, models and compute resources
CN103713935A (en)	2014-04-09	Method and device for managing Hadoop cluster resources in online manner
JP2024543115A (en)	2024-11-19	Method and apparatus for automatic parallel simulation of integrated circuits
Safavifar et al.	2021	Adaptive workload orchestration in pure edge computing: A reinforcement-learning model
Garg et al.	2021	Heuristic and reinforcement learning algorithms for dynamic service placement on mobile edge cloud
Lorido-Botran et al.	2022	ImpalaE: Towards an optimal policy for efficient resource management at the edge
Ogden et al.	2023	Layercake: Efficient inference serving with cloud and mobile resources
CN106502790A (en)	2017-03-15	A kind of task distribution optimization method based on data distribution
Shahhosseini et al.	2022	Hybrid learning for orchestrating deep learning inference in multi-user edge-cloud networks
US20220413896A1 (en)	2022-12-29	Selecting a node group of a work group for executing a target transaction of another work group to optimize parallel execution of steps of the target transaction
Gao et al.	2024	NebulaFL: Effective Asynchronous Federated Learning for JointCloud Computing
US20220413920A1 (en)	2022-12-29	Selecting a node of a work group for executing a target transaction of another work group to execute skippable steps prior to a predicted interruption
Mostafaei et al.	2022	Network-aware worker placement for wide-area streaming analytics
Mostafaei et al.	2021	SNR: Network-aware geo-distributed stream analytics