Elzohairy et al., 2022 - Google Patents
Fedlesscan: Mitigating stragglers in serverless federated learningElzohairy et al., 2022
View PDF- Document ID
- 648020169799306862
- Author
- Elzohairy M
- Chadha M
- Jindal A
- Grafberger A
- Gu J
- Gerndt M
- Abboud O
- Publication year
- Publication venue
- 2022 IEEE International Conference on Big Data (Big Data)
External Links
Snippet
Federated Learning (FL) is a machine learning paradigm that enables the training of a shared global model across distributed clients while keeping the training data local. While most prior work on designing systems for FL has focused on using stateful always running …
- 230000000116 mitigating 0 title abstract description 5
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
- G06F17/30424—Query processing
- G06F17/30533—Other types of queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3409—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network-specific arrangements or communication protocols supporting networked applications
- H04L67/10—Network-specific arrangements or communication protocols supporting networked applications in which an application is distributed across nodes in the network
- H04L67/1002—Network-specific arrangements or communication protocols supporting networked applications in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers, e.g. load balancing
- H04L67/1004—Server selection in load balancing
- H04L67/1014—Server selection in load balancing based on the content of a request
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network-specific arrangements or communication protocols supporting networked applications
- H04L67/32—Network-specific arrangements or communication protocols supporting networked applications for scheduling or organising the servicing of application requests, e.g. requests for application data transmissions involving the analysis and optimisation of the required network resources
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance or administration or management of packet switching networks
- H04L41/50—Network service management, i.e. ensuring proper service fulfillment according to an agreement or contract between two parties, e.g. between an IT-provider and a customer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Elzohairy et al. | Fedlesscan: Mitigating stragglers in serverless federated learning | |
US10841241B2 (en) | Intelligent placement within a data center | |
CN115408151B (en) | A method to accelerate federated learning training | |
US20200219028A1 (en) | Systems, methods, and media for distributing database queries across a metered virtual network | |
Swarup et al. | Task scheduling in cloud using deep reinforcement learning | |
US7793290B2 (en) | Grip application acceleration by executing grid application based on application usage history prior to user request for application execution | |
Jain et al. | QoS-aware task offloading in fog environment using multi-agent deep reinforcement learning | |
US10733190B2 (en) | Method and device for deciding where to execute subqueries of an analytics continuous query | |
US20090300161A1 (en) | Method and system for using feedback in accessing network services | |
Marcus et al. | Releasing Cloud Databases for the Chains of Performance Prediction Models. | |
US20220300323A1 (en) | Job Scheduling Method and Job Scheduling Apparatus | |
US20210012225A1 (en) | Machine learning based ranking of private distributed data, models and compute resources | |
CN103713935A (en) | Method and device for managing Hadoop cluster resources in online manner | |
JP2024543115A (en) | Method and apparatus for automatic parallel simulation of integrated circuits | |
Safavifar et al. | Adaptive workload orchestration in pure edge computing: A reinforcement-learning model | |
Garg et al. | Heuristic and reinforcement learning algorithms for dynamic service placement on mobile edge cloud | |
Lorido-Botran et al. | ImpalaE: Towards an optimal policy for efficient resource management at the edge | |
Ogden et al. | Layercake: Efficient inference serving with cloud and mobile resources | |
CN106502790A (en) | A kind of task distribution optimization method based on data distribution | |
Shahhosseini et al. | Hybrid learning for orchestrating deep learning inference in multi-user edge-cloud networks | |
US20220413896A1 (en) | Selecting a node group of a work group for executing a target transaction of another work group to optimize parallel execution of steps of the target transaction | |
Gao et al. | NebulaFL: Effective Asynchronous Federated Learning for JointCloud Computing | |
US20220413920A1 (en) | Selecting a node of a work group for executing a target transaction of another work group to execute skippable steps prior to a predicted interruption | |
Mostafaei et al. | Network-aware worker placement for wide-area streaming analytics | |
Mostafaei et al. | SNR: Network-aware geo-distributed stream analytics |