[go: up one dir, main page]

Elzohairy et al., 2022 - Google Patents

Fedlesscan: Mitigating stragglers in serverless federated learning

Elzohairy et al., 2022

View PDF
Document ID
648020169799306862
Author
Elzohairy M
Chadha M
Jindal A
Grafberger A
Gu J
Gerndt M
Abboud O
Publication year
Publication venue
2022 IEEE International Conference on Big Data (Big Data)

External Links

Snippet

Federated Learning (FL) is a machine learning paradigm that enables the training of a shared global model across distributed clients while keeping the training data local. While most prior work on designing systems for FL has focused on using stateful always running …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30386Retrieval requests
    • G06F17/30424Query processing
    • G06F17/30533Other types of queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3409Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network-specific arrangements or communication protocols supporting networked applications
    • H04L67/10Network-specific arrangements or communication protocols supporting networked applications in which an application is distributed across nodes in the network
    • H04L67/1002Network-specific arrangements or communication protocols supporting networked applications in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers, e.g. load balancing
    • H04L67/1004Server selection in load balancing
    • H04L67/1014Server selection in load balancing based on the content of a request
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network-specific arrangements or communication protocols supporting networked applications
    • H04L67/32Network-specific arrangements or communication protocols supporting networked applications for scheduling or organising the servicing of application requests, e.g. requests for application data transmissions involving the analysis and optimisation of the required network resources
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance or administration or management of packet switching networks
    • H04L41/50Network service management, i.e. ensuring proper service fulfillment according to an agreement or contract between two parties, e.g. between an IT-provider and a customer
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/04Inference methods or devices

Similar Documents

Publication Publication Date Title
Elzohairy et al. Fedlesscan: Mitigating stragglers in serverless federated learning
US10841241B2 (en) Intelligent placement within a data center
CN115408151B (en) A method to accelerate federated learning training
US20200219028A1 (en) Systems, methods, and media for distributing database queries across a metered virtual network
Swarup et al. Task scheduling in cloud using deep reinforcement learning
US7793290B2 (en) Grip application acceleration by executing grid application based on application usage history prior to user request for application execution
Jain et al. QoS-aware task offloading in fog environment using multi-agent deep reinforcement learning
US10733190B2 (en) Method and device for deciding where to execute subqueries of an analytics continuous query
US20090300161A1 (en) Method and system for using feedback in accessing network services
Marcus et al. Releasing Cloud Databases for the Chains of Performance Prediction Models.
US20220300323A1 (en) Job Scheduling Method and Job Scheduling Apparatus
US20210012225A1 (en) Machine learning based ranking of private distributed data, models and compute resources
CN103713935A (en) Method and device for managing Hadoop cluster resources in online manner
JP2024543115A (en) Method and apparatus for automatic parallel simulation of integrated circuits
Safavifar et al. Adaptive workload orchestration in pure edge computing: A reinforcement-learning model
Garg et al. Heuristic and reinforcement learning algorithms for dynamic service placement on mobile edge cloud
Lorido-Botran et al. ImpalaE: Towards an optimal policy for efficient resource management at the edge
Ogden et al. Layercake: Efficient inference serving with cloud and mobile resources
CN106502790A (en) A kind of task distribution optimization method based on data distribution
Shahhosseini et al. Hybrid learning for orchestrating deep learning inference in multi-user edge-cloud networks
US20220413896A1 (en) Selecting a node group of a work group for executing a target transaction of another work group to optimize parallel execution of steps of the target transaction
Gao et al. NebulaFL: Effective Asynchronous Federated Learning for JointCloud Computing
US20220413920A1 (en) Selecting a node of a work group for executing a target transaction of another work group to execute skippable steps prior to a predicted interruption
Mostafaei et al. Network-aware worker placement for wide-area streaming analytics
Mostafaei et al. SNR: Network-aware geo-distributed stream analytics