[go: up one dir, main page]

Gévay et al., 2021 - Google Patents

Efficient control flow in dataflow systems: When ease-of-use meets high performance

Gévay et al., 2021

View PDF
Document ID
18101428451283902510
Author
Gévay G
Rabl T
Breß S
Madai-Tahy L
Quiané-Ruiz J
Markl V
Publication year
Publication venue
2021 IEEE 37th International Conference on Data Engineering (ICDE)

External Links

Snippet

Modern data analysis tasks often involve control flow statements, such as iterations. Common examples are PageRank and K-means. To achieve scalability, developers usually implement data analysis tasks in distributed dataflow systems, such as Spark and Flink …
Continue reading at www.researchgate.net (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformations of program code
    • G06F8/41Compilation
    • G06F8/44Encoding
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30386Retrieval requests
    • G06F17/30424Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/36Preventing errors by testing or debugging software
    • G06F11/362Software debugging
    • G06F11/3636Software debugging by tracing the execution of the program
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformations of program code
    • G06F8/41Compilation
    • G06F8/42Syntactic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/36Preventing errors by testing or debugging software
    • G06F11/3668Software testing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/44Arrangements for executing specific programmes
    • G06F9/4421Execution paradigms
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/44Arrangements for executing specific programmes
    • G06F9/455Emulation; Software simulation, i.e. virtualisation or emulation of application or operating system execution engines
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/70Software maintenance or management
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/60Software deployment
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/02Knowledge representation
    • G06N5/022Knowledge engineering, knowledge acquisition

Similar Documents

Publication Publication Date Title
Gévay et al. Efficient control flow in dataflow systems: When ease-of-use meets high performance
Pedreira et al. Velox: meta's unified execution engine
KR101687213B1 (en) Dynamically loading graph-based computations
Weimer et al. Machine learning in ScalOps, a higher order cloud computing language
JP2018510445A (en) Domain-specific system and method for improving program performance
Rimsa et al. Practical dynamic reconstruction of control flow graphs
Chen et al. A refined decompiler to generate C code with high readability
Wen et al. Runtime value numbering: A profiling technique to pinpoint redundant computations
Zhang et al. UDF to SQL translation through compositional lazy inductive synthesis
Onizuka et al. Optimization for iterative queries on MapReduce
Wu et al. HDM: A composable framework for big data processing
Ormenisan et al. Implicit provenance for machine learning artifacts
Xiao et al. Accelerating patch validation for program repair with interception-based execution scheduling
Gévay et al. The power of nested parallelism in big data processing–hitting three flies with one slap–
Dörre et al. Modeling and optimizing MapReduce programs
Wagner et al. Incremental Fusion: Unifying Compiled and Vectorized Query Execution
Li et al. Graph neural networks based memory inefficiency detection using selective sampling
Guo et al. Graphspy: Fused program semantic embedding through graph neural networks for memory efficiency
Zou et al. Lachesis: Automated generation of persistent partitionings for big data applications
US9032372B2 (en) Runtime environment and method for non-invasive monitoring of software applications
Gévay et al. Labyrinth: Compiling imperative control flow to parallel dataflows
Biringa et al. Automated user experience testing through multi-dimensional performance impact analysis
Sun et al. Graph-based detection of library api imitations
Gévay et al. Imperative or Functional Control Flow Handling: Why not the Best of Both Worlds?
Wu et al. Composable and efficient functional big data processing framework