[go: up one dir, main page]

Eghbali et al., 2024 - Google Patents

De-hallucinator: Iterative grounding for llm-based code completion

Eghbali et al., 2024

View PDF
Document ID
909688515627380167
Author
Eghbali A
Pradel M
Publication year
Publication venue
arXiv preprint arXiv:2401.01701

External Links

Snippet

Large languages models (LLMs) trained on datasets of publicly available source code have established a new state-of-the-art in code completion. However, these models are mostly unaware of the code that already exists within a specific project, preventing the models from …
Continue reading at jespereggers.com (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30386Retrieval requests
    • G06F17/30389Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30634Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformations of program code
    • G06F8/41Compilation
    • G06F8/43Checking; Contextual analysis
    • G06F8/436Semantic checking
    • G06F8/437Type checking
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/36Preventing errors by testing or debugging software
    • G06F11/3668Software testing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30861Retrieval from the Internet, e.g. browsers
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/70Software maintenance or management
    • G06F8/75Structural analysis for program understanding
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/36Preventing errors by testing or debugging software
    • G06F11/362Software debugging
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/30Creation or generation of source code
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/02Knowledge representation
    • G06N5/022Knowledge engineering, knowledge acquisition
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management

Similar Documents

Publication Publication Date Title
Eghbali et al. De-hallucinator: Iterative grounding for llm-based code completion
Raghothaman et al. SWIM: synthesizing what I mean: code search and idiomatic snippet synthesis
DeFreez et al. Path-based function embedding and its application to error-handling specification mining
Zhang et al. Pydex: Repairing bugs in introductory python assignments using llms
Wong et al. Clocom: Mining existing source code for automatic comment generation
Wei et al. Typet5: Seq2seq type inference using static analysis
US20070050343A1 (en) Semantic-based query techniques for source code
Dilhara et al. Unprecedented code change automation: The fusion of llms and transformation by example
Xiao et al. Bug localization with semantic and structural features using convolutional neural network and cascade forest
Eghbali et al. De-hallucinator: Mitigating llm hallucinations in code generation tasks via iterative grounding
Loyola et al. Bug localization by learning to rank and represent bug inducing changes
Xia et al. An effective change recommendation approach for supplementary bug fixes
Xu et al. Lmpa: Improving decompilation by synergy of large language model and program analysis
Cheng et al. A similarity integration method based information retrieval and word embedding in bug localization
Bui et al. Rambo: Enhancing rag-based repository-level method body completion
Chang et al. Bridging bug localization and issue fixing: A hierarchical localization framework leveraging large language models
Wang et al. Pre-implementation method name prediction for object-oriented programming
Sohrabizadeh et al. Nemotron-cortexa: Enhancing llm agents for software engineering tasks via improved localization and solution diversity
Santos et al. Finding and correcting syntax errors using recurrent neural networks
Zhang et al. Fusing multi-abstraction vector space models for concern localization
Stepanov et al. Type-centric Kotlin compiler fuzzing: Preserving test program correctness by preserving types
Zhang et al. Exploring metadata in bug reports for bug localization
Wang et al. Towards an understanding of context utilization in code intelligence
WO2025075780A1 (en) Automatic binary code understanding
Akram et al. Llm-based method name suggestion with automatically generated context-rich prompts