Eghbali et al., 2024 - Google Patents

De-hallucinator: Iterative grounding for llm-based code completion

Eghbali et al., 2024

Document ID: 909688515627380167
Author: Eghbali A; Pradel M
Publication year: 2024
Publication venue: arXiv preprint arXiv:2401.01701

External Links

Cited by

Snippet

Large languages models (LLMs) trained on datasets of publicly available source code have established a new state-of-the-art in code completion. However, these models are mostly unaware of the code that already exists within a specific project, preventing the models from …

Continue reading at jespereggers.com (PDF) (other versions)

239000008186 active pharmaceutical agent 0 abstract description 172

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
- G06F17/30389—Query formulation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
- G06F8/41—Compilation
- G06F8/43—Checking; Contextual analysis
- G06F8/436—Semantic checking
- G06F8/437—Type checking
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/36—Preventing errors by testing or debugging software
- G06F11/3668—Software testing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/70—Software maintenance or management
- G06F8/75—Structural analysis for program understanding
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/36—Preventing errors by testing or debugging software
- G06F11/362—Software debugging
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/30—Creation or generation of source code
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management

Similar Documents

Publication	Publication Date	Title
Eghbali et al.	2024	De-hallucinator: Iterative grounding for llm-based code completion
Raghothaman et al.	2016	SWIM: synthesizing what I mean: code search and idiomatic snippet synthesis
DeFreez et al.	2018	Path-based function embedding and its application to error-handling specification mining
Zhang et al.	2024	Pydex: Repairing bugs in introductory python assignments using llms
Wong et al.	2015	Clocom: Mining existing source code for automatic comment generation
Wei et al.	2023	Typet5: Seq2seq type inference using static analysis
US20070050343A1 (en)	2007-03-01	Semantic-based query techniques for source code
Dilhara et al.	2024	Unprecedented code change automation: The fusion of llms and transformation by example
Xiao et al.	2018	Bug localization with semantic and structural features using convolutional neural network and cascade forest
Eghbali et al.	2024	De-hallucinator: Mitigating llm hallucinations in code generation tasks via iterative grounding
Loyola et al.	2018	Bug localization by learning to rank and represent bug inducing changes
Xia et al.	2017	An effective change recommendation approach for supplementary bug fixes
Xu et al.	2023	Lmpa: Improving decompilation by synergy of large language model and program analysis
Cheng et al.	2020	A similarity integration method based information retrieval and word embedding in bug localization
Bui et al.	2024	Rambo: Enhancing rag-based repository-level method body completion
Chang et al.	2025	Bridging bug localization and issue fixing: A hierarchical localization framework leveraging large language models
Wang et al.	2023	Pre-implementation method name prediction for object-oriented programming
Sohrabizadeh et al.	2025	Nemotron-cortexa: Enhancing llm agents for software engineering tasks via improved localization and solution diversity
Santos et al.	2017	Finding and correcting syntax errors using recurrent neural networks
Zhang et al.	2018	Fusing multi-abstraction vector space models for concern localization
Stepanov et al.	2021	Type-centric Kotlin compiler fuzzing: Preserving test program correctness by preserving types
Zhang et al.	2017	Exploring metadata in bug reports for bug localization
Wang et al.	2025	Towards an understanding of context utilization in code intelligence
WO2025075780A1 (en)	2025-04-10	Automatic binary code understanding
Akram et al.	2025	Llm-based method name suggestion with automatically generated context-rich prompts