Eghbali et al., 2024 - Google Patents
De-hallucinator: Iterative grounding for llm-based code completionEghbali et al., 2024
View PDF- Document ID
- 909688515627380167
- Author
- Eghbali A
- Pradel M
- Publication year
- Publication venue
- arXiv preprint arXiv:2401.01701
External Links
Snippet
Large languages models (LLMs) trained on datasets of publicly available source code have established a new state-of-the-art in code completion. However, these models are mostly unaware of the code that already exists within a specific project, preventing the models from …
- 239000008186 active pharmaceutical agent 0 abstract description 172
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
- G06F17/30389—Query formulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
- G06F8/41—Compilation
- G06F8/43—Checking; Contextual analysis
- G06F8/436—Semantic checking
- G06F8/437—Type checking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/36—Preventing errors by testing or debugging software
- G06F11/3668—Software testing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/70—Software maintenance or management
- G06F8/75—Structural analysis for program understanding
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/36—Preventing errors by testing or debugging software
- G06F11/362—Software debugging
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/30—Creation or generation of source code
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Eghbali et al. | De-hallucinator: Iterative grounding for llm-based code completion | |
| Raghothaman et al. | SWIM: synthesizing what I mean: code search and idiomatic snippet synthesis | |
| DeFreez et al. | Path-based function embedding and its application to error-handling specification mining | |
| Zhang et al. | Pydex: Repairing bugs in introductory python assignments using llms | |
| Wong et al. | Clocom: Mining existing source code for automatic comment generation | |
| Wei et al. | Typet5: Seq2seq type inference using static analysis | |
| US20070050343A1 (en) | Semantic-based query techniques for source code | |
| Dilhara et al. | Unprecedented code change automation: The fusion of llms and transformation by example | |
| Xiao et al. | Bug localization with semantic and structural features using convolutional neural network and cascade forest | |
| Eghbali et al. | De-hallucinator: Mitigating llm hallucinations in code generation tasks via iterative grounding | |
| Loyola et al. | Bug localization by learning to rank and represent bug inducing changes | |
| Xia et al. | An effective change recommendation approach for supplementary bug fixes | |
| Xu et al. | Lmpa: Improving decompilation by synergy of large language model and program analysis | |
| Cheng et al. | A similarity integration method based information retrieval and word embedding in bug localization | |
| Bui et al. | Rambo: Enhancing rag-based repository-level method body completion | |
| Chang et al. | Bridging bug localization and issue fixing: A hierarchical localization framework leveraging large language models | |
| Wang et al. | Pre-implementation method name prediction for object-oriented programming | |
| Sohrabizadeh et al. | Nemotron-cortexa: Enhancing llm agents for software engineering tasks via improved localization and solution diversity | |
| Santos et al. | Finding and correcting syntax errors using recurrent neural networks | |
| Zhang et al. | Fusing multi-abstraction vector space models for concern localization | |
| Stepanov et al. | Type-centric Kotlin compiler fuzzing: Preserving test program correctness by preserving types | |
| Zhang et al. | Exploring metadata in bug reports for bug localization | |
| Wang et al. | Towards an understanding of context utilization in code intelligence | |
| WO2025075780A1 (en) | Automatic binary code understanding | |
| Akram et al. | Llm-based method name suggestion with automatically generated context-rich prompts |