Yitagesu et al., 2021 - Google Patents
Automatic part-of-speech tagging for security vulnerability descriptionsYitagesu et al., 2021
- Document ID
- 11612484094906539953
- Author
- Yitagesu S
- Zhang X
- Feng Z
- Li X
- Xing Z
- Publication year
- Publication venue
- 2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR)
External Links
Snippet
In this paper, we study the problem of part-of-speech (POS) tagging for security vulnerability descriptions (SVD). In contrast to newswire articles, SVD often contains a high-level natural language description of the text composed of mixed language studded with codes, domain …
- 238000011156 evaluation 0 abstract description 19
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
- G06F17/2217—Character encodings
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/274—Grammatical analysis; Style critique
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/24—Editing, e.g. insert/delete
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/36—Preventing errors by testing or debugging software
- G06F11/3668—Software testing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/70—Software maintenance or management
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Yitagesu et al. | Automatic part-of-speech tagging for security vulnerability descriptions | |
Haddow et al. | Survey of low-resource machine translation | |
Meng et al. | A semantic-aware representation framework for online log analysis | |
US10169706B2 (en) | Corpus quality analysis | |
Zhong et al. | Inferring resource specifications from natural language API documentation | |
Fu et al. | WASTK: A weighted abstract syntax tree kernel method for source code plagiarism detection | |
US7849399B2 (en) | Method and system for tracking authorship of content in data | |
Guo et al. | Detecting and augmenting missing key aspects in vulnerability descriptions | |
CN108459874B (en) | An automated code summarization method that integrates deep learning and natural language processing | |
Zhou et al. | User review-based change file localization for mobile applications | |
Liu et al. | Syntax and domain aware model for unsupervised program translation | |
CN113010679A (en) | Question and answer pair generation method, device and equipment and computer readable storage medium | |
Liguori et al. | Can we generate shellcodes via natural language? An empirical study | |
Ciurumelea et al. | Suggesting comment completions for python using neural language models | |
Guo et al. | Key aspects augmentation of vulnerability description based on multiple security databases | |
Takerngsaksiri et al. | Syntax-aware on-the-fly code completion | |
Orosz et al. | PurePos: An Open Source Morphological Disambiguator | |
Huang et al. | Api entity and relation joint extraction from text via dynamic prompt-tuned language model | |
Jiang et al. | Automated expansion of abbreviations based on semantic relation and transfer expansion | |
Althebeiti et al. | Enriching vulnerability reports through automated and augmented description summarization | |
Das et al. | Zero-shot learning for named entity recognition in software specification documents | |
Zhang et al. | SecLMNER: A framework for enhanced named entity recognition in multi-source cybersecurity data using large language models | |
Han et al. | Do chase your tail! missing key aspects augmentation in textual vulnerability descriptions of long-tail software through feature inference | |
Wang et al. | Difftech: Differencing similar technologies from crowd-scale comparison discussions | |
EP4369246A1 (en) | Translation review suitability assessment |