MY192169A - System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository - Google Patents
System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repositoryInfo
- Publication number
- MY192169A MY192169A MYPI2018001926A MYPI2018001926A MY192169A MY 192169 A MY192169 A MY 192169A MY PI2018001926 A MYPI2018001926 A MY PI2018001926A MY PI2018001926 A MYPI2018001926 A MY PI2018001926A MY 192169 A MY192169 A MY 192169A
- Authority
- MY
- Malaysia
- Prior art keywords
- knowledge base
- entities
- duplicates
- module
- production knowledge
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
- G06F16/215—Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Devices For Executing Special Programs (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Disclosed is a system and method for managing one or more duplicate entities based on a relationship cardinality in a production knowledge base repository. The method comprises steps of performing a first level detection of duplicates in existing data present in the production knowledge base repository through an object harmonisation module (202). The first level detection identifies duplicates of one or more attribute objects within a specific entity. The object harmonisation module (202) implements a sanitization and standardization operation on the identified attribute objects. Then the method performs a second level detection of duplicates between entities of a specific concept through a homogeneity recognition module (204). The homogeneity recognition module (204) identifies duplicates according to base-attributes of the specific concept based on a predefined similarity threshold. The method then enables a user to determine the similarity of the entities and further enables the user to merge the similar entities through an entity conflation and merging module (206). (FIG. 2)
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| MYPI2018001926A MY192169A (en) | 2018-11-14 | 2018-11-14 | System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository |
| PCT/MY2019/050093 WO2020101478A1 (en) | 2018-11-14 | 2019-11-14 | System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| MYPI2018001926A MY192169A (en) | 2018-11-14 | 2018-11-14 | System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| MY192169A true MY192169A (en) | 2022-08-03 |
Family
ID=70730534
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| MYPI2018001926A MY192169A (en) | 2018-11-14 | 2018-11-14 | System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository |
Country Status (2)
| Country | Link |
|---|---|
| MY (1) | MY192169A (en) |
| WO (1) | WO2020101478A1 (en) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112001451A (en) * | 2020-08-27 | 2020-11-27 | 上海擎感智能科技有限公司 | Data redundancy processing method, system, medium and device |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2014012576A1 (en) * | 2012-07-16 | 2014-01-23 | Qatar Foundation | A method and system for integrating data into a database |
| KR101740317B1 (en) * | 2013-04-10 | 2017-05-26 | 한국전자통신연구원 | Method and apparatus for memory management |
| US20150081668A1 (en) * | 2013-09-13 | 2015-03-19 | Nec Laboratories America, Inc. | Systems and methods for tuning multi-store systems to speed up big data query workload |
| KR20150121505A (en) * | 2014-04-21 | 2015-10-29 | 삼성전자주식회사 | Method and device for data deduplication |
-
2018
- 2018-11-14 MY MYPI2018001926A patent/MY192169A/en unknown
-
2019
- 2019-11-14 WO PCT/MY2019/050093 patent/WO2020101478A1/en not_active Ceased
Also Published As
| Publication number | Publication date |
|---|---|
| WO2020101478A1 (en) | 2020-05-22 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20210201168A1 (en) | Method and Apparatus for Outputting Information, Device and Storage Medium | |
| US10878000B2 (en) | Extracting graph topology from distributed databases | |
| CN104361140B (en) | Dynamic generation data model configuration device and method | |
| CN111414352B (en) | Method and device for managing database information | |
| CN107608732B (en) | Bug searching and positioning method based on bug knowledge graph | |
| WO2019183483A3 (en) | Facilitating queries of encrypted sensitive data via encrypted variant data objects | |
| US20130339385A1 (en) | Leveraging graph databases in a federated database system | |
| MX391550B (en) | Method and system for information extraction from document images using conversational interface and database querying | |
| US20170220606A1 (en) | Unified data model for integration between relational and non-relational databases | |
| GB2574969A (en) | Systems and methods of matching style attributes | |
| PH12022551096A1 (en) | Method and apparatus for managing iot device, and server and storage medium thereof | |
| ZA202306267B (en) | Systems and methods for accessing data entities managed by a data processing system | |
| WO2022227764A1 (en) | Event detection method and apparatus, electronic device, and readable storage medium | |
| CN110083639A (en) | A kind of method and device that the data blood relationship based on clustering is intelligently traced to the source | |
| CN109582831B (en) | Graph database management system supporting unstructured data storage and query | |
| CN103234549B (en) | A kind of differential data generation method for upgrading map | |
| CN112231417A (en) | Data classification method and device, electronic equipment and storage medium | |
| CN112000773A (en) | Data association relation mining method based on search engine technology and application | |
| CN104516976A (en) | Intellectual property infringement reminding system based on cloud database | |
| CN111666419A (en) | Knowledge graph construction method and device for legal data | |
| Chen et al. | Development of foundation models for Internet of Things | |
| CN105095436B (en) | Data source data method for automatic modeling | |
| CN112632106A (en) | Knowledge graph query method, device, equipment and storage medium | |
| MY192169A (en) | System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository | |
| CN110929120B (en) | Method and apparatus for managing technical metadata |