[go: up one dir, main page]

MY192169A - System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository - Google Patents

System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository

Info

Publication number
MY192169A
MY192169A MYPI2018001926A MYPI2018001926A MY192169A MY 192169 A MY192169 A MY 192169A MY PI2018001926 A MYPI2018001926 A MY PI2018001926A MY PI2018001926 A MYPI2018001926 A MY PI2018001926A MY 192169 A MY192169 A MY 192169A
Authority
MY
Malaysia
Prior art keywords
knowledge base
entities
duplicates
module
production knowledge
Prior art date
Application number
MYPI2018001926A
Inventor
Binti Mohamed Sa'niah
Zarina Binti Ishak Ros'aleza
Stella Tabora Domingo Ma
Wooi Kin Goon
Raziq Ramesh Bin Abdullah Muhammad
Original Assignee
Mimos Berhad
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mimos Berhad filed Critical Mimos Berhad
Priority to MYPI2018001926A priority Critical patent/MY192169A/en
Priority to PCT/MY2019/050093 priority patent/WO2020101478A1/en
Publication of MY192169A publication Critical patent/MY192169A/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Devices For Executing Special Programs (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Disclosed is a system and method for managing one or more duplicate entities based on a relationship cardinality in a production knowledge base repository. The method comprises steps of performing a first level detection of duplicates in existing data present in the production knowledge base repository through an object harmonisation module (202). The first level detection identifies duplicates of one or more attribute objects within a specific entity. The object harmonisation module (202) implements a sanitization and standardization operation on the identified attribute objects. Then the method performs a second level detection of duplicates between entities of a specific concept through a homogeneity recognition module (204). The homogeneity recognition module (204) identifies duplicates according to base-attributes of the specific concept based on a predefined similarity threshold. The method then enables a user to determine the similarity of the entities and further enables the user to merge the similar entities through an entity conflation and merging module (206). (FIG. 2)
MYPI2018001926A 2018-11-14 2018-11-14 System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository MY192169A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
MYPI2018001926A MY192169A (en) 2018-11-14 2018-11-14 System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository
PCT/MY2019/050093 WO2020101478A1 (en) 2018-11-14 2019-11-14 System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
MYPI2018001926A MY192169A (en) 2018-11-14 2018-11-14 System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository

Publications (1)

Publication Number Publication Date
MY192169A true MY192169A (en) 2022-08-03

Family

ID=70730534

Family Applications (1)

Application Number Title Priority Date Filing Date
MYPI2018001926A MY192169A (en) 2018-11-14 2018-11-14 System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository

Country Status (2)

Country Link
MY (1) MY192169A (en)
WO (1) WO2020101478A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112001451A (en) * 2020-08-27 2020-11-27 上海擎感智能科技有限公司 Data redundancy processing method, system, medium and device

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014012576A1 (en) * 2012-07-16 2014-01-23 Qatar Foundation A method and system for integrating data into a database
KR101740317B1 (en) * 2013-04-10 2017-05-26 한국전자통신연구원 Method and apparatus for memory management
US20150081668A1 (en) * 2013-09-13 2015-03-19 Nec Laboratories America, Inc. Systems and methods for tuning multi-store systems to speed up big data query workload
KR20150121505A (en) * 2014-04-21 2015-10-29 삼성전자주식회사 Method and device for data deduplication

Also Published As

Publication number Publication date
WO2020101478A1 (en) 2020-05-22

Similar Documents

Publication Publication Date Title
US20210201168A1 (en) Method and Apparatus for Outputting Information, Device and Storage Medium
US10878000B2 (en) Extracting graph topology from distributed databases
CN104361140B (en) Dynamic generation data model configuration device and method
CN111414352B (en) Method and device for managing database information
CN107608732B (en) Bug searching and positioning method based on bug knowledge graph
WO2019183483A3 (en) Facilitating queries of encrypted sensitive data via encrypted variant data objects
US20130339385A1 (en) Leveraging graph databases in a federated database system
MX391550B (en) Method and system for information extraction from document images using conversational interface and database querying
US20170220606A1 (en) Unified data model for integration between relational and non-relational databases
GB2574969A (en) Systems and methods of matching style attributes
PH12022551096A1 (en) Method and apparatus for managing iot device, and server and storage medium thereof
ZA202306267B (en) Systems and methods for accessing data entities managed by a data processing system
WO2022227764A1 (en) Event detection method and apparatus, electronic device, and readable storage medium
CN110083639A (en) A kind of method and device that the data blood relationship based on clustering is intelligently traced to the source
CN109582831B (en) Graph database management system supporting unstructured data storage and query
CN103234549B (en) A kind of differential data generation method for upgrading map
CN112231417A (en) Data classification method and device, electronic equipment and storage medium
CN112000773A (en) Data association relation mining method based on search engine technology and application
CN104516976A (en) Intellectual property infringement reminding system based on cloud database
CN111666419A (en) Knowledge graph construction method and device for legal data
Chen et al. Development of foundation models for Internet of Things
CN105095436B (en) Data source data method for automatic modeling
CN112632106A (en) Knowledge graph query method, device, equipment and storage medium
MY192169A (en) System and method for managing duplicate entities based on a relationship cardinality in production knowledge base repository
CN110929120B (en) Method and apparatus for managing technical metadata