[go: up one dir, main page]

WO2010013473A1 - データ分類システム、データ分類方法、及びデータ分類プログラム - Google Patents

データ分類システム、データ分類方法、及びデータ分類プログラム Download PDF

Info

Publication number
WO2010013473A1
WO2010013473A1 PCT/JP2009/003602 JP2009003602W WO2010013473A1 WO 2010013473 A1 WO2010013473 A1 WO 2010013473A1 JP 2009003602 W JP2009003602 W JP 2009003602W WO 2010013473 A1 WO2010013473 A1 WO 2010013473A1
Authority
WO
WIPO (PCT)
Prior art keywords
classification
data classification
data
classification items
category axis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/JP2009/003602
Other languages
English (en)
French (fr)
Inventor
水口弘紀
立石健二
細見格
久寿居大
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP2010522626A priority Critical patent/JP5423676B2/ja
Priority to US13/056,031 priority patent/US9361367B2/en
Publication of WO2010013473A1 publication Critical patent/WO2010013473A1/ja
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/353Clustering; Classification into predefined classes

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

 本発明に係るデータ分類システムは、階層的な分類項目と、当該分類項目に対応するデータ群とに基づいて、当該データ群に対応する分類項目を複数選択して分類軸として出力する。該データ分類システムは、基準項目蓄積手段と、分類軸候補作成手段と、優先度計算手段と、を備える。基準項目蓄積手段は、分類項目を選択するための基準項目となる分類項目群を予め蓄積する。分類軸候補作成手段は、基準項目の子孫の分類項目においてデータに少なくとも1つ対応する分類項目の組合せに基づいて、分類軸候補を作成する。優先度計算手段は、分類階層における分類項目間の階層的な距離に基づいて、前記分類軸候補作成手段が作成した分類軸候補について、当該分類軸候補の優先度を計算する。
PCT/JP2009/003602 2008-07-30 2009-07-29 データ分類システム、データ分類方法、及びデータ分類プログラム Ceased WO2010013473A1 (ja)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2010522626A JP5423676B2 (ja) 2008-07-30 2009-07-29 データ分類システム、データ分類方法、及びデータ分類プログラム
US13/056,031 US9361367B2 (en) 2008-07-30 2009-07-29 Data classifier system, data classifier method and data classifier program

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2008-195896 2008-07-30
JP2008195896 2008-07-30

Publications (1)

Publication Number Publication Date
WO2010013473A1 true WO2010013473A1 (ja) 2010-02-04

Family

ID=41610187

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2009/003602 Ceased WO2010013473A1 (ja) 2008-07-30 2009-07-29 データ分類システム、データ分類方法、及びデータ分類プログラム

Country Status (3)

Country Link
US (1) US9361367B2 (ja)
JP (1) JP5423676B2 (ja)
WO (1) WO2010013473A1 (ja)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011253449A (ja) * 2010-06-03 2011-12-15 Toshiba Corp 文書分析装置およびプログラム
JP5319829B1 (ja) * 2012-07-31 2013-10-16 楽天株式会社 情報処理装置、情報処理方法及び情報処理プログラム
JP2014219984A (ja) * 2013-05-09 2014-11-20 鴻海精密工業股▲ふん▼有限公司 ファイル分類システム及びその分類方法
WO2015025386A1 (ja) 2013-08-21 2015-02-26 株式会社日立製作所 データ処理システム、データ処理方法およびデータ処理装置
KR20210071147A (ko) * 2019-12-05 2021-06-16 한양대학교 산학협력단 기술 도메인에 대한 기술 발전도 생성 방법 및 장치

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010013472A1 (ja) 2008-07-30 2010-02-04 日本電気株式会社 データ分類システム、データ分類方法、及びデータ分類プログラム
AU2010202901B2 (en) 2010-07-08 2016-04-14 Patent Analytics Holding Pty Ltd A system, method and computer program for preparing data for analysis
US8639695B1 (en) * 2010-07-08 2014-01-28 Patent Analytics Holding Pty Ltd System, method and computer program for analysing and visualising data
US8650198B2 (en) * 2011-08-15 2014-02-11 Lockheed Martin Corporation Systems and methods for facilitating the gathering of open source intelligence
KR101510647B1 (ko) * 2011-10-07 2015-04-10 한국전자통신연구원 이슈 템플릿 추출 기반의 웹 동향 분석 방법 및 장치
US9053140B2 (en) * 2012-02-03 2015-06-09 Apple Inc. Enhanced B-trees with record merging
CN103366013B (zh) * 2013-07-29 2016-03-02 腾讯科技(深圳)有限公司 一种数据处理的方法及服务器
US10748116B2 (en) * 2015-10-16 2020-08-18 Dell Products L.P. Test vector generation from documentation
US10725800B2 (en) 2015-10-16 2020-07-28 Dell Products L.P. User-specific customization for command interface
US10432484B2 (en) * 2016-06-13 2019-10-01 Silver Peak Systems, Inc. Aggregating select network traffic statistics

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0991314A (ja) * 1995-07-14 1997-04-04 Fuji Xerox Co Ltd 情報探索装置
JP2005038162A (ja) * 2003-07-14 2005-02-10 Fuji Xerox Co Ltd 単語間類似度計算装置、方法およびプログラム
JP2006139518A (ja) * 2004-11-11 2006-06-01 Nec Corp 文書クラスタリング装置、クラスタリング方法及びクラスタリングプログラム

Family Cites Families (73)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5251144A (en) * 1991-04-18 1993-10-05 Texas Instruments Incorporated System and method utilizing a real time expert system for tool life prediction and tool wear diagnosis
JPH0573615A (ja) 1991-09-17 1993-03-26 Kobe Nippon Denki Software Kk 階層構造型情報の管理方式
JP3096353B2 (ja) 1992-04-22 2000-10-10 株式会社戸上電機製作所 データの分類方法
US5325445A (en) * 1992-05-29 1994-06-28 Eastman Kodak Company Feature classification using supervised statistical pattern recognition
EP0582885A3 (en) * 1992-08-05 1997-07-02 Siemens Ag Procedure to classify field patterns
US5353346A (en) * 1992-12-22 1994-10-04 Mpr Teltech, Limited Multi-frequency signal detector and classifier
US5640492A (en) * 1994-06-30 1997-06-17 Lucent Technologies Inc. Soft margin classifier
US5596993A (en) * 1994-09-21 1997-01-28 Beth Israel Hospital Fetal data processing system and method
US5561431A (en) * 1994-10-24 1996-10-01 Martin Marietta Corporation Wavelet transform implemented classification of sensor data
JPH0981585A (ja) 1995-09-14 1997-03-28 Ricoh Co Ltd 電子ファイリング装置
JP3670076B2 (ja) 1996-03-07 2005-07-13 松下電器産業株式会社 データ表示装置
US5765029A (en) * 1996-05-08 1998-06-09 Xerox Corporation Method and system for fuzzy image classification
US5983170A (en) * 1996-06-25 1999-11-09 Continuum Software, Inc System and method for generating semantic analysis of textual information
US5930392A (en) 1996-07-12 1999-07-27 Lucent Technologies Inc. Classification technique using random decision forests
US5933822A (en) * 1997-07-22 1999-08-03 Microsoft Corporation Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision
US5956721A (en) * 1997-09-19 1999-09-21 Microsoft Corporation Method and computer program product for classifying network communication packets processed in a network stack
US6185328B1 (en) 1998-01-21 2001-02-06 Xerox Corporation Method and system for classifying and processing of pixels of image data
US6229923B1 (en) 1998-01-21 2001-05-08 Xerox Corporation Method and system for classifying and processing of pixels of image data
JPH11306187A (ja) 1998-04-20 1999-11-05 Nippon Telegr & Teleph Corp <Ntt> カテゴリ付文書の検索結果の提示処理方法およびその装置
US6304773B1 (en) 1998-05-21 2001-10-16 Medtronic Physio-Control Manufacturing Corp. Automatic detection and reporting of cardiac asystole
US6192360B1 (en) 1998-06-23 2001-02-20 Microsoft Corporation Methods and apparatus for classifying text and for building a text classifier
JP3665480B2 (ja) * 1998-06-24 2005-06-29 富士通株式会社 文書整理装置および方法
US6243670B1 (en) * 1998-09-02 2001-06-05 Nippon Telegraph And Telephone Corporation Method, apparatus, and computer readable medium for performing semantic analysis and generating a semantic structure having linked frames
US6185336B1 (en) 1998-09-23 2001-02-06 Xerox Corporation Method and system for classifying a halftone pixel based on noise injected halftone frequency estimation
US6421683B1 (en) 1999-03-31 2002-07-16 Verizon Laboratories Inc. Method and product for performing data transfer in a computer system
US6907566B1 (en) 1999-04-02 2005-06-14 Overture Services, Inc. Method and system for optimum placement of advertisements on a webpage
AU5448500A (en) * 1999-05-26 2000-12-28 Alok Batra Network element management system
US7363359B1 (en) * 1999-05-26 2008-04-22 Fujitsu Limited Element management system with automatic remote backup of network elements' local storage
US7185075B1 (en) * 1999-05-26 2007-02-27 Fujitsu Limited Element management system with dynamic database updates based on parsed snooping
US6490556B2 (en) 1999-05-28 2002-12-03 Intel Corporation Audio classifier for half duplex communication
JP2001043221A (ja) * 1999-07-29 2001-02-16 Matsushita Electric Ind Co Ltd 中国語単語分割装置
US6671680B1 (en) * 2000-01-28 2003-12-30 Fujitsu Limited Data mining apparatus and storage medium storing therein data mining processing program
JP2001216306A (ja) 2000-01-31 2001-08-10 Hitachi Ltd カテゴリ作成装置
US7325201B2 (en) 2000-05-18 2008-01-29 Endeca Technologies, Inc. System and method for manipulating content in a hierarchical data-driven search and navigation system
US7113934B2 (en) * 2000-05-25 2006-09-26 Fujitsu Limited Element management system with adaptive interfacing selected by last previous full-qualified managed level
US6459974B1 (en) 2001-05-30 2002-10-01 Eaton Corporation Rules-based occupant classification system for airbag deployment
US7028024B1 (en) 2001-07-20 2006-04-11 Vignette Corporation Information retrieval from a collection of information objects tagged with hierarchical keywords
AUPR824401A0 (en) 2001-10-15 2001-11-08 Silverbrook Research Pty. Ltd. Methods and systems (npw002)
JP2003141159A (ja) 2001-11-06 2003-05-16 Fujitsu Ltd 距離インデクスを用いた検索装置および方法
JP4404533B2 (ja) * 2002-08-30 2010-01-27 株式会社ニデック 眼内レンズの製造方法及び該方法にて得られる眼内レンズ
JP2004110161A (ja) * 2002-09-13 2004-04-08 Fuji Xerox Co Ltd テキスト文比較装置
JP4233836B2 (ja) 2002-10-16 2009-03-04 インターナショナル・ビジネス・マシーンズ・コーポレーション 文書自動分類システム、不要語判定方法、文書自動分類方法、およびプログラム
JP2005036162A (ja) 2003-07-18 2005-02-10 Sumitomo Bakelite Co Ltd 熱硬化性樹脂組成物
JP4451624B2 (ja) 2003-08-19 2010-04-14 富士通株式会社 情報体系対応付け装置および対応付け方法
US7877238B2 (en) * 2003-09-12 2011-01-25 Sysmex Corporation Data classification supporting method, computer readable storage medium, and data classification supporting apparatus
US7577655B2 (en) * 2003-09-16 2009-08-18 Google Inc. Systems and methods for improving the ranking of news articles
KR20050045746A (ko) 2003-11-12 2005-05-17 삼성전자주식회사 계층 구조의 가변 블록 크기를 이용한 움직임 추정 방법및 장치
JP2005202535A (ja) 2004-01-14 2005-07-28 Hitachi Ltd 文書集計方法及び装置並びにそれらに用いるプログラムを記憶した媒体
JP2005267604A (ja) * 2004-02-18 2005-09-29 Fuji Xerox Co Ltd 動作分類支援装置および動作分類装置
US7428528B1 (en) 2004-03-31 2008-09-23 Endeca Technologies, Inc. Integrated application for manipulating content in a hierarchical data-driven search and navigation system
US7710897B2 (en) * 2004-08-26 2010-05-04 Fujitsu Limited Automatic discovery of logical network elements from existing links in a network
US7693683B2 (en) * 2004-11-25 2010-04-06 Sharp Kabushiki Kaisha Information classifying device, information classifying method, information classifying program, information classifying system
JP2006171931A (ja) 2004-12-14 2006-06-29 Mitsubishi Electric Corp テキストマイニング装置およびテキストマイニングプログラム
TW200622402A (en) 2004-12-28 2006-07-01 Innolux Display Corp Liquid crystal panel and its cutting method
JP2006285419A (ja) * 2005-03-31 2006-10-19 Sony Corp 情報処理装置および方法、並びにプログラム
US7912871B2 (en) 2005-07-27 2011-03-22 Technion Research And Development Foundation Ltd. Incremental validation of key and keyref constraints
JP4992715B2 (ja) * 2005-08-04 2012-08-08 日本電気株式会社 データ処理装置、データ処理方法、データ処理プログラム
JP2007102309A (ja) 2005-09-30 2007-04-19 Mitsubishi Electric Corp 自動分類装置
FR2902913A1 (fr) 2006-06-21 2007-12-28 France Telecom Procede et dispositif de codage d'une note de similarite semantique et spatiale entre concepts d'une ontologie memorisee sous forme de treillis numerote hierarchiquement
US7873616B2 (en) 2006-07-07 2011-01-18 Ecole Polytechnique Federale De Lausanne Methods of inferring user preferences using ontologies
US8001130B2 (en) * 2006-07-25 2011-08-16 Microsoft Corporation Web object retrieval based on a language model
US7720830B2 (en) * 2006-07-31 2010-05-18 Microsoft Corporation Hierarchical conditional random fields for web extraction
US7921106B2 (en) 2006-08-03 2011-04-05 Microsoft Corporation Group-by attribute value in search results
US7912875B2 (en) 2006-10-31 2011-03-22 Business Objects Software Ltd. Apparatus and method for filtering data using nested panels
US8065307B2 (en) * 2006-12-20 2011-11-22 Microsoft Corporation Parsing, analysis and scoring of document content
WO2008092147A2 (en) * 2007-01-26 2008-07-31 Information Resources, Inc. Analytic platform
US20080221983A1 (en) 2007-03-06 2008-09-11 Siarhei Ausiannik Network information distribution system and a method of advertising and search for supply and demand of products/goods/services in any geographical location
CN101295305B (zh) * 2007-04-25 2012-10-31 富士通株式会社 图像检索装置
US8229881B2 (en) * 2007-07-16 2012-07-24 Siemens Medical Solutions Usa, Inc. System and method for creating and searching medical ontologies
KR100930799B1 (ko) 2007-09-17 2009-12-09 한국전자통신연구원 자동화된 클러스터링 방법 및 이를 이용한 이동통신환경에서 다중 경로의 클러스터링 방법 및 장치
JP4998237B2 (ja) * 2007-12-06 2012-08-15 富士通株式会社 論理構造モデル作成支援プログラム、論理構造モデル作成支援装置および論理構造モデル作成支援方法
WO2010013472A1 (ja) 2008-07-30 2010-02-04 日本電気株式会社 データ分類システム、データ分類方法、及びデータ分類プログラム
US9378202B2 (en) 2010-03-26 2016-06-28 Virtuoz Sa Semantic clustering

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0991314A (ja) * 1995-07-14 1997-04-04 Fuji Xerox Co Ltd 情報探索装置
JP2005038162A (ja) * 2003-07-14 2005-02-10 Fuji Xerox Co Ltd 単語間類似度計算装置、方法およびプログラム
JP2006139518A (ja) * 2004-11-11 2006-06-01 Nec Corp 文書クラスタリング装置、クラスタリング方法及びクラスタリングプログラム

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
AKIHIRO INOKUCHI ET AL.: "Text Bunseki no Tameno OLAP System", DATABASE TO WEB JOHO SYSTEM NI KANSURU SYMPOSIUM RONBUNSHU, vol. 2006, no. 16, - 30 November 2006 (2006-11-30), pages 251 - 258 *
KATSUJI BESSHO ET AL.: "Tangokan no Kaiso Kankei ni Motozuku Text Bunrui Hoshiki", IEICE TECHNICAL REPORT, 2007, PRMU2007-15, MI2007-15, May 2007 (2007-05-01), pages 79 - 84 *
WISAM DAKKA ET AL.: "Automatic Construction of Multifaceted Browsing Interfaces", PROC. OF CIKM'05, 5 November 2005 (2005-11-05), pages 768 - 775, Retrieved from the Internet <URL:http://pages.stern.nyu.edu/ "panos/publications/cikm2005.pdf> *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011253449A (ja) * 2010-06-03 2011-12-15 Toshiba Corp 文書分析装置およびプログラム
JP5319829B1 (ja) * 2012-07-31 2013-10-16 楽天株式会社 情報処理装置、情報処理方法及び情報処理プログラム
WO2014020929A1 (ja) * 2012-07-31 2014-02-06 楽天株式会社 情報処理装置、情報処理方法及び情報処理プログラム
KR101427549B1 (ko) * 2012-07-31 2014-09-02 라쿠텐 인코포레이티드 정보 처리 장치 및 정보 처리 방법
JP2014219984A (ja) * 2013-05-09 2014-11-20 鴻海精密工業股▲ふん▼有限公司 ファイル分類システム及びその分類方法
WO2015025386A1 (ja) 2013-08-21 2015-02-26 株式会社日立製作所 データ処理システム、データ処理方法およびデータ処理装置
KR20210071147A (ko) * 2019-12-05 2021-06-16 한양대학교 산학협력단 기술 도메인에 대한 기술 발전도 생성 방법 및 장치
KR102351854B1 (ko) * 2019-12-05 2022-01-14 한양대학교 산학협력단 기술 도메인에 대한 기술 발전도 생성 방법 및 장치

Also Published As

Publication number Publication date
JP5423676B2 (ja) 2014-02-19
JPWO2010013473A1 (ja) 2012-01-05
US9361367B2 (en) 2016-06-07
US20110179037A1 (en) 2011-07-21

Similar Documents

Publication Publication Date Title
WO2010013473A1 (ja) データ分類システム、データ分類方法、及びデータ分類プログラム
Chen et al. A sensitivity analysis algorithm for hierarchical decision models
WO2010013472A1 (ja) データ分類システム、データ分類方法、及びデータ分類プログラム
WO2010120929A3 (en) Generating user-customized search results and building a semantics-enhanced search engine
WO2014137672A3 (en) Probabilistic parsing
WO2009111127A3 (en) Location determination
CA2879417A1 (en) Structured search queries based on social-graph information
WO2013130630A3 (en) Listing data objects using a hierarchical dispersed storage index
WO2009060454A3 (en) Multi-point detection on a single-point detection digitizer
WO2013025874A3 (en) Page reporting
GB201217271D0 (en) Traffic sensor management
WO2012169807A3 (ko) 데이터 웨어하우스를 이용한 데이터베이스 구축 방법 및 그 시스템
Li et al. Robust conjugate duality for convex optimization under uncertainty with application to data classification
Gu et al. A hierarchical prototype-based approach for classification
Kwon et al. A new augmented Lyapunov–Krasovskii functional approach to exponential passivity for neural networks with time-varying delays
Kanbur et al. Life cycle integrated thermoeconomic assessment method for energy conversion systems
Liu et al. Robust stability analysis of generalized neural networks with multiple discrete delays and multiple distributed delays
WO2012125295A3 (en) Generic data exchange method using hierarchical routing
Lim et al. Design process for formwork system in tall building construction using quality function deployment and TRIZ
Ramnath et al. Application of Analytical Hierarchy Process to Select Lean–Kitting Assembly for an Engine Assembly
Weijie et al. Research on intermediate configurations of multi-step inverse approach in sheet metal forming
Tcvetkov et al. A modified hausdorff distance between intuitionistic fuzzy sets
Lewis et al. Level set methods for finding critical points of mountain pass type
Kim et al. The Analysis of Vibration on the Guide Rail Installed with Manufacturing System of the Smart Phone Lens
Zheng et al. Fuzzy random continuous review inventory model with imperfect quality

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09802722

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 13056031

Country of ref document: US

Ref document number: 2010522626

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09802722

Country of ref document: EP

Kind code of ref document: A1