JP2000048024A

JP2000048024A - Structured document processing device

Info

Publication number: JP2000048024A
Application number: JP10212814A
Authority: JP
Inventors: Atsushi Shimada; 敦史島田; Hitoshi Hayakawa; 仁志早川; Tomoyuki Umehara; 智之梅原
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1998-07-28
Filing date: 1998-07-28
Publication date: 2000-02-18

Abstract

PROBLEM TO BE SOLVED: To reduce user's working load by automating routine processing following the preparation/edition of a tagged document. SOLUTION: The processor is provided with a means 111 for monitoring the edition start and end of a document, a means 112 for analyzing/extracting information necessary for succeeding processing while noticing the tag part of the edited document or its relational document at the edition end timing of the document and a means 114 for automatically executing prescribed processing for a document stored in a document database or the distribution of a prepared document based on the analytical/extracted result.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、ＳＧＭＬやＸＭ
Ｌ、ＨＴＭＬなどのタグ付きの構造化文書を編集処理及
び管理する装置に係り、特に文書編集終了時に、その後
の定型的な処理を自動的に行うことで、ユーザの作業量
を低減する構造化文書処理装置に関する。TECHNICAL FIELD The present invention relates to an SGML or XM
The present invention relates to a device for editing and managing structured documents with tags such as L and HTML, and in particular, at the end of document editing, by automatically performing subsequent routine processing, thereby reducing the amount of user work. The present invention relates to a document processing device.

【０００２】[0002]

【従来の技術】既存のタグ付き文書に変更を加える場
合、変更内容・変更箇所によっては他のタグ付き文書や
他データに影響を及ぼす場合がある。たとえば、編集し
た文書に他文書からの参照元が存在する場合、文書の変
更内容によっては、変更後は参照されてはならない場合
がある。さらに参照されるべき個所が削除された場合、
参照関係は成り立たなくなっている。このような場合、
従来は参照元が削除または変更されたにも関わらず、削
除・変更があった時点で参照先の文書への影響を知らせ
る手段がなかった。また、変更があったことを文書を開
くなどして人間が認識したとしても、人間が変更内容や
変更箇所を判断して文書間の参照関係を削除・変更を行
う必要があった。2. Description of the Related Art When a change is made to an existing tagged document, it may affect other tagged documents or other data depending on the contents of the change and the changed portion. For example, if the edited document has a reference source from another document, it may not be possible to refer to the edited document depending on the changed content of the document. If further references are removed,
The reference relationship no longer holds. In such a case,
Conventionally, there has been no means for notifying the influence on the referenced document when the reference source is deleted or changed even though the reference source is deleted or changed. Further, even if a human recognizes that a change has been made, for example, by opening a document, the human needs to determine the details of the change and the location of the change and delete / change the reference relationship between the documents.

【０００３】また、文書は何らかの目的を持って作成・
編集される。作成・編集する目的に応じて文書のその後
の処理を行う場合がある。たとえば、議事録の編集をし
ていた場合、編集終了後、その議事録は会議出席者に配
布する必要がある。この場合、従来は編集終了後、人間
がメールするなり、回覧のワークフローに投入するなり
して、目的の人物に議事録を配布していた。[0003] In addition, a document is created for some purpose.
Edited. Subsequent processing of the document may be performed depending on the purpose of creation / edit. For example, when editing the minutes, the minutes need to be distributed to the meeting attendees after the editing is completed. In this case, in the past, after editing was completed, the minutes of the minutes were distributed to the target person by e-mailing a person or putting it into a circulation workflow.

【０００４】[0004]

【発明が解決しようとする課題】従来技術においては、
上述のように、既存の文書に変更を加えた場合の文書間
の参照関係の削除・変更や作成・編集した文書の配布な
ど、タグ付き文書の作成・編集の作業に伴うその後の処
理を人間が行っており、作業の効率が悪いという問題が
あった。また、装置によっては、ユーザの行う操作を代
行する機能を備えたものもあるが、ユーザが目的の代行
プログラムを起動し、文書データなどを指定しないと目
的の作業が完了せず、作業が煩雑化する問題があった。In the prior art,
As described above, the subsequent processing involved in creating / editing tagged documents, such as deleting / changing reference relationships between documents and distributing created / edited documents when changes are made to existing documents, is performed by humans. However, there is a problem that work efficiency is low. In addition, some devices have a function to perform the operation performed by the user, but if the user activates the target proxy program and does not specify document data, the target operation is not completed and the operation becomes complicated. There was a problem of becoming.

【０００５】本発明は、構造化文書処理装置において、
タグ付き文書の作成・編集の作業に伴うその後の定型的
な処理等を、文書の編集終了時に自動的に行うことで、
ユーザの作業量を低減し、作業の効率向上を図ることを
目的とする。[0005] The present invention relates to a structured document processing apparatus,
By automatically performing the subsequent routine processing etc. accompanying the work of creating and editing tagged documents at the end of document editing,
An object of the present invention is to reduce the amount of work performed by a user and to improve work efficiency.

【０００６】[0006]

【課題を解決するための手段】上記目的を達成するため
に、本発明の構造化文書処理装置では、文書の編集開始
・終了を監視し、文書編集終了のタイミングで、編集し
た文書あるいは関連する文書のタグ部に着目して、あら
かじめ定義された特定の要素や属性値等の解析・抽出を
行い、その結果を元に、あらかじめ定義された特定の処
理を自動実行する手段を設ける。この手段を編集後文書
処理手段や装置と呼ぶことにする。In order to achieve the above object, a structured document processing apparatus according to the present invention monitors the start and end of editing of a document and, at the timing of the end of document editing, edits a document or a related document. A means is provided for analyzing and extracting a specific element or attribute value defined in advance by focusing on a tag portion of the document, and automatically executing a specific process defined in advance based on the result. This means will be referred to as a post-editing document processing means or device.

【０００７】[0007]

【発明の実施の形態】以下、本発明の実施の形態につい
て図面を用いて説明する。図１は、本発明の構造化文書
処理装置の一実施の形態を示すブロック図である。図に
おいて、１００は通常の文書編集・管理装置であり、エ
ディタ１０１、作業用のメモリ１０２、文書データベー
ス１０３等を含んでいる。１１０は本発明により設けた
編集後文書処理装置で、管理エージエント部１１１、文
書解析部１１２、解析定義ファイル１１３、処理定義実
行部１１４、処理定義ファイル１１５等で構成される。
文書編集・管理装置１００と編集後文書処理装置１１０
との間は任意の手段（例えばバス、ＬＡＮ等）で接続さ
れており、編集後文書処理装置１１０からエディタ１０
１の動作監視や、メモリ１０２及び文書データベース１
０３へのアクセスができるようになっている。なお、文
書編集・管理装置１００と編集後文書処理装置１１０は
一体的に構成し、各部の間を例えば内部バスで接続する
ことでもよい。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing one embodiment of the structured document processing apparatus of the present invention. In FIG. 1, reference numeral 100 denotes a normal document editing / managing apparatus, which includes an editor 101, a work memory 102, a document database 103, and the like. Reference numeral 110 denotes an edited document processing device provided according to the present invention, which includes a management agent unit 111, a document analysis unit 112, an analysis definition file 113, a process definition execution unit 114, a process definition file 115, and the like.
Document editing / managing device 100 and post-editing document processing device 110
Is connected by an arbitrary means (for example, a bus, a LAN, or the like).
1 and the memory 102 and the document database 1
03 can be accessed. Note that the document editing / managing apparatus 100 and the post-editing document processing apparatus 110 may be integrally configured, and the respective units may be connected by, for example, an internal bus.

【０００８】文書データベース１０３には、ＳＧＭＬや
ＸＭＬ、ＨＴＭＬなどのタグ付き文書（構造化文書）が
蓄積されている。エディタ１０１は、文書データベース
１０３から処理対象の文書をメモリ１０２を介して取り
込み、文の追加、削除等の編集を行い、メモリ１０２を
介して文書データベース１０３に再格納する。また、新
規文書作成の場合には、エディタ１０１は、作成した文
書を、同様にメモリ１０２を介して文書データベース１
０３に格納する。The document database 103 stores tagged documents (structured documents) such as SGML, XML, and HTML. The editor 101 fetches a document to be processed from the document database 103 via the memory 102, performs editing such as adding or deleting a sentence, and re-stores the document in the document database 103 via the memory 102. In the case of creating a new document, the editor 101 also stores the created document in the document database 1 via the memory 102.
03.

【０００９】管理エージエント部１１１は、エディタ１
０１における文書の編集開始、編集終了を監視し、文書
編集終了時に、まず、文書解析部（文書解析プログラ
ム）１１２を起動して該当ファイル名等を渡す。文書解
析部１１２は、ファイル名等をキーにメモリ１０２や文
書データベース１０３内の該当文書や関連文書を検索し
て、タグ部に着目して特定の要素や属性値の解析・抽出
を行い、解析結果を管理エージエント部１１１へ通知す
る。抽出対象となる要素や属性値、解析方法は、あらか
じめ解析定義ファイル１１３に定義されており、この解
析定義情報を元に、文書解析部１１２は編集終了したタ
グ付き文書の解析・抽出を行う。管理エージエント部１
１１は、次に、処理定義実行部（処理定義実行プログラ
ム）１１４を起動し、文書解析部１１２の解析結果を渡
す。処理定義実行部１１４は、該解析結果を元に、文書
データベース１０３上の文書群に対して特定の処理を実
行したり、所定プログラムの起動処理を実行したりす
る。処理定義実行部１１４が行うべき処理は、あらかじ
め処理定義ファイル１１５に処理定義情報として定義さ
れている。[0009] The management agent unit 111 includes an editor 1
01, the start and end of editing of the document are monitored. When the editing of the document is ended, first, the document analysis unit (document analysis program) 112 is activated and the corresponding file name is passed. The document analysis unit 112 searches the memory 102 or the document database 103 for a corresponding document or a related document by using a file name or the like as a key, and analyzes and extracts a specific element or attribute value by focusing on the tag unit. The result is notified to the management agent unit 111. Elements, attribute values, and analysis methods to be extracted are defined in the analysis definition file 113 in advance, and based on the analysis definition information, the document analysis unit 112 analyzes and extracts the tagged document that has been edited. Management Agent 1
Next, the processing definition execution unit (processing definition execution program) 114 activates the processing definition execution unit 114 and passes the analysis result of the document analysis unit 112. The process definition execution unit 114 executes a specific process on a document group on the document database 103 based on the analysis result, or executes a process of starting a predetermined program. The process to be performed by the process definition execution unit 114 is defined in advance in the process definition file 115 as process definition information.

【００１０】図２は、編集後文書処理装置１１０におけ
る全体的処理の流れを示したものである。ステップ２１
０は、管理エージェント部１１１において、エディタ１
０１での管理対象のタグ付き文書の編集開始や編集終了
など、文書の処理状態を監視する処理である。ステップ
２２０、２３０は、編集終了時に管理エージエント部１
１１から起動された文書解析部１１２が、ファイル解析
定義ファイル１１３から解析定義情を読み込み、それを
元に文書解析を実行する処理である。この文書解析で
は、文書のタグ部に着目して、編集した文書あるいは関
連する文書から特定の要素や属性の解析・抽出が行われ
る。FIG. 2 shows the overall processing flow in the post-editing document processing device 110. Step 21
0 is the editor 1 in the management agent unit 111.
This is a process of monitoring the processing status of the document, such as the start and end of editing of the tagged document to be managed in 01. Steps 220 and 230 are executed when the management agent 1
The document analysis unit 112 started from step 11 reads the analysis definition information from the file analysis definition file 113 and executes document analysis based on the information. In this document analysis, specific elements and attributes are analyzed and extracted from an edited document or a related document by focusing on a tag portion of the document.

【００１１】ユーザはあらかじめ目的に応じて解析対象
を定義する。議事録、設計書、操作マニュアルなど文書
の種別を、その後の処理を判断する材料にするのであれ
ば、文書の種別を表す要素または属性を解析対象として
定義する。また、既存文書を編集後、削除された文書の
中に参照関係や索引語等、編集文書以外に影響を及ぼし
文書間の関係に変化が起こるような場合を検知したいと
きは、参照関係や索引語を表す要素または属性値を定義
する。解析定義ファイル１１３には、このようなユーザ
の定義した解析定義情報があらかじめ保持されている。The user defines an analysis target in advance according to the purpose. If the document type such as the minutes, the design document, and the operation manual is used as a material for determining the subsequent processing, an element or attribute representing the document type is defined as an analysis target. After editing an existing document, if you want to detect cases where the deleted document affects other than the edited document, such as reference relationships and index words, and changes in the relationship between documents occur, the reference relationship or index Define an element or attribute value that represents a word. The analysis definition file 113 stores such analysis definition information defined by the user in advance.

【００１２】ステップ２３０の処理を実行すると、タグ
付き文書内より更新差分等、その後の処理に必要な情報
（解析結果）を得ることができる。管理エージェント部
１１１は処理定義実行部１１４に対し、この情報を元に
対象文書に対し処理を行うことを指示する。ステップ２
４０は、管理エージエント部１１１からの指示をうけて
処理定義実行部１１４が処理に必要な定義情報を処理定
義ファイル１１５から読み込む処理である。ステップ２
５０は、処理定義実行部１１４が対象文書に対して、解
析結果を元に処理定義情報に従った処理を実行する処理
である。例えば、処理定義ファイル１１５にユーザが議
事録は関係者にメールで配布するなどの処理定義情報を
あらかじめ定義しておくことで、議事録の作成に伴うそ
の後のメール配布処理が自動化できる。By executing the processing of step 230, information (analysis result) necessary for the subsequent processing, such as an update difference, can be obtained from the tagged document. The management agent unit 111 instructs the process definition execution unit 114 to perform a process on the target document based on this information. Step 2
Reference numeral 40 denotes a process in which the process definition execution unit 114 reads definition information necessary for the process from the process definition file 115 in response to an instruction from the management agent unit 111. Step 2
Reference numeral 50 denotes a process in which the process definition executing unit 114 executes a process on the target document according to the process definition information based on the analysis result. For example, if the user defines in advance the process definition information such as distributing the minutes of the minutes to the persons concerned by e-mail in the process definition file 115, the subsequent mail distribution process accompanying the creation of the minutes can be automated.

【００１３】以下に、図１の本発明装置による処理の具
体的な実施例を示す。〔実施例１〕これは、文書編集・管理装置１００にて文
書データベース１０３内のある既存文書を編集し、それ
に他文書からの参照箇所が存在して、それを変更した場
合、編集後文書処理装置１１０において、文書データベ
ース内の対象文書中の該当箇所を削除して、編集した文
書以外に影響を及ぼさないようにするものである。図３
に、本実施例の場合の編集後文書処理装置１１０の詳細
処理フローを示す。図３において、ステップ３１０〜３
５０は図２のステップ２１０〜２５０に対応するもので
ある。A specific embodiment of the processing by the apparatus of the present invention shown in FIG. 1 will be described below. [Embodiment 1] This is because, when a document editing / managing apparatus 100 edits an existing document in a document database 103 and a reference portion from another document exists in the document, and changes it, the edited document processing is performed. In the apparatus 110, the relevant portion in the target document in the document database is deleted so that the target document is not affected except for the edited document. FIG.
7 shows a detailed processing flow of the post-editing document processing device 110 in the case of the present embodiment. In FIG. 3, steps 310 to 3
Reference numeral 50 corresponds to steps 210 to 250 in FIG.

【００１４】図４は、処理対象の文書の具体例である。
図４において、４０１に示すタグ付き文書Ａは、４０２
に示すタグ付き文書Ｂの特定の箇所「＜ＴｉｔｌｅＩ
Ｄ＝“ＣＯ１５１０１”＞」への参照を持つている。こ
こで、４０２のタグ付き文書Ｂを編集して、４０１のタ
グ付き文書Ａからの参照箇所を変更したとする。この４
０２のタグ付き文書Ｂの編集後の文書Ｂ′を４０３に示
す。この場合、４０１のタグ付き文書Ａの該当箇所を削
除する必要がある。これは、本発明により次のようにし
て実現される。FIG. 4 shows a specific example of a document to be processed.
In FIG. 4, a document A with a tag indicated by 401 is denoted by 402.
"<Title I
D = “CO15101”> ”. Here, it is assumed that the tagged document B 402 is edited and the reference portion from the tagged document A 401 is changed. This 4
Reference numeral 403 denotes a document B ′ after editing the tagged document B of No. 02. In this case, it is necessary to delete the corresponding portion of the tagged document A 401. This is achieved by the present invention as follows.

【００１５】管理エージエント部１１１では、エディタ
１０１での文書Ｂの編集終了のタイミングで、文書解析
部（文書解析プログラム）１１２を起動する（ステップ
３１０）。この時、管理エージエント部１１１から文書
解析部１１２に対して、解析対象の文書として、タグ付
き文書Ｂがそのファイル名などで指示される。文書解析
部１１２では、解析定義ファイル１１３にあらかじめ保
持されている解析定義情報を取り込み（ステップ３２
０）、該解析定義情報を元に、タグ部に着目して文書Ｂ
の解析、特定の要素や属性値の抽出を実行する（ステッ
プ３３０）。The management agent unit 111 activates a document analysis unit (document analysis program) 112 at the end of the editing of the document B by the editor 101 (step 310). At this time, the tagged document B is instructed from the management agent unit 111 to the document analysis unit 112 as a document to be analyzed by its file name or the like. The document analysis unit 112 takes in the analysis definition information held in advance in the analysis definition file 113 (step 32).
0), based on the analysis definition information, focusing on the tag portion,
Is analyzed, and a specific element or attribute value is extracted (step 330).

【００１６】図５は、この時の文書解析部１１２での処
理内容を示したものである。解析定義ファイル１１３の
解析定義情報は「更新差分をとる。更新差分の中から＜
Ｔｉｔｌｅ＞タグの部分を抽出する。」と定義されてい
る。エディタ１０１による文書の編集終了時、図４に示
す更新後の４０３の文書Ｂ′は作業用のメモリ１０２上
にあり、更新前の４０２の文書Ｂは、文書データベース
１０３内に、まだ更新されずにある。そこで、文書解析
部１１２は、管理エージエント部１１１から指示された
ファイル名をキーに、メモリ１０２から更新後のタグ付
き文書Ｂ′を取得し、また、文書データベース１０３か
ら更新前のタグ付き文書Ｂを取得し、両者の更新差分を
抽出する。図４の４０２、４０３に示したように、編集
前に「＜Ｔitle ＩＤ＝“Ｃ０１５１０１”＞文書構造
の定義＜／Ｔitle＞」となっていた部分がなくなり、新
たに「＜Ｔitle ＩＤ＝“Ｃ０５１２４５６”＞文書の
構造の種類＜／Ｔitle＞」といったデータになってい
る。文書解析部１１２は、この更新差分の中から＜ＴＩ
ＴＬＥ＞の部分を抽出する。図４の４０２、４０３の例
の場合、差分の中に＜ＴＩＴＬＥＩＤ＝ＸＸＸＸ＞の
記述が含まれており、結局、図５に５０１で示すような
更新差分が抽出される。文書解析部１１２は、この更新
差分抽出結果５０１を解析結果として管理エージエント
部１１１へ返す。FIG. 5 shows the contents of processing in the document analysis unit 112 at this time. The analysis definition information of the analysis definition file 113 is “Take the update difference.
Extract the part of Title> Tag. Is defined. At the end of the editing of the document by the editor 101, the updated document 403 of document 403 shown in FIG. 4 is in the working memory 102, and the document B of 402 before the update is not yet updated in the document database 103. It is in. Therefore, the document analysis unit 112 acquires the updated tagged document B ′ from the memory 102 by using the file name specified by the management agent unit 111 as a key, and acquires the updated tagged document B ′ from the document database 103. And extract the update difference between the two. As indicated by reference numerals 402 and 403 in FIG. 4, the portion where “<Title ID =“ C015101 ”> definition of document structure </ Title>” before editing disappears, and “<Title ID =“ C0512456 ”is newly added. "> Type of document structure </ Title>". The document analysis unit 112 selects <TI
TLE> is extracted. In the case of the examples 402 and 403 in FIG. 4, the description of <TITLE ID = XXXX> is included in the difference, and as a result, an update difference as indicated by 501 in FIG. 5 is extracted. The document analysis unit 112 returns the update difference extraction result 501 to the management agent unit 111 as an analysis result.

【００１７】なお、エディタ１０１では、文書編集終了
後、ユーザからの指示等により、メモリ１０２上の更新
後の文書Ｂ′で文書データベース１０２の更新前の文書
Ｂを上書きする。これにより、文書編集処理は完了す
る。After the document editing is completed, the editor 101 overwrites the document B before updating in the document database 102 with the updated document B 'in the memory 102 according to an instruction from the user or the like. Thus, the document editing process is completed.

【００１８】一方、管理エージエント部１１１は、処理
定義実行部（処理定義実行プログラム）１１４を起動
し、文書解析部１１２からの更新差分抽出結果５０１を
渡す。処理定義実行部１１４では、処理定義ファイル１
１５にあらかじめ保持されている処理定義情報を取り込
み（ステップ３４０）、更新差分抽出結果５０１を元
に、文書データベース１０３内の対象文書に対して、処
理定義情報で定義された処理を実行する（ステップ３５
０）。On the other hand, the management agent unit 111 activates the process definition execution unit (process definition execution program) 114 and passes the update difference extraction result 501 from the document analysis unit 112. In the process definition execution unit 114, the process definition file 1
15 is fetched (step 340), and the process defined by the process definition information is executed on the target document in the document database 103 based on the update difference extraction result 501 (step 340). 35
0).

【００１９】図６は、この時の処理定義実行部１１４で
の処理内容を示したものである。図５に示した解析定義
ファイル１１３の解析定義情報に対応して、処理定義フ
ァイル１１５には、＜ＴＩＴＬＥ＞（参照関係）が変更
になったときに何をするか、あらかじめ処理定義が登録
されている。図６に示すように、ここでは、処理定義フ
ァイル１１５の処理定義情報は「＜Ｔｉｔｌｅ＞のＩＤ
値と同じ値のＨＲＥＦ属性を検索し、ＨＲＥＦ属性およ
び属性値を削除する。」と定義されている。そこで、処
理定義実行部１１４は、管理エージエント部１１１から
渡された更新差分抽出結果５０１を元に文書データベー
ス１０３を検索して、「＜Ａnchor ＨＲＥＦ＝“ＣＯ１
５１０１”＞」の記述を削除し、参照関係を解除する。
これを文書データベース１０３内に対象文書が存在する
間繰り返す。この結果、処理定義実行後、図４に示した
４０１の文書Ａは、図６の６０１の文書Ａ’のように
「＜Ａnchor ＨＲＥＦ＝“ＣＯ１５１０１”＞」が削除
される。FIG. 6 shows the contents of processing in the processing definition execution unit 114 at this time. Corresponding to the analysis definition information of the analysis definition file 113 shown in FIG. 5, in the processing definition file 115, what to do when <TITLE> (reference relation) is changed is registered in advance with the processing definition. ing. As shown in FIG. 6, here, the process definition information of the process definition file 115 is “<Title> ID”.
The HREF attribute having the same value as the value is searched, and the HREF attribute and the attribute value are deleted. Is defined. Therefore, the process definition execution unit 114 searches the document database 103 based on the update difference extraction result 501 passed from the management agent unit 111, and returns “<Anchor HREF =“ CO1
5101 ">" is deleted and the reference relation is released.
This is repeated while the target document exists in the document database 103. As a result, after the process definition is executed, “<Anchor HREF =“ CO15101 ”>” is deleted from the document A 401 shown in FIG. 4 like the document A ′ 601 in FIG.

【００２０】このように、実施例１によれば、参照先を
意味するＩＤ属性をもつ「＜／Ｔitle＞タグ」がなくな
った場合、ユーザが個々に削除を指示することなく、文
書編集終了のタイミングで、対象文書の「Ａｎｃｈｏｒ
ＨＲＥＦ＝“ＸＸＸＸ”＞」の記述を自動的に削除す
ることが可能になる。As described above, according to the first embodiment, when the "</ Title>tag" having the ID attribute indicating the reference destination is no longer present, the user does not individually instruct the deletion, and ends the document editing. At the timing, "Anchor"
HREF = "XXXX">"can be automatically deleted.

【００２１】なお、処理定義ファイル１１５の処理定義
情報が「タグ付き文書の削除された参照先を参照してい
るタグ付き文書一覧を表示する。」と定義されていたと
すると、処理定義実行プログラムは文書一覧を表示す
る。これにより、ユーザは参照関係が変更になったこと
がわかり、この一覧を見ながら、参照しているタグ付き
文書内容によって、参照元タグ付き文書を閲覧・修正す
るといったことが可能になる。If the process definition information of the process definition file 115 is defined as "display a list of tagged documents that refer to the reference where the tagged document has been deleted", the process definition execution program executes Display the document list. As a result, the user knows that the reference relation has been changed, and while viewing this list, it is possible to view and modify the reference-source tagged document according to the tagged document content being referred to.

【００２２】〔実施例２〕これは、文書編集・管理装置
１００にて議事録等のタグ付き文書を新規作成して文書
データベース１０３に保管する場合、編集後文書処理装
置１１０において、それを関係者にメールで配布する処
理を自動的に行うものである。図７に、本実施例の場合
の編集後文書処理装置１１０の詳細フローを示す。図７
において、ステップ７１０〜７５０は図２のステップ２
１０〜２５０に対応するものである。[Embodiment 2] When a document with a tag such as minutes is newly created in the document editing / managing apparatus 100 and stored in the document database 103, the edited document processing apparatus 110 associates it with the document. It automatically performs the process of distributing it to users by e-mail. FIG. 7 shows a detailed flow of the post-editing document processing device 110 in the case of this embodiment. FIG.
In Steps 710 to 750, Step 2 in FIG.
This corresponds to 10 to 250.

【００２３】図８は新規作成されたタグ付き文書の具体
例である。この８０１に示す文書Ｃがエディタ１０１で
新規され、メモリ１０２を介して文書データベース１０
３に保管されるとする。FIG. 8 shows a specific example of a newly created document with a tag. The document C shown in 801 is newly created by the editor 101, and is stored in the document database 10 via the memory 102.
3

【００２４】管理エージエント部１１１では、エディタ
１０１での文書Ｃの編集終了（新規作成終了）のタイミ
ングで、文書解析部（文書解析部プログラム）１１２を
起動し、解析対象の文書として、文書Ｃのファイル名な
どを指示する（ステップ７１０）。文書解析部１１２で
は、解析定義ファイル１１３から、あらかじめユーザに
よって定義されている解析定義情報を読み込み（ステッ
プ７２０）、該解析定義情報を元に、タグ部に着目して
文書Ｃの解析、特定の要素や属性値の抽出を実行する
（ステップ７３０）。The management agent unit 111 activates a document analysis unit (document analysis unit program) 112 at the end of editing (new creation end) of the document C by the editor 101, and as a document to be analyzed, the document C A file name or the like is specified (step 710). The document analysis unit 112 reads analysis definition information defined in advance by the user from the analysis definition file 113 (step 720), and analyzes the document C by focusing on the tag unit based on the analysis definition information, Extraction of elements and attribute values is executed (step 730).

【００２５】図９は、この時の文書解析部１１２での処
理内容を示したものである。ここで、解析定義ファイル
１１３の解析定義情報は文書中から「＜ＤＯＣＵＭＥＮ
Ｔ＞」タグの属性ＴＹＰＥの属性値を抽出するよう定義
され、また、文書中から「＜Ｎame＞」タグの文字列要
素も抽出するように定義されている。文書解析部１１２
は、文書の編集終了時、まだメモリ１０２上にある、図
８の８０１に示すタグ付き文書Ｃを取り込み、この解析
定義情報を元に、文書Ｃから「＜ＤＯＣＵＭＥＮＴ＞」
タグのタイプ属性値、「＜ＮＡＭＥ＞」タグの文字列を
抽出する。「＜ＤＯＣＵＭＥＮＴ＞」タグの属性ＴＹＰ
Ｅの属性値が文書の種別を表わし、「＜Ｎame＞」は会
議出席者を表している。文書解析部１１２での処理後、
図９の９０１に示すような抽出結果を得る。文書解析部
１１２は、この抽出結果９０１を解析結果として管理エ
ージエント部１１１へ返す。FIG. 9 shows the contents of processing in the document analysis unit 112 at this time. Here, the analysis definition information of the analysis definition file 113 is “<DOCUMEN” from the document.
It is defined to extract the attribute value of the attribute TYPE of the “T>” tag, and is also defined to extract the character string element of the “<Name>” tag from the document. Document analysis unit 112
Captures the tagged document C 801 in FIG. 8 still in the memory 102 when the editing of the document is completed, and reads “<DOCUMENT>” from the document C based on the analysis definition information.
The type attribute value of the tag and the character string of the “<NAME>” tag are extracted. Attribute TYP of "<DOCUMENT>" tag
The attribute value of E represents the type of the document, and “<Name>” represents the meeting attendee. After processing by the document analysis unit 112,
An extraction result as shown by 901 in FIG. 9 is obtained. The document analysis unit 112 returns the extraction result 901 to the management agent unit 111 as an analysis result.

【００２６】管理エージエント部１１１は、文書解析部
１１２から解析結果が返ると、処理定義実行部（処理定
義実行プログラム）１１４を起動し、解析結果（図９の
抽出結果９０１）を渡す。処理定義実行部１１４では、
処理定義ファイル１１５にあらかじめ保持されている処
理定義情報を取り込み（ステップ７４０）、管理エージ
エント部１１１から渡された抽出結果９０１を元に、処
理定義情報で定義された処理を実行する（ステップ７５
０）。When the analysis result is returned from the document analysis unit 112, the management agent unit 111 activates the process definition execution unit (process definition execution program) 114 and passes the analysis result (the extraction result 901 in FIG. 9). In the process definition execution unit 114,
The process definition information held in advance in the process definition file 115 is fetched (step 740), and the process defined by the process definition information is executed based on the extraction result 901 passed from the management agent unit 111 (step 75).
0).

【００２７】図１０は、この時の処理定義実行部１１４
での処理内容を示したものである。ここで、処理定義フ
ァイル１１５には、図９に示した解析定義ファイル１１
３の解析定義情報に対応して、処理定義情報として「＜
ＤＯＣＵＭＥＮＴ＞のＴＹＰＥの属性値が“議事録”で
あれば、その文書を＜ＮＡＭＥ＞で記述されている出席
者に配布する。」と定義されている。処理定義実行部１
１４では、この処理定義を読み込み、図９の抽出結果９
０１に従い、１００１のメールプログラムを起動し、１
００２に示すように作成した文書を添付して＜ＮＡＭＥ
＞で記述されている出席者にメールする。FIG. 10 shows the process definition execution unit 114 at this time.
3 shows the processing contents. Here, the process definition file 115 includes the analysis definition file 11 shown in FIG.
In response to the analysis definition information of No. 3, "<"
If the attribute value of TYPE of "DOCUMENT>" is "minutes", the document is distributed to the attendees described in <NAME>. Is defined. Process definition execution unit 1
At 14, the process definition is read and the extraction result 9 in FIG.
01, the mail program of 1001 is started, and 1
Attach the document created as shown in 002 and <NAME
Email the attendees described in>.

【００２８】このように、実施例２によれば、ユーザは
議事録作成−議事録配布といった作業を、議事録作成
し、保管しただけで、関係者に配布する処理を自動化す
ることができ、作業効率が向上する。As described above, according to the second embodiment, the user can automate the process of creating the minutes and distributing the minutes by simply creating and storing the minutes, and distributing the minutes to the parties concerned. Work efficiency is improved.

【００２９】以上、本発明装置による処理の具体例とし
て二つの実施例を示したが、本発明は、これらに限られ
るものでないことは云うまでもない。解析定義ファイル
１１３と処理定義ファイル１１５に、種々の解析定義情
報と処理定義情報の組み合わせをあらかじめ登録してお
くことで、タグ付き文書の作成・編集に伴うその後の種
々の定型的処理の自動化が可能になる。この場合、解析
定義ファイル１１３の解析定義情報と処理定義ファイル
１１５の処理定義情報との対応付けは、同一の通し番号
や見出しなどを用いることで容易に可能である。また、
解析定義ファイル１１３と処理定義ファイル１１５は、
物理的に同一の記憶装置（ハードディスク等）を使用す
ることでもよい。Although two embodiments have been described above as specific examples of the processing by the apparatus of the present invention, it is needless to say that the present invention is not limited to these. By registering various combinations of the analysis definition information and the process definition information in the analysis definition file 113 and the process definition file 115 in advance, it is possible to automate various subsequent standard processes associated with the creation and editing of the tagged document. Will be possible. In this case, the analysis definition information of the analysis definition file 113 can be easily associated with the processing definition information of the processing definition file 115 by using the same serial number, heading, and the like. Also,
The analysis definition file 113 and the process definition file 115
Physically the same storage device (hard disk or the like) may be used.

【００３０】さらに、図２に示した処理手順は、例えば
編集後文書処理プログラムとして、コンピュータが読み
取り可能な記録媒体、例えばフロッピーディスクやメモ
リカード、ＣＤ−ＲＯＭなどに記録して提供することが
可能である。この場合、図１に示した文書解析プログラ
ムや処理定義実行プログラムは、かかる編集後文書処理
プログラムの一部を構成することになる。Further, the processing procedure shown in FIG. 2 can be provided as, for example, an edited document processing program recorded on a computer-readable recording medium, such as a floppy disk, a memory card, or a CD-ROM. It is. In this case, the document analysis program and the process definition execution program shown in FIG. 1 constitute a part of the edited document processing program.

【００３１】[0031]

【発明の効果】以上説明したように、本発明の構造化文
書処理装置によれば、文書編集終了のタイミングで、タ
グ付き文書中からタグ部に着目して必要な情報を抽出
し、文書の作成・編集の作業に伴うその後の定型的処理
等を自動的に実行することで、ユーザの作業量を低減す
ることが可能になり、作業効率が大幅に向上する利点が
ある。As described above, according to the structured document processing apparatus of the present invention, at the end of document editing, necessary information is extracted from the tagged document by focusing on the tag portion, and By automatically executing the subsequent routine processing and the like accompanying the creation / editing work, the amount of work required by the user can be reduced, and there is an advantage that the work efficiency is greatly improved.

[Brief description of the drawings]

【図１】本発明の構造化文書処理装置の一実施の形態を
示すブロック図である。FIG. 1 is a block diagram illustrating an embodiment of a structured document processing apparatus according to the present invention.

【図２】図１の編集後文書処理装置の全体的処理フロー
である。FIG. 2 is an overall processing flow of the post-editing document processing apparatus of FIG. 1;

【図３】本発明の実施例１における編集後文書処理装置
の詳細処理フローである。FIG. 3 is a detailed processing flow of the post-editing document processing apparatus according to the first embodiment of the present invention.

【図４】本発明の実施例１における文書の具体例であ
る。FIG. 4 is a specific example of a document according to the first embodiment of the present invention.

【図５】本発明の実施例１における文書解析部の処理内
容である。FIG. 5 shows processing contents of a document analysis unit according to the first embodiment of the present invention.

【図６】本発明の実施例１における処理定義実行部の処
理内容である。FIG. 6 shows processing contents of a processing definition execution unit according to the first embodiment of the present invention.

【図７】本発明の実施例２における編集後文書処理装置
の詳細処理フローである。FIG. 7 is a detailed processing flow of the post-editing document processing apparatus according to the second embodiment of the present invention.

【図８】本発明の実施例２における文書の具体例であ
る。FIG. 8 is a specific example of a document according to the second embodiment of the present invention.

【図９】本発明の実施例２における文書解析部の処理内
容である。FIG. 9 shows processing contents of a document analysis unit according to the second embodiment of the present invention.

【図１０】本発明の実施例２における処理定義実行部の
処理内容である。FIG. 10 shows processing contents of a processing definition execution unit according to the second embodiment of the present invention.

[Explanation of symbols]

１００文書編集・管理装置１０１エディタ１０２作業用メモリ１０３文書データベース１１０編集後文書処理装置１１１管理エージエント部１１２文書解析部１１３解析定義ファイル１１４処理定義実行部１１５処理定義ファイル REFERENCE SIGNS LIST 100 Document editing / management device 101 Editor 102 Working memory 103 Document database 110 Edited document processing device 111 Management agent unit 112 Document analysis unit 113 Analysis definition file 114 Processing definition execution unit 115 Processing definition file

───────────────────────────────────────────────────── フロントページの続き (72)発明者梅原智之神奈川県横浜市戸塚区戸塚町5030番地株式会社日立製作所ソフトウェア開発本部内Ｆターム(参考） 5B009 QA06 QA11 QB18 SA03 SA14 TA06 VA09 VC02 ────────────────────────────────────────────────── ─── Continuing on the front page (72) Inventor Tomoyuki Umehara 5030 Totsukacho, Totsuka-ku, Yokohama-shi, Kanagawa Prefecture F-term in the Software Development Division of Hitachi, Ltd. (Reference) 5B009 QA06 QA11 QB18 SA03 SA14 TA06 VA09 VC02

Claims

[Claims]

1. A structured document processing apparatus comprising: means for editing a structured document with a tag; and means for managing the structured document. When editing of the document is completed, a tag section of the edited document or a related document is provided. A structured document processing apparatus, comprising: a post-editing document processing unit that analyzes a document and executes a specific process based on the analysis result.

2. The structured document processing device according to claim 1, wherein the post-editing document processing unit analyzes a tag portion of the edited document or a related document, and analyzes a tag portion of the edited document or a related document. Means for extracting an element or attribute value, means for defining the element or attribute value to be extracted, an analysis method, and means for executing a specific process based on the extracted specific element or attribute value. Means for defining a process to be executed on the extracted specific element or attribute value.

3. The structured document processing apparatus according to claim 1, wherein, when a change is made to an existing tagged document in which a reference source from another document exists, the changed attribute value is extracted from the tag section. A structured document processing apparatus for deleting, with respect to a tagged document in a document database, a description that refers to the attribute value of the changed tag part as a reference destination.

4. A structured document processing apparatus according to claim 1, wherein when a tagged document to be distributed to a related person is created, distribution destination data is extracted from the tag portion and a mail distribution program is started. Structured document processing apparatus characterized by the above-mentioned.