[go: up one dir, main page]

AU2005255044B2 - Reproduction of documents into requested forms - Google Patents

Reproduction of documents into requested forms Download PDF

Info

Publication number
AU2005255044B2
AU2005255044B2 AU2005255044A AU2005255044A AU2005255044B2 AU 2005255044 B2 AU2005255044 B2 AU 2005255044B2 AU 2005255044 A AU2005255044 A AU 2005255044A AU 2005255044 A AU2005255044 A AU 2005255044A AU 2005255044 B2 AU2005255044 B2 AU 2005255044B2
Authority
AU
Australia
Prior art keywords
document
requested
documents
available
access
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
AU2005255044A
Other versions
AU2005255044A1 (en
Inventor
Gregory Lyle Duncan
Christopher Colin Stephen
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Objective Systems Pty Ltd
Original Assignee
Objective Systems Pty Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from AU2004903307A external-priority patent/AU2004903307A0/en
Application filed by Objective Systems Pty Ltd filed Critical Objective Systems Pty Ltd
Priority to AU2005255044A priority Critical patent/AU2005255044B2/en
Priority claimed from PCT/AU2005/000832 external-priority patent/WO2005124579A1/en
Publication of AU2005255044A1 publication Critical patent/AU2005255044A1/en
Application granted granted Critical
Publication of AU2005255044B2 publication Critical patent/AU2005255044B2/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Landscapes

  • Document Processing Apparatus (AREA)

Description

WO 2005/124579 PCT/AU2005/000832 Reproduction of Documents into Requested Forms Field of the invention The invention relates to the reproduction of documents into a requested form. The forms can include print, audio, Braille or an electronic file. It also relates to the distribution of 5 such documents over electronic networks, and remote reproduction. The documents can be either large or small in size. Background Print form documents Currently many documents are transmitted in paper, usually via post. One particularly 10 common form of document is invoices. It is expensive for companies to print and post invoices. When they are received, they must be opened, be paid, sorted, and often information from the invoice must be data entered into a computer. This is expensive for customers. Often customers can not read the invoice they are sent because they are blind, the type is too small, the reader has a disability, or it is written in a language they cannot 15 read. This problem extends to several other kinds of document, including bank statements, credit card statements, legal documents and letters. Commercial computer networks, such as the Internet, have been used as a means of facilitating ordering of books and other reading material by consumers. This is typically 20 achieved by presenting a web site-based user interface to consumers to allow them to order reading material such as books. One example of this is the website Amazon.com. However, the reading material that can be purchased by users of these systems are the same as the offering made by a traditional book store. That is, each item of reading material is usually offered in only one format. Further, users must wait whilst the reading 25 material they ordered is retrieved from a warehouse and shipped to them. Electronic form documents The distribution of electronic documents is generally known, and is described, for example, in International Publication No. WO 00/7223 5 Al (Silverbrook Research Pty WO 2005/124579 PCT/AU2005/000832 2 Ltd, 30 November 2000). Silverbrook describes text being formatted in the Extendable Mark-up Language (XML) using the Extensible Stylesheet Language (XSL). Audio form documents Digital talking Books (DTBs) are one type of audio form documents. DTBs known to the 5 extent that there are technical standards that apply. One such standard is ANSI/NISO Z39.86-2002 "Specifications for the Digital Talking Book", published in 2002 by the US National Information Standards Organisation, Bethesda, Maryland 20814 (ISBN: 1 880124-52-1). The Z39.86 Standard deals with many aspects of DTBs, including the DTB package file, content format for text, audio file formats, image file formats, 10 synchronisation of media files, navigation control files, portable bookmarks and highlights, resource file, packaging files for distribution and presentation files. The Z39-86 Standard owes much to the work done by the DAISY Consortium. The DAISY 2.0 specification is based on HTML, and version 2.01, published in February 15 2001 (www.daisy.org/publication/specifications/daisy_202.html) extends the data representation to the XML DTB DTD. The DAISY format is based on the W3C-defined SGML (150 8879) applications XHTML 1.0 and SMILl.0. Using this framework, a talking book format is achieved that allows navigation of a marked-up text with audio. Although DAISY DTBs offer fine granularity and sophisticated navigation tools for user, 20 their implementation requires very high computational power. Braille form documents Braille characters are made up of up to six raised impressions in two columns of three impressions. Braille characters are approximately 28 point and are always the same size and the horizontal space between characters is constant. Letters are mapped to the Braille 25 codes and this form of Braille is called Grade 1 Braille. Grade 2 Braille has contractions applied to words to make the size of Braille documents smaller and quicker to read. In English there are different contraction rules in the US, UK, Australia, and there is now a new standard, Universal English Braille Code, which is a fourth set of rules. Many of the rules are the same. In say German, the mapping of letters to Braille codes and the 30 contractions may be so different that a German Braille reader who can speak both English WO 2005/124579 PCT/AU2005/000832 3 and German may not be able to read English Braille. Images in documents need to be described in words, generally using additional information to be added. In addition, some graphical information can be provided by Brailled images. A map of Australia can be Brailled, so that the outline of Australia can be shown as a series of raised dots on paper, 5 so that a blind person can feel it. A needs exists, however, for the reproduction, and electronic distribution of a wide variety of documents in a chosen one of a number of available forms. Summary 10 The invention generally provides computer programs, methods and computer apparatus/systems for reproducing a requested source document in a requested one of available forms. Additionally, requested documents can be provided in requested formats, and be navigable. 15 For each one of a plurality of documents: at least one access pathway is applied to a marked-up form of the document, the access pathways define discrete parts of the document. A fragment of the marked-up document is generated for each said access pathway for each available form. 20 A requested one or more parts of a source document can be generated in a requested form from the respective stored fragments. Preferably, the access pathways are defined in a configuration file. A document is assigned to a respective class, and there is a configuration file for each class. The source 25 documents are marked-up according to a schema, and there is a separate schema for each class. The configuration file for each class may contain certain variations for each form. The schema describes the document fully. The configuration file indicates which pieces of the full document are significant. 30 WO 2005/124579 PCT/AU2005/000832 4 Advantageously, an index list is created for each request maker, the index list including a set of documents available to each request maker, and lists the access pathways for each fragment of each document. One fragment comprises the entire source document. Description of the drawings 5 Fig. 1 shows an exemplary system for generating a chosen form of document. Fig. 2 shows another exemplary system for generating a chosen form of document. Fig. 3 is a schematic block diagram of document server processes. Fig. 4 shows the build process in greater detail. Fig. 5 is a schematic block diagram of client/reproduction (user) server processes. 10 Fig. 6 shows an XML schema for a 'bank statement' class file. Fig. 7 shows an XML schema for index document and access paths. Fig. 8 shows an XML schema for validating index documents. Fig. 9 is a schematic block diagram of a formatting process. Fig. 10 shows an XML document. 15 Fig. 11 shows an XSL style sheet. Fig. 12 shows an XSL:FO file. Fig. 13 shows a bar chart for which a Braille representation is required. Detailed description Definitions 20 Document - is intended to mean any information contained in hard copy or electronic form, and includes books, pamphlets, brochures, reports, bank statements and other written material, or voice or Braille. Form - means the medium or file type in which information is to be reproduced, such as 25 print, audio, Braille, electronic and visual forms. Format - is used to describe the general presentation of written material. For print and Braille this could include such things as typeface, type size and margins, and for audio could include tone, speech and gender. 30 WO 2005/124579 PCT/AU2005/000832 5 Classes of document - a grouping of documents of similar type. Document classes can include bank statements, technical or academic articles or books, legal contracts, legislation, etc. Statements issued by different banks may have small variations, but if these variations cannot be accommodated in the same schema, then they fall into another 5 class. There is only one schema for each class. Fragment - a fragment is the entire document, or a section of a document that relates to one of the access paths defined for that class of document. A fragment is usually rendered in the form(s) and/or format(s) requested by he customer. 10 Access paths/pathways - the manner in which a document may be accessed and provides the link between class documents and index documents. Access paths are also used to trigger the building and storing of output fragments. All document classes must have at least one access path, being the 'document level' access path (ie. the entire document). 15 Process configuration files - allow a single piece of software to perform a specific part of the process regardless of the document class being processed. They are specific to a document class, and perform a mapping of known actions based on specific elements within the source documents and are loaded at runtime for a process. Configuration files 20 define access pathways and thus how fragments are to be built. Index list - a marked-up document to a known standard (eg. XML) that defines a customer's catalogue of available documents, and defines the ways that these documents may be accessed, ie. either as whole documents and/or fragments. Index lists utilise 25 access pathways. A. Overview Source documents are the subject of a mark-up process according to an appropriate one of a number of schemas. Each such marked-up document is the subject of a build process, in which a document is analysed (according to a schema/set of rules) to 30 determine pieces important as access paths. The access paths are defined for each document. So, for any one document the access paths are then used to create the set of WO 2005/124579 PCT/AU2005/000832 6 fragments for that document. The fragments enable navigation of the document. The fragments are then each rendered into each one of the forms in which the document is to be available to the customer or customers entitled to see the document (for example, the person to whom the bank statement is addressed). Thus, for any one document, a set of 5 fragments exists for each of the chosen forms that are available. The source document can be translated as a preliminary step to the build process, to be available to customer or customers entitled to see the document (e.g. the person to whom a bank statement is addressed) in other languages. Document formatting choices can also 10 be provided. A customer request includes the identity of a document to be reproduced, the required form of the document, and optionally desired formatting information. An output file is produced, and is then subject to a reproduction process that utilises the access paths. The 15 resultant forms supported in the embodiment described are a Braille physical document, a printed document, audio (eg. spoken word or music) or a physical storage medium (eg. CDROM or magnetic disk). Fig. 1 shows an example of a system 20 for reproducing a chosen form of document 20 where distribution across a network is involved. Documents 14 are input to a document server 22. The document server 22 and repository 24 can be a part of or separate to the system generating the documents 14. The document server 22 has a repository 24 in which products of the build process are stored. The document server 22 has connection with a public or private network 26. A customer computer 28 also has communication 25 with the network 26. The customer computer 28 issues a request to the document server 22 for a specified document in a specified form, via the network 26. The document server 22 retrieves the relevant fragments from the repository 24, and then passes the fragments via the network 46 to the customer computer 28. The reproduction processes are performed at the customer computer 28. 30 Fig. 2 shows a further system 30, that is similar to system 20 of Fig. 1 in so far as the document server 22 performs the same function in receiving requests for documents and WO 2005/124579 PCT/AU2005/000832 7 distributing them via the network 26. The difference, however, is that whilst the request for a specific document initiates with a customer 32, the reproduction is performed by a separate reproduction client server 34, connected with the network 26. The output form of the document is separately provided to the customer, in a printed or electronic form. A 5 benefit of the arrangement of this system 30 is that the customer need not buy and configure expensive software and an expensive (fast) computer. In typical arrangements, a reproduction server 34 (eg. a large publisher) would be located in a general geographical proximity to customers 32 (eg. in the same city or state). The document server 22, in fact, may reside in another country to the customers and the reproduction 10 server 34. This arrangement gives efficiencies in terms of the cost of postage and the time it takes for a requested document to be provided to a customer. Generally, it is desirable to use customer's existing computer systems, since it allows interfacing with existing financial records and systems (invoices are one form of 15 document that can be requested), and, in the main, is the least troublesome for the customer. Customers who are visually impaired may prefer to use their existing computers and software, rather than install new software and learn how to use it. For example, presenting invoices in a DAISY format may be more convenient for someone used to a particular DAISY reader than requiring the customer to acquire and learn new 20 software. Some document providers, such as banks, may not easily be able to generate invoices, statements and the like in XML form. In such a situation, a bank would require specific additional software to create and format such documents then forward them to a central 25 repository where the documents can be organised for the user and from which a user can obtain requested documents. B. Build Process Fig. 3, 4 and 5 are schematic diagrams that embody the arrangements of both Figs. 1 and 2.
WO 2005/124579 PCT/AU2005/000832 8 Building index lists and access pathways Turning specifically to Fig. 3, a document server 50 and a reproduction server 60 are shown. Sourced or input documents are subject to a mark-up process 60 to take marked up XML form 70 in accordance with a defined class schema. As shown particularly in 5 Fig. 4, a marked-up document 70 must be validated against a set of rules/schema 71 for the particular class of document. The source document can be input to the mark-up process 68 by any convenient means, including a foreign system integration, manual mark-up or form-based entry. 10 For Braille-form mark-up, character strings that need to be treated differently in Braille should be separately tagged and identified. An example is a foreign word that will be spelled out in Braille 1. Information like phone numbers and web addresses that may be treated differently in the different versions of Braille are likely to be tagged, so that these character strings can be rendered into a standard form more easily (eg. phone numbers 15 with area codes can be written in several different formats and the actual number may or may not have spaces in it). Images and diagrams need to be annotated for the visually impaired, as will be described below. If a document is to be offered in a different language to the original, then the marked-up 20 form of document is actively processed by a translation system 72 in response to a customer request. The translation will depend on the document type and importance. Certain documents like invoices, bank statements and credit card statements consist of a template into which the content of the document is inserted. The content of the documents often contain largely numeric information (which does not need language 25 translation), part or product names (which do not need language translation) or single words or phrases that can often be machine translated. If the documents contain only numerical, part or product names in the content, then simply translating the template will translate the document. If the template constructed so that the information in the template is called from a database, and if the calls to the database include the language, then these 30 documents can be automatically translated at the request of the user. Other information in the invoice can be machine translated or if the information is say standard advertising information, then it can be manually translated and temporarily added to the template.
WO 2005/124579 PCT/AU2005/000832 9 Other documents can be machine translated. More valuable documents (such as legal document or contracts) may be translated manually. The most valuable documents can be manually translated and manually verified by an independent translator. For manual translation of documents a work flow process will be instituted for tracking the manual 5 translation of documents. The marked-up documents 70, 70' are then stored in an XML repository 73. An index and access pathway builder 74 uses an XML configuration document 75, in turn 10 based on an XML schema 76 providing validation rules, to configure an application that will build an XML document specific to each customer containing a list of all the documents available for a particular customer: the Index list 77. The index list 77 provides various ways for the customer to access those documents (ie. the access paths) determined by an XML schema 78. An XML index list 77 allows searching of, and 15 navigation to any fragments defined in the configuration file 75 which generates and defines the granularity of any fragments. Index documents thus generated are stored in an index store 79. Consider the following example XML code for a 'bank statement' class of document 70: 20 <?xml version="1.0" encoding="UTF-8"?> <PBPDoc xmlns="http://tempuri.org/BankStatement.xsd" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://tempuri.org/BankStatement .xsd 25 E: \vsprojects\voicePatent\XML\BankStatement .xsd"> <Reciever> <Name>Mr C Stephen</Name> <address>21 Smith St</address> <City>Blacktown</City> 30 <State>NSW</State> <PostCode>2615</PostCode> </Reciever> <Identification> <AccountNO>14062347</AccountNO> 35 <BSBNO>123 789</BSBNO> <StatemmentNO>17</StatemmentNO> <StatementDate>2004-03-21</StatementDate> <PageNO>1</PageNO> </Identification> 40 <Summary> <AccountNO>14062347</AccountNO> WO 2005/124579 PCT/AU2005/000832 10 <AccontName>Business Account 1</AccontName> <BalanceOpen>250251.89</BalanceOpen> <BalanceClose>240789.92</BalanceClose> <TotalCredit>15893.73</TotalCredit> 5 <TotalDebit>25355.70</TotalDebit> </Summary> <Transactions> <TRX TRXSign="DEBIT"> <TRXDate>2004-03-17</TRXDate> 10 <TRXDesc>Wages</TRXDesc> <TRXAmount>25355.70</TRXAmount> <TRXBalance>224896.19</TRXBalance> </TRX> <TRX TRXSign=1"CREDIT"> 15 <TRXDate>2004-03-19</TRXDate> <TRXDesc>Deposit ARC</TRXDesc> <TRXAmount>15, 893 .73</TRXAmount> <TRXBalance>240789 .92</TRXBalance> </TRX> 20 </Transactions> <Information> <Note>Effective April 30 a $1.00 charge will apply for each business account transaction</Note> </ Inf ormat ion> 25 </PBPDoc> Fig. 6 shows a corresponding XML schema 71 for the 'bank statement' class of document. 30 As a separate example, consider an example marked-up XML file for a 'book' class of document 70: <pbp-book xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:noNamespaceSchemaLocation="E:\schemas\pbpbook_04.xsd.'> 35 <pbp-meta> <pbp-info schema-name="pbp-book04.xsd" schema-rev="4.01" file name="A Brief History of Time.xml" tag-date="2003-08-25" tag operator="canberra" book-title="A Brief History of Time" book type="PBPress Novel" publication-status="NOT FOR PUBLICATION" copyright 40 status="IN COPYRIGHT"/> </pbp-meta> <pbp-front> <cover> <construction-model> 45 <c-title>A Brief History of Time</c-title> <author-list> <c-author>Stephen W Hawking</c-author> </author-list> <c-category>Non-Fiction, Science</c-category> 50 <c-section> <cs-head> WO 2005/124579 PCT/AU2005/000832 11 <title>A Brief Hostory of Time</title> </cs-head> <cs-body> <upara>This book has sold more copies that any non 5 religious book ever printed, unfortunately most of the people who bought it can't understand anything beyond the introduction.</upara> </cs-body> </c-section> <c-image page="front" image-url="e:\images\xx.jpg"> 10 <voice-description> <para>this is a pretty picture of a clock</para> </voice-description> </c-image> <c-ISBN edition-value="35689 78221"/> 15 </construction-model> </cover> <title-block> <book-title>A Brief History of Time</book-title> <author> 20 <first-name>Stephen </first-name> <other-name>W </other-name> <last-name>Hawking </last-name> </author> </title-block> 25 <intro-block> <intro type="foreword"> <intro-title>FOREWARD</intro-title> <body> <upara>I didn't write a foreword to the original edition of 30 A Brief History of Time. That was done by <emphasis type="italics">Carl Sagan.</emphasis> Instead, I wrote a short piece titled "Acknowledgments" in which I was- advised to thank everyone. Some of the foundations that had given me support weren't too pleased to have been mentioned, however, because it led to a great increase in 35 applications.</upara> </body> <sigblock> <sig-name>Stephen W Hawking</sig-name> </sigblock> 40 </intro> </intro-block> </pbp-front> <pbp-body> <section type="chapter"> 45 <head> <section-num>CHAPTER 1</section-num> <section-title>OUR PICTURE OF THE UNIVERSE</section-title> </head> <body> 50 <upara>A well-known scientist (some say it was Bertrand Russell) once gave a public lecture on astronomy. He described how the earth orbits around the sun and how the sun, in turn, orbits around the center of a vast collection of stars called our galaxy. At the end of the lecture, a little old lady at the back of the room got up and said: 55 "What you have told us is rubbish. The world is really a flat plate supported on the back of a giant tortoise." The scientist gave a superior smile before replying, "What is the tortoise standing on." WO 2005/124579 PCT/AU2005/000832 12 "You're very clever, young man, very clever," said the old lady. "But it's turtles all the way down!"</upara> </body> </section> 5 </pbp-body> </pbp-book> Configuration files configure the software applications to provide the necessary functionality. An example XML configuration file 75 for the client index and access 10 pathway builder 74 for a 'bank statement' class file is: <?xml version="1.0" encoding="UTF-8"?> <AccessBuilder xmlns="http://tempuri.org/Builder.xsd" xmlns:xsi="http://www.w3 .org/2001/XMLSchema-instance" 15 xsi:schemaLocation="http://tempuri.org/Builder.xsd E: \vsprojects\VoicePatent\XML\Builder.xsd"> <DocClass>Bank Statement</DocClass> <AccessItem> <PathClass>Address</PathClass> 20 <PathName>Reciever</PathName> </AccessItem> <AccessItem> <PathClass>Account Info</PathClass> <PathName>Identification</PathName> 25 </Accessltem> <AccessItem> <PathClass>TRXSummary</PathClass> <PathName>Summary</PathName> </AccessItem> 30 <AccessItem> <PathClass>ProviderNote</PathClass> <PathName>Information</PathName> </AccessItem> <AccessItem> 35 <PathClass>TRXGroup</PathClass> <PathName>Transactions</PathName> </AccessItem> <AccessItem> <PathClass>TRXBank</PathClass> 40 <PathName>TRX</PathName> </Access Item> </AccessBuilder> The configuration file 75 for the 'book' class of document is: 45 <ixml version="1.0" encoding="UTF-8"?> <pbpVoiceConfig xmlns="http://tempuri.org/VoiceConfig.xsd" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://tempuri.org/VoiceConfig.xsd 50 E:\vsprojects\VoicePatent\XML\VoiceConfig.xsd"> <pbpSchemaID>STDBOOK</pbpSchemaID> WO 2005/124579 PCT/AU2005/000832 13 <pbpProcessID>DAISYBOOK</pbpProcessID> <VCMapItem> <pbpTagName>upara</pbpTagName> <pbpTagDesc>Un-numbered Para</pbpTagDesc> 5 <htmTagMap>p</htmTagMap> <htmJustify/> <pbpActionMap>P</pbpActionMap> <pbpProcessInclude>include</pbpProcessInclude> </VCMapItem> 10 <VCMapItem> <pbpTagName>npara</pbpTagName> <pbpTagDesc>Numbered Para</pbpTagDesc> <htmTagMap>p</htmTagMap> <htmJustify/> 15 <pbpActionMap>P</pbpActionMap> <pbpProcessInclude>include</pbpProcessInclude> </VCMapItem> <VCMapItem> <pbpTagName>author</pbpTagName> 20 <pbpTagDesc>Author Name</pbpTagDesc> <htmTagMap>h2</htmTagMap> <htmJustify>center</htmJustify> <pbpActionMap>O</pbpActionMap> <pbpProcessInclude>include</pbpProcessInclude> 25 </VCMapItem> <VCMapItem> <pbpTagName>section-num</pbpTagName> <pbpTagDesc>Section Number</pbpTagDesc> <htmTagMap>h3</htmTagMap> 30 <htmJustify>center</htmJustify> <pbpActionMap>T</pbpActionMap> <pbpProcessInclude>include</pbpProcessInclude> </VCMapItem> <VCMapItem> 35 <pbpTagName>section-title</pbpTagName> <pbpTagDesc>Section Title</pbpTagDesc> <htmTagMap>h3</htmTagMap> <htmJustify>center</htmJustify> <pbpActionMap>T</pbpActionMap> 40 <pbpProcessInclude>include</pbpProcessInclude> </VCMapItem> <VCMapItem> <pbpTagName>book-title</pbpTagName> <pbpTagDesc>Book Title</pbpTagDesc> 45 <htmTagMap>h2</htmTagMap> <htmJustify>center</htmJustify> <pbpActionMap>T</pbpActionMap> <pbpProcessInclude>include</pbpProcessInclude> </VCMapItem> 50 <VCMapItem> <pbpTagName>intro-title</pbpTagName> <pbpTagDesc>Intro Title</pbpTagDesc> <htmTagMap>h3</htmTagMap> 55 <htmJustify>center</htmJustify> <pbpActionMap>T</pbpActionMap> <pbpProcessInclude>include</pbpProcessInclude> WO 2005/124579 PCT/AU2005/000832 14 </VCMapItem> <VCMapItem> <pbpTagName>defterm</pbpTagName> <pbpTagDesc>Defined Term</pbpTagDesc> 5 <htmTagMap>p</htmTagMap> <htmJustify/> <pbpActionMap>O</pbpActionMap> <pbpProcessInclude>include</pbpProcessInclude> </VCMapItem> 10 <VCMapItem> <pbpTagName>sig-name</pbpTagName> <pbpTagDesc>Signature Name</pbpTagDesc> <htmTagMap>h4</htmTagMap> <htmJustify/> 15 <pbpActionMap>O</pbpActionMap> <pbpProcessInclude>include</pbpProcessInclude> </VCMapItem> <VCMapItem> <pbpTagName>sig-position</pbpTagName> 20 <pbpTagDesc>Signature Position</pbpTagDesc> <htmTagMap>h4</htmTagMap> <htmJustify/> <pbpActionMap>O</pbpActionMap> <pbpProcessInclude>include</pbpProcessInclude> 25 </VCMapItem> <VCMapItem> <pbpTagName>sig-date</pbpTagName> <pbpTagDesc>Signature Date</pbpTagDesc> <htmTagMap>h4</htmTagMap> 30 <htmJustify/> <pbpActionMap>O</pbpActionMap> <pbpProcessInclude>include</pbpProcessInclude> </VCMapItem> <VCMapItem> 35 <pbpTagName>cover</pbpTagName> <pbpTagDesc>Comp1ete Cover</pbpTagDesc> <htmTagMap/> <htmJustify/> <pbpActionMap>N</pbpActionMap> 40 <pbpProcessInclude>exclude</pbpProcessInclude> </VCMapItem> </pbpVoiceConfig> The XML schema 76 providing validation rules for the configuration files is shown in 45 Fig. 7. Access pathways can be applied to any schema, and there is an ability to apply different access paths to the same element (eg. transactions and transaction item). Additionally, it is possible to use only a containing element (ie. a leaf node or one that does not contain 50 lower level elements becomes the container).
WO 2005/124579 PCT/AU2005/000832 15 Consider the following index list 77 for a particular customer: <?xml version="1.0" encoding="UTF-8"?> 5 <PBPIndex xmlns="http://tempuri.org/index.xsd" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://tempuri.org/index.xsd E:\vsprojects\VoicePatent\XML\index.xsd"> <IDXGroup Name="Bank Statement"> 10 <IDXItem> <DocClass>Bank Statement</Docclass> <ITMIdentifier>Al2345</ITMIdentifier> <ITMSender>Westpac Corp</ITMSender> <ITMReceiver>CCStephen</ITMReceiver> 15 <ITMTitle>Statement - Savings Account</ITMTitle> <ITMDate>2004-04-15</ITMDate> <ITMOriginator>Westpac Corp</ITMOriginator> <ITMAccess> <ITMPath> 20 <PathClass>Address</PathClass> <PathName>Reciever</PathName> </ITMPath> <ITMPath> <PathClass>AccountInfo</PathClass> 25 <PathName>Identification</PathName> </ITMPath> <ITMPath> <PathClass>TRXSummary</PathClass> <PathName>Summary</PathName> 30 </ITMPath> <ITMPath> <PathClass>ProviderNote</Pathclass> <PathName>Information</PathName> </ITMPath> 35 <ITMPath> <PathClass>TRXGroup</PathClass> <PathName>Transactions</PathName> </ITMPath> <ITMPath> 40 <PathClass>TRXBank</PathClass> <PathName>TRX</PathName> </ITMPath> </ITMAccess> </IDXItem> 45 <IDXItem> <DocClass>Bank Statement</DocClass> <ITMIdentifier>C567823</ITMIdentifier> <ITMSender>CCStephen</ITMSender> <ITMReceiver>CCStephen</ITMReceiver> 50 <ITMTitle>Statement - Business Account</ITMTitle> <ITMDate>2004-04-17</ITMDate> <ITMOriginator>Commonwealth Bank</ITMOriginator> <ITMAccess> <ITMPath> 55 <PathClass>Address</PathClass> WO 2005/124579 PCT/AU2005/000832 16 <PathName>Reciever</PathName> </ITMPath> <ITMPath> <PathClass>AccountInfo</PathClass> 5 <PathName>Identification</PathName> </ITMPath> <ITMPath> <PathClass>TRXSummary</PathClass> <PathName>Summary</PathName> 10 </ITMPath> <ITMPath> <PathClass>ProviderNote</PathClass> <PathName>Information</PathName> </ITMPath> 15 <ITMPath> <PathClass>TRXGroup</PathClass> <PathName>Transactions</PathName> </ITMPath> <ITMPath> 20 <PathClass>TRXBank</PathClass> <PathName>TRX</PathName> </ITMPath> </ITMAccess> </IDXItem> 25 </IDXGroup> <IDXGroup Name="Telephone Account"> <IDXItem> <DocClass>Telephone Account</DocClass> <ITMIdentifier>T43215</ITMIdentifier> 30 <ITMSender>Telstra Corp</ITMSender> <ITMReceiver>CCStephen</ITMReceiver> <ITMTitle>Telephone 9133 3487</ITMTitle> <ITMDate>2004-03-31</ITMDate> <ITMOriginator>Telstra Corp</ITMOriginator> 35 <ITMAccess> <ITMPath> <PathClass>Document</PathClass> <PathName>PBPDoc</PathName> </ITMPath> 40 </ITMAccess> </IDXItem> </IDXGroup> <IDXGroup Name="Documents April 2004"> <IDXItem> 45 <DocClass>Bank Statement</DocClass> <ITMIdentifier>A12345</ITMIdentifier> <ITMSender>Westpac Corp</ITMSender> <ITMReceiver>CCStephen</ITMReceiver> <ITMTitle>Statement - Savings Account</ITMTitle> 50 <ITMDate>2004-04-15</ITMDate> <ITMOriginator>Westpac Corp</ITMOriginator> <ITMAccess> <ITMPath> <PathClass>Address</PathClass> 55 <PathName>Reciever</PathName> </ITMPath> <ITMPath> WO 2005/124579 PCT/AU2005/000832 17 <PathClass>AccountInfo</PathClass> <PathName>Identification</PathName> </ITMPath> <ITMPath> 5 <PathClass>TRXSummary</PathClass> <PathName>Summary</PathName> </ITMPath> <ITMPath> <PathClass>ProviderNote</PathClass> 10 <PathName>Information</PathName> </ITMPath> <ITMPath> <PathClass>TRXGroup</PathClass> <PathName>Transactions</PathName> 15 </ITMPath> <ITMPath> <PathClass>TRXBank</PathClass> <PathName>TRX</PathName> </ITMPath> 20 </ITMAccess> </IDXItem> <IDXItem> <DocClass>Bank Statement</DocClass> <ITMIdentifier>C567823</ITMIdentifier> 25 <ITMSender>CCStephen</ITMSender> <ITMReceiver>CCStephen</ITMReceiver> <ITMTitle>Statement - Business Account</ITMTitle> <ITMDate>2004-04-17</ITMDate> <ITMOriginator>Commonwealth Bank</ITMOriginator> 30 <ITMAccess> <ITMPath> <PathClass>Address</PathClass> <PathName>Reciever</PathName> </ITMPath> 35 <ITMPath> <PathClass>AccountInfo</PathClass> <PathName>Identification</PathName> </ITMPath> <ITMPath> 40 <PathClass>TRXSummary</PathClass> <PathName>Summary</PathName> </ITMPath> <ITMPath> <PathClass>ProviderNote</PathClass> 45 <PathName>Information</PathName> </ITMPath> <ITMPath> <PathClass>TRXGroup</PathClass> <PathName>Transactions</PathName> 50 </ITMPath> <ITMPath> <PathClass>TRXBank</PathClass> <PathName>TRX</PathName> </ITMPath> 55 </ITMAccess> </IDXItem> <IDXItem> WO 2005/124579 PCT/AU2005/000832 18 <DocClass>Telephone Account</DocClass> <ITMIdentifier>T43215</ITMIdentifier> <ITMSender>Telstra Corp</ITMSender> <ITMReceiver>CCStephen</ITMReceiver> 5 <ITMTitle>Telephone 9133 3487</ITMTitle> <ITMDate>2004-03-31</ITMDate> <ITMOriginator>Telstra Corp</ITMOriginator> <ITMAccess> <ITMPath> 10 <PathClass>Document</PathClass> <PathName>PBPDoc</PathName> </ITMPath> </ITMAccess> </IDXItem> 15 </IDXGroup> </PBPIndex> This index document holds two 'bank statement' records and one 'telephone account' record. Each access path consists of a block of one or more elements contained by a 20 single element; these containing elements are the identifiers in the "ITMPath" elements of the index list. Reference is made to Fig. 8, showing a corresponding XML schema 78 for the index list and the access paths available to each document. 25 Building fragments A fragment builder 80 has knowledge of the fragments for a particular document, and utilises known application programs to convert each fragment into each of the requested supported forms. The fragments can also include formatting options available to customers (as described below). 30 One objective is to provide disabled people with the ability to deal with their documents in an efficient manner in their chosen form. It may not apply for all customer document reports. This is described as a 'navigation' ability, in that a document can be navigated by its fragments. 35 For each class of document, analysis and mapping must be carried out-to clearly identify the significant blocks of data requiring presentation to the user through navigable means.
WO 2005/124579 PCT/AU2005/000832 19 Consider the bank statement document described above. The following significant blocks of information are needed: " Period Information 5 The period covered by the statement " Account Information Account identification information such as the number * Personal Information Name, address, etc information presented on the statement 10 * Transaction Information The block container of all transactions in the period o Individual Transactions - each transaction within the transaction block " Balance information The starting and ending balances 15 * Summary Information The summary of debits and credits e Message Information A special message or advertising material provided on the statement 20 Indeed, these fragments are evident in the 'bank statement' XML index list 77 given above. Relationships and schema relating to fragment production The following example is an audio fragment, but it applies equally to any fragment. Firstly, it is important that the processing systems be able to clearly identify the elements 25 of the schema that contain actual text that needs to be "spoken". A schema may contain hundreds or even thousands of elements, some mandatory, others optional or dependant on higher level elements in the element "tree" a lesser number of the elements will encapsulate actual text. For this example, assume a schema holds 100 elements, 20 of those elements can contain text, the remaining 80 provide the context in which those text 30 elements are used - the ancestry of the text. Thus it is important in using the chosen WO 2005/124579 PCT/AU2005/000832 20 schema for the system to be able to identify which elements contain text and which elements provide the context of the text. This classification of elements is further complicated by the fact that some elements can 5 contain both text and lower level elements which also contain text, called a mixed model element. An example of a mixed model is emphasis within a paragraph 10 Simple Paragraph <upara>The quick brown fox jumps over the lazy dog.</upara> Mixed Paragraph <upara>The quick brown fox <emphasis type='Italic'>jumps over</emphasis> 15 the lazy dog.</upara> It is obvious that the. second model is more complex as we cannot simply speak the 'upara' element and the 'emphasis' element as there would then be two sound blocks, which in all practicality does not work. 20 The approach is to ignore the mixed element tags (emphasis) and speak all the text contained in the upara element, including that enclosed in the emphasis element, but not the actual tag itself (<emphasis type='Italic'>). This entails the need to clearly identify: 25 * Elements that provide context information * Elements that contain text * Elements that are used within mixed model elements Although it is unlikely that the headings would be spoken differently (although it would 30 be possible to use a different voice for each or tone or even volume for the hard of hearing), it is currently unlikely that this would happen.
WO 2005/124579 PCT/AU2005/000832 21 Component identification Analysis of the chosen schema must be performed to clearly identify the elements that encapsulate complete blocks of text. 5 Definitions: complete block of text - blocks of text that need to be read as a single stream, and is the smallest navigable unit within a voice document (eg. in a simple audio book, this could be a chapter, in a Daisy book, more likely a paragraph.). 10 granularity - the process of deciding the size of the block of text to be read as a single unit, coarse granularity may refer to reading the entire document or a chapter.as a single unit, fine granularity may refer to reading the individual paragraphs as a single unit. Finer granularity enables more precise navigation and searching. 15 Complete blocks of text may contain in-line or nested tags, typically these would relate to emphasis or such like, but in reality, all text contained within the root element of the document could be read in a single stream (ie. the complete book). Actual tags within the text block (but not their text content) need to be ignored in the reading process and this applies during recursion of the nesting process. 20 Where in-line tags occur, or structural tags are treated as in-line tags (such as in treating a complete chapter as a single block of text), it is ensured that removal or ignoring of the inline tags preserves white space and does not cause words to be joined. 25 All elements that are not those encapsulating complete blocks of text are either: a. Inline elements or those regarded as inline elements due to the selection of corse granularity (lower level elements than the elements containing complete blocks of text) - these will be ignored. 30 b. Structural elements (higher level elements than the elements containing complete blocks of text) - these will provide context for the text elements WO 2005/124579 PCT/AU2005/000832 22 Element Types & Usage The three element types described above are used in the following manner i. Inline - ignored ii. Structural - provide context for the text elements (via use of ancestors) 5 iii. Text - contains the text to be read Element Ancestry Although ancestry is less important in voice generation that in say the production of printed matter, it still has some significance and the same basic rules apply. Ancestry is 10 important as a heading tag may be used in both the book title and the chapter title, same element - different ancestry (context). The context of the element is used in creation of the navigation component for the DAISY book. The complete ancestry of an element is typically not of interest, rather just whether element X is anywhere in the ancestry. Element X would normally be unique to a single path and sufficient to identify the 15 context. The fragment builder 80 thus generates - using standard software applications - output files 81 of the appropriate type for each form the source document can take: for example, .pdf for print, MP3 for audio, Braille ASCII for Braille and any convenient file type (eg. 20 MS ReaderTM) for E-book. These are stored in the fragment store 82. C. Reproduction Reproduction is under the control of the management and synchronisation system 84. Both complete rendered documents in the chosen form and rendered document fragments of the chosen form for each navigable component defined by the pathway builder 74 can 25 be reproduced. The chosen reproduction form is achieved by an appropriate mapping process. In one embodiment the following set of applications can be used: Voice generation system - generates DAISY, MP3 and CD audio forms. The process is as follows 30 e Way file generation of each navigable fragment: for this process the prototype Microsoft MAPITM and AT&T VoicesTM software products are used.
WO 2005/124579 PCT/AU2005/000832 23 " MP3 conversion of each fragment: for this process, the shareware/freeware LAME (LAME v.3.96 of 11 April 2004, available to download from http://lame.sourceforge.net) is used. " Author the collection of DAISY files: for this process, a tool based on the access 5 path methods and mapping process is used to output a file to the DAISY format is used. Braille production process Braille production is dependant on two principal driving factors. The first is the selected 10 contraction table which is usually based on the language (US English Braille. UK English Braille, German Braille, etc). The second is the selection of the target Braille code which maps the characters of the language to the dot based Braille code. Although typically English words would have English contractions and English codes (also German->German->German) English words could be written with German 15 contractions and German codes so that a German Braille reader who could speak English could read the English words without having to learn English Braille codes. Braille contractions are driven by large translation tables (one for each language supported). These tables contain the word and the Braille contracted word in the target 20 language. There are rules as to where contractions may be applied, for example some words may not have ending contractions applied if immediately followed by punctuation, etc. In this situation the word will be entered several times in the table, with the punctuation mark appended to the word in the additional entries. In the following hypothetical example, the characters "ing" are replaced in the word "running" but not in 25 the word "running." XML and table fragments illustrate this. Running replace <Braille contracted form> running. no replace running! no replace WO 2005/124579 PCT/AU2005/000832 24 <xmlfragment> <para>The boy was running at the beach. The boy left the room running.</para> <./xmlfragment> 5 <xmlfragment> <para>The <Braille contraction=true>boy </Braille>was <Braille contraction=true>running</Braille>at the beach. The boy left the room <Braille contraction=false>running.</Braille></para> </xmlfragment> 10 In reality all words in the <para> will be tagged with either true or false, but in this example for clarity we have tagged only "running" and "boy". Words that are not tagged do not appear in the translation table, and will be written to an exception file for either addition to the table and reprocessing or they may be handled as Braille 1. The final step is processing to Braille output. 15 E-book generation process Any convenient text conversion software application can be used (eg. Acrobat ReaderTM). The document management and synchronisation system 84 manages and tracks the 20 documents, fragments, XML documents and indexes. The management and synchronisation system 84 interacts with three output interfaces: a physical production interface 86, a web interface 88 and a download interface 90. Physical production 25 The physical production centre 86 uses the pre-built output documents and document fragments to produce physical media to delivered by suitable means to a customer 100. The physical production centre 86 produces the chosen form of either a Braille document 94, a printed document 96, or a storage medium such as a CDROM 98. 30 The web interface 88 employs web pages to call server functionality to deliver electronic files to the client in the following forms: - output documents; WO 2005/124579 PCT/AU2005/000832 25 - output fragments; - index functionality; - searching; and - interactive forms. 5 The web interface 88 is accessed by the customer 102 by any convenient browser application 104. The download interface 90 is a simple web-service or other transfer mechanism to move 10 documents to a customer PC for access purposes. This interface 90 is active when a customer chooses to synchronise documents over the internet. The download interface 90 thus communicates with local PC systems 106, under the control of the customer 108. Turning now to Fig. 5, the management and synchronisation system 84 and download 15 interface 90 of the document server processes 50 are shown. The user server processes 60 correspond broadly with the local PC systems 106 and user 108 shown in Fig. 3. A download interface 120, 122 is provided for the simple PC system solution and the full-function PC system solution, respectively. A simple PC system solution has an index 20 application 124, whereas a full-function PC system has a management application 126. In both cases the user's files are copied to the reproduction computer, including index files 128, output documents and fragments 130 and XML documents 132, in a common store 62. 25 The index application 124 has the ability to read and/or search the customer's index list, and search documents using the XML documents store 132 to present complete documents through a reader application. The management application 126 has the ability to handle various forms of input other 30 than a keyboard or helper application.
WO 2005/124579 PCT/AU2005/000832 26 Four forms of output are provided. A Braille application 128, 130 generates a Braille document using any convenient commercial system, to be delivered to the user 108 in paper form by host or electronically for local printing or for use on a reader/keyboard device. 5 A voice application 132, 134 are generated as described above. Voice fragments are navigable using standard DAISY functionality giving limited levels of navigation through these classes of documents. One way to improve the navigability is to concatenate the index and access the pathways to create longer access pathways. 10 Having done this, the information can be mapped into a DAISY form. This approach delivers navigability in a third party product. An E-book application 136, 138 can be achieved through the use of XSL(T) transformations. 15 Finally, a print application 140, 142 generates a PDF output file. For these simple PC systems, a simple keyboard 150 can be interfaced with the index application 124. For the full-function PC system, a Braille input device 152, voice input device 154 and keyboard 156 can interface with an input conversion application 158, in 20 turn inputting to the management application 126. Print formatting Referring now to Fig. 9, a chosen document format is produced by additional processes 200 on the document server 50. A Style Sheet Builder 210 uses an XML file 212 (shown in Fig. 10) defining the format (typically selected by the customer) to create an XSL:FO 25 style sheet 214 (shown in Fig. 11). This style sheet 214 is then applied by the XSLT processor 216 to the XML document or fragment/s which corresponds to the document required by the customer from the repository 82 to produce an XSL:FO file 218 (shown in Fig. 12). The explicit flow information in the XML document captured in the mark-up cannot be modified by this process. When in final form, the XSL:FO file 216 is 30 processed by the XSL:FO processor 220 to produce the document in a form ready for printing, in this case in PDF format 222.
WO 2005/124579 PCT/AU2005/000832 27 D. Searching Searching can be performed on the index 77 or on the whole document. The index is used for navigation to allow rapid retrieval of a document or fragment, and in addition, the index can be searched for content. Not all information need be in the index, and so 5 the document can also be searched for context. In searching for a telephone number on a phone bill, the search could be restricted to the phone number in the transaction listing sections (ie. access pathway) finding a specific number called, because the information is provided in XML as well as in any user-requested format. In the case of presentment in any form, the functionality is available as the XML used to create the presented document 10 is provided as a basis for searching in context, the choice of customer system will define how the result is presented. In the case of a simple storage solution (left-hand side of item 60 in Fig. 5), an indexer application is provided to the customer on the local PC 108. This will only be able to present a complete document as the result of the search (ie. a phone bill, not a line on the bill). The full function system (right-hand side of item 60 in 15 Fig. 5) or the online system 104 will be able to present just the line item fragments in the format required by the user (say a PDF or voice fragment). E. Other embodiments Special Braille mark-up Images can be represented in print and to a lesser extent in Braille. For example, a square 20 can be represented as four lines intersecting lines of closely spaced Braille impressions forming a square. A pie chart can be represented as a circle of Braille impressions which are intersected by radii at appropriate points. A bar chart can similarly be represented as can a graph. 25 A program that can create regular images in print can also be used to create Braille representations at appropriate sizes for the reader. With images represented in Braille, there are usually descriptions in Braille. These descriptions are usually manually created, as are the Braille images. These manual WO 2005/124579 PCT/AU2005/000832 28 descriptions or annotations of the diagram can be used directly in Audio Books as well as Braille documents. A standard text template be formed for regular images such as geometric shapes, pie or 5 bar charts, graphs and other similar images, and variables can be automatically inserted in the mark-up process so that the particulars of that image can be correctly explained to the Braille reader. A customer can create a Braille image representation and annotation simply by selecting 10 the image type and inserting the variables to define the image. If an embossed image is required, the mark-up will generate the embossed image with the appropriate labels and insert the text of the variables in the annotation template text in a suitable format so that the Braille reader can quickly find out what the image refers to. This also can be applied to non English languages. 15 For example, a person wanting to create a Braille representation of a simple bar chart shown in Fig. 13. The Braille annotation may read as follows: <Annotation> 20 This diagram is titled "People x Age Group". The diagram is a bar chart. The vertical axis shows numbers of people. The bars horizontal axis shows age group categories. The bars are vertical. There are three bars in the diagram. Vertical Bar 1 - Less than 20 years old. The number of people in bar 1 is 20. Vertical Bar 2 - Between 20 and 60 years old. The number of people in bar 2 is 60. 25 Vertical Bar 3 - More than 60 years old. The number of people in bar 3 is 20. </Annotation> The variables to be filled in are: Variable 1 =Title 30 Variable 2 =Diagram Type Variable 3 =Vertical Axis name Variable 4 =Horizontal Axis name WO 2005/124579 PCT/AU2005/000832 29 Variable 5 =Direction of bars (vertical or horizontal) Variable 6 =Number of bars Variable 6 - the number of bars - will determine that there are 6 more variables representing the title and number of each of the three bars: 5 Variable 7 =Title of bar 1 Variable 8 =Size of bar 1 Variable 9 =Title of bar 2 Variable 10 =Size of bar 2 Variable 11 =Title of bar 3 10 Variable 12 =Size of bar 3 The Template is <Template> This diagram is titled "<variable 1>". The diagram is a <variable 2>. The vertical 15 axis shows <variable 3>. The horizontal axis shows <variable 4>. The bars are <variable 5>. There are <variable 6> bars in the diagram. Vertical Bar 1 - <variable 7>. The <variable 3> bar is <variable 8>. Vertical Bar 2 - <Variable 9>. The <variable 3> bar is <variable 10>. Vertical Bar 3 - <variable 11>. The <variable 3> bar is <variable 12>. 20 </Template> The template may not include all of the visual information, such as the shading and horizontal lines shown in Fig. 13, as such information may be confusing to visually impaired people. 25 The same variables can be used to generate the Braille and also the typeset image of the diagram of Fig. 13. Storage and retrieval of Braille images and image annotations 30 Sighted people can search for images from image categories and from descriptions of the images, and can locate possible images and then view the images to select the correct image. Using this technique, in addition to the original image being stored, an annotation WO 2005/124579 PCT/AU2005/000832 30 of the image and a Braille representation of the image can be stored. In this way, someone who has created a Braille representation of an image of the map of Australia and annotated it can store the original image, the Braille representation of the image and the annotation, and make it available for other people to locate and use without having to 5 redo this work. Response capability The facility for customers to provide responses to documents is provided. Foi example, one form of document that is reproduced may be a questionnaire, and responses to the 10 questions can be made by the customer in any desired form (supported by the customer computer), and stored on the document server for subsequent attention. Invoice classification A person with normal vision may get the following invoice information sent to him: 15 1. PDF's of the full invoice. These PDF's should be locked so that the user cannot change them. 2. The invoice information in XML so that he can search the XML and find the relevant information. 3. The invoice information in a form that can be input into the user's accounting 20 system. This may require some categorisation of the sender of the invoice or the type of invoice that the sender dispatches, if the sender dispatches more than one invoice. See below. 4. Fragments of the invoice for display to the user - eg. a line in the invoice. This is of lesser importance for a sighted user, but there may be some applications where 25 this is requested. A customer may be permitted to classify invoices into categories so that a phone bill from a Telco will entered correctly into the accounting system. There are two ways to do this: build a table or file using a mapping process that is translated from the XML to some 30 input format for the customer's accounting system, or allow the user to enter his own classification code so that all bills from the Telco will go into chart of accounts entry 23, for example. If the Telco sends accounts for Internet and phones, the customer may be WO 2005/124579 PCT/AU2005/000832 31 permitted to look at the bill and classify it, or to classify the Telco account number on the invoice. These arrangements (ie. response capability and invoice classification) utilise the 5 repository 73 on the document server side.

Claims (30)

1. A method for reproducing a requested source document in a requested one of available forms comprising the steps of: 5 (a) for each one of a plurality of documents: (i) applying at least one access pathway to a marked-up form of the document, said access pathways defining discrete parts of the document; and (ii) generating a fragment of said marked-up document for each said 10 access pathway for each available form; and (b) generating a requested one or more parts of a source document in a requested form from the respective stored fragments.
2. The method of claim 1, wherein said access pathways are defined in a 15 configuration file.
3. The method of claim 2, wherein said documents are assigned to a respective a plurality of classes, and there is a configuration file for each said class. 20
4. The method of clam 3, wherein said configuration file includes requestable information relating to available format that is added to said fragments.
5. The method of claim 3 or 4, comprising the further step of marking-up said source documents according to a schema, and wherein there is a separate schema for each said 25 class.
6. The method of claim 5, further comprising creating an index list for each request maker, said index list including a set of documents available to each request maker, and lists the access pathways for each fragment of each document. 30 WO 2005/124579 PCT/AU2005/000832 33
7. The method of any one of the preceding claims, wherein one said fragment comprises the entire source document.
8. The method of any one of the preceding claims, wherein said marked-up 5 documents and said configuration files are in XML code.
9. The method of any one of the preceding claims, wherein said requested forms include electronic, print, audio and Braille.
10 10. The method of any one of the preceding claims, wherein said generating step includes transmitting an electronic file of said respective fragments from a server computer to a remote computer where reproduction is performed.
11. The method of claim 10, wherein, at said remote computer, a requested document 15 is navigable by said fragments.
12. A method for reproducing a requested source document in a requested one of available forms. 20
13. A method for reproducing a requested source document in a requested one of available forms and formats.
14. A method for reproducing a requested source document in a requested one of available forms comprising the steps of: 25 (a) at a document server, for each one of a plurality of documents: (i) applying at least one access pathway to a marked-up form of the document, said access pathways defining discrete parts of the document; and (ii) generating a fragment of said marked-up document for each said 30 access pathway for each available form; (b) transmitting said fragments for said requested form for a requested document over a communication channel; and WO 2005/124579 PCT/AU2005/000832 34 (c) at a remote computer connected to said communication channel, generating a requested one or more parts of a source document in a requested form from the respective stored fragments. 5
15. The method of claim 14, further comprising the features of any one of claims 2 to 13.
16. A computer system for reproducing a requested source document in a requested *one of available forms comprising a processor programmed to: 10 (a) for each one of a plurality of documents: (i) apply at least one access pathway to a marked-up form of the document, said access pathways defining discrete parts of the document; and (ii) generate a fragment of said marked-up document for each said access 15 pathway for each available form; and (b) generate a requested one or more parts of a source document in a requested form from the respective stored fragments.
17. The computer system of claim 16, wherein said processor is programmed to 20 perform the steps of any one of claims 2 to 13.
18. A computer system for reproducing a requested source document in a requested one of available forms comprising: (a) a document server programmed to, for each one of a plurality of documents: 25 (i) apply at least one access pathway to a marked-up form of the document, said access pathways defining discrete parts of the document; and (ii) generate a fragment of said marked-up document for each said access pathway for each available form; 30 (b) a transmission channel for transmitting said fragments for said requested form for a requested document; and WO 2005/124579 PCT/AU2005/000832 35 (c) a remote computer connected to said communication channel, operable to generate a requested one or more parts of a source document in a requested form from the respective stored fragments. 5
19. The computer system of claim 18, further comprising customer computer means coupled to said communication channel and by which requests for documents and document forms are made to said document server.
20. A computer program product comprising a computer program on a storage 10 medium, said computer program comprising code means for performing the steps of any one of claims I to 15.
21. A computer program comprising code means for performing the steps of any one of claims I to 15. 15
22. A method for converting a requestable source document in a requestable one of available forms to be available for reproduction comprising the steps of: for each one of a plurality of documents: (i) applying at least one access pathway to a marked-up form of the 20 document, said access pathways defining discrete parts of the document; and (ii) generating a fragment of said marked-up document for each said access pathway for each available form. 25
23. The method of claim 22, wherein said access pathways are defined in a configuration file.
24. The method of claim 22, wherein said documents are assigned to a respective a plurality of classes, and there is a configuration file for each said class. 30
25. The method of clam 24, wherein said configuration file includes requestable information relating to available format that is added to said fragments. WO 2005/124579 PCT/AU2005/000832 36
26. The method of claim 24 or 25, comprising the further step of marking-up said source documents according to a schema, and wherein there is a separate schema for each said class. 5
27. The method of claim 26, further comprising creating an index list for each request maker, said index list including a set of documents available to each request maker, and lists the access pathways for each fragment of each document. 10
28. The method of any one of claims 22 to 27, wherein one said fragment comprises the entire source document.
29. The method of any one of claims 22 to 28, wherein said marked-up documents and said configuration files are in XML code. 15
30. The method of any one of claims 22 to 29, wherein said requested forms include electronic, print, audio and Braille.
AU2005255044A 2004-06-17 2005-06-10 Reproduction of documents into requested forms Ceased AU2005255044B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2005255044A AU2005255044B2 (en) 2004-06-17 2005-06-10 Reproduction of documents into requested forms

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
AU2004903307 2004-06-17
AU2004903307A AU2004903307A0 (en) 2004-06-17 Reproduction of Documents into Requested Forms
PCT/AU2005/000832 WO2005124579A1 (en) 2004-06-17 2005-06-10 Reproduction of documents into requested forms
AU2005255044A AU2005255044B2 (en) 2004-06-17 2005-06-10 Reproduction of documents into requested forms

Publications (2)

Publication Number Publication Date
AU2005255044A1 AU2005255044A1 (en) 2005-12-29
AU2005255044B2 true AU2005255044B2 (en) 2010-03-04

Family

ID=37636174

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2005255044A Ceased AU2005255044B2 (en) 2004-06-17 2005-06-10 Reproduction of documents into requested forms

Country Status (1)

Country Link
AU (1) AU2005255044B2 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6029182A (en) * 1996-10-04 2000-02-22 Canon Information Systems, Inc. System for generating a custom formatted hypertext document by using a personal profile to retrieve hierarchical documents
US6397231B1 (en) * 1998-08-31 2002-05-28 Xerox Corporation Virtual documents generated via combined documents or portions of documents retrieved from data repositories
US6591289B1 (en) * 1999-07-27 2003-07-08 The Standard Register Company Method of delivering formatted documents over a communications network

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6029182A (en) * 1996-10-04 2000-02-22 Canon Information Systems, Inc. System for generating a custom formatted hypertext document by using a personal profile to retrieve hierarchical documents
US6397231B1 (en) * 1998-08-31 2002-05-28 Xerox Corporation Virtual documents generated via combined documents or portions of documents retrieved from data repositories
US6591289B1 (en) * 1999-07-27 2003-07-08 The Standard Register Company Method of delivering formatted documents over a communications network

Also Published As

Publication number Publication date
AU2005255044A1 (en) 2005-12-29

Similar Documents

Publication Publication Date Title
Harold XML 1.1 Bible
Nolan et al. XML and web technologies for data sciences with R
US8719696B2 (en) Production of documents
US20060064631A1 (en) Method and apparatus for automated authoring and marketing
US20070255694A1 (en) Document-drafting system using document components
US20070182990A1 (en) Reproduction of documents into requested forms
Perugini et al. Enhancing usability in CITIDEL: multimodal, multilingual, and interactive visualization interfaces
US9854109B2 (en) Document output processing
Lubas et al. The metadata manual: a practical workbook
Yesilada et al. Rendering tables in audio: the interaction of structure and reading styles
US8687210B2 (en) Document data access
Byrne MARC manual: Understanding and using MARC records
US20070055682A1 (en) Rhetorical content management system and methods
KR102126342B1 (en) Electronic document braille translation system and a method therefor
AU2005255044B2 (en) Reproduction of documents into requested forms
GB2458692A (en) A process for generating database-backed, web-based documents
Coleman et al. SGML as a Framework for Digital Preservation and Access.
Hockey Making Technology Work for Scholarship: Investing in the Data.
Greulich Embedded indexing with Word. Part 5–locators other than page numbers
KR101965527B1 (en) A System of Formation of Estimate Sheet with Advertisement URL
Jenstad “Tangled in a Net”: Shakespeare the Adaptor/Shakespeare as Source 1
Hockey The reality of electronic editions
JP3501240B2 (en) Document creation support device
Desmarais ABCs of XML: The Librarian’s Guide to the eXtensible Markup Language
Youngen TEX-based production at AMS

Legal Events

Date Code Title Description
FGA Letters patent sealed or granted (standard patent)
MK14 Patent ceased section 143(a) (annual fees not paid) or expired