US20130128315A1 - Content-aware method for saving paper and ink while printing a pdf document - Google Patents
Content-aware method for saving paper and ink while printing a pdf document Download PDFInfo
- Publication number
- US20130128315A1 US20130128315A1 US12/849,519 US84951910A US2013128315A1 US 20130128315 A1 US20130128315 A1 US 20130128315A1 US 84951910 A US84951910 A US 84951910A US 2013128315 A1 US2013128315 A1 US 2013128315A1
- Authority
- US
- United States
- Prior art keywords
- pdf document
- document
- content
- pages
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/12—Digital output to print unit, e.g. line printer, chain printer
- G06F3/1201—Dedicated interfaces to print systems
- G06F3/1202—Dedicated interfaces to print systems specifically adapted to achieve a particular effect
- G06F3/1203—Improving or facilitating administration, e.g. print management
- G06F3/1208—Improving or facilitating administration, e.g. print management resulting in improved quality of the output result, e.g. print layout, colours, workflows, print preview
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/12—Digital output to print unit, e.g. line printer, chain printer
- G06F3/1201—Dedicated interfaces to print systems
- G06F3/1202—Dedicated interfaces to print systems specifically adapted to achieve a particular effect
- G06F3/1218—Reducing or saving of used resources, e.g. avoiding waste of consumables or improving usage of hardware resources
- G06F3/1219—Reducing or saving of used resources, e.g. avoiding waste of consumables or improving usage of hardware resources with regard to consumables, e.g. ink, toner, paper
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/12—Digital output to print unit, e.g. line printer, chain printer
- G06F3/1201—Dedicated interfaces to print systems
- G06F3/1223—Dedicated interfaces to print systems specifically adapted to use a particular technique
- G06F3/1237—Print job management
- G06F3/125—Page layout or assigning input pages onto output media, e.g. imposition
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/103—Formatting, i.e. changing of presentation of documents
- G06F40/114—Pagination
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/12—Digital output to print unit, e.g. line printer, chain printer
- G06F3/1201—Dedicated interfaces to print systems
- G06F3/1223—Dedicated interfaces to print systems specifically adapted to use a particular technique
- G06F3/1237—Print job management
- G06F3/1244—Job translation or job parsing, e.g. page banding
Definitions
- This disclosure generally relates to printing. More particularly, the disclosure relates to the reduction of paper and/or ink utilized for printing.
- green technologies have been developed for computers, automobiles, household appliances, etc.
- a computer program product includes a computer useable medium having a computer readable program.
- the computer readable program when executed on a computer causes the computer to preprocess a PDF document having a first quantity of pages of content. Further, the computer readable program when executed on a computer causes the computer to perform complexity analysis on the PDF document.
- computer readable program when executed on a computer causes the computer to create a document object model based on the PDF document.
- the computer readable program when executed on a computer also causes the computer to perform document object model analysis on the PDF document with the document object model. Further, computer readable program when executed on a computer causes the computer to perform content repurposing of the PDF document.
- the computer readable program when executed on a computer causes the computer to perform content re-layout of the PDF document based on the content repurposing such that a printer prints the PDF document with the content in a transformed format that results in a second quantity of pages being printed without falling below a predetermined readability threshold, the second quantity of pages being less than the first quantity of pages.
- a process preprocesses, with a processor, a PDF document having a first quantity of pages of content. Further, the process performs, with the processor, complexity analysis on the PDF document. In addition, the process creates, with the processor, a document object model based on the PDF document. The process also performs, with the processor, document object model analysis on the PDF document with the document object model. Further, the process performs, with the processor, content repurposing of the PDF document.
- the process performs, with the processor, content re-layout of the PDF document based on the content repurposing such that a printer prints the PDF document with the content in a transformed format that results in a second quantity of pages being printed without falling below a predetermined readability threshold, the second quantity of pages being less than the first quantity of pages.
- a system in yet another aspect of the disclosure, includes a green print module that receives an indication that a printout of a PDF document from a printer has been requested.
- the PDF document has content with a format that would result in a first quantity of pages being printed.
- the system has a processor that (i) preprocesses the PDF document, (ii) perform complexity analysis on the PDF document, (iii) creates a document object model based on the PDF document, (iv) performs document object model analysis on the PDF document with the document object model, (v) performs content repurposing of the PDF document by utilizing the document object model, and (vi) performs content re-layout of the PDF document based on the content repurposing such that a printer prints the PDF document with the content in a transformed format that results in a second quantity of pages being printed without falling below a predetermined readability threshold.
- the second quantity of pages being less than the first quantity of pages.
- FIG. 1 illustrates a green printing configuration
- FIG. 2 illustrates a green print module that may be utilized with the green printing configuration.
- FIG. 3 illustrates a process that is utilized by the green print module illustrated in FIG. 2 to analyze a PDF document for the purpose of selecting one or more transformations to the PDF document for green printing.
- FIG. 4A illustrates an example of a first page of a PDF document.
- FIG. 4B illustrates an example of a second page of the PDF document.
- FIG. 4C illustrates an example of a third page of the PDF document.
- FIG. 4D illustrates a PDF green document that results from the process illustrated in FIG. 3 being applied to the first page, second page, and third page of the PDF document.
- FIG. 5 illustrates a system configuration that may be utilized for green printing of a PDF document.
- a green printing configuration is provided herein that repurposes a PDF document to save paper and/or ink.
- the green printing configuration is a configuration that reduces the number of pages and/or ink utilizing to print a PDF document through content repurposing, which is an approach that transforms content and the formatting of the content to shrink the size of the content to fit into fewer pages at the same time as retaining the readability and aesthetics of the document.
- the green printing configuration may be a method, system, computing device, computer program stored on a computing device, computer program stored on a printer, computer module that may be downloaded through a network, plug-in, extension, etc.
- a user may print in a single click to a default printer so that the content takes up fewer pages less ink than a typical printed document.
- a PDF document may have text, images, and vector arts as primary building blocks. These objects are placed on pages in a PDF in a static fashion.
- a PDF file is typically difficult to modify as the entire content in the PDF file is placed content with fixed positions and the relationship between content spread across pages does not exist. For example, paragraphs or tables spanning multiple pages are treated as separate objects. The present of content such as footer text makes relating the objects more difficult. Accordingly, in the context of printing, the single biggest redundancy in a PDF document is that content from one page cannot flow into content from another page. As a result, large vacant spaces present in PDF pages remain unutilized on paper.
- redundant white space may also be present in PDF pages. These other forms of redundant white space include margins, gaps between objects, etc.
- the green printing configuration exploits redundant whitespace and large text/Images present in a PDF document to reduce paper and ink for printing the PDF document. Further, the green printing configuration determines a relationship between objects that should be placed together.
- content repurposing is utilized to automatically shrink content in a PDF document.
- the logical structure is extracted form the input PDF file with a heuristic rule model.
- a re-layout is performed on the logical structure after applying green transformations.
- the green transformations may include changing the page orientation, reducing text sizes, scaling images, flowing content from one page into another (e.g., disregarding page boundaries), and/or removal of unimportant content such as a cover page, header/footer, background fills, etc.
- graying of text and images may be utilized to save ink for printing.
- the transformations utilized for content repurposing have the goal of reducing the overall redundancy in a document. Although a group of transformations together may reduce the redundancy, one or more of the transformations may individually increase the redundancy. The redundancy for an individual transformation may be increased to accentuate certain characteristics of the document to add to the aesthetic appeal and/or readability of the document. For example, if and when a document is printed with multiple pages per paper sheet, a green print program may automatically decide to increase the size of the text to make the text more readable.
- a transformation may have a magnitude associated therewith.
- the magnitude may be binary.
- the binary magnitude may indicate whether a transformation is applied is or not applied, e.g., “0” equals transformation is not to be applied and “1” equals transformation is to be applied.
- the magnitude may have a set of predefined discrete values.
- the magnitude may take continuous values.
- Each instance of a transformation type may have a transformation cost associated therewith for a particular document type.
- a particular sizing transformation may have a different transformation cost for a PDF document than for a spreadsheet.
- each transformation may have an associated saved paper quantity. For example, a particular textual transformation may save one tenth of a sheet of paper.
- each transformation may have an associated saved ink quantity. For example, a particular textual transformation may save one half an ounce of ink. The transformation may potentially have both a saved paper quantity and a saved ink quantity if both paper and ink would be saved as a result of the transformation.
- FIG. 1 illustrates a green printing configuration 100 .
- a computing device 102 is illustrated as a PC. Further, as an example, the computing device 102 is operably connected to a printer 104 through a wireline connection.
- the term computing device 102 is herein intended to include a personal computer (“PC”), desktop computer, laptop, notebook, cell phone, smart phone, personal digital assistant (“PDA”), kiosk, etc. Further, the computing device 102 may be a client, server, network device, etc.
- the printer 104 may be a printing device that is separately connected, e.g., through a wireline or wireless connection, to the computing device 102 , built into the computing device 102 , etc.
- a wireless connection may receive and/or send data through a Radio Frequency (“RF”) transmission, an Infrared (“IR”) transmission, or the like.
- RF Radio Frequency
- IR Infrared
- the printer 104 may or may not be part of a network. Further, the printer 104 may utilize any type of printing methodology to print on paper 106 , e.g. laser printing, ink jet printing, or the like.
- FIG. 2 illustrates a green print module 202 that may be utilized with the green printing configuration 100 .
- the green print module 202 may be stored in the computing device 102 or the printer 104 .
- the green print module 202 may analyze the content of multiple pages, sections, etc. of a PDF document to repurpose the content to save paper and/or ink.
- the green print module 202 may analyze a first page 206 and a second page 208 of a PDF document 204 .
- the green print module 202 may then repurpose the content of the PDF document 204 so that a green PDF document 210 may be printed.
- the green PDF document 210 has a repurposed page 210 that has the content from the first page 206 and the second page 208 in a readable format. Lines are provided in a document as illustrated in drawings such as FIG. 2 to represent text, symbols, shapes, images, and/or the like.
- FIG. 3 illustrates a process 300 that is utilized by the green print module 202 illustrated in FIG. 2 to analyze a PDF document for the purpose of selecting one or more transformations to the PDF document for green printing.
- the process 300 preprocesses a PDF document.
- the pre-processing acquires information about the PDF document.
- the information may include identifying text, images, vector objects, dimensions (bounding boxes), text fonts, etc. These elements can be represented in a page as cells of well defined bound boxes. These cells may be grouped together to maintain the reading context of the individual element/cell. For example, overlapping vector graphics cells may be grouped together so that they can be formatted individually and placed in their entirety.
- the individual objects may be placed far apart, which may lead a less readable output.
- original multicolumn text cells may be identified and merged in to a single text cell containing the entire multicolumn text in reading order.
- the preprocessing determines how many pages may be saved in the PDF document to figure out whether or not the number of pages should be reduced. For example, if the PDF document has only one page, then the green printing configuration will only save ink.
- a subprocess may be utilized to identify the cell groups.
- the subprocess may find the set of isolated cells (bounding boxes) B in the PDF page. Further, the subprocess may group the cells in B by utilizing heuristic rules which output the logical elements in the page. For every cell C in B, the subprocess groups the cells in B utilizing heuristic rules, which output the logical elements in the PDF page. For every cell C in B, the set of cells B′ that lies in the proximity of C is found. The cells in B′ can either lie in horizontal proximity or in vertical proximity. A plurality of factors may be utilized to make the decision as to which cells are chosen. The type of individual cells is a factor.
- text cells will be merged with the graphic cells, e.g., image/vector arts, only when the text cells overlap with the graphic cells.
- graphic cells can be merged with text cells even if they don't overlap with text cells, but lie in a small proximity.
- Another factor is that a merged cell should not contain empty areas beyond a certain threshold.
- multi-column text cells should not be merged as is, but rather in a single text cell in which all the text of individual cells should be added in the reading order.
- the subprocess creates a merged cell merged(C). For every region C′ in B, a determination is made to figure out whether or not C′ was merged with merged cell merged(C). If C′ was merged with merged cell merged(C), C′ is removed from B. Further, the subprocess removes C from B. In addition, the subprocess adds the new merged cell merged(C) to B. After the initial finding of isolated cells, the subprocess repeats the remaining portions of the subprocess for every other cell present in the set B.
- the process 300 performs a complexity analysis.
- the grouped cells are utilized to categorize certain pages and some content elements as being too complex for complete transformation. Certain rules are utilized for such categorization, e.g. the number of overlapped images/vectors in a given PDF page, presence of form field, etc. Such pages are typically converted to raster and replaced. This mechanism of complexity analysis also ensures the correctness of the entire green printing of PDF document approach.
- the preprocessing allows the subprocess to find out the source of the document. For example, if the document that was converted to a PDF was a word processing document rather than a spreadsheet program, then one type of green transformations are utilized for a word processing document.
- DOM document object model
- a PDF document is generally not created by keeping any specific document structure in mind.
- a tagged PDF provides some information in terms of logical constructs such as a table of contents, paragraph, tables, drawing, etc.
- the creation of the DOM model first involves tagging the input PDF if the PDF is not already tagged. Subsequently, higher level constructs such as cover page, reference page, background image, etc. will be identified utilizing a heuristic rule based model. The grouped cells identified at the process block 302 will be tagged as high level document constructs utilizing this rule model. These constructs will be considered while applying a transformation.
- This structure creation is helpful because a PDF document inherently does not support any document structure or any relationship between different objects. Therefore, a high level flow able structure is created to be utilized in content re-layout. This structure extraction is not same as object recognition in images, etc. Semantic tags are applied that are helpful in printing. For example, a group of vector paths may not necessarily be a single diagram, but is labeled so, because the label helps in placing all the vector arts together while laying out the output pages.
- the process 300 advances to a process block 308 to perform DOM analysis.
- Certain entities are identified in the PDF document.
- the process 300 may identify page labels and object labels.
- a page label may be a page that is labeled as a cover page, table of contents page, content page, reference/index page, etc.
- An object label may be a PDF object in a page that is labeled as a header/footer, paragraph, main heading, sub heading, figure/background, table, figure/table caption, etc. These provide a way to control the formatting of content in the output. For example, headings and table/figure captions on all output pages need to be have the same font size so that the overall appearance of the output is consistent. In one embodiment, heuristic rules are utilized to identify these entities.
- the heuristic rules are based on characteristics such as sequence of appearance for pages, location on the page, and relative text sizes for objects only containing text.
- the base probabilities for these characteristics for each of the entities may be determined by analyzing large number of real world PDF files to create a training data set.
- a subprocess that labels given pages/objects takes a set of rules R (R 1 . . . R k ) such that each jth rule provides a basic success probability P(R j ). For example, if a rule R j for any construct C is computed as successful, that rule R j will increase the probability of identifying that PDF object as construct C by P(R j ).
- the document construct identification may be performed as follows. The set of rules that identifies an input document construct C is found. For every rule C′ in C, a determination is made as to whether or not the rule is successful for the given page/cells.
- the process 300 then advances to a process block 310 to perform content repurposing.
- Certain green transformations are applied on the document constructs. For example, a paragraph may have some text with large font size, color, and large blank line space. The font size is reduced by analyzing all the font sizes present in the PDF document and normalizing them. Also line spacing may be reduced without affecting the readability of the paragraph.
- CMYK is a subtractive color model that is utilized in color printing and refers to the four inks utilized in color printing: cyan, magenta, yellow, and black.
- the process 300 advances to a process block 312 to perform content re-layout.
- the content re-layout decides the final placement of the content in the output.
- a special re-layout engine may be utilized to convert the high level constructs in to concrete PDF objects and to perform all necessary scaling.
- the re-layout engine may also take placement decisions for the output.
- the content re-layout also reduces the vertical gaps between objects, manages bounding boxes, etc.
- the content re-layout also attempts to fully utilize the pages that the content will occupy eventually, i.e., fit-to-page. As a result of the content re-layout, a green PDF document is created.
- a user may provide an input such as selecting a button to only save ink.
- the user may provide an input such as selecting a button to only save paper.
- the user may provide an input such as selecting a button to save both ink and paper.
- FIG. 4A illustrates an example of a first page 400 of a PDF document.
- FIG. 4B illustrates an example of a second page 420 of the PDF document.
- FIG. 4C illustrates an example of a third page 440 of the PDF document.
- FIG. 4D illustrates a PDF green document 460 that results from the process 300 illustrated in FIG. 3 being applied to the first page 400 , second page 420 , and third page 440 of the PDF document.
- FIG. 5 illustrates a system configuration 500 that may be utilized for green printing of a PDF document.
- the green print module 202 interacts with a memory 502 .
- the green print module 202 generates a variety of potential repurposed documents that meet a readability threshold.
- a first potential repurposed PDF document 504 may have text according to one format that meets the readability threshold
- a second potential repurposed document 506 may have text according to another format that meets the readability threshold
- a third potential repurposed document 508 may have text according to yet another format that meets the readability threshold.
- the green print module 202 may select one final repurposed document from these potential repurposed documents by utilizing a repurposing quantifier.
- the repurposing quantifier may be the result of an equation that is based on the number of pages reduced and the degradation of the readability. Even if all of the potential repurposed documents meet the readability threshold, a higher score will generally be given to a first repurposed document that degrades readability less than a second repurposed document with the same number of reduced pages. Further, even if all of the potential repurposed documents meet the readability threshold, a higher score will generally be given to a first repurposed document that reduces more pages than a second repurposed document with the same readability degradation. In other words, the highest score will be given to the potential repurposed document that as a whole minimizes readability degradation and maximizes page reduction better than the other potential repurposed documents.
- the green print module 202 After the green print module 202 selects a potential repurposed document, the green print module 202 provides the repurposed PDF document to a processor 510 . Further, the processor 510 applies the transformations in the potential repurposed document to the document so that the user may print the final repurposed document.
- the processor 510 interacts with input/output (“I/O”) devices 512 . For example, the processor 1012 receives an input from a user through a keyboard to print the document. The processor 1012 may then print the repurposed document on a printer.
- I/O input/output
- a green score may be indicated for a PDF document.
- the system configuration 100 may inform a user how green the PDF document is by utilizing any of the scoring methodologies described herein.
- a display device may provide a user with an indication of what changes the user can make to the PDF document to save paper and/or ink. The indication may be provided during the user's editing of the PDF document. Alternatively, the indication may be provided to the user when the user is not editing the PDF document.
- the system configuration 500 is suitable for storing and/or executing program code and is implemented using a general purpose computer or any other hardware equivalents.
- the processor 510 is coupled, either directly or indirectly, to the memory 1002 through a system bus.
- the memory 502 can include local memory employed during actual execution of the program code, bulk storage, and/or cache memories which provide temporary storage of at least some program code in order to reduce the number of times code must be retrieved from bulk storage during execution.
- the I/O devices 512 can be coupled directly to the system 1000 or through intervening input/output controllers. Further, the I/O devices 512 can include a keyboard, a keypad, a mouse, a microphone for capturing speech commands, a pointing device, and other user input devices that will be recognized by one of ordinary skill in the art. Further, the I/O devices 512 can include output devices such as a printer, display screen, or the like. Further, the I/O devices 512 can include a receiver, transmitter, speaker, display, image capture sensor, biometric sensor, etc. In addition, the I/O devices 512 can include storage devices such as a tape drive, floppy drive, hard disk drive, compact disk (“CD”) drive, etc.
- CD compact disk
- Network adapters may also be coupled to the system configuration 500 to enable the system configuration 500 to become coupled to other systems, remote printers, or storage devices through intervening private or public networks.
- Modems, cable modems, and Ethernet cards are just a few of the currently available types of network adapters.
- the processes described herein may be implemented in a general, multi-purpose or single purpose processor. Such a processor will execute instructions, either at the assembly, compiled or machine-level, to perform the processes. Those instructions can be written by one of ordinary skill in the art following the description of the figures corresponding to the processes and stored or transmitted on a computer readable medium. The instructions may also be created using source code or any other known computer-aided design tool.
- a computer readable medium may be any medium capable of carrying those instructions and include a CD-ROM, DVD, magnetic or other optical disc, tape, silicon memory (e.g., removable, non-removable, volatile or non-volatile), packetized or non-packetized data through wireline or wireless transmissions locally or remotely through a network.
- a computer is herein intended to include any device that has a general, multi-purpose or single purpose processor as described above.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Quality & Reliability (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Record Information Processing For Printing (AREA)
Abstract
Description
- 1. Field
- This disclosure generally relates to printing. More particularly, the disclosure relates to the reduction of paper and/or ink utilized for printing.
- 2. General Background
- Recent attempts have been made to make technology more environmentally friendly. The resulting technologies are typically called green technologies. For example, green technologies have been developed for computers, automobiles, household appliances, etc.
- With respect to computing technologies, a significant environmental concern stems from the printing of paper. Computer users may use compute printers to print large quantities of various types of documents, which typically leads to the use of large amounts of paper. A large demand for paper may lead to the destruction of large quantities of trees, which may have a negative impact on the environment.
- As a result, computer users are typically encouraged to reduce the amount of paper utilized for printing by scaling multiple pages down to fit on a single page. A problem with this approach is that the readability and aesthetics of the page are severely hampered. A user may have such a difficult time reading text that has been miniaturized so that two or more pages may fit on one side of a sheet of paper (the other side may also have two or more pages) that the user may simply choose not to utilize green printing and may print in the typical manner, which would not be environmentally friendly.
- In one aspect of the disclosure, a computer program product is provided. The computer program product includes a computer useable medium having a computer readable program. The computer readable program when executed on a computer causes the computer to preprocess a PDF document having a first quantity of pages of content. Further, the computer readable program when executed on a computer causes the computer to perform complexity analysis on the PDF document. In addition, computer readable program when executed on a computer causes the computer to create a document object model based on the PDF document. The computer readable program when executed on a computer also causes the computer to perform document object model analysis on the PDF document with the document object model. Further, computer readable program when executed on a computer causes the computer to perform content repurposing of the PDF document. In addition, the computer readable program when executed on a computer causes the computer to perform content re-layout of the PDF document based on the content repurposing such that a printer prints the PDF document with the content in a transformed format that results in a second quantity of pages being printed without falling below a predetermined readability threshold, the second quantity of pages being less than the first quantity of pages.
- In another aspect of the disclosure, a process is provided. The process preprocesses, with a processor, a PDF document having a first quantity of pages of content. Further, the process performs, with the processor, complexity analysis on the PDF document. In addition, the process creates, with the processor, a document object model based on the PDF document. The process also performs, with the processor, document object model analysis on the PDF document with the document object model. Further, the process performs, with the processor, content repurposing of the PDF document. In addition, the process performs, with the processor, content re-layout of the PDF document based on the content repurposing such that a printer prints the PDF document with the content in a transformed format that results in a second quantity of pages being printed without falling below a predetermined readability threshold, the second quantity of pages being less than the first quantity of pages.
- In yet another aspect of the disclosure, a system is provided. The system includes a green print module that receives an indication that a printout of a PDF document from a printer has been requested. The PDF document has content with a format that would result in a first quantity of pages being printed. Further, the system has a processor that (i) preprocesses the PDF document, (ii) perform complexity analysis on the PDF document, (iii) creates a document object model based on the PDF document, (iv) performs document object model analysis on the PDF document with the document object model, (v) performs content repurposing of the PDF document by utilizing the document object model, and (vi) performs content re-layout of the PDF document based on the content repurposing such that a printer prints the PDF document with the content in a transformed format that results in a second quantity of pages being printed without falling below a predetermined readability threshold. The second quantity of pages being less than the first quantity of pages.
- The above-mentioned features of the present disclosure will become more apparent with reference to the following description taken in conjunction with the accompanying drawings wherein like reference numerals denote like elements and in which:
-
FIG. 1 illustrates a green printing configuration. -
FIG. 2 illustrates a green print module that may be utilized with the green printing configuration. -
FIG. 3 illustrates a process that is utilized by the green print module illustrated inFIG. 2 to analyze a PDF document for the purpose of selecting one or more transformations to the PDF document for green printing. -
FIG. 4A illustrates an example of a first page of a PDF document. -
FIG. 4B illustrates an example of a second page of the PDF document. -
FIG. 4C illustrates an example of a third page of the PDF document. -
FIG. 4D illustrates a PDF green document that results from the process illustrated inFIG. 3 being applied to the first page, second page, and third page of the PDF document. -
FIG. 5 illustrates a system configuration that may be utilized for green printing of a PDF document. - A green printing configuration is provided herein that repurposes a PDF document to save paper and/or ink. The green printing configuration is a configuration that reduces the number of pages and/or ink utilizing to print a PDF document through content repurposing, which is an approach that transforms content and the formatting of the content to shrink the size of the content to fit into fewer pages at the same time as retaining the readability and aesthetics of the document. The green printing configuration may be a method, system, computing device, computer program stored on a computing device, computer program stored on a printer, computer module that may be downloaded through a network, plug-in, extension, etc. In one embodiment, a user may print in a single click to a default printer so that the content takes up fewer pages less ink than a typical printed document.
- A PDF document may have text, images, and vector arts as primary building blocks. These objects are placed on pages in a PDF in a static fashion. A PDF file is typically difficult to modify as the entire content in the PDF file is placed content with fixed positions and the relationship between content spread across pages does not exist. For example, paragraphs or tables spanning multiple pages are treated as separate objects. The present of content such as footer text makes relating the objects more difficult. Accordingly, in the context of printing, the single biggest redundancy in a PDF document is that content from one page cannot flow into content from another page. As a result, large vacant spaces present in PDF pages remain unutilized on paper.
- Other forms of redundant white space may also be present in PDF pages. These other forms of redundant white space include margins, gaps between objects, etc.
- The green printing configuration exploits redundant whitespace and large text/Images present in a PDF document to reduce paper and ink for printing the PDF document. Further, the green printing configuration determines a relationship between objects that should be placed together. In one embodiment, content repurposing is utilized to automatically shrink content in a PDF document. The logical structure is extracted form the input PDF file with a heuristic rule model. A re-layout is performed on the logical structure after applying green transformations. The green transformations may include changing the page orientation, reducing text sizes, scaling images, flowing content from one page into another (e.g., disregarding page boundaries), and/or removal of unimportant content such as a cover page, header/footer, background fills, etc. In one embodiment, graying of text and images may be utilized to save ink for printing.
- The transformations utilized for content repurposing have the goal of reducing the overall redundancy in a document. Although a group of transformations together may reduce the redundancy, one or more of the transformations may individually increase the redundancy. The redundancy for an individual transformation may be increased to accentuate certain characteristics of the document to add to the aesthetic appeal and/or readability of the document. For example, if and when a document is printed with multiple pages per paper sheet, a green print program may automatically decide to increase the size of the text to make the text more readable.
- A transformation may have a magnitude associated therewith. In one embodiment, the magnitude may be binary. The binary magnitude may indicate whether a transformation is applied is or not applied, e.g., “0” equals transformation is not to be applied and “1” equals transformation is to be applied. In another embodiment, the magnitude may have a set of predefined discrete values. In another embodiment, the magnitude may take continuous values.
- Each instance of a transformation type may have a transformation cost associated therewith for a particular document type. In other words, a particular sizing transformation may have a different transformation cost for a PDF document than for a spreadsheet.
- Further, each transformation may have an associated saved paper quantity. For example, a particular textual transformation may save one tenth of a sheet of paper. In addition, each transformation may have an associated saved ink quantity. For example, a particular textual transformation may save one half an ounce of ink. The transformation may potentially have both a saved paper quantity and a saved ink quantity if both paper and ink would be saved as a result of the transformation.
-
FIG. 1 illustrates agreen printing configuration 100. As an example, acomputing device 102 is illustrated as a PC. Further, as an example, thecomputing device 102 is operably connected to aprinter 104 through a wireline connection. Theterm computing device 102 is herein intended to include a personal computer (“PC”), desktop computer, laptop, notebook, cell phone, smart phone, personal digital assistant (“PDA”), kiosk, etc. Further, thecomputing device 102 may be a client, server, network device, etc. Theprinter 104 may be a printing device that is separately connected, e.g., through a wireline or wireless connection, to thecomputing device 102, built into thecomputing device 102, etc. A wireless connection may receive and/or send data through a Radio Frequency (“RF”) transmission, an Infrared (“IR”) transmission, or the like. Theprinter 104 may or may not be part of a network. Further, theprinter 104 may utilize any type of printing methodology to print onpaper 106, e.g. laser printing, ink jet printing, or the like. -
FIG. 2 illustrates agreen print module 202 that may be utilized with thegreen printing configuration 100. Thegreen print module 202 may be stored in thecomputing device 102 or theprinter 104. Thegreen print module 202 may analyze the content of multiple pages, sections, etc. of a PDF document to repurpose the content to save paper and/or ink. For example, thegreen print module 202 may analyze afirst page 206 and asecond page 208 of aPDF document 204. Thegreen print module 202 may then repurpose the content of thePDF document 204 so that agreen PDF document 210 may be printed. Thegreen PDF document 210 has a repurposedpage 210 that has the content from thefirst page 206 and thesecond page 208 in a readable format. Lines are provided in a document as illustrated in drawings such asFIG. 2 to represent text, symbols, shapes, images, and/or the like. -
FIG. 3 illustrates aprocess 300 that is utilized by thegreen print module 202 illustrated inFIG. 2 to analyze a PDF document for the purpose of selecting one or more transformations to the PDF document for green printing. At aprocess block 302, theprocess 300 preprocesses a PDF document. The pre-processing acquires information about the PDF document. The information may include identifying text, images, vector objects, dimensions (bounding boxes), text fonts, etc. These elements can be represented in a page as cells of well defined bound boxes. These cells may be grouped together to maintain the reading context of the individual element/cell. For example, overlapping vector graphics cells may be grouped together so that they can be formatted individually and placed in their entirety. Otherwise, in the re-layout phase the individual objects may be placed far apart, which may lead a less readable output. Similarly, original multicolumn text cells may be identified and merged in to a single text cell containing the entire multicolumn text in reading order. The preprocessing determines how many pages may be saved in the PDF document to figure out whether or not the number of pages should be reduced. For example, if the PDF document has only one page, then the green printing configuration will only save ink. - In one embodiment, a subprocess may be utilized to identify the cell groups. The subprocess may find the set of isolated cells (bounding boxes) B in the PDF page. Further, the subprocess may group the cells in B by utilizing heuristic rules which output the logical elements in the page. For every cell C in B, the subprocess groups the cells in B utilizing heuristic rules, which output the logical elements in the PDF page. For every cell C in B, the set of cells B′ that lies in the proximity of C is found. The cells in B′ can either lie in horizontal proximity or in vertical proximity. A plurality of factors may be utilized to make the decision as to which cells are chosen. The type of individual cells is a factor. For example, text cells will be merged with the graphic cells, e.g., image/vector arts, only when the text cells overlap with the graphic cells. Similarly, graphic cells can be merged with text cells even if they don't overlap with text cells, but lie in a small proximity. Another factor is that a merged cell should not contain empty areas beyond a certain threshold. Yet another factor is that multi-column text cells should not be merged as is, but rather in a single text cell in which all the text of individual cells should be added in the reading order.
- The subprocess creates a merged cell merged(C). For every region C′ in B, a determination is made to figure out whether or not C′ was merged with merged cell merged(C). If C′ was merged with merged cell merged(C), C′ is removed from B. Further, the subprocess removes C from B. In addition, the subprocess adds the new merged cell merged(C) to B. After the initial finding of isolated cells, the subprocess repeats the remaining portions of the subprocess for every other cell present in the set B.
- Further, at a
process block 304, theprocess 300 performs a complexity analysis. The grouped cells are utilized to categorize certain pages and some content elements as being too complex for complete transformation. Certain rules are utilized for such categorization, e.g. the number of overlapped images/vectors in a given PDF page, presence of form field, etc. Such pages are typically converted to raster and replaced. This mechanism of complexity analysis also ensures the correctness of the entire green printing of PDF document approach. - The preprocessing allows the subprocess to find out the source of the document. For example, if the document that was converted to a PDF was a word processing document rather than a spreadsheet program, then one type of green transformations are utilized for a word processing document.
- Further, the
process 300 advances to aprocess block 306 to create a document object model (“DOM”). A PDF document is generally not created by keeping any specific document structure in mind. However, a tagged PDF provides some information in terms of logical constructs such as a table of contents, paragraph, tables, drawing, etc. Accordingly, the creation of the DOM model first involves tagging the input PDF if the PDF is not already tagged. Subsequently, higher level constructs such as cover page, reference page, background image, etc. will be identified utilizing a heuristic rule based model. The grouped cells identified at the process block 302 will be tagged as high level document constructs utilizing this rule model. These constructs will be considered while applying a transformation. - This structure creation is helpful because a PDF document inherently does not support any document structure or any relationship between different objects. Therefore, a high level flow able structure is created to be utilized in content re-layout. This structure extraction is not same as object recognition in images, etc. Semantic tags are applied that are helpful in printing. For example, a group of vector paths may not necessarily be a single diagram, but is labeled so, because the label helps in placing all the vector arts together while laying out the output pages.
- In addition, the
process 300 advances to aprocess block 308 to perform DOM analysis. Certain entities are identified in the PDF document. As examples, theprocess 300 may identify page labels and object labels. A page label may be a page that is labeled as a cover page, table of contents page, content page, reference/index page, etc. An object label may be a PDF object in a page that is labeled as a header/footer, paragraph, main heading, sub heading, figure/background, table, figure/table caption, etc. These provide a way to control the formatting of content in the output. For example, headings and table/figure captions on all output pages need to be have the same font size so that the overall appearance of the output is consistent. In one embodiment, heuristic rules are utilized to identify these entities. The heuristic rules are based on characteristics such as sequence of appearance for pages, location on the page, and relative text sizes for objects only containing text. The base probabilities for these characteristics for each of the entities may be determined by analyzing large number of real world PDF files to create a training data set. - In one embodiment, a subprocess that labels given pages/objects takes a set of rules R (R1 . . . Rk) such that each jth rule provides a basic success probability P(Rj). For example, if a rule Rj for any construct C is computed as successful, that rule Rj will increase the probability of identifying that PDF object as construct C by P(Rj). The document construct identification (Page level/Object level) may be performed as follows. The set of rules that identifies an input document construct C is found. For every rule C′ in C, a determination is made as to whether or not the rule is successful for the given page/cells. If the rule is successful for the given page/cells, then Pc (Probability of the given page/cells to be C)+=P(Cs). Further, if Pc>threshold and C is an object level rule, all the individual cells are grouped into a single cell and the grouped cell is marked as C. Otherwise if Pc>threshold and C is an Page level rule, the input page is marked as C. After the subprocess is completed, all the cells identified at the process block 302 will be tagged as high level document constructs, which will be transformed to create a green PDF.
- The
process 300 then advances to aprocess block 310 to perform content repurposing. Certain green transformations are applied on the document constructs. For example, a paragraph may have some text with large font size, color, and large blank line space. The font size is reduced by analyzing all the font sizes present in the PDF document and normalizing them. Also line spacing may be reduced without affecting the readability of the paragraph. - Similarly to save ink, some graying transformations may be performed. Since PDF as a document format supports multiple color spaces, this property may be utilized to save ink in printing already grayed content which contains grey color values in RGB color space as R=G=B color values. When such content is printed on paper, RGB color values are converted in to CMYK color space. The CMYK is a subtractive color model that is utilized in color printing and refers to the four inks utilized in color printing: cyan, magenta, yellow, and black. Printing CMYK colors for gray colors consumes all ink toners present in the printer to represent the correct grey color. This consumption can be avoided in PDF by converting RGB color space to CMYK as a green transformation. All the grey content with R=G=B color representation is then converted into a single K value. After this transformation, grey content will consume only K toner on printers which is already does not support under color removal.
- Finally, the
process 300 advances to aprocess block 312 to perform content re-layout. The content re-layout decides the final placement of the content in the output. In one embodiment, a special re-layout engine may be utilized to convert the high level constructs in to concrete PDF objects and to perform all necessary scaling. The re-layout engine may also take placement decisions for the output. The content re-layout also reduces the vertical gaps between objects, manages bounding boxes, etc. The content re-layout also attempts to fully utilize the pages that the content will occupy eventually, i.e., fit-to-page. As a result of the content re-layout, a green PDF document is created. - In one embodiment, a user may provide an input such as selecting a button to only save ink. In another embodiment, the user may provide an input such as selecting a button to only save paper. In yet another embodiment, the user may provide an input such as selecting a button to save both ink and paper.
-
FIG. 4A illustrates an example of afirst page 400 of a PDF document. Further,FIG. 4B illustrates an example of asecond page 420 of the PDF document.FIG. 4C illustrates an example of athird page 440 of the PDF document. Finally,FIG. 4D illustrates a PDFgreen document 460 that results from theprocess 300 illustrated inFIG. 3 being applied to thefirst page 400,second page 420, andthird page 440 of the PDF document. -
FIG. 5 illustrates asystem configuration 500 that may be utilized for green printing of a PDF document. In one embodiment, thegreen print module 202 interacts with amemory 502. Thegreen print module 202 generates a variety of potential repurposed documents that meet a readability threshold. For example, a first potential repurposedPDF document 504 may have text according to one format that meets the readability threshold, a second potential repurposeddocument 506 may have text according to another format that meets the readability threshold, and a third potential repurposeddocument 508 may have text according to yet another format that meets the readability threshold. Thegreen print module 202 may select one final repurposed document from these potential repurposed documents by utilizing a repurposing quantifier. The repurposing quantifier may be the result of an equation that is based on the number of pages reduced and the degradation of the readability. Even if all of the potential repurposed documents meet the readability threshold, a higher score will generally be given to a first repurposed document that degrades readability less than a second repurposed document with the same number of reduced pages. Further, even if all of the potential repurposed documents meet the readability threshold, a higher score will generally be given to a first repurposed document that reduces more pages than a second repurposed document with the same readability degradation. In other words, the highest score will be given to the potential repurposed document that as a whole minimizes readability degradation and maximizes page reduction better than the other potential repurposed documents. - After the
green print module 202 selects a potential repurposed document, thegreen print module 202 provides the repurposed PDF document to aprocessor 510. Further, theprocessor 510 applies the transformations in the potential repurposed document to the document so that the user may print the final repurposed document. Theprocessor 510 interacts with input/output (“I/O”)devices 512. For example, the processor 1012 receives an input from a user through a keyboard to print the document. The processor 1012 may then print the repurposed document on a printer. - In another embodiment, a green score may be indicated for a PDF document. The
system configuration 100 may inform a user how green the PDF document is by utilizing any of the scoring methodologies described herein. In other words, a display device may provide a user with an indication of what changes the user can make to the PDF document to save paper and/or ink. The indication may be provided during the user's editing of the PDF document. Alternatively, the indication may be provided to the user when the user is not editing the PDF document. - In one embodiment, the
system configuration 500 is suitable for storing and/or executing program code and is implemented using a general purpose computer or any other hardware equivalents. Theprocessor 510 is coupled, either directly or indirectly, to the memory 1002 through a system bus. Thememory 502 can include local memory employed during actual execution of the program code, bulk storage, and/or cache memories which provide temporary storage of at least some program code in order to reduce the number of times code must be retrieved from bulk storage during execution. - The I/
O devices 512 can be coupled directly to the system 1000 or through intervening input/output controllers. Further, the I/O devices 512 can include a keyboard, a keypad, a mouse, a microphone for capturing speech commands, a pointing device, and other user input devices that will be recognized by one of ordinary skill in the art. Further, the I/O devices 512 can include output devices such as a printer, display screen, or the like. Further, the I/O devices 512 can include a receiver, transmitter, speaker, display, image capture sensor, biometric sensor, etc. In addition, the I/O devices 512 can include storage devices such as a tape drive, floppy drive, hard disk drive, compact disk (“CD”) drive, etc. - Network adapters may also be coupled to the
system configuration 500 to enable thesystem configuration 500 to become coupled to other systems, remote printers, or storage devices through intervening private or public networks. Modems, cable modems, and Ethernet cards are just a few of the currently available types of network adapters. - The processes described herein may be implemented in a general, multi-purpose or single purpose processor. Such a processor will execute instructions, either at the assembly, compiled or machine-level, to perform the processes. Those instructions can be written by one of ordinary skill in the art following the description of the figures corresponding to the processes and stored or transmitted on a computer readable medium. The instructions may also be created using source code or any other known computer-aided design tool. A computer readable medium may be any medium capable of carrying those instructions and include a CD-ROM, DVD, magnetic or other optical disc, tape, silicon memory (e.g., removable, non-removable, volatile or non-volatile), packetized or non-packetized data through wireline or wireless transmissions locally or remotely through a network. A computer is herein intended to include any device that has a general, multi-purpose or single purpose processor as described above.
- It should be understood that the processes and systems described herein can take the form of entirely hardware embodiments, entirely software embodiments, or embodiments containing both hardware and software elements. If software is utilized to implement the method or system, the software can include but is not limited to firmware, resident software, microcode, etc.
- It is understood that the processes and systems described herein may also be applied in other types of processes and systems. Those skilled in the art will appreciate that the various adaptations and modifications of the embodiments of the processes and systems described herein may be configured without departing from the scope and spirit of the present processes and systems. Therefore, it is to be understood that, within the scope of the appended claims, the present processes and systems may be practiced other than as specifically described herein.
Claims (20)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US12/849,519 US8451489B1 (en) | 2010-08-03 | 2010-08-03 | Content-aware method for saving paper and ink while printing a PDF document |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US12/849,519 US8451489B1 (en) | 2010-08-03 | 2010-08-03 | Content-aware method for saving paper and ink while printing a PDF document |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20130128315A1 true US20130128315A1 (en) | 2013-05-23 |
| US8451489B1 US8451489B1 (en) | 2013-05-28 |
Family
ID=48426597
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US12/849,519 Expired - Fee Related US8451489B1 (en) | 2010-08-03 | 2010-08-03 | Content-aware method for saving paper and ink while printing a PDF document |
Country Status (1)
| Country | Link |
|---|---|
| US (1) | US8451489B1 (en) |
Cited By (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20130007599A1 (en) * | 2011-06-30 | 2013-01-03 | International Business Machines Corporation | Optimizing the layout of electronic documents |
| US8705092B2 (en) | 2010-08-03 | 2014-04-22 | Adobe Systems Incorporated | Method and system for repurposing E-mail correspondence to save paper and ink |
| US8773712B2 (en) | 2010-08-03 | 2014-07-08 | Adobe Systems Incorporated | Repurposing a word processing document to save paper and ink |
| US8799761B2 (en) | 2010-08-03 | 2014-08-05 | Adobe Systems Incorporated | Method and system for repurposing a spreadsheet to save paper and ink |
| US9032284B2 (en) | 2010-08-03 | 2015-05-12 | Adobe Systems Incorporated | Green printing: re-purposing a document to save ink and paper |
| US20160231966A1 (en) * | 2015-02-06 | 2016-08-11 | Konica Minolta, Inc. | Image forming apparatus, image forming method, non-transitory computer-readable recording medium stored with common blank forming period setting program, and image forming system |
| US9658997B2 (en) | 2010-08-03 | 2017-05-23 | Adobe Systems Incorporated | Portable page template |
| US10346104B2 (en) * | 2015-10-30 | 2019-07-09 | Hewlett-Packard Development Company, L.P. | Object type based image processing |
| CN114201127A (en) * | 2022-02-17 | 2022-03-18 | 北京辰光融信技术有限公司 | Laser printer control method and device, storage medium and electronic equipment |
| US11645017B2 (en) | 2021-06-17 | 2023-05-09 | International Business Machines Corporation | Print governance management |
| CN119883152A (en) * | 2025-03-26 | 2025-04-25 | 南通中远海运川崎船舶工程有限公司 | Method for converting DocuWorks document into PDF document |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20120173332A1 (en) * | 2010-12-30 | 2012-07-05 | Konica Minolta Systems Laboratory, Inc. | Method and system for promoting and marketing more environmental friendly printing |
| WO2015167550A1 (en) | 2014-04-30 | 2015-11-05 | Hewlett-Packard Development Company, L.P. | Providing print dimensions from a printer |
Family Cites Families (36)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6266684B1 (en) | 1997-08-06 | 2001-07-24 | Adobe Systems Incorporated | Creating and saving multi-frame web pages |
| US6128655A (en) | 1998-07-10 | 2000-10-03 | International Business Machines Corporation | Distribution mechanism for filtering, formatting and reuse of web based content |
| US7237188B1 (en) | 2004-02-06 | 2007-06-26 | Microsoft Corporation | Method and system for managing dynamic tables |
| US7617446B2 (en) | 2001-03-26 | 2009-11-10 | International Business Machines Corporation | Method and system for pre-print processing of web-based documents to reduce printing costs |
| US20030163537A1 (en) | 2001-11-27 | 2003-08-28 | International Business Machines Corporation | Method and apparatus for handling conversation threads and message groupings as a single entity |
| CA2407211A1 (en) | 2002-07-26 | 2004-01-26 | Global Mart Int'l (Dalian) Electronics Information Technology Co.,Ltd. | An on-line system for website transforming and method therof |
| US20040044735A1 (en) | 2002-08-30 | 2004-03-04 | International Business Machines Corporation | Method and system for organizing an email thread |
| US7212309B1 (en) | 2002-08-30 | 2007-05-01 | Microsoft Corporation | Best fit printing |
| US7366981B2 (en) * | 2002-10-04 | 2008-04-29 | Fuji Xerox Co., Ltd. | Image forming device and method |
| US20040205607A1 (en) * | 2003-01-03 | 2004-10-14 | Samsung Electronics Co., Ltd. | Printing method using Nup function, and computer readable recording medium storing computer program for executing the printing method |
| US7716593B2 (en) | 2003-07-01 | 2010-05-11 | Microsoft Corporation | Conversation grouping of electronic mail records |
| US20070273895A1 (en) | 2003-12-12 | 2007-11-29 | Canon Information Systems Research Austrialia | Efficient Whole Page Printing |
| WO2005077023A2 (en) | 2004-02-06 | 2005-08-25 | Releaf | Systems and methods relating to paper and printer cartridge usage |
| US20060015804A1 (en) | 2004-07-15 | 2006-01-19 | Microsoft Corporation | Method and system for presenting editable spreadsheet page layout view |
| US7548954B2 (en) | 2004-12-15 | 2009-06-16 | Research In Motion Limited | Compressible display of e-mail message string to facilitate readability |
| US7487143B2 (en) | 2005-11-17 | 2009-02-03 | International Business Machines Corporation | Method for nested categorization using factorization |
| JP4689453B2 (en) * | 2005-12-02 | 2011-05-25 | キヤノン株式会社 | Information processing apparatus, data processing method, and program |
| US7680858B2 (en) | 2006-07-05 | 2010-03-16 | Yahoo! Inc. | Techniques for clustering structurally similar web pages |
| US20080086695A1 (en) | 2006-10-10 | 2008-04-10 | International Business Machines Corporation | Method to color tag e-mail content containing multiple replies to ease reading |
| US7729001B2 (en) | 2006-12-07 | 2010-06-01 | Zerox Corporation | Integration of content-based relevant information into print jobs and applications using same |
| WO2009025788A1 (en) | 2007-08-17 | 2009-02-26 | Dma Ink | Scheduling and budgeting application |
| US9262038B2 (en) | 2007-10-10 | 2016-02-16 | Lumapix | Method and system for referenced region manipulation |
| US8024412B2 (en) | 2008-02-15 | 2011-09-20 | Microsoft Corporation | User interface reading email conversations |
| US8225198B2 (en) | 2008-03-31 | 2012-07-17 | Vistaprint Technologies Limited | Flexible web page template building system and method |
| US8085421B2 (en) | 2008-07-30 | 2011-12-27 | International Business Machines Corporation | Efficient print operations |
| US20100123908A1 (en) | 2008-11-17 | 2010-05-20 | Fuji Xerox Co., Ltd. | Systems and methods for viewing and printing documents including animated content |
| JP2010176184A (en) | 2009-01-27 | 2010-08-12 | Sharp Corp | Print system and information processing apparatus |
| US20100281351A1 (en) | 2009-04-29 | 2010-11-04 | Soiba Mohammed | Web print content control using html |
| US8433997B1 (en) * | 2009-07-31 | 2013-04-30 | Google Inc. | Efficient portable document |
| US8934119B2 (en) | 2009-08-04 | 2015-01-13 | Electronics For Imaging, Inc. | Greenbooks |
| US8891125B2 (en) * | 2009-08-18 | 2014-11-18 | Xerox Corporation | Method and system for automatically reducing page count in a document printing process |
| US8305653B2 (en) * | 2009-09-24 | 2012-11-06 | Xerox Corporation | Printing system with improved scanning functionality |
| US8676666B2 (en) | 2009-12-15 | 2014-03-18 | Xerox Corporation | System for assessing environmental impact of processing print jobs |
| JP2011203826A (en) | 2010-03-24 | 2011-10-13 | Canon Inc | Apparatus and method for processing image, and program |
| US8503016B2 (en) | 2010-05-04 | 2013-08-06 | Xerox Corporation | System and method for providing environmental feedback to users of shared printers |
| US9032284B2 (en) | 2010-08-03 | 2015-05-12 | Adobe Systems Incorporated | Green printing: re-purposing a document to save ink and paper |
-
2010
- 2010-08-03 US US12/849,519 patent/US8451489B1/en not_active Expired - Fee Related
Cited By (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9658997B2 (en) | 2010-08-03 | 2017-05-23 | Adobe Systems Incorporated | Portable page template |
| US9032284B2 (en) | 2010-08-03 | 2015-05-12 | Adobe Systems Incorporated | Green printing: re-purposing a document to save ink and paper |
| US8705092B2 (en) | 2010-08-03 | 2014-04-22 | Adobe Systems Incorporated | Method and system for repurposing E-mail correspondence to save paper and ink |
| US8773712B2 (en) | 2010-08-03 | 2014-07-08 | Adobe Systems Incorporated | Repurposing a word processing document to save paper and ink |
| US8799761B2 (en) | 2010-08-03 | 2014-08-05 | Adobe Systems Incorporated | Method and system for repurposing a spreadsheet to save paper and ink |
| US9229914B2 (en) * | 2011-06-30 | 2016-01-05 | International Business Machines Corporation | Optimizing the layout of electronic documents by reducing presentation size of content within document sections so that when combined a plurality of document sections fit within a page |
| US20130007580A1 (en) * | 2011-06-30 | 2013-01-03 | International Business Machines Corporation | Optimizing the layout of electronic documents |
| US9218327B2 (en) * | 2011-06-30 | 2015-12-22 | International Business Machines Corporation | Optimizing the layout of electronic documents by reducing presentation size of content within document sections so that when combined a plurality of document sections fit within a page |
| US20130007599A1 (en) * | 2011-06-30 | 2013-01-03 | International Business Machines Corporation | Optimizing the layout of electronic documents |
| US9904494B2 (en) * | 2015-02-06 | 2018-02-27 | Konica Minolta, Inc. | Image forming apparatus for determining a common blank space for parallel continuous printing |
| US20160231966A1 (en) * | 2015-02-06 | 2016-08-11 | Konica Minolta, Inc. | Image forming apparatus, image forming method, non-transitory computer-readable recording medium stored with common blank forming period setting program, and image forming system |
| US10346104B2 (en) * | 2015-10-30 | 2019-07-09 | Hewlett-Packard Development Company, L.P. | Object type based image processing |
| US11645017B2 (en) | 2021-06-17 | 2023-05-09 | International Business Machines Corporation | Print governance management |
| CN114201127A (en) * | 2022-02-17 | 2022-03-18 | 北京辰光融信技术有限公司 | Laser printer control method and device, storage medium and electronic equipment |
| CN119883152A (en) * | 2025-03-26 | 2025-04-25 | 南通中远海运川崎船舶工程有限公司 | Method for converting DocuWorks document into PDF document |
Also Published As
| Publication number | Publication date |
|---|---|
| US8451489B1 (en) | 2013-05-28 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US8451489B1 (en) | Content-aware method for saving paper and ink while printing a PDF document | |
| US9032284B2 (en) | Green printing: re-purposing a document to save ink and paper | |
| JP6838209B1 (en) | Document image analyzer, document image analysis method and program | |
| US8804139B1 (en) | Method and system for repurposing a presentation document to save paper and ink | |
| RU2357284C2 (en) | Method of processing digital hand-written notes for recognition, binding and reformatting digital hand-written notes and system to this end | |
| US11238312B2 (en) | Automatically generating labeled synthetic documents | |
| Clausner et al. | Aletheia-an advanced document layout and text ground-truthing system for production environments | |
| US6895552B1 (en) | Method and an apparatus for visual summarization of documents | |
| US8773712B2 (en) | Repurposing a word processing document to save paper and ink | |
| CN102165393B (en) | Editing 2D structures using natural input | |
| CN109858036B (en) | Method and device for dividing documents | |
| US20080225336A1 (en) | Method and System to Allow Printing Compression of Documents | |
| CN102081594A (en) | Equipment and method for extracting enclosing rectangles of characters from portable electronic documents | |
| US20210368064A1 (en) | Utilizing intelligent sectioning and selective document reflow for section-based printing | |
| US8799761B2 (en) | Method and system for repurposing a spreadsheet to save paper and ink | |
| US12254263B2 (en) | Method, device, and system for analyzing unstructured document | |
| US10095677B1 (en) | Detection of layouts in electronic documents | |
| CN109948518B (en) | Neural network-based PDF document content text paragraph aggregation method | |
| US8744171B1 (en) | Text script and orientation recognition | |
| JP2021015604A (en) | Drawing control method and drawing control system | |
| US20150169508A1 (en) | Obfuscating page-description language output to thwart conversion to an editable format | |
| CN102081736B (en) | Equipment and method for extracting enclosing rectangles of characters from portable electronic documents | |
| US11687700B1 (en) | Generating a structure of a PDF-document | |
| US20200265278A1 (en) | Colored region barcode printing | |
| JP2008129793A (en) | Document processing system, apparatus and method, and recording medium recording program |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: ADOBE SYSTEMS INCORPORATED, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:ARORA, KAPIL;GOEL, NAVEEN;MITTAL, AMIT;AND OTHERS;SIGNING DATES FROM 20100706 TO 20100707;REEL/FRAME:024782/0218 |
|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
| FPAY | Fee payment |
Year of fee payment: 4 |
|
| AS | Assignment |
Owner name: ADOBE INC., CALIFORNIA Free format text: CHANGE OF NAME;ASSIGNOR:ADOBE SYSTEMS INCORPORATED;REEL/FRAME:048867/0882 Effective date: 20181008 |
|
| MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 8 |
|
| FEPP | Fee payment procedure |
Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| LAPS | Lapse for failure to pay maintenance fees |
Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
| FP | Lapsed due to failure to pay maintenance fee |
Effective date: 20250528 |