[go: up one dir, main page]

TWM655760U - System for processing invoice data - Google Patents

System for processing invoice data Download PDF

Info

Publication number
TWM655760U
TWM655760U TW112212528U TW112212528U TWM655760U TW M655760 U TWM655760 U TW M655760U TW 112212528 U TW112212528 U TW 112212528U TW 112212528 U TW112212528 U TW 112212528U TW M655760 U TWM655760 U TW M655760U
Authority
TW
Taiwan
Prior art keywords
data
module
file module
invoice
file
Prior art date
Application number
TW112212528U
Other languages
Chinese (zh)
Inventor
游上逸
戴昆生
Original Assignee
爍益管理顧問有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 爍益管理顧問有限公司 filed Critical 爍益管理顧問有限公司
Priority to TW112212528U priority Critical patent/TWM655760U/en
Publication of TWM655760U publication Critical patent/TWM655760U/en

Links

Images

Landscapes

  • Character Input (AREA)

Abstract

本新型為一套處理發票數據的系統,包括掃描器、OCR模組、三個檔案、顯示介面及轉換模組。上述三個檔案分別為第一檔案模組、第二檔案模組、和第三檔案模組。掃描器用於掃描發票,OCR模組轉換掃描資訊為數據,並存儲於第一檔案模組。第二檔案模組含修正演算法,用於數據修正,修正後數據存儲於第三檔案模組。顯示介面提供發票圖像和數據的同時顯示,以便手動修正。最後,轉換模組將修正後的數據轉換為會計系統格式。值得注意的是,第三檔案模組不含任何公式或程序。 This new model is a system for processing invoice data, including a scanner, an OCR module, three files, a display interface and a conversion module. The above three files are the first file module, the second file module, and the third file module. The scanner is used to scan invoices, and the OCR module converts the scanned information into data and stores it in the first file module. The second file module contains a correction algorithm for data correction, and the corrected data is stored in the third file module. The display interface provides simultaneous display of invoice images and data for manual correction. Finally, the conversion module converts the corrected data into the accounting system format. It is worth noting that the third file module does not contain any formulas or programs.

Description

用於處理發票資料的系統 System for processing invoice data

本新型涉及資料處理和管理的領域,特別是一種發票數據的處理的方法與系統。 This novel invention relates to the field of data processing and management, in particular to a method and system for processing invoice data.

發票處理涉及處理多種類型和格式的發票。傳統的發票處理方法通常涉及手動資料輸入,這是耗時、容易出錯且低效的程序。此外,手動分類和排序發票可能是一項繁重的任務,尤其是當處理大量發票時。 Invoice processing involves handling multiple invoice types and formats. Traditional invoice processing methods often involve manual data entry, which is a time-consuming, error-prone and inefficient process. In addition, manually categorizing and sorting invoices can be a cumbersome task, especially when processing a large number of invoices.

光學字符識別(OCR)技術已被用於自動化將掃描得到的發票圖像轉換為機器可讀文本的過程。然而,OCR技術並非無懈可擊,經常會出現錯誤。這些錯誤可能是由於多種因素造成的,例如掃描質量差、字體變化、字符間距以及相似外觀字符之間的混淆。因此,從OCR獲得的數據通常需要進一步的修正和驗證以確保其準確性。 Optical character recognition (OCR) technology has been used to automate the process of converting scanned invoice images into machine-readable text. However, OCR technology is not infallible and errors often occur. These errors can be caused by a variety of factors, such as poor scan quality, font variations, character spacing, and confusion between similar-looking characters. Therefore, the data obtained from OCR usually requires further correction and verification to ensure its accuracy.

此外,從發票中提取的數據通常需要進一步處理以與各種會計系統兼容。這可能涉及將數據轉換為會計系統可接受的特定格式。手動將數據轉換為不同格式可能是一項繁瑣且容易出錯的任務。 Additionally, data extracted from invoices often requires further processing to be compatible with various accounting systems. This may involve converting the data into a specific format acceptable to the accounting system. Manually converting data into different formats can be a tedious and error-prone task.

因此,儘管已經整合了OCR技術,傳統的發票處理方法在效率、準確性和與各種會計系統的兼容性方面仍面臨挑戰。 Therefore, despite the integration of OCR technology, traditional invoice processing methods still face challenges in terms of efficiency, accuracy and compatibility with various accounting systems.

本新型提供了一個用於處理發票數據的系統,該系統包括掃描器、OCR模組、三個檔案(第一檔案模組、第二檔案模組及第三檔案模組)、顯示介面以及轉換模組。掃描器用於掃描發票,OCR模組將發票資訊轉換為數據並存儲於第一 檔案模組。第二檔案模組含有修正數據的演算法,修正後的數據存儲於第三檔案模組。顯示介面提供發票圖像和數據的同時顯示,方便手動修正,而轉換模組則將數據轉換為會計系統可接受的格式。本系統特別適用於以Excel格式進行檔案管理,並支持在Microsoft Power Automate平台上運行OCR模組。 The present invention provides a system for processing invoice data, which includes a scanner, an OCR module, three files (a first file module, a second file module, and a third file module), a display interface, and a conversion module. The scanner is used to scan invoices, and the OCR module converts invoice information into data and stores it in the first file module. The second file module contains an algorithm for correcting data, and the corrected data is stored in the third file module. The display interface provides simultaneous display of invoice images and data for easy manual correction, while the conversion module converts the data into a format acceptable to the accounting system. This system is particularly suitable for file management in Excel format and supports running the OCR module on the Microsoft Power Automate platform.

在上述系統中,第二檔案模組中的演算法可以根據在OCR技術中觀察到的常見錯誤模式而定制。發票可以根據發票類型進行分類。這些類別可以包括銷項手開三聯式發票、銷項手開二聯式發票、進項手開三聯式發票、進項機開三聯發票、進項機開二聯式發票和銷貨退回、進貨退出或折讓證明單。 In the above system, the algorithm in the second file module can be customized based on common error patterns observed in OCR technology. Invoices can be classified according to the invoice type. These categories can include sales hand-issued three-part invoice, sales hand-issued two-part invoice, purchase hand-issued three-part invoice, purchase machine-issued three-part invoice, purchase machine-issued two-part invoice and sales returns, purchase returns or allowance certificates.

在上述系中,顯示介面允許手動修正與發票圖像相對應的數據。演算法使用數據驗證技術來修正數據。這些數據驗證技術可以包括規則驗證、模式匹配、範圍檢查、列表驗證、多階段驗證、手動審查以及錯誤記錄和報告。 In the above system, the display interface allows manual correction of the data corresponding to the invoice image. The algorithm uses data validation techniques to correct the data. These data validation techniques may include rule validation, pattern matching, range checking, list validation, multi-stage validation, manual review, and error logging and reporting.

在上述系中,使用到的資料夾和檔案是用特定的符號和命名規則進行命名和管理,以便更容易識別和自動處理。這些符號和命名規則可以基於發票類型而定制。 In the above system, the folders and files used are named and managed with specific symbols and naming rules for easier identification and automatic processing. These symbols and naming rules can be customized based on invoice types.

為讓本新型之上述特徵和優點能更明顯易懂,下文特舉較佳實施例,並配合所附圖式,作詳細說明如下。 In order to make the above features and advantages of the new model more obvious and easy to understand, the following is a detailed description of the preferred embodiment with the attached drawings.

100:發票數據處理系統 100: Invoice data processing system

110:掃描器 110: Scanner

120:OCR模組 120:OCR module

130:第一檔案模組 130: First file module

140:第二檔案模組 140: Second file module

142:數據修正演算法 142:Data correction algorithm

150:第三檔案模組 150: Third file module

160:顯示介面 160: Display interface

170:轉換模組 170:Conversion module

200:會計系統 200:Accounting system

10:發票 10: Invoice

S110~S170:流程圖符號 S110~S170: Flowchart symbols

圖1所繪示為本新型的發票數據處理方法的流程圖。 Figure 1 shows a flow chart of the new invoice data processing method.

圖2所繪示為本新型的發票數據處理系統。 Figure 2 shows the new invoice data processing system.

圖3所繪示為在顯示介面上同時顯示發票圖像和相應的數據。 Figure 3 shows the invoice image and corresponding data displayed simultaneously on the display interface.

請參照圖1,圖1所繪示為本新型的發票數據處理方法的流程圖,此方法涉及多個步驟。首先,請參照步驟S110,對多張發票進行分類和掃描。在本實施例中,發票10的分類基於發票的類型,這取決於所在國家或地區的特定的商業或會計 背景。一旦發票被分類和掃描,便執行步驟S120,將發票的掃描資訊轉換為數據。這一轉換過程利用光學字符識別技術(以下簡稱OCR技術),該OCR技術能將發票掃描後的圖像轉換為電腦可判讀的文本。 Please refer to FIG. 1, which is a flowchart of the novel invoice data processing method, which involves multiple steps. First, please refer to step S110, and multiple invoices are classified and scanned. In this embodiment, the classification of invoices 10 is based on the type of invoice, which depends on the specific business or accounting background of the country or region. Once the invoice is classified and scanned, step S120 is executed to convert the scanned information of the invoice into data. This conversion process uses optical character recognition technology (hereinafter referred to as OCR technology), which can convert the scanned image of the invoice into computer-readable text.

掃描資訊轉換為數據後,執行步驟S130,將轉換後的數據轉寫到第一檔案模組中。這個第一檔案模組作為轉換後數據的儲存處,允許對數據進行後續處理和操作。第一檔案模組通常是一個Excel檔案,提供了一種方便且廣泛使用的儲存和處理數據的格式。 After the scanned information is converted into data, step S130 is executed to transfer the converted data into the first file module. This first file module serves as a storage for the converted data, allowing subsequent processing and manipulation of the data. The first file module is usually an Excel file, which provides a convenient and widely used format for storing and processing data.

轉換後的數據轉寫到第一檔案模組後,執行步驟S140,將數據從第一檔案模組轉移到第二檔案模組以進行數據修正。第二檔案模組包括用於修正第一檔案模組中的數據的演算法。這些演算法根據在OCR技術中觀察到的常見錯誤模式進行定制。通過修正這些常見錯誤,可以顯著提高數據的準確性。 After the converted data is transferred to the first file module, step S140 is executed to transfer the data from the first file module to the second file module for data correction. The second file module includes algorithms for correcting the data in the first file module. These algorithms are customized based on common error patterns observed in OCR technology. By correcting these common errors, the accuracy of the data can be significantly improved.

第二檔案模組中的數據被修正後,執行步驟S150,將修正後的數據從第二檔案模組轉移到第三檔案模組。這個轉移過程旨在維護一個不包含任何公式或程式的檔案,從而提高性能優化。在本實施例中,第一檔案模組、第二檔案模組、和第三檔案模組都是一個Excel檔案。 After the data in the second file module is corrected, step S150 is executed to transfer the corrected data from the second file module to the third file module. This transfer process is intended to maintain a file that does not contain any formulas or programs, thereby improving performance optimization. In this embodiment, the first file module, the second file module, and the third file module are all Excel files.

除了數據修正和轉移過程外,該方法還涉及在顯示介面上同時顯示發票圖像和相應的數據(如步驟S160所示)。這個顯示介面允許手動修正與發票圖像相對應的數據。通過提供發票圖像與相應數據的視覺表示,用戶可以輕鬆識別並修正數據中的任何差異或錯誤。 In addition to the data correction and transfer process, the method also involves simultaneously displaying the invoice image and the corresponding data on a display interface (as shown in step S160). This display interface allows manual correction of the data corresponding to the invoice image. By providing a visual representation of the invoice image and the corresponding data, the user can easily identify and correct any differences or errors in the data.

之後,執行步驟S170,將修正後的數據轉換為會計系統可接受的格式。這個轉換過程確保處理後的發票數據可以無縫地集成到現有的會計系統中,從而促進高效和準確的會計實踐。 Afterwards, step S170 is executed to convert the corrected data into a format acceptable to the accounting system. This conversion process ensures that the processed invoice data can be seamlessly integrated into the existing accounting system, thereby promoting efficient and accurate accounting practices.

以下,將對上述方法的各步驟進行更詳細的介紹。請同時參照圖1與圖2,圖2所繪示為本新型的發票數據處理系統。該發票數據處理系統100包括幾個元件, 每個元件都設計用於在整個過程中執行特定功能。這些元件包括一掃描器110、一光學字符識別模組(以下稱為OCR模組120)、一第一檔案模組130、一第二檔案模組140、一第三檔案模組150、一顯示介面160、和一轉換模組170。 Below, each step of the above method will be described in more detail. Please refer to Figure 1 and Figure 2 at the same time. Figure 2 shows the new invoice data processing system of the present invention. The invoice data processing system 100 includes several components, each of which is designed to perform a specific function in the whole process. These components include a scanner 110, an optical character recognition module (hereinafter referred to as OCR module 120), a first file module 130, a second file module 140, a third file module 150, a display interface 160, and a conversion module 170.

顯示介面作為手動修正與發票圖像相對應的數據的視覺平台。該介面旨在向用戶呈現發票圖像和相應數據的並排視圖,從而提供一種清晰和直觀的方式來比較和驗證數據的準確性。 The display interface serves as a visual platform for manually correcting the data corresponding to the invoice image. The interface is designed to present the user with a side-by-side view of the invoice image and the corresponding data, thereby providing a clear and intuitive way to compare and verify the accuracy of the data.

轉換模組是一個軟體元件,設計用於執行數據轉換任務。它包括一組預定義的規則和演算法,用於指示如何將第三檔案模組中的數據轉換為會計系統可接受的格式。轉換模組可以定制以適應不同的會計系統,從而確保該方法是靈活和多功能的。 A conversion module is a software component designed to perform data conversion tasks. It includes a set of predefined rules and algorithms that indicate how to convert data in a third-party file module into a format acceptable to an accounting system. The conversion module can be customized to suit different accounting systems, thus ensuring that the method is flexible and versatile.

該方法的初始步驟,亦即:步驟S110,是對多張發票10進行分類和掃描。這一步涉及根據發票10的類型將發票10分類到不同的類別中。發票10的分類不是任意的,而是由發票10本身固有的特定特性決定的。這些特性可以包括但不限於,由發票表示的交易的性質、參與交易的各方,以及發票的格式或結構。 The initial step of the method, i.e., step S110, is to classify and scan a plurality of invoices 10. This step involves classifying the invoices 10 into different categories according to the type of invoice 10. The classification of the invoices 10 is not arbitrary, but is determined by specific characteristics inherent in the invoices 10 themselves. These characteristics may include, but are not limited to, the nature of the transaction represented by the invoice, the parties involved in the transaction, and the format or structure of the invoice.

在本實施例中,發票10被分為六個類別。這些類別包括銷項手開三聯式發票、銷項手開二聯式發票、進項手開三聯式發票、進項機開三聯發票、進項機開二聯式發票、和銷貨退回、進貨退出或折讓證明單。每一個類別代表一種在商業和會計實踐中常見的不同類型的發票。 In this embodiment, the invoice 10 is divided into six categories. These categories include sales hand-issued triple invoice, sales hand-issued double invoice, purchase hand-issued triple invoice, purchase machine-issued triple invoice, purchase machine-issued double invoice, and sales return, purchase return or allowance certificate. Each category represents a different type of invoice commonly seen in business and accounting practice.

一旦發票被分類,便進行掃描。掃描是由掃描器110來執行,它負責掃描發票並將其轉換為數位圖像。掃描器110可以是任何能夠捕捉發票的高品質圖像的設備,例如平板掃描器或檔案餵送器掃描器(document feeder scanner)。掃描器旨在捕捉發票中包含的所有相關資訊,包括文本資訊、數值資訊以及發票上可能存在的任何其他符號或標記。 Once the invoice is sorted, it is scanned. The scanning is performed by a scanner 110, which is responsible for scanning the invoice and converting it into a digital image. The scanner 110 can be any device capable of capturing a high-quality image of the invoice, such as a flatbed scanner or a document feeder scanner. The scanner is designed to capture all relevant information contained in the invoice, including text information, numerical information, and any other symbols or markings that may be present on the invoice.

掃描過程涉及捕獲每張發票的圖像,然後可以執行步驟S120,由OCR模組120進行處理和分析。掃描過程旨在捕獲發票中包含的所有相關資訊,包括文本資訊、數字資訊以及發票上可能存在的任何其他符號或標記。掃描過程的結果是發票的數位圖像,這作為後續OCR過程的輸入。 The scanning process involves capturing an image of each invoice, which can then be processed and analyzed by the OCR module 120 in step S120. The scanning process is intended to capture all relevant information contained in the invoice, including text information, numeric information, and any other symbols or marks that may be present on the invoice. The result of the scanning process is a digital image of the invoice, which serves as input to the subsequent OCR process.

也就是說,掃描和分類發票後,該方法繼續將掃描得到的發票資訊轉換為數據。在本實施例中,這一轉換過程是通過使用OCR模組120來實現的。OCR模組120執行OCR技術,而OCR技術是一種能夠將不同類型的文檔(如掃描的紙質文檔、PDF檔案或由數位相機捕獲的圖像)中的文字影像轉換為可編輯和可搜索的數據。OCR模組120負責將掃描的發票圖像轉換為電腦軟體可讀的文本。這涉及分析掃描圖像的結構,並識別和提取圖像內部包含的文本資訊。OCR模組120可以識別各種字符和符號,包括字母、數字和特殊字符。這允許從發票中全面提取所有相關資訊,不僅僅是發票的主要文本,還有可能存在的任何其他資訊或註釋。也就是說,OCR技術用於將掃描得到的發票圖像轉換為機器可讀的文本,然後將提取的文本轉換為數據,該數據可以以數字格式儲存、操作和分析。此外,在步驟S120中,OCR模組120允許從發票10中全面提取所有相關資訊,包括不僅是發票的主要文本,還包括可能存在的任何其他資訊或註釋。 That is, after scanning and classifying the invoice, the method continues to convert the scanned invoice information into data. In the present embodiment, this conversion process is achieved by using the OCR module 120. The OCR module 120 performs OCR technology, which is a technology that can convert text images in different types of documents (such as scanned paper documents, PDF files, or images captured by a digital camera) into editable and searchable data. The OCR module 120 is responsible for converting the scanned invoice image into text that can be read by computer software. This involves analyzing the structure of the scanned image and identifying and extracting the text information contained inside the image. The OCR module 120 can recognize a variety of characters and symbols, including letters, numbers, and special characters. This allows comprehensive extraction of all relevant information from the invoice, not only the main text of the invoice, but also any other information or annotations that may be present. That is, OCR technology is used to convert the scanned invoice image into machine-readable text, and then convert the extracted text into data that can be stored, manipulated and analyzed in a digital format. In addition, in step S120, the OCR module 120 allows comprehensive extraction of all relevant information from the invoice 10, including not only the main text of the invoice, but also any other information or annotations that may be present.

值得注意的是,儘管OCR技術通常是準確和可靠的,但它並不是萬無一失的,有時會出現錯誤。這些錯誤可能是由於多種因素造成的,例如掃描品質差、字體變化、字符間距以及相似外觀字符之間的混淆。因此,從OCR模組120獲得的數據通常需要進一步的修正和驗證以確保其準確性,這一過程由本新型方法的後續步驟來促進。 It is worth noting that although OCR technology is generally accurate and reliable, it is not infallible and sometimes errors occur. These errors may be caused by a variety of factors, such as poor scan quality, font variations, character spacing, and confusion between similar-looking characters. Therefore, the data obtained from the OCR module 120 generally requires further correction and verification to ensure its accuracy, which is facilitated by the subsequent steps of the novel method.

在本實施例中,使用的OCR技術是在Microsoft Power Automate平台上執行的。Microsoft Power Automate是由Microsoft提供的一種基於雲的服務,允許用戶在多個應用程式和服務之間創建和管理自動化工作流。在本新型方法的背景下, Microsoft Power Automate用於自動化OCR過程,從而提高掃描發票資訊轉換為數據的效率和準確性。 In this embodiment, the OCR technology used is executed on the Microsoft Power Automate platform. Microsoft Power Automate is a cloud-based service provided by Microsoft that allows users to create and manage automated workflows between multiple applications and services. In the context of this novel method, Microsoft Power Automate is used to automate the OCR process, thereby improving the efficiency and accuracy of converting scanned invoice information into data.

一旦使用OCR技術將掃描得到的發票資訊轉換為數據,該方法的下一步,亦即:步驟S130,是將這些轉換後的數據轉寫到第一檔案模組130中。這個轉寫過程涉及將數據從其初始格式(由OCR過程產生)轉換為適合進一步處理和操作的格式。第一檔案模組130作為轉換後數據的儲存處,提供了一個結構化和有組織的格式來儲存數據,這有助於後續對數據進行如數據修正、數據驗證和數據轉移等操作。第一檔案模組130通常是一個Excel檔案,這是一種廣泛用於數據儲存和操作的格式。Excel檔案提供了一系列有益於處理發票數據的功能,例如能夠在行和列中組織數據、對數據進行計算,以及應用各種數據格式化和數據驗證規則。 Once the scanned invoice information is converted into data using OCR technology, the next step of the method, i.e., step S130, is to transfer these converted data into the first file module 130. This transcription process involves converting the data from its original format (generated by the OCR process) into a format suitable for further processing and manipulation. The first file module 130, as a storage for the converted data, provides a structured and organized format to store the data, which facilitates subsequent operations such as data correction, data verification and data transfer. The first file module 130 is usually an Excel file, which is a format widely used for data storage and manipulation. Excel files offer a range of features that are useful for working with invoice data, such as the ability to organize data in rows and columns, perform calculations on data, and apply various data formatting and data validation rules.

值得注意的是,將轉換後的數據轉寫到第一檔案模組130不僅僅是一個簡單的複製過程。相反,它涉及將數據從其初始格式(由OCR過程產生)轉換為適合進一步處理和操作的格式。這個轉換過程可能涉及各種操作,如數據格式化、數據清理和數據驗證,這些都是為了確保數據處於適合後續步驟的狀態。 It is worth noting that transferring the converted data to the first file module 130 is not just a simple copy process. Instead, it involves converting the data from its initial format (resulting from the OCR process) to a format suitable for further processing and manipulation. This conversion process may involve various operations such as data formatting, data cleaning, and data validation, all of which are to ensure that the data is in a suitable state for subsequent steps.

在本實施例中,第一檔案模組130作為轉換後數據的初始儲存處,它提供了後續數據修正和數據轉移過程的起點,並作為手動修正過程的參考點。通過將轉換後的數據儲存在第一檔案模組130中,該方法確保數據可隨時用於這些後續過程,從而促進發票數據的高效和準確處理。 In this embodiment, the first file module 130 serves as the initial storage for the converted data, which provides a starting point for subsequent data correction and data transfer processes and serves as a reference point for the manual correction process. By storing the converted data in the first file module 130, the method ensures that the data is available for these subsequent processes at any time, thereby facilitating efficient and accurate processing of invoice data.

在將轉換後的數據轉寫到第一檔案模組130之後,該方法進行下一步,即:步驟S140,將數據從第一檔案模組130轉移到第二檔案模組140以進行數據修正。這個轉移過程旨在促進修正在OCR過程中可能引入的任何錯誤或不準確之處。第二檔案模組140作為應用各種數據修正演算法142的平台,這些演算法旨在識別和修正數據中的常見錯誤模式。 After the converted data is transferred to the first file module 130, the method proceeds to the next step, namely: step S140, transferring the data from the first file module 130 to the second file module 140 for data correction. This transfer process is intended to facilitate the correction of any errors or inaccuracies that may have been introduced during the OCR process. The second file module 140 serves as a platform for applying various data correction algorithms 142 that are intended to identify and correct common error patterns in the data.

第二檔案模組140中的數據修正演算法142是根據在OCR技術中觀察到的常見錯誤模式來定制的。這些錯誤模式可能包括但不限於字符混淆、噪聲干擾、對齊和間距問題、字體和大小變化,以及手寫文本的困難。通過定制演算法以解決這些特定的錯誤模式,該方法可以顯著提高從OCR過程中獲得的數據的準確性。 The data correction algorithm 142 in the second file module 140 is customized based on common error patterns observed in OCR technology. These error patterns may include, but are not limited to, character confusion, noise interference, alignment and spacing problems, font and size variations, and difficulties with handwritten text. By customizing the algorithm to address these specific error patterns, the method can significantly improve the accuracy of the data obtained from the OCR process.

數據修正演算法142通過將第一檔案模組130中的數據與一組預定義的規則或模式進行比較來運作。如果數據匹配已知的錯誤模式,演算法會對數據進行修正。這個修正過程可能涉及各種操作,例如替換不正確的字符、調整字符間距、糾正對齊問題,以及規範字體和大小變化。然後將修正後的數據儲存在第二檔案模組140中,以便進一步處理。 The data correction algorithm 142 operates by comparing the data in the first file module 130 to a set of predefined rules or patterns. If the data matches a known error pattern, the algorithm corrects the data. This correction process may involve various operations, such as replacing incorrect characters, adjusting character spacing, correcting alignment problems, and standardizing font and size variations. The corrected data is then stored in the second file module 140 for further processing.

值得注意的是,步驟S140的數據修正過程不是一種一刀切的解決方案。相反,它是一個靈活和可適應的過程,可以根據數據中普遍存在的特定錯誤模式來定制。這種定制是通過在第二檔案模組140中使用演算法來促進的,這些演算法可以根據觀察到的錯誤模式進行調整和微調。這確保了數據修正過程盡可能有效和準確,從而提高了發票數據的整體品質和可靠性。 It is important to note that the data correction process of step S140 is not a one-size-fits-all solution. Rather, it is a flexible and adaptable process that can be customized based on specific error patterns prevalent in the data. This customization is facilitated by the use of algorithms in the second file module 140 that can be adjusted and fine-tuned based on observed error patterns. This ensures that the data correction process is as effective and accurate as possible, thereby improving the overall quality and reliability of the invoice data.

在第二檔案模組140中完成數據修正過程後,執行步驟S150,修正後的數據隨後被轉移到第三檔案模組150。這個轉移過程是整個方法的一個不可或缺的部分,具有特定的目的,即在數據處理工作流中優化性能。第三檔案模組150旨在成為一個乾淨和優化的檔案,不包含任何公式或程式。這與第二檔案模組140形成對比,後者包含用於數據修正的各種演算法和公式。 After the data correction process is completed in the second file module 140, step S150 is performed and the corrected data is then transferred to the third file module 150. This transfer process is an integral part of the overall method and has a specific purpose, namely to optimize performance in the data processing workflow. The third file module 150 is intended to be a clean and optimized file that does not contain any formulas or programs. This is in contrast to the second file module 140, which contains various algorithms and formulas for data correction.

這個轉移過程的原因是為了維護一個性能優化的檔案。包含公式或程式的檔案(如第二檔案模組)可能會較慢地加載和處理。這在處理大量數據時可能是一個問題,這在發票處理中經常是這樣。通過將修正後的數據轉移到不包含任何公式或程式的第三檔案模組150,該方法確保數據可以快速和高效地被訪問和處理。 The reason for this migration process is to maintain a performance optimized file. Files containing formulas or programs (such as the second file module) may be slow to load and process. This can be a problem when processing large amounts of data, which is often the case in invoice processing. By migrating the corrected data to the third file module 150, which does not contain any formulas or programs, this approach ensures that the data can be accessed and processed quickly and efficiently.

第三檔案模組150,像第一檔案模組130一樣,通常也是一個Excel檔案。然而,不同於第一檔案模組130和第二檔案模組140,第三檔案模組150特別格式化為不包含任何公式或程式(演算法)。這是通過使用特定的檔案格式來實現的,例如XLSX,這是一種由Microsoft Excel使用的不支持巨集的檔案格式。通過使用這種檔案格式,此步驟確保第三檔案模組150是一個乾淨和優化的檔案,適合進一步處理和操作。 The third file module 150, like the first file module 130, is also typically an Excel file. However, unlike the first file module 130 and the second file module 140, the third file module 150 is specifically formatted to not contain any formulas or programs (algorithms). This is achieved by using a specific file format, such as XLSX, which is a file format used by Microsoft Excel that does not support macros. By using this file format, this step ensures that the third file module 150 is a clean and optimized file suitable for further processing and manipulation.

總之,將修正後的數據從第二檔案模組140轉移到第三檔案模組150是該方法中的一個策略性步驟,旨在優化數據處理工作流的性能。通過維護一個乾淨和優化的第三檔案模組150。此步驟確保發票數據可以快速和高效地被處理,從而提高了發票處理方法的整體效率和有效性。 In summary, the transfer of the corrected data from the second file module 140 to the third file module 150 is a strategic step in the method to optimize the performance of the data processing workflow by maintaining a clean and optimized third file module 150. This step ensures that the invoice data can be processed quickly and efficiently, thereby improving the overall efficiency and effectiveness of the invoice processing method.

在將修正後的數據轉移到第三檔案模組150之後,執行步驟S160,在顯示介面160上同時顯示發票圖像和相應的數據(如圖3所示)。這個顯示介面160作為手動修正與發票圖像相對應的數據的視覺平台。顯示介面160旨在向用戶提供發票圖像和相應數據的並排視圖,從而提供了一種清晰和直觀的方式來比較和驗證數據的準確性。 After the corrected data is transferred to the third file module 150, step S160 is executed to display the invoice image and the corresponding data simultaneously on the display interface 160 (as shown in FIG. 3). This display interface 160 serves as a visual platform for manually correcting the data corresponding to the invoice image. The display interface 160 is intended to provide the user with a side-by-side view of the invoice image and the corresponding data, thereby providing a clear and intuitive way to compare and verify the accuracy of the data.

顯示介面160旨在通過提供一種清晰和直觀的方式來比較和驗證數據的準確性,從而促進手動修正過程。通過並排顯示發票圖像和相應的數據,顯示介面160使用戶能夠輕易地識別數據中的任何不一致或錯誤。這種視覺比較過程在OCR過程導致數據中的錯誤或不準確的情況下尤為有用。通過視覺地比較發票圖像與相應的數據,用戶可以輕易地識別並修正這些錯誤,從而提高數據的整體準確性。 Display interface 160 is intended to facilitate the manual correction process by providing a clear and intuitive way to compare and verify the accuracy of the data. By displaying the invoice image and the corresponding data side by side, display interface 160 enables the user to easily identify any inconsistencies or errors in the data. This visual comparison process is particularly useful in situations where the OCR process results in errors or inaccuracies in the data. By visually comparing the invoice image to the corresponding data, the user can easily identify and correct these errors, thereby improving the overall accuracy of the data.

由顯示介面160促進的手動修正過程是整體方法的一個不可或缺的部分。儘管使用了先進的OCR技術和精密的數據修正演算法,仍然可能存在數據中的錯誤或不準確未被這些自動化過程檢測或修正的情況。在這種情況下,手動修正過 程提供了最後一層的品質把關,確保數據在被轉換為會計系統200可接受的格式之前盡可能準確和可靠。 The manual correction process facilitated by the display interface 160 is an integral part of the overall approach. Despite the use of advanced OCR technology and sophisticated data correction algorithms, there may still be instances where errors or inaccuracies in the data are not detected or corrected by these automated processes. In such cases, the manual correction process provides a final layer of quality control, ensuring that the data is as accurate and reliable as possible before being converted into a format acceptable to the accounting system 200.

總之,顯示介面160作為手動修正與發票圖像相對應的數據的工具。通過提供一種清晰和直觀的方式來比較和驗證數據的準確性,顯示介面160提高了發票處理方法的整體效率和有效性。 In summary, the display interface 160 serves as a tool for manually correcting data corresponding to an invoice image. By providing a clear and intuitive way to compare and verify the accuracy of data, the display interface 160 improves the overall efficiency and effectiveness of the invoice processing method.

在由顯示介面160促進的手動修正過程之後,該方法進行到步驟S170,即將修正後的數據轉換為會計系統200可接受的格式。這個轉換過程是整體方法中的一個關鍵步驟,因為它確保了處理後的發票數據可以無縫地整合到現有的會計系統中,從而促進高效和準確的會計實踐。 After the manual correction process facilitated by the display interface 160, the method proceeds to step S170, which converts the corrected data into a format acceptable to the accounting system 200. This conversion process is a key step in the overall method because it ensures that the processed invoice data can be seamlessly integrated into the existing accounting system, thereby promoting efficient and accurate accounting practices.

在此步驟S170中,轉換過程涉及將儲存在第三檔案模組150中的修正後的數據轉換為與會計系統200兼容的格式。這涉及將第三檔案模組中的數據字段映射到會計系統200中的相應字段,並將數據值轉換為會計系統能識別的格式。這可能涉及各種操作,例如數據格式化、數據類型轉換和數據驗證,具體取決於會計系統200的特定要求。 In this step S170, the conversion process involves converting the corrected data stored in the third file module 150 into a format compatible with the accounting system 200. This involves mapping the data fields in the third file module to the corresponding fields in the accounting system 200 and converting the data values into a format recognizable by the accounting system. This may involve various operations such as data formatting, data type conversion, and data validation, depending on the specific requirements of the accounting system 200.

為了促進這個轉換過程,該方法使用了一個轉換模組170。轉換模組170是一個設計用於執行數據轉換任務的軟體元件。它包括一組預定義的規則和演算法,用於指導如何將第三檔案模組150中的數據轉換為會計系統200可接受的格式。轉換模組170可以定制以適應不同的會計系統,從而確保該方法是靈活和多功能的。 To facilitate this conversion process, the method uses a conversion module 170. The conversion module 170 is a software component designed to perform data conversion tasks. It includes a set of predefined rules and algorithms that guide how to convert the data in the third file module 150 into a format acceptable to the accounting system 200. The conversion module 170 can be customized to adapt to different accounting systems, thereby ensuring that the method is flexible and versatile.

轉換模組170通過讀取第三檔案模組150中的數據,應用轉換規則和演算法將數據轉換為所需格式,然後輸出轉換後的數據。轉換模組170的輸出是一組以會計系統可接受的格式呈現的數據。這些數據隨後可以導入到會計系統200中,從而完成發票處理工作流程。 The conversion module 170 converts the data into the required format by reading the data in the third file module 150, applying the conversion rules and algorithms, and then outputting the converted data. The output of the conversion module 170 is a set of data presented in a format acceptable to the accounting system. These data can then be imported into the accounting system 200, thereby completing the invoice processing workflow.

總之,將修正後的數據轉換為會計系統200可接受的格式是該公開方法中的一個策略性步驟,確保了處理後的發票數據可以無縫地整合到現有的會計系統中。通過使用轉換模組來促進這一過程,該方法確保了發票數據可以準確和高效地被處理,從而提高了發票處理方法的整體效率和有效性。 In summary, converting the corrected data into a format acceptable to the accounting system 200 is a strategic step in the disclosed method, ensuring that the processed invoice data can be seamlessly integrated into the existing accounting system. By using the conversion module to facilitate this process, the method ensures that the invoice data can be processed accurately and efficiently, thereby improving the overall efficiency and effectiveness of the invoice processing method.

在本實施例中,採用了一個特定的過程來命名和管理資料夾和檔案。這個過程涉及使用特定的符號和命名規則,旨在促進資料夾和檔案的更容易識別和自動化處理。這些符號和命名規則是基於發票的類型,從而提供了一個邏輯和直觀的系統來組織和管理發票數據。 In this embodiment, a specific process is employed to name and manage folders and files. This process involves the use of specific symbols and naming conventions designed to facilitate easier identification and automated handling of folders and files. These symbols and naming conventions are based on the type of invoice, thereby providing a logical and intuitive system to organize and manage invoice data.

其中,用於命名和管理過程中的符號代表不同類型的發票。例如,符號'▲'代表銷項發票,而符號'▼'代表進項發票。'ʃ'符號的數量表示發票是三聯式還是二聯式。例如,三個'ʃ'符號,亦即:ʃʃʃ,代表一個三聯式發票,而兩個'ʃ'符號,亦即:ʃʃ,代表一個二聯式發票。'III'或'II'符號的存在表示發票是由收銀機發出的,其中'III'代表三聯式的收銀機發票,而'II'代表二聯式的收銀機發票。'▲'和'▼'符號的組合代表銷貨退回、進貨退出或折讓證明單。 Among them, the symbols used in the naming and management process represent different types of invoices. For example, the symbol '▲' represents a sales invoice, while the symbol '▼' represents a purchase invoice. The number of 'ʃ' symbols indicates whether the invoice is a triple or double invoice. For example, three 'ʃ' symbols, that is: ʃʃʃ, represent a triple invoice, while two 'ʃ' symbols, that is: ʃʃ, represent a double invoice. The presence of the 'III' or 'II' symbol indicates that the invoice is issued by a cash register, where 'III' represents a triple cash register invoice and 'II' represents a double cash register invoice. The combination of the '▲' and '▼' symbols represents a sales return, a purchase withdrawal, or a discount certificate.

另外,命名規則涉及將序列號或其他符號附加到代表發票類型的符號上。例如,檔案或資料夾名稱可能是'▲ʃʃ-1-xxx-load',其中'▲ʃʃ'代表手動發出的進項二聯式發票,'-1-'是一個序列號,'xxx'是額外資訊的占位符,而'load'則表示處理工作流程處於下載資料的階段。這些命名規則提供了一種系統化和一致的方式來命名和管理資料夾和檔案,從而促進發票數據的高效和準確處理。 Additionally, naming conventions involve appending a sequence number or other symbol to a symbol representing an invoice type. For example, a file or folder name might be '▲ʃʃ-1-xxx-load', where '▲ʃʃ' represents a manually issued invoice, '-1-' is a sequence number, 'xxx' is a placeholder for additional information, and 'load' indicates that the processing workflow is in the downloading data phase. These naming conventions provide a systematic and consistent way to name and manage folders and files, thereby facilitating efficient and accurate processing of invoice data.

通過使用這些特定的符號和命名規則,本實施例提供了一種高效和有效的方式來管理涉及發票處理工作流程的資料夾和檔案。這不僅提高了該方法的整體效率,而且提高了發票數據處理的準確性和可靠性,從而有助於提高發票處理方法的整體效果。 By using these specific symbols and naming conventions, the present embodiment provides an efficient and effective way to manage folders and files involved in the invoice processing workflow. This not only improves the overall efficiency of the method, but also improves the accuracy and reliability of invoice data processing, thereby helping to improve the overall effectiveness of the invoice processing method.

在上述的實施例中,提到第二檔案模組140中包含的數據修正演算法142使用各種數據驗證技術來識別和修正數據中的常見錯誤模式。這些技術旨在提高從OCR過程獲得的數據的準確性和可靠性。以下,將對這些數據修正演算法舉例進行較詳細的介紹。 In the above-mentioned embodiment, it is mentioned that the data correction algorithm 142 included in the second file module 140 uses various data verification techniques to identify and correct common error patterns in the data. These techniques are intended to improve the accuracy and reliability of the data obtained from the OCR process. Below, these data correction algorithms will be introduced in more detail with examples.

其中一種技術是規則驗證。這涉及將數據與一組預定義的規則或模式進行比較。如果數據匹配已知的錯誤模式,演算法會對數據進行修正。這可能涉及各種操作,例如替換不正確的字符、調整字符間距、糾正對齊問題,以及規範字體和大小的變化。演算法使用的另一種技術是模式匹配。這涉及識別數據中的特定模式或結構,例如日期、電話號碼或發票號碼。通過識別這些模式,演算法可以準確地提取和驗證相應的數據。 One of these techniques is rule validation. This involves comparing the data to a set of predefined rules or patterns. If the data matches a known error pattern, the algorithm makes corrections to the data. This can involve various actions, such as replacing incorrect characters, adjusting character spacing, correcting alignment issues, and regulating font and size variations. Another technique used by the algorithm is pattern matching. This involves identifying specific patterns or structures in the data, such as dates, phone numbers, or invoice numbers. By identifying these patterns, the algorithm can accurately extract and verify the corresponding data.

範圍檢查是演算法使用的另一種技術。這涉及將數值數據與預定義的範圍或限制進行比較。如果數據超出這個範圍,則會被標記為可能有誤,這一技術對於驗證數值數據(如發票金額或稅率)特別有用。列表驗證是演算法採用的另一種技術。這涉及將數據與一個預定義的有效值列表進行比較。如果數據與列表中的任何值都不匹配,則會被標記為可能有誤。這一技術對於驗證分類數據(如產品代碼或供應商名稱)特別有用。 Range checking is another technique used by the algorithm. This involves comparing numeric data to a predefined range or limit. If the data falls outside this range, it is flagged as potentially erroneous. This technique is particularly useful for validating numeric data, such as invoice amounts or tax rates. List validation is another technique employed by the algorithm. This involves comparing the data to a predefined list of valid values. If the data does not match any of the values in the list, it is flagged as potentially erroneous. This technique is particularly useful for validating categorical data, such as product codes or supplier names.

演算法還採用了多階段驗證過程。這涉及按順序應用多種驗證技術。每個驗證過程的階段都旨在捕捉和糾正不同類型的錯誤,從而提高數據的整體準確性。 The algorithm also employs a multi-stage validation process. This involves applying multiple validation techniques in sequence. Each stage of the validation process is designed to catch and correct different types of errors, thereby improving the overall accuracy of the data.

除了這些自動化驗證技術外,該方法還涉及手動審核過程。這涉及人工操作員審核數據並手動糾正任何錯誤或不一致。這一手動審核過程作為最終的品質控制層,確保數據盡可能地準確和可靠。 In addition to these automated validation techniques, the approach also involves a manual review process. This involves a human operator reviewing the data and manually correcting any errors or inconsistencies. This manual review process acts as a final layer of quality control, ensuring that the data is as accurate and reliable as possible.

此外,演算法還包括錯誤記錄和報告功能。這涉及在錯誤日誌中記錄所有識別出的錯誤和相應的修正。這個錯誤日誌可以被審查和分析,以識別常見的錯誤 模式並提高演算法的性能。錯誤日誌還作為數據修正過程的記錄,提供透明度和問責制。 Additionally, the algorithm includes error logging and reporting capabilities. This involves recording all identified errors and corresponding corrections in an error log. This error log can be reviewed and analyzed to identify common error patterns and improve the performance of the algorithm. The error log also serves as a record of the data correction process, providing transparency and accountability.

總之,第二檔案模組140中的數據修正演算法142使用各種數據驗證技術來提高從OCR過程獲得的數據的準確性和可靠性。這些技術,結合手動審核過程和錯誤記錄功能,確保數據盡可能地準確和可靠,從而提高了發票處理方法的整體效率和有效性。 In summary, the data correction algorithm 142 in the second file module 140 uses various data validation techniques to improve the accuracy and reliability of the data obtained from the OCR process. These techniques, combined with the manual review process and error logging capabilities, ensure that the data is as accurate and reliable as possible, thereby improving the overall efficiency and effectiveness of the invoice processing method.

總的來說,這項專利揭示了一種高效且準確的處理發票數據的方法和系統。該方法和系統利用先進的OCR技術和精密的數據修正演算法,以及手動審核過程和錯誤記錄功能,確保數據盡可能地準確和可靠。此外,該方法和系統還包括一個特定的過程來命名和管理資料夾和檔案,以及將修正後的數據轉換為會計系統可接受的格式,從而提高了發票處理方法的整體效率和有效性。這些特性使得該方法和系統對於處理大量發票數據的企業來說,具有很大的實用價值。 In summary, this patent discloses a method and system for processing invoice data efficiently and accurately. The method and system utilize advanced OCR technology and sophisticated data correction algorithms, as well as manual review processes and error recording functions to ensure that the data is as accurate and reliable as possible. In addition, the method and system also include a specific process to name and manage folders and files, and convert the corrected data into a format acceptable to the accounting system, thereby improving the overall efficiency and effectiveness of the invoice processing method. These features make the method and system of great practical value to enterprises that process large amounts of invoice data.

雖然本新型已以較佳實施例揭露如上,然其並非用以限定本新型,任何所屬技術領域中具有通常知識者,在不脫離本新型之精神和範圍內,當可作些許之更動與潤飾,因此本新型之保護範圍當視後附之申請專利範圍所界定者為準。 Although the present invention has been disclosed as above with the preferred embodiment, it is not intended to limit the present invention. Anyone with common knowledge in the relevant technical field can make some changes and modifications without departing from the spirit and scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the scope defined in the attached patent application.

100:發票數據處理系統 100: Invoice data processing system

110:掃描器 110: Scanner

120:OCR模組 120:OCR module

130:第一檔案模組 130: First file module

140:第二檔案模組 140: Second file module

142:數據修正演算法 142:Data correction algorithm

150:第三檔案模組 150: Third file module

160:顯示介面 160: Display interface

170:轉換模組 170:Conversion module

200:會計系統 200:Accounting system

10:發票 10: Invoice

Claims (9)

一個用於處理發票數據的系統,該系統包括:一掃描器,用於掃描多張發票;一OCR模組,電性連接到該掃描器,該OCR模組用於將掃描得到的發票資訊轉換為數據;一第一檔案模組,電性連接到該OCR模組,該第一檔案模組用於接收和存儲轉換後的數據;一第二檔案模組,電性連接到該第一檔案模組,該第二檔案模組用於接收來自該第一檔案模組的數據並進行數據修正,其中該第二檔案模組包括用於修正數據的演算法;一第三檔案模組,電性連接到該第二檔案模組,該第三檔案模組用於接收和存儲來自該第二檔案模組的修正後的數據;一顯示介面,電性連接到該第三檔案模組,該顯示介面用於同時顯示所述發票的圖像和相應的數據,以便手動修正;以及一轉換模組,電性連接到該第三檔案模組,該轉換模組用於將修正後的數據轉換為一會計系統的可接受格式;其中,該第三檔案模組不包含任何公式或程序。 A system for processing invoice data, the system comprising: a scanner for scanning a plurality of invoices; an OCR module electrically connected to the scanner, the OCR module for converting the scanned invoice information into data; a first file module electrically connected to the OCR module, the first file module for receiving and storing the converted data; a second file module electrically connected to the first file module, the second file module for receiving the data from the first file module and performing data modification, wherein the second file module comprises a module for modifying the invoice information; An algorithm for calculating positive data; a third file module electrically connected to the second file module, the third file module is used to receive and store the corrected data from the second file module; a display interface electrically connected to the third file module, the display interface is used to simultaneously display the image of the invoice and the corresponding data for manual correction; and a conversion module electrically connected to the third file module, the conversion module is used to convert the corrected data into an acceptable format for an accounting system; wherein the third file module does not contain any formula or program. 如請求項1所述的系統,其中該第一檔案模組、該第二檔案模組和該第三檔案模組均為Excel檔案。 A system as described in claim 1, wherein the first file module, the second file module and the third file module are all Excel files. 如請求項1或請求項2所述的系統,其中該第二檔案模組中的演算法是基於在所述OCR模組中觀察到的常見錯誤模式而定制的。 A system as claimed in claim 1 or claim 2, wherein the algorithm in the second file module is customized based on common error patterns observed in the OCR module. 如請求項1所述的系統,其中該發票是根據發票類別進行分類。 A system as claimed in claim 1, wherein the invoices are classified according to invoice categories. 如請求項1所述的系統,其中該演算法使用一數據驗證技術來修正數據,且該數據驗證技術包括規則驗證、模式匹配、範圍檢查、列表驗證、或多階段驗證。 A system as described in claim 1, wherein the algorithm uses a data validation technique to modify the data, and the data validation technique includes rule validation, pattern matching, range checking, list validation, or multi-stage validation. 如請求項1所述的系統,其中於該系統中使用到的資料夾和檔案是用特定的符號和命名規則進行命名和管理,以便更容易識別和自動處理。 A system as described in claim 1, wherein the folders and files used in the system are named and managed using specific symbols and naming rules for easier identification and automatic processing. 如請求項6所述的系統,其中所述符號和命名規則是基於該發票類型而定制的。 A system as claimed in claim 6, wherein the symbols and naming conventions are customized based on the invoice type. 如請求項2所述的系統,其中該第一檔案模組和該第三檔案模組是XLSX檔案,該第二檔案模組是XLSM檔案。 A system as described in claim 2, wherein the first file module and the third file module are XLSX files, and the second file module is an XLSM file. 如請求項1所述的系統,其中所述OCR模組是在Microsoft Power Automate平台上執行的。 A system as described in claim 1, wherein the OCR module is executed on the Microsoft Power Automate platform.
TW112212528U 2023-11-17 2023-11-17 System for processing invoice data TWM655760U (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW112212528U TWM655760U (en) 2023-11-17 2023-11-17 System for processing invoice data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW112212528U TWM655760U (en) 2023-11-17 2023-11-17 System for processing invoice data

Publications (1)

Publication Number Publication Date
TWM655760U true TWM655760U (en) 2024-05-21

Family

ID=92075621

Family Applications (1)

Application Number Title Priority Date Filing Date
TW112212528U TWM655760U (en) 2023-11-17 2023-11-17 System for processing invoice data

Country Status (1)

Country Link
TW (1) TWM655760U (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI897103B (en) * 2023-11-17 2025-09-11 爍益管理顧問有限公司 Invoice processing system and method for converting paper invoices into electronic data

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI897103B (en) * 2023-11-17 2025-09-11 爍益管理顧問有限公司 Invoice processing system and method for converting paper invoices into electronic data

Similar Documents

Publication Publication Date Title
US8958605B2 (en) Systems, methods and computer program products for determining document validity
US10366123B1 (en) Template-free extraction of data from documents
US20050289182A1 (en) Document management system with enhanced intelligent document recognition capabilities
US20210366055A1 (en) Systems and methods for generating accurate transaction data and manipulation
JP6357621B1 (en) Accounting processing apparatus, accounting processing system, accounting processing method and program
US11501344B2 (en) Partial perceptual image hashing for invoice deconstruction
JP2015146075A (en) accounting data input support system, method, and program
JP2019086984A (en) Computer and document identification method
US20120203676A1 (en) Software for Automatic Processing of Paper Financial Documents
CN115116068B (en) Archive intelligent archiving system based on OCR
US20140207631A1 (en) Systems and Method for Analyzing and Validating Invoices
TWM655760U (en) System for processing invoice data
CN118628263A (en) Financial audit methods, devices, equipment, media and products
CN111768565B (en) Method for identifying and post-processing invoice codes in value-added tax invoices
KR20240117827A (en) Method for document automation processing and apparatus implementing the same
US20250316107A1 (en) Systems and methods for secure management of document execution
JP6755926B2 (en) Form information recognition device and form information recognition method
TWI897103B (en) Invoice processing system and method for converting paper invoices into electronic data
CN119600639A (en) An OCR drawing recognition system based on large models
CN118863813A (en) A contract verification method, device, computer equipment and storage medium
CN115587098A (en) Method and system for intelligently identifying chart data
JP2007280413A (en) Automatic input device of financial statement
TWI879700B (en) Methods for extracting OCR data of purchase and sales items or other documents
JP2006252575A (en) Financial statement automatic input apparatus and method therefore
Fernando Intelligent Document Processing: A Guide For Building RPA Solutions