CN111815500A - Image processing method, device, equipment and storage medium - Google Patents
Image processing method, device, equipment and storage medium Download PDFInfo
- Publication number
- CN111815500A CN111815500A CN202010611345.6A CN202010611345A CN111815500A CN 111815500 A CN111815500 A CN 111815500A CN 202010611345 A CN202010611345 A CN 202010611345A CN 111815500 A CN111815500 A CN 111815500A
- Authority
- CN
- China
- Prior art keywords
- published
- information
- platform
- picture
- original picture
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
- G06T1/0021—Image watermarking
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2201/00—General purpose image data processing
- G06T2201/005—Image watermarking
- G06T2201/0065—Extraction of an embedded watermark; Reliable detection
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Information Transfer Between Computers (AREA)
Abstract
本申请公开了一种图像处理方法、装置、设备及存储介质,涉及图像处理技术领域。具体实现方案为:获取在已发布平台中的已发布信息;其中,所述已发布信息包括文字内容和水印图片;获取与所述水印图片相关联的原始图片;将所述原始图片和所述文字内容上传至待发布平台,进行信息发布。本申请实施例实现了对已发布信息从已发布平台到待发布平台的自动迁移,无需进行文字素材的收集和水印图片的去水印处理,也无需发布信息的手动编辑处理,从而提高了信息发布效率。
The present application discloses an image processing method, apparatus, device and storage medium, and relates to the technical field of image processing. The specific implementation scheme is: obtaining the published information in the published platform; wherein, the published information includes text content and a watermarked picture; obtaining the original picture associated with the watermarked picture; combining the original picture with the watermarked picture The text content is uploaded to the platform to be released for information release. The embodiment of the present application realizes the automatic migration of the published information from the published platform to the platform to be published, without the need for the collection of text materials, the removal of watermarking of watermarked pictures, and the need for manual editing of the published information, thereby improving information publishing. efficiency.
Description
技术领域technical field
本申请涉及人工智能技术领域,具体涉及图像处理技术。具体地,本申请提供了一种利用图像处理技术进行信息发布的图像处理方法、装置、设备及存储介质。The present application relates to the field of artificial intelligence technology, in particular to image processing technology. Specifically, the present application provides an image processing method, apparatus, device, and storage medium that utilize image processing technology to publish information.
背景技术Background technique
随着互联网技术的发展,各种信息共享平台应运而生。为了提升信息的曝光度,信息发布者通常会选择将同一信息在不同平台中进行发布,从而增加信息的传播量。With the development of Internet technology, various information sharing platforms have emerged. In order to increase the exposure of information, information publishers usually choose to publish the same information on different platforms, thereby increasing the dissemination of information.
当在一个平台中的已发布信息在另一个平台进行二次发布时,需要对已发布信息中的水印图片进行去水印处理,并将处理后的图片及图片关联内容在待发布平台进行发布。由于去水印过程较为繁琐,致使信息发布效率较低。When the published information on one platform is re-published on another platform, the watermarked images in the published information need to be de-watermarked, and the processed images and their associated content are published on the platform to be published. Due to the cumbersome process of de-watermarking, the efficiency of information release is low.
发明内容SUMMARY OF THE INVENTION
本申请提供了一种信息发布效率更高的图像处理方法、装置、设备及存储介质。The present application provides an image processing method, apparatus, device and storage medium with higher information release efficiency.
根据本申请的一方面,提供了一种图像处理方法,包括:According to an aspect of the present application, an image processing method is provided, comprising:
获取在已发布平台中的已发布信息;其中,所述已发布信息包括文字内容和水印图片;Obtain the published information in the published platform; wherein, the published information includes text content and watermark images;
获取与所述水印图片相关联的原始图片;obtaining the original picture associated with the watermark picture;
将所述原始图片和所述文字内容上传至待发布平台,进行信息发布。The original picture and the text content are uploaded to the platform to be released, and information is released.
根据本申请的另一方面,提供了一种图像处理装置,包括:According to another aspect of the present application, an image processing apparatus is provided, comprising:
已发布信息获取模块,用于获取在已发布平台中的已发布信息;其中,所述已发布信息包括文字内容和水印图片;A published information acquisition module, used to acquire the published information in the published platform; wherein, the published information includes text content and watermark pictures;
原始图片获取模块,用于获取与所述水印图片相关联的原始图片;an original image acquisition module, used to acquire the original image associated with the watermark image;
信息发布模块,用于将所述原始图片和所述文字内容上传至待发布平台,进行信息发布。The information release module is used for uploading the original picture and the text content to the platform to be released for information release.
根据本申请的又一方面,提供了一种电子设备,其中,包括:According to yet another aspect of the present application, an electronic device is provided, comprising:
至少一个处理器;以及at least one processor; and
与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein,
所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行本申请实施例提供的任意一种图像处理方法。The memory stores instructions executable by the at least one processor, where the instructions are executed by the at least one processor, so that the at least one processor can perform any image processing provided by the embodiments of the present application method.
根据本申请的再一方面,提供了一种存储有计算机指令的非瞬时计算机可读存储介质,其中,所述计算机指令用于使所述计算机执行本申请实施例提供的任意一种图像处理方法。According to another aspect of the present application, a non-transitory computer-readable storage medium storing computer instructions is provided, wherein the computer instructions are used to cause the computer to execute any image processing method provided by the embodiments of the present application .
根据本申请的技术方案提高了在信息共享平台中的信息发布效率。The technical solution according to the present application improves the information release efficiency in the information sharing platform.
应当理解,本部分所描述的内容并非旨在标识本公开的实施例的关键或重要特征,也不用于限制本公开的范围。本公开的其它特征将通过以下的说明书而变得容易理解。It should be understood that what is described in this section is not intended to identify key or critical features of embodiments of the disclosure, nor is it intended to limit the scope of the disclosure. Other features of the present disclosure will become readily understood from the following description.
附图说明Description of drawings
附图用于更好地理解本方案,不构成对本申请的限定。其中:The accompanying drawings are used for better understanding of the present solution, and do not constitute a limitation to the present application. in:
图1是本申请实施例提供的一种图像处理方法的流程图;1 is a flowchart of an image processing method provided by an embodiment of the present application;
图2是本申请实施例提供的另一种图像处理方法的流程图;2 is a flowchart of another image processing method provided by an embodiment of the present application;
图3是本申请实施例提供的另一种图像处理方法的流程图;3 is a flowchart of another image processing method provided by an embodiment of the present application;
图4是本申请实施例提供的另一种图像处理方法的流程图;4 is a flowchart of another image processing method provided by an embodiment of the present application;
图5是本申请实施例提供的一种图像处理装置的结构图;5 is a structural diagram of an image processing apparatus provided by an embodiment of the present application;
图6是用来实现本申请实施例的一种图像处理方法的电子设备的框图。FIG. 6 is a block diagram of an electronic device for implementing an image processing method according to an embodiment of the present application.
具体实施方式Detailed ways
以下结合附图对本申请的示范性实施例做出说明,其中包括本申请实施例的各种细节以助于理解,应当将它们认为仅仅是示范性的。因此,本领域普通技术人员应当认识到,可以对这里描述的实施例做出各种改变和修改,而不会背离本申请的范围和精神。同样,为了清楚和简明,以下的描述中省略了对公知功能和结构的描述。Exemplary embodiments of the present application are described below with reference to the accompanying drawings, which include various details of the embodiments of the present application to facilitate understanding, and should be considered as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present application. Also, descriptions of well-known functions and constructions are omitted from the following description for clarity and conciseness.
本申请实施例所提供的各图像处理方法和图像处理装置,适用于采用图像处理技术,对在一个信息共享平台中的已发布信息,在另一个信息共享平台中进行信息发布的情况,该方法由图像处理装置执行,该装置采用软件和/或硬件实现,并具体配置于电子设备中。The image processing methods and image processing apparatuses provided in the embodiments of the present application are suitable for the situation that the information published in one information sharing platform is released in another information sharing platform by using image processing technology, the method Executed by an image processing apparatus, the apparatus is implemented by software and/or hardware, and is specifically configured in an electronic device.
图1是本申请实施例提供的一种图像处理方法的流程图,该方法包括:FIG. 1 is a flowchart of an image processing method provided by an embodiment of the present application, and the method includes:
S101、获取在已发布平台中的已发布信息;其中,已发布信息包括文字内容和水印图片。S101. Acquire published information on a published platform; wherein, the published information includes text content and watermarked pictures.
其中,水印图片可以理解为添加有与已发布平台相关联的水印标记的图片。The watermark picture can be understood as a picture added with a watermark associated with the published platform.
在本申请实施例的一种可选实施方式中,已发布信息可以预先存储在电子设备或与电子设备关联的存储设备中,相应的,在需要时,在电子设备或与电子设备所关联的存储设备中,进行已发布信息的获取。In an optional implementation of this embodiment of the present application, the published information may be pre-stored in the electronic device or a storage device associated with the electronic device. In the storage device, the published information is obtained.
为了减少数据存储或所存储数据的维护成本,在本申请实施例的另一可选实施方式中,还可以通过登录已发布平台,在已发布平台中的已发布网址,进行已发布信息的爬取。In order to reduce the cost of data storage or the maintenance of the stored data, in another optional implementation of the embodiment of the present application, it is also possible to log in to the published platform, and to crawl the published information on the published website of the published platform. Pick.
为了实现对已发布平台中已发布信息的自动爬取,提高数据获取效率,还可以在已发布平台实时或定时针对设定信息发布者进行监测,并在监测到信息发布者在该已发布平台进行信息发布时,根据已发布信息的网址,从已发布平台中进行已发布信息的爬取。In order to realize the automatic crawling of the published information on the published platform and improve the efficiency of data acquisition, it is also possible to monitor the set information publishers in real time or regularly on the published platform, and monitor the information publishers on the published platform after monitoring. When publishing information, crawl the published information from the published platform according to the URL of the published information.
需要说明的是,为了避免版权问题带来的纷争,通常对已发布信息的爬取和后续处理,为经信息发布者允许或授权后才会执行。It should be noted that, in order to avoid disputes caused by copyright issues, the crawling and subsequent processing of published information is usually performed only after the permission or authorization of the information publisher.
S102、获取与水印图片相关联的原始图片。S102. Obtain the original picture associated with the watermarked picture.
其中,原始图片可以理解为在已发布平台中进行信息发布时,未添加有平台关联水印的图片。需要说明的是,该原始图片可以是在该已发布平台中进行信息发布时,信息发布者所采用的原始图片,还可以是在其他已发布平台中进行信息发布时,信息发布者所采用的原始图片。Among them, the original picture can be understood as the picture without the platform-related watermark added when the information is published on the published platform. It should be noted that the original picture can be the original picture used by the information publisher when the information is published on the published platform, or the original image used by the information publisher when the information is published on other published platforms. Original Image.
需要说明的是,为了避免所获取的原始图片经多次编辑后出现清晰度下降或图片失真的情况,还可以会在进行已发布信息关联的首次发布信息的发布时所采用的未添加水印的图片,作为水印图片相关联的原始图片。It should be noted that, in order to avoid the loss of sharpness or distortion of the original image after multiple editing, the unwatermarked image may also be used when publishing the information associated with the published information for the first time. Picture, as the original picture associated with the watermark picture.
可选的,在电子设备本地或与电子设备所关联的其他存储设备中,预先存储有与水印图片所关联的原始图片,相应的,可以通过水印图片进行原始图片的查找,从而通过原始图片查找的方式替代对水印图片进行去水印处理,为后续进行信息发布提供了信息素材,同时减少了数据处理成本,提高了图片处理效率。Optionally, in the local electronic device or other storage devices associated with the electronic device, the original picture associated with the watermarked picture is pre-stored. Correspondingly, the original picture can be searched through the watermarked picture, so that the original picture can be searched through the original picture. Instead of de-watermarking the watermarked pictures, it provides information material for subsequent information release, reduces the data processing cost, and improves the picture processing efficiency.
示例性地,获取与水印图片相关联的原始图片,可以通过水印图片的附图说明,从电子设备本地或电子设备所关联的其他存储设备中,进行水印图片所关联的原始图片的查找获取。Exemplarily, to obtain the original picture associated with the watermarked picture, the original picture associated with the watermarked picture can be searched and obtained from the local electronic device or other storage devices associated with the electronic device through the description of the watermarked picture.
由于不同图片所对应的附图说明可能相同,因此通过附图说明进行水印图片所关联的原始图片的查找,可能存在查找结果不准确的情况,为了提高所查找结果的准确度,在本申请实施例的一种可选实施方式中,还可以在原始图片库中查找与水印图片相关性较高的原始图片,并将查找到的原始图片作为水印图片所关联的原始图片。Since the descriptions of the drawings corresponding to different pictures may be the same, the search results of the original pictures associated with the watermarked pictures may be inaccurate in the search for the original pictures associated with the watermarked pictures through the descriptions of the drawings. In an optional implementation manner of the example, an original picture with high correlation with the watermarked picture may also be searched in the original picture library, and the found original picture is used as the original picture associated with the watermarked picture.
示例性地,确定原始图片和水印图片自身的相似度,并根据相似度确定结果,从各原始图片中选取与水印图片关联的原始图片。具体的们可以选取相似度大于设定相似度阈值的其中一个原始图片作为水印图片关联的原始图片。其中,设定相似度阈值可以由技术人员根据需要或经验值进行确定。Exemplarily, the similarity between the original picture and the watermarked picture itself is determined, and according to the similarity determination result, the original picture associated with the watermarked picture is selected from each original picture. Specifically, one of the original pictures whose similarity is greater than the set similarity threshold may be selected as the original picture associated with the watermarked picture. Wherein, the setting of the similarity threshold can be determined by technical personnel according to needs or empirical values.
示例性地,在原始图片库中查找与水印图片相关性较高的原始图片,可以是:提取水印图片的特征数据,得到水印图片特征;将水印图片特征与原始图片库中各原始图片的原始图片特征进行比对;根据比对结果确定与水印图片所关联的原始图片。可以理解的是,通过图像处理技术对水印图片进行特征数据的提取,为原始图片的查找确定提供了数据支撑,也避免了通过原始图片与水印图片自身比对带来的数据量的增加。Exemplarily, in the original picture library, searching for the original picture with high correlation with the watermark picture may be: extracting the feature data of the watermark picture to obtain the watermark picture feature; The image features are compared; the original image associated with the watermarked image is determined according to the comparison result. It can be understood that the feature data extraction of the watermark image by image processing technology provides data support for the search and determination of the original image, and also avoids the increase in the amount of data caused by the comparison between the original image and the watermark image itself.
具体的,根据比对结果,确定与水印图片所关联的原始图片,可以是:确定与水印图片特征比对相似度大于设定相似度阈值的原始图片,并将所确定的原始图片作为水印图片所关联的原始图片。和/或确定与水印图片特征比对相似度最高的设定数量个原始图片作为候选原始图片;选择候选原始图片中的其中一个,作为水印图片所关联的原始图片。其中,设定相似度阈值由技术人员根据需要或经验值进行确定,还可以根据实际比对结果,对设定相似度阈值在一定误差范围内进行微调;其中,设定数量由技术人员根据需要或经验值进行确定,例如确定为固定值,或通过固定百分比,根据原始图片库中所包含原始图片的数量与固定百分比的乘积,设定为可变值。Specifically, determining the original image associated with the watermarked image according to the comparison result may be: determining the original image whose similarity with the watermarked image feature is greater than the set similarity threshold, and using the determined original image as the watermarked image The associated original image. And/or determine a set number of original pictures with the highest similarity with the watermarked picture feature comparison as candidate original pictures; select one of the candidate original pictures as the original picture associated with the watermarked picture. Among them, the set similarity threshold is determined by the technical personnel according to the needs or experience, and the set similarity threshold can be fine-tuned within a certain error range according to the actual comparison results; wherein, the set number is determined by the technical personnel according to the needs. It is determined by an empirical value, for example, a fixed value, or a fixed percentage, which is set as a variable value according to the product of the number of original pictures contained in the original picture library and the fixed percentage.
S103、将原始图片和文字内容上传至待发布平台,进行信息发布。S103, upload the original image and text content to the platform to be released, and release the information.
将原始图片和文字内容上传至待发布平台,从而使得在待发布平台的信息发布页面,根据原始图片和文字内容生成新的发布信息。The original image and text content are uploaded to the platform to be released, so that on the information release page of the platform to be released, new release information is generated according to the original image and text content.
需要说明的是,一般的,为了避免版权纠纷,在进行发布过程中,通过会对所发布的原始图片中添加与待发布平台相关联的水印。It should be noted that, generally, in order to avoid copyright disputes, during the publishing process, a watermark associated with the platform to be published is added to the published original image.
本申请实施例通过获取在已发布平台中的已发布信息;其中,已发布信息包括文字内容和水印图片;获取与水印图片相关联的原始图片;将原始图片和文字内容上传至待发布平台,进行信息发布。本申请实施例通过进行已发布信息的获取,实现了对已发布信息中文字内容的复用,为后续在待发布平台的信息发布提供信息素材,减少了文字素材获取时间;通过进行水印图片的原始图片的查找代替对水印图片的去水印处理,为后续在待发布平台的信息发布提供信息素材,减少了图片素材获取时间;通过采用文字内容和水印图片关联的原始图片在待发布平台进行信息发布,从而实现了对已发布信息从已发布平台到待发布平台的自动迁移,无需发布信息的手动编辑处理,从而提高了信息发布效率。The embodiment of the present application obtains the published information in the published platform; wherein, the published information includes the text content and the watermarked picture; obtains the original picture associated with the watermarked picture; uploads the original picture and the textual content to the platform to be published, Release information. The embodiment of the present application realizes the multiplexing of the text content in the published information by acquiring the published information, provides information materials for the subsequent information release on the platform to be released, and reduces the acquisition time of the text materials; The search of the original image replaces the watermarking process of the watermarked image, providing information material for the subsequent information release on the platform to be released, reducing the acquisition time of the image material; by using the original image associated with the text content and the watermarked image on the platform to be released. Publishing, thereby realizing the automatic migration of the published information from the published platform to the platform to be published, without manual editing of the published information, thereby improving the information publishing efficiency.
图2是本申请实施例提供的另一种图像处理方法的流程图,该方法在是上述各实施例的技术方案的基础上,进行了优化改进。FIG. 2 is a flowchart of another image processing method provided by an embodiment of the present application, which is optimized and improved on the basis of the technical solutions of the foregoing embodiments.
进一步地,将操作“获取与水印图片相关联的原始图片”,细化为“对水印图片进行特征提取,得到水印图片特征;根据水印图片特征,在原始图片特征库中查找与水印图片相关联的原始图片”,以完善原始图片的获取机制。Further, the operation "obtain the original picture associated with the watermarked picture" is refined into "perform feature extraction on the watermarked picture to obtain the watermarked picture feature; according to the watermarked picture feature, search for the original picture feature library associated with the watermarked picture. "Original Picture" to improve the acquisition mechanism of the original picture.
如图2所示的一种图像处理方法,包括:An image processing method as shown in Figure 2, comprising:
S201、获取在已发布平台中的已发布信息;其中,已发布信息包括文字内容和水印图片。S201. Acquire published information on a published platform; wherein, the published information includes text content and watermarked pictures.
S202、对水印图片进行特征提取,得到水印图片特征。S202. Perform feature extraction on the watermarked picture to obtain features of the watermarked picture.
S203、根据所述水印图片特征,在原始图片特征库中确定与所述水印图片相关联的原始图片特征。S203. According to the watermarked picture feature, determine the original picture feature associated with the watermarked picture in the original picture feature library.
S204、获取所确定的原始图片特征所关联的原始图片作为所述水印图片相关联的原始图片。S204: Obtain the original picture associated with the determined original picture feature as the original picture associated with the watermarked picture.
示例性地,依次遍历原始图片特征库,确定原始图片特征库中各原始图片特征与水印图片特征之间的相似度;根据相似度从原始图片特征库中确定与水印图片所关联的原始图片特征;根据所确定的原始图片特征,获取相应的原始图片作为水印图片所关联的原始图片。Exemplarily, the original image feature library is traversed in turn to determine the similarity between each original image feature in the original image feature library and the watermarked image feature; the original image feature associated with the watermarked image is determined from the original image feature library according to the similarity. ; According to the determined original picture feature, obtain the corresponding original picture as the original picture associated with the watermarked picture.
可以理解的是,通过预先对各原始图片的特征数据进行提取,得到原始图片特征,并添加至预先构建的原始图片特征库中。从而在进行水印图片所关联的原始图片的确定过程中,直接在原始图片特征库中进行水印图片特征的查找比对,无需再针对各原始图片进行特征提取,从而提高了原始图片的确定效率。It can be understood that, by pre-extracting the feature data of each original image, the original image features are obtained, and added to the pre-built original image feature library. Therefore, in the process of determining the original image associated with the watermarked image, the feature of the watermarked image is directly searched and compared in the original image feature library, and there is no need to perform feature extraction for each original image, thereby improving the determination efficiency of the original image.
需要说明的是,随着同部门信息发布者数量的增加,以及信息发布者工作年限的增加,可能导致存在原始图片数量过大的情况。那么,相应的,原始图片特征库中的原始图片特征的数量也会较大,使得采用依次遍历查找水印图片相关联的原始图片特征的方式,会占用大量的数据运算资源,同时还会消耗较高的时间成本。It should be noted that, with the increase in the number of information publishers in the same department and the increase in the working years of the information publishers, there may be a situation where the number of original pictures is too large. Correspondingly, the number of original image features in the original image feature library will also be larger, so that the method of searching for the original image features associated with the watermarked image in turn will take up a lot of data computing resources, and will also consume a relatively large amount of data. high time cost.
为了避免在原始图片特征库遍历库遍历过程中,出现遍历中断,需要重新进行原始图片特征库遍历,导致的数据重复处理的情况,还可以在进行原始图片特征库遍历过程中,实时或定时进行进度保存,从而在出现遍历中断时,能够基于最新存储的进度位置,继续进行原始图片特征库的遍历,减少不必要的重复操作。In order to avoid traversal interruption during the traversal process of the original image feature library, it is necessary to re-traverse the original image feature library, resulting in repeated data processing. During the original image feature library traversal process, real-time or timing can also be performed. The progress is saved, so that when the traversal is interrupted, the traversal of the original image feature library can be continued based on the latest stored progress position, thereby reducing unnecessary repeated operations.
为了减少数据运算资源占用量,同时提高数据查询效率,在本申请实施例的一个可选实施方式中,可以将原始图片特征库中的各原始图片特征,根据已发布信息的连载属性和/或信息发布者的发布者标识,分批次存储,得到多个特征查找库;相应的,根据所述水印图片特征,在原始图片特征库中确定与所述水印图片相关联的原始图片特征,可以是根据已发布信息的连载属性和/或信息发布者,从原始图片特征库中确定待查找特征库;根据水印图片特征,在待查找特征库中,查找与水印图片相关联的原始图片特征。In order to reduce the occupancy of data computing resources and improve the efficiency of data query, in an optional implementation of the embodiment of the present application, each original image feature in the original image feature library may be The publisher identifier of the information publisher is stored in batches, and multiple feature search libraries are obtained; correspondingly, according to the watermark picture feature, the original picture feature associated with the watermark picture is determined in the original picture feature library, which can be The feature library to be searched is determined from the original picture feature library according to the serialization attribute of the published information and/or the information publisher; the original picture feature associated with the watermark picture is searched in the feature library to be searched according to the watermark picture feature.
其中,连载属性,可以理解为已发布信息存在与之具备时间顺序关联和内容前后关联的其他发布信息的属性。例如,在平台中发布的连载小说即为具备连载属性的已发布信息。Among them, the serialization attribute can be understood as an attribute of other published information that is associated with time sequence and content before and after the published information. For example, serialized novels published on the platform are published information with serialization properties.
其中,发布者标识可以是具备唯一性的发布者名称或编号等。Wherein, the publisher identifier may be a unique publisher name or serial number or the like.
例如,当已发布平台中所发布的信息为某连载小说时,针对该连载小说单独创建一原始图片特征库,用于存储与连载小说中各已发布信息中的原始图片的原始图片特征,从而在已发布平台的已发布网址进行连载小说爬取后,通过在连载小说的原始图片特征库的查找匹配,确定与爬取的连载小说中的水印图片关联的原始图片特征,进而将该原始图片特征所关联的原始图片作为水印图片所关联的原始图片。由于无需全部原始图片特征库的查找,因此减少了数据查询时间和数据查找时的资源占用量。For example, when the information published on the published platform is a serialized novel, an original image feature library is created separately for the serialized novel to store the original image features of the original images in the published information in the serialized novel, so as to After the serialized novel is crawled on the published website of the published platform, the original image feature associated with the watermarked image in the crawled serialized novel is determined by searching and matching the original image feature library of the serialized novel, and then the original image The original picture associated with the feature is the original picture associated with the watermark picture. Since there is no need to search all original image feature databases, data query time and resource occupancy during data search are reduced.
又如,某部门有多个信息发布者,不同信息发布者负责不同类型内容的发布。这种情况下,可以针对不同信息发布者单独创建原始图片特征库,用于存储该信息发布者所发布的信息中的原始图片的原始图片特征,从而能够通过水印图片特征查找匹配的方式,在该原始图片特征库中进行水印图片关联的原始图片特征的确定,进而将该原始图片特征所关联的原始图片,作为爬取的数据中的水印图片所关联的原始图片。由于无需全部原始图片特征库的查找,因此减少了数据查询时间和数据查找时的资源占用量。For another example, a department has multiple information publishers, and different information publishers are responsible for publishing different types of content. In this case, the original image feature library can be created separately for different information publishers to store the original image features of the original images in the information published by the information publishers, so that the matching method can be searched through the watermark image features. The original image feature associated with the watermarked image is determined in the original image feature library, and then the original image associated with the original image feature is used as the original image associated with the watermarked image in the crawled data. Since there is no need to search all original image feature databases, data query time and resource occupancy during data search are reduced.
再如,当一个连载小说有多个信息发布者交替发布时,还可以分别针对各信息发布者所发布的连载小说部分,构建连载小说发布时的原始图片的原始图片特征库,进而在所发布网页进行连载小说爬取后,根据信息发布者和连载小说进行原始图片特征库的查找,确定与水印图片关联的原始图片特征,进而将该原始图片特征所关联的原始图片,作为爬取的数据中的水印图片所关联的原始图片。由于无需全部原始图片特征库的查找,因此减少了数据查询时间和数据查找时的资源占用量。For another example, when a serialized novel is published alternately by multiple information publishers, it is also possible to construct an original image feature library of the original images of the serialized novel when the serialized novel is published for the parts of the serialized novel published by the information publishers, and then publish the original images in the published novels. After the serialized novel is crawled on the webpage, the original image feature library is searched according to the information publisher and the serialized novel to determine the original image feature associated with the watermarked image, and then the original image associated with the original image feature is used as the crawling data. The original image associated with the watermarked image in . Since there is no need to search all original image feature databases, data query time and resource occupancy during data search are reduced.
在上述各实施例的技术方案的基础上,为了减少原始图片确定时的资源占用量,同时提高原始图片的确定效率,在本申请实施例的另一可选实施方式中,原始图片特征库中的各原始图片特征存储有再发布标识,用于区分各原始图片特征所关联原始图片的再发布情况。示例性地,若原始图片特征包含有在某平台的再发布标识,则表明该原始图片特征所关联的原始图片对应的发布信息,已在该平台进行发布;若原始图片特征未含有在某平台的再发布标识,则表明该原始图片特征所关联的原始图片对应的发布信息,未在该平台进行发布。On the basis of the technical solutions of the above embodiments, in order to reduce the resource occupancy when determining the original image and improve the efficiency of determining the original image, in another optional implementation of the embodiments of the present application, the original image feature library is Each of the original picture features of the . Exemplarily, if the original image feature contains a re-release logo on a certain platform, it means that the release information corresponding to the original image associated with the original image feature has been released on the platform; The re-publishing logo indicates that the publishing information corresponding to the original image associated with the original image feature has not been published on this platform.
相应的,根据再发布标识,从原始图片特征库中筛选未再发布的原始图片特征;根据水印图片特征和各未再发布的原始图片特征,确定与水印图片相关联的原始图片特征。Correspondingly, unrepublished original image features are selected from the original image feature library according to the republished identifier; original image features associated with the watermarked image are determined according to the watermarked image features and the unrepublished original image features.
具体的,确定水印图片特征与各未再发布的原始图片特征之间的相似度;确定相似度大于设定相似度阈值的至少一个原始图片特征作为与水印图片相关联的原始图片特征。其中,设定相似度阈值由人员根据需要或经验值进行确定。Specifically, the similarity between the watermarked picture feature and each unpublished original picture feature is determined; at least one original picture feature whose similarity is greater than the set similarity threshold is determined as the original picture feature associated with the watermarked picture. Wherein, the set similarity threshold is determined by personnel according to needs or experience.
S205、将原始图片和文字内容上传至待发布平台,进行信息发布。S205, upload the original image and text content to the platform to be released, and release the information.
本申请实施例通过对水印图片进行特征提取,得到水印图片特征;根据水印图片特征,在原始图片特征库中确定与水印图片相关联的原始图片特征;获取所确定的原始图片特征所关联的原始图片作为水印图片相关联的原始图片,避免了在进行原始图片确定过程中,针对各原始图片进行原始图片特征的确定,提高了原始图片确定效率,从而为提高在待发布平台进行信息发布的发布效率,奠定了基础。In the embodiment of the present application, the feature of the watermarked picture is obtained by extracting the features of the watermarked picture; according to the feature of the watermarked picture, the original picture feature associated with the watermarked picture is determined in the original picture feature library; the original picture feature associated with the determined original picture feature is obtained. The picture is used as the original picture associated with the watermarked picture, which avoids determining the original picture features for each original picture in the process of determining the original picture, which improves the efficiency of determining the original picture, thereby improving the release of information release on the platform to be released. Efficiency laid the foundation.
图3是本申请实施例提供的另一种图像处理方法的流程图,该方法在上述各实施例的技术方案的基础上,进行了优化改进。FIG. 3 is a flowchart of another image processing method provided by an embodiment of the present application, which is optimized and improved on the basis of the technical solutions of the foregoing embodiments.
进一步地,将操作“获取在已发布平台中的已发布信息”,细化为“根据已发布平台的网页结构,从已发布平台的已发布网页中,爬取已发布信息”;相应的,将操作“将原始图片和文字内容上传至待发布平台,进行信息发布”,细化为“将原始图片和文字内容上传至待发布平台,以根据待发布平台的网页结构,进行信息发布”,以完善已发布信息的获取机制。Further, the operation "obtain the published information in the published platform" is refined into "according to the web page structure of the published platform, crawl the published information from the published web pages of the published platform"; correspondingly, The operation "Upload the original image and text content to the platform to be released for information release" is refined into "Upload the original image and text content to the platform to be released to release information according to the web page structure of the platform to be released", To improve the access mechanism of published information.
如图3所示的一种图像处理方法,包括:An image processing method as shown in Figure 3, comprising:
S301、根据已发布平台的网页结构,从已发布平台的已发布网页中,爬取已发布信息;其中,已发布信息包括文字内容和水印图片。S301. According to the web page structure of the published platform, crawl published information from the published web pages of the published platform; wherein, the published information includes text content and watermark images.
示例性地,可以根据已发布平台的网页结构,确定至少一个内容标签;根据各内容标签,依次从已发布平台的已发布网页中,爬取包括文字内容和水印图片的已发布信息。Exemplarily, at least one content tag may be determined according to the web page structure of the published platform; according to each content tag, the published information including text content and watermark images is sequentially crawled from the published web pages of the published platform.
其中,各平台的网页结构可以预先存储在电子设备本地或与电子设备所关联的存储设备中,相应的,在已发布平台进行信息爬取时,进行已发布平台的网页结果的查找获取。The web page structure of each platform may be pre-stored locally on the electronic device or in a storage device associated with the electronic device. Correspondingly, when the published platform performs information crawling, the web page results of the published platform are searched and obtained.
可以理解的是,为了避免网络拥堵或被平台认定为恶意操作从而进行抓取屏蔽,在已发布平台进行已发布信息爬取时,会预先针对该已发布平台设置抓取频率,进而依照抓取频率在已发布平台的已发布网页中进行已发布信息的抓取。其中,抓取频率可以根据已发布平台的屏蔽抓取频率进行确定。It is understandable that, in order to avoid network congestion or be identified as malicious operations by the platform for crawling and blocking, when a published platform crawls published information, the crawling frequency will be set for the published platform in advance, and then the crawling frequency will be set according to the crawling frequency. Frequency of crawling of published information in published pages of published platforms. The crawling frequency may be determined according to the blocked crawling frequency of the published platform.
S302、获取与水印图片相关联的原始图片。S302. Obtain the original picture associated with the watermarked picture.
S303、将原始图片和文字内容上传至待发布平台,以根据待发布平台的网页结构,进行信息发布。S303 , uploading the original image and text content to the platform to be released, so as to release information according to the web page structure of the platform to be released.
示例性地,将原始图片和文字内容上传至待发布平台,从而对原始图片添加与待发布平台关联的水印,得到新的水印图片;根据待发布平台的网页结构,对文字内容和新的水印图片进行排版,并发布。Exemplarily, the original picture and text content are uploaded to the platform to be released, so that a watermark associated with the platform to be released is added to the original picture to obtain a new watermarked picture; according to the webpage structure of the platform to be released, the text content and the new watermark are Images are typeset and published.
示例性地,若已发布平台的网页结构中包含有待发布平台的网页结构中未包含的冗余网页标签时,需要将所爬取的文字内容中与冗余网页标签对应的冗余内容进行剔除,从而基于剔除后的文字内容和新的水印图片,在待发布平台进行信息发布。Exemplarily, if the web page structure of the published platform contains redundant web page tags that are not included in the web page structure of the platform to be published, the redundant content corresponding to the redundant web page tags in the crawled text content needs to be eliminated. , so that information is released on the platform to be released based on the deleted text content and the new watermark image.
可选的,可以将原始图片和文字内容上传至待发布平台;对原始图片添加与待发布平台关联的水印,得到新的水印图片;将已发布平台的网页结构和待发布平台的网页结构进行比对;根据比对结果,确定网页结构中的冗余网页标签;将所爬取的文字内容中与冗余网页标签对应的冗余内容进行剔除;根据待发布平台的网页结构中各网页标签的属性信息,对提出后的文字内容和新的水印图片进行排版,并发布。Optionally, the original picture and text content can be uploaded to the platform to be released; a watermark associated with the platform to be released is added to the original picture to obtain a new watermarked picture; the webpage structure of the published platform and the webpage structure of the platform to be released are compared. Comparison; according to the comparison result, determine the redundant webpage tags in the webpage structure; remove the redundant content corresponding to the redundant webpage tags in the crawled text content; according to the webpage tags in the webpage structure of the platform to be published Attribute information of the proposed text content and new watermark images are typeset and published.
为了减少数据上传量,可选的,还可以预先根据已发布平台的网页结构和待发布平台的网页结构,确定冗余网页标签;将所爬取的文字内容中与冗余网页标签对应的冗余内容进行剔除;对原始图片添加与待发布平台关联的水印,得到新的水印图片;将新的水印图片和剔除后的文字内容上传至待发布平台;根据待发布平台的网页结构中各网页标签的属性信息,对所上传的信息进行排版,并发布。In order to reduce the amount of data uploaded, optionally, redundant webpage tags can be determined in advance according to the webpage structure of the published platform and the webpage structure of the platform to be released; Remove the remaining content; add a watermark associated with the platform to be published to the original image to obtain a new watermarked image; upload the new watermarked image and the removed text content to the platform to be published; The attribute information of the tag, typeset the uploaded information, and publish it.
需要说明的是,本申请对新的水印图片的生成操作和文字内容的处理操作两者的先后顺序不做任何限定。It should be noted that the present application does not make any limitation on the sequence of the generation operation of the new watermark image and the processing operation of the text content.
示例性地,若待发布平台的网页结构中包含已发布平台的网页结构中不存在的额外网页标签,则根据所述待发布平台的网页结构,进行信息发布,可以是:根据文字内容,生成与额外网页标签对应的描述内容;根据待发布平台的网页结构,将原始图片、文字内容和描述内容在待发布平台进行信息发布。Exemplarily, if the web page structure of the platform to be published includes additional web page tags that do not exist in the web page structure of the published platform, then according to the web page structure of the platform to be published, information is published, which may be: generating according to the text content. The description content corresponding to the extra webpage label; according to the webpage structure of the platform to be released, the original image, text content and description content are released on the platform to be released.
示例性地,额外网页标签可以是摘要、关键字和副标题等信息中的至少一个;相应的,可以通过人工智能技术中的自然语义处理技术,对文字内容进行语义分析,生成与额外网页标签对应的描述内容。Exemplarily, the extra webpage tag can be at least one of information such as abstract, keyword, and subtitle; correspondingly, the text content can be semantically analyzed through the natural semantic processing technology in the artificial intelligence technology, and the corresponding extra webpage tag can be generated. description content.
可以理解的是,通过基于文字内容进行额外网页标签的描述内容的生成,从而完善了待发布平台所发布信息所包含的内容,为在待发布平台进行信息发布提供了数据支撑。It can be understood that, by generating the description content of the additional webpage label based on the text content, the content included in the information published on the platform to be published is improved, and data support is provided for information publishing on the platform to be published.
本申请实施例在已发布平台进行已发布信息爬取时,通过根据已发布平台的网页结构从已发布平台的已发布网页中,爬取已发布信息;相应的,在待发布平台进行信息发布时,将原始图片和文字内容上传至待发布平台,已根据待发布平台的网页结构,进行信息发布。采用上述技术方案,能够使得在平台进行信息发布时,能够使所发布信息适配该平台的排版要求,从而实现了已发布信息在不同排版要求的平台之间的迁移。In the embodiment of the present application, when the published platform crawls the published information, the published information is crawled from the published web pages of the published platform according to the web page structure of the published platform; correspondingly, the information is published on the platform to be published. When the original image and text content are uploaded to the platform to be released, information has been released according to the web page structure of the platform to be released. By adopting the above technical solution, when the platform publishes information, the published information can be adapted to the typesetting requirements of the platform, thereby realizing the migration of published information between platforms with different typesetting requirements.
在上述各技术方案的基础上,在根据已发布平台的网页结构,从已发布平台的已发布网页中,爬取已发布信息时,还可以根据已发布平台的已发布网页,确定已发布网页是否关联有并行网页;若已发布网页关联有并行网页,则根据已发布平台的网页结构,从已发布网页中爬取发布信息,并从已发布网页的并行网页中,爬取关联发布信息,用于进行关联发布信息在其他待发布平台的发布。On the basis of the above technical solutions, when crawling the published information from the published web pages of the published platform according to the web page structure of the published platform, it is also possible to determine the published web page according to the published web pages of the published platform Whether there is a parallel web page associated with it; if the published web page is associated with a parallel web page, the published information will be crawled from the published web page according to the web page structure of the published platform, and the associated published information will be crawled from the parallel web page of the published web page. It is used to publish related publishing information on other platforms to be published.
可以理解的是,通过在已发布平台的一个已发布网页中进行已发布信息爬取的同时,从已发布网页关联的并行网页中进行关联发布信息的爬取,从而在对并行网页中的关联发布信息在其他待发布平台的发布时,无需再进行关联发布信息的网页收集和关联发布信息的爬取,直接进行关联发布信息的使用即可,提高了后续进行关联发布信息,在其他待发布平台进行信息发布时的发布效率。It can be understood that by crawling the published information from a published web page of the published platform, the associated published information is crawled from the parallel web page associated with the published web page, so that the association in the parallel web page is crawled. When publishing information on other platforms to be published, it is no longer necessary to collect and crawl the related publishing information on the webpage, and just use the related publishing information directly, which improves the subsequent related publishing information. The publishing efficiency of the platform when publishing information.
示例性地,为了避免对已爬取的已发布信息的遗漏处理,还可以设置一消息队列,用于对同步爬取的关联发布信息进行存储;相应的,后续从该消息队列中依次进行关联发布信息的消费处理。Exemplarily, in order to avoid missing processing of the crawled published information, a message queue may also be set up to store the synchronously crawled associated published information; correspondingly, the subsequent associations are sequentially performed from the message queue Consumption processing of published information.
图4是本申请实施例提供的一种图像处理方法的流程图,该方法在上述各实施例的技术方案的基础上,提供了一种优选实施方式。FIG. 4 is a flowchart of an image processing method provided by an embodiment of the present application, and the method provides a preferred implementation based on the technical solutions of the foregoing embodiments.
如图4所示的一种图像处理方法,包括:信息爬取阶段410、信息匹配阶段420和信息发布阶段430。As shown in FIG. 4 , an image processing method includes: an
示例性地,信息爬取阶段410,包括:爬虫爬取411、数据清洗412和水印图片提取413。其中,Exemplarily, the
爬虫爬取411,用于依照设定频率从已发布平台的已发布网页中,根据网页结构爬取已发布信息;其中已发布信息包括文字内容和水印图片。The
具体的,根据已发布信息提取时的网页标签,对已发布信息中的文字内容和水印图片加以区分。Specifically, according to the webpage label when the published information is extracted, the text content and the watermark image in the published information are distinguished.
其中,水印图片为在已发布平台发布的原始图片添加与已发布平台关联的水印后形成的图片。The watermark picture is a picture formed by adding a watermark associated with the published platform to the original picture published by the published platform.
数据清洗412,用于根据已发布网页的网页结构,对所提取的文字内容和水印图片进行整理,形成初始文档,存储至内容库中。The data cleaning 412 is used to organize the extracted text content and watermark pictures according to the web page structure of the published web page to form an initial document and store it in the content library.
水印图片提取413,用于对水印图片进行特征提取,得到水印图片特征。The
示例性地,信息匹配阶段420,包括:信息过滤421和原始图片确定422。其中,Exemplarily, the
信息过滤421,用于对原始图片关联的非图片内容进行过滤,并将过滤后的各原始图片添加至本地特征库中,并关联存储各原始图片的原始图片特征。
具体的,若原始图片存储在Word文档中,则可以对文档中的文字内容进行过滤。Specifically, if the original picture is stored in the Word document, the text content in the document can be filtered.
原始图片确定422,用于基于特征相似度,根据水印图片特征在图搜特征库中查询与水印图片关联的原始图片特征;并在本地特征库中,查找与该原始图片特征对应的原始图片作为水印图片关联的原始图片。The
其中,图搜特征库中存储有根据连载属性和信息发布者分批构建的原始图片特征子库;根据爬取的已发布信息中的信息发布者和所爬取的已发布信息的连载属性,确定图搜特征库中的原始图片特征子库为待查询特征库;确定待查询特征库中各原始图片特征与水印图片特征的特征相似度;选取特征相似度大于设定相似度阈值的其中一个原始图片特征作为与水印图片关联的原始图片特征。Among them, the image search feature library stores original image feature sub-bases constructed in batches according to serialization attributes and information publishers; according to the information publishers in the crawled published information and the serialization attributes of the crawled published information, Determine the original image feature sub-base in the image search feature library as the feature library to be queried; determine the feature similarity of each original image feature and the watermark image feature in the feature library to be queried; select one of the feature similarity greater than the set similarity threshold The original picture feature is used as the original picture feature associated with the watermarked picture.
示例性地,信息发布阶段430,包括:阈值调整431和信息发布432。Exemplarily, the
其中,in,
阈值调整431,用于从内容库中获取初始文档;将初始文档中的水印图片替换为水印图片所关联的原始图片,得到目标文档;将目标文档和初始文档进行比对;若目标文档中的原始图片与初始文档中的水印图片未添加水印前的图片不同时,将设定相似度阈值调大;若目标文档中不存在与初始文档中的水印图片对应的原始图片,则将设定相似度阈值调小。
信息发布432,用于将验证通过的与水印图片关联的原始图片和爬取的文字内容上传至待发布平台;进行信息发布。
待发布平台对原始图片添加与待发布平台关联的水印,生成新的水印图片;将新的水印图片和文字内容,根据待发布平台的网页结构,进行排版并发布。The platform to be published adds a watermark associated with the platform to be published to the original image to generate a new watermarked image; the new watermarked image and text content are typeset and published according to the web page structure of the platform to be published.
需要说明的是,为了便于后续进行原始图片和水印图片的查找比对,还可以在确定原始图片和水印图片之间的对应关系之后,将具备对应关系的原始图片和水印图片关联上传至云端存储服务器;后续在需要获取水印图片关联的原始图片时,从云端存储服务器中进行原始图片的查找。为了减少上传时的数据量,还可以将图片进行压缩,并将压缩后的图片上传至云端存储服务器。相应的,在进行图片获取时,将获取后的图片进行解压缩后进行使用。It should be noted that, in order to facilitate the subsequent search and comparison of the original image and the watermarked image, after determining the corresponding relationship between the original image and the watermarked image, the original image with the corresponding relationship and the watermarked image can be associated and uploaded to the cloud storage. server; when the original image associated with the watermark image needs to be obtained later, the original image is searched from the cloud storage server. In order to reduce the amount of data when uploading, you can also compress the image, and upload the compressed image to the cloud storage server. Correspondingly, when acquiring an image, the acquired image is decompressed and used.
图5是本申请实施例提供的一种图像处理装置的结构图,该图像处理装置500,包括:已发布信息获取模块501、原始图片获取模块502和信息发布模块503。其中,5 is a structural diagram of an image processing apparatus provided by an embodiment of the present application. The
已发布信息获取模块501,用于获取在已发布平台中的已发布信息;其中,已发布信息包括文字内容和水印图片;The published
原始图片获取模块502,用于获取与水印图片相关联的原始图片;The original
信息发布模块503,用于将原始图片和文字内容上传至待发布平台,进行信息发布。The
本申请实施例通过已发布信息获取模块获取在已发布平台中的已发布信息;其中,已发布信息包括文字内容和水印图片;通过原始图片获取模块获取与水印图片相关联的原始图片;通过信息发布模块将原始图片和文字内容上传至待发布平台,进行信息发布。本申请实施例通过进行已发布信息的获取,实现了对已发布信息中文字内容的复用,为后续在待发布平台的信息发布提供信息素材,减少了文字素材获取时间;通过进行水印图片的原始图片的查找代替对水印图片的去水印处理,为后续在待发布平台的信息发布提供信息素材,减少了图片素材获取时间;通过采用文字内容和水印图片关联的原始图片在待发布平台进行信息发布,从而实现了对已发布信息从已发布平台到待发布平台的自动迁移,无需发布信息的手动编辑处理,从而提高了信息发布效率。In the embodiment of the present application, the published information in the published platform is obtained through the published information acquisition module; wherein, the published information includes text content and watermark pictures; the original picture acquisition module obtains the original pictures associated with the watermark pictures; through the information The publishing module uploads the original image and text content to the platform to be published for information publishing. The embodiment of the present application realizes the multiplexing of the text content in the published information by acquiring the published information, provides information materials for the subsequent information release on the platform to be released, and reduces the acquisition time of the text materials; The search of the original image replaces the watermarking process of the watermarked image, providing information material for the subsequent information release on the platform to be released, reducing the acquisition time of the image material; by using the original image associated with the text content and the watermarked image on the platform to be released. Publishing, thereby realizing the automatic migration of the published information from the published platform to the platform to be published, without manual editing of the published information, thereby improving the information publishing efficiency.
进一步地,原始图片获取模块502,包括:Further, the original
特征提取单元,用于对水印图片进行特征提取,得到水印图片特征;The feature extraction unit is used to perform feature extraction on the watermark image to obtain the feature of the watermark image;
原始图片特征确定单元,用于根据所述水印图片特征,在原始图片特征库中确定与所述水印图片相关联的原始图片特征;an original picture feature determining unit, configured to determine the original picture feature associated with the watermark picture in the original picture feature library according to the watermark picture feature;
原始图片获取单元,用于获取所确定的原始图片特征所关联的原始图片作为所述水印图片相关联的原始图片。An original picture obtaining unit, configured to obtain the original picture associated with the determined original picture feature as the original picture associated with the watermark picture.
进一步地,原始图片特征确定单元,包括:Further, the original picture feature determination unit includes:
待查找特征库确定子单元,用于根据所述已发布信息的连载属性和/或信息发布者,从原始图片特征库中确定待查找特征库;a feature library to be searched determining subunit, configured to determine the feature library to be searched from the original image feature library according to the serialization attribute of the published information and/or the information publisher;
原始图片特征确定子单元,用于根据所述水印图片特征,在所述待查找特征库中,查找与所述水印图片相关联的原始图片特征。The original picture feature determination subunit is configured to search for the original picture feature associated with the watermark picture in the feature library to be searched according to the watermark picture feature.
进一步地,原始图片特征库中的各原始图片特征存储有再发布标识,用于区分各原始图片特征所关联原始图片的再发布情况;Further, each original picture feature in the original picture feature library is stored with a re-release identifier, which is used to distinguish the re-release situation of the original picture associated with each original picture feature;
相应的,原始图片特征确定单元,包括:Correspondingly, the original image feature determination unit includes:
原始图片特征筛选子单元,用于根据再发布标识,从原始图片特征库中筛选未再发布的原始图片特征;The original image feature screening subunit is used to filter the unrepublished original image features from the original image feature library according to the reissue identifier;
原始图片特征确定子单元,用于根据水印图片特征和各未再发布的原始图片特征,确定与水印图片相关联的原始图片特征。The original picture feature determination subunit is used to determine the original picture feature associated with the watermarked picture according to the watermarked picture feature and each unpublished original picture feature.
进一步地,已发布信息获取模块501,包括:Further, the published
已发布信息爬取单元,用于根据已发布平台的网页结构,从已发布平台的已发布网页中,爬取已发布信息;The published information crawling unit is used to crawl the published information from the published web pages of the published platform according to the web page structure of the published platform;
相应的,信息发布模块503,包括:Correspondingly, the
信息发布单元,用于将原始图片和文字内容上传至待发布平台,以根据待发布平台的网页结构,进行信息发布。The information release unit is used to upload the original pictures and text content to the platform to be released, so as to release information according to the web page structure of the platform to be released.
进一步地,已发布信息爬取单元,包括:Further, the published information crawling unit includes:
并行网页确定子单元,用于根据已发布平台的已发布网页,确定已发布网页是否关联有并行网页;The parallel webpage determination subunit is used to determine whether the published webpage is associated with a parallel webpage according to the published webpage of the published platform;
关联发布信息爬取子单元,用于若已发布网页包含关联有并行网页,则根据已发布平台的网页结构,从已发布网页中爬取已发布信息,并从已发布网页的并行网页中,爬取关联发布信息,用于进行关联发布信息在其他待发布平台的发布。The associated published information crawling sub-unit is used to crawl published information from the published web page according to the web page structure of the published platform if the published web page contains associated parallel web pages, and from the parallel web pages of the published web page, Crawl the associated release information to publish the associated release information on other platforms to be released.
进一步地,若待发布平台的网页结构中包含已发布平台的网页结构中不存在的额外网页标签,则信息发布单元,包括:Further, if the web page structure of the platform to be published includes additional web page tags that do not exist in the web page structure of the published platform, the information publishing unit includes:
描述内容生成子单元,用于根据文字内容,生成与额外网页标签对应的描述内容;The description content generation subunit is used to generate description content corresponding to the extra webpage label according to the text content;
信息发布子单元,用于根据待发布平台的网页结构,将原始图片、文字内容和描述内容在待发布平台进行信息发布。The information release subunit is used to release the original image, text content and description content on the platform to be released according to the web page structure of the platform to be released.
上述图像处理装置可执行本申请任意实施例所提供的图像处理方法,具备执行图像处理方法相应的功能模块和有益效果。The above-mentioned image processing apparatus can execute the image processing method provided by any embodiment of the present application, and has corresponding functional modules and beneficial effects for executing the image processing method.
根据本申请的实施例,本申请还提供了一种电子设备和一种可读存储介质。According to the embodiments of the present application, the present application further provides an electronic device and a readable storage medium.
如图6所示,是实现本申请实施例的图像处理方法的电子设备的框图。电子设备旨在表示各种形式的数字计算机,诸如,膝上型计算机、台式计算机、工作台、个人数字助理、服务器、刀片式服务器、大型计算机、和其它适合的计算机。电子设备还可以表示各种形式的移动装置,诸如,个人数字处理、蜂窝电话、智能电话、可穿戴设备和其它类似的计算装置。本文所示的部件、它们的连接和关系、以及它们的功能仅仅作为示例,并且不意在限制本文中描述的和/或者要求的本申请的实现。As shown in FIG. 6 , it is a block diagram of an electronic device implementing the image processing method of the embodiment of the present application. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframe computers, and other suitable computers. Electronic devices may also represent various forms of mobile devices, such as personal digital processors, cellular phones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions are by way of example only, and are not intended to limit implementations of the application described and/or claimed herein.
如图6所示,该电子设备包括:一个或多个处理器601、存储器602,以及用于连接各部件的接口,包括高速接口和低速接口。各个部件利用不同的总线互相连接,并且可以被安装在公共主板上或者根据需要以其它方式安装。处理器可以对在电子设备内执行的指令进行处理,包括存储在存储器中或者存储器上以在外部输入/输出装置(诸如,耦合至接口的显示设备)上显示GUI的图形信息的指令。在其它实施方式中,若需要,可以将多个处理器和/或多条总线与多个存储器和多个存储器一起使用。同样,可以连接多个电子设备,各个设备提供部分必要的操作(例如,作为服务器阵列、一组刀片式服务器、或者多处理器系统)。图6中以一个处理器601为例。As shown in FIG. 6, the electronic device includes: one or
存储器602即为本申请所提供的非瞬时计算机可读存储介质。其中,存储器存储有可由至少一个处理器执行的指令,以使至少一个处理器执行本申请所提供的图像处理方法。本申请的非瞬时计算机可读存储介质存储计算机指令,该计算机指令用于使计算机执行本申请所提供的图像处理方法。The
存储器602作为一种非瞬时计算机可读存储介质,可用于存储非瞬时软件程序、非瞬时计算机可执行程序以及模块,如本申请实施例中的图像处理方法对应的程序指令/模块(例如,附图5所示的已发布信息获取模块501、原始图片获取模块502和信息发布模块503)。处理器601通过运行存储在存储器602中的非瞬时软件程序、指令以及模块,从而执行服务器的各种功能应用以及数据处理,即实现上述方法实施例中的图像处理方法。As a non-transitory computer-readable storage medium, the
存储器602可以包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需要的应用程序;存储数据区可存储实现图像处理方法的电子设备的使用所创建的数据等。此外,存储器602可以包括高速随机存取存储器,还可以包括非瞬时存储器,例如至少一个磁盘存储器件、闪存器件、或其他非瞬时固态存储器件。在一些实施例中,存储器602可选包括相对于处理器601远程设置的存储器,这些远程存储器可以通过网络连接至实现图像处理方法的电子设备。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The
实现图像处理方法的电子设备还可以包括:输入装置603和输出装置604。处理器601、存储器602、输入装置603和输出装置604可以通过总线或者其他方式连接,图6中以通过总线连接为例。The electronic device implementing the image processing method may further include: an
输入装置603可接收输入的数字或字符信息,以及产生与实现图像处理方法的电子设备的用户设置以及功能控制有关的键信号输入,例如触摸屏、小键盘、鼠标、轨迹板、触摸板、指示杆、一个或者多个鼠标按钮、轨迹球、操纵杆等输入装置。输出装置604可以包括显示设备、辅助照明装置(例如,LED)和触觉反馈装置(例如,振动电机)等。该显示设备可以包括但不限于,液晶显示器(LCD)、发光二极管(LED)显示器和等离子体显示器。在一些实施方式中,显示设备可以是触摸屏。The
此处描述的系统和技术的各种实施方式可以在数字电子电路系统、集成电路系统、专用ASIC(专用集成电路)、计算机硬件、固件、软件、和/或它们的组合中实现。这些各种实施方式可以包括:实施在一个或者多个计算机程序中,该一个或者多个计算机程序可在包括至少一个可编程处理器的可编程系统上执行和/或解释,该可编程处理器可以是专用或者通用可编程处理器,可以从存储系统、至少一个输入装置、和至少一个输出装置接收数据和指令,并且将数据和指令传输至该存储系统、该至少一个输入装置、和该至少一个输出装置。Various implementations of the systems and techniques described herein can be implemented in digital electronic circuitry, integrated circuit systems, application specific ASICs (application specific integrated circuits), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include being implemented in one or more computer programs executable and/or interpretable on a programmable system including at least one programmable processor that The processor, which may be a special purpose or general-purpose programmable processor, may receive data and instructions from a storage system, at least one input device, and at least one output device, and transmit data and instructions to the storage system, the at least one input device, and the at least one output device an output device.
这些计算程序(也称作程序、软件、软件应用、或者代码)包括可编程处理器的机器指令,并且可以利用高级过程和/或面向对象的编程语言、和/或汇编/机器语言来实施这些计算程序。如本文使用的,术语“机器可读介质”和“计算机可读介质”指的是用于将机器指令和/或数据提供给可编程处理器的任何计算机程序产品、设备、和/或装置(例如,磁盘、光盘、存储器、可编程逻辑装置(PLD)),包括,接收作为机器可读信号的机器指令的机器可读介质。术语“机器可读信号”指的是用于将机器指令和/或数据提供给可编程处理器的任何信号。These computational programs (also referred to as programs, software, software applications, or codes) include machine instructions for programmable processors, and may be implemented using high-level procedural and/or object-oriented programming languages, and/or assembly/machine languages calculation program. As used herein, the terms "machine-readable medium" and "computer-readable medium" refer to any computer program product, apparatus, and/or apparatus for providing machine instructions and/or data to a programmable processor ( For example, magnetic disks, optical disks, memories, programmable logic devices (PLDs), including machine-readable media that receive machine instructions as machine-readable signals. The term "machine-readable signal" refers to any signal used to provide machine instructions and/or data to a programmable processor.
为了提供与用户的交互,可以在计算机上实施此处描述的系统和技术,该计算机具有:用于向用户显示信息的显示装置(例如,CRT(阴极射线管)或者LCD(液晶显示器)监视器);以及键盘和指向装置(例如,鼠标或者轨迹球),用户可以通过该键盘和该指向装置来将输入提供给计算机。其它种类的装置还可以用于提供与用户的交互;例如,提供给用户的反馈可以是任何形式的传感反馈(例如,视觉反馈、听觉反馈、或者触觉反馈);并且可以用任何形式(包括声输入、语音输入或者、触觉输入)来接收来自用户的输入。To provide interaction with a user, the systems and techniques described herein may be implemented on a computer having a display device (eg, a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to the user ); and a keyboard and pointing device (eg, a mouse or trackball) through which a user can provide input to the computer. Other kinds of devices can also be used to provide interaction with the user; for example, the feedback provided to the user can be any form of sensory feedback (eg, visual feedback, auditory feedback, or tactile feedback); and can be in any form (including acoustic input, voice input, or tactile input) to receive input from the user.
可以将此处描述的系统和技术实施在包括后台部件的计算系统(例如,作为数据服务器)、或者包括中间件部件的计算系统(例如,应用服务器)、或者包括前端部件的计算系统(例如,具有图形用户界面或者网络浏览器的用户计算机,用户可以通过该图形用户界面或者该网络浏览器来与此处描述的系统和技术的实施方式交互)、或者包括这种后台部件、中间件部件、或者前端部件的任何组合的计算系统中。可以通过任何形式或者介质的数字数据通信(例如,通信网络)来将系统的部件相互连接。通信网络的示例包括:局域网(LAN)、广域网(WAN)和互联网。The systems and techniques described herein may be implemented on a computing system that includes back-end components (eg, as a data server), or a computing system that includes middleware components (eg, an application server), or a computing system that includes front-end components (eg, a user's computer having a graphical user interface or web browser through which a user may interact with implementations of the systems and techniques described herein), or including such backend components, middleware components, Or any combination of front-end components in a computing system. The components of the system may be interconnected by any form or medium of digital data communication (eg, a communication network). Examples of communication networks include: Local Area Networks (LANs), Wide Area Networks (WANs), and the Internet.
计算机系统可以包括客户端和服务器。客户端和服务器一般远离彼此并且通常通过通信网络进行交互。通过在相应的计算机上运行并且彼此具有客户端-服务器关系的计算机程序来产生客户端和服务器的关系。A computer system can include clients and servers. Clients and servers are generally remote from each other and usually interact through a communication network. The relationship of client and server arises by computer programs running on the respective computers and having a client-server relationship to each other.
根据本申请实施例的技术方案,通过进行已发布信息的获取,实现了对已发布信息中文字内容的复用,为后续在待发布平台的信息发布提供信息素材,减少了文字素材获取时间;通过进行水印图片的原始图片的查找代替对水印图片的去水印处理,为后续在待发布平台的信息发布提供信息素材,减少了图片素材获取时间;通过采用文字内容和水印图片关联的原始图片在待发布平台进行信息发布,从而实现了对已发布信息从已发布平台到待发布平台的自动迁移,无需发布信息的手动编辑处理,从而提高了信息发布效率。According to the technical solutions of the embodiments of the present application, by acquiring the published information, multiplexing of the text content in the published information is realized, information materials are provided for subsequent information release on the platform to be released, and the acquisition time of the text materials is reduced; By searching for the original image of the watermarked image instead of removing the watermarking process of the watermarked image, information materials are provided for subsequent information release on the platform to be released, and the acquisition time of image materials is reduced; by using the text content and the original image associated with the watermarked image in Information is released on the platform to be released, thereby realizing automatic migration of released information from the platform to be released to the platform to be released, without manual editing of the released information, thereby improving the efficiency of information release.
应该理解,可以使用上面所示的各种形式的流程,重新排序、增加或删除步骤。例如,本申请中记载的各步骤可以并行地执行也可以顺序地执行也可以不同的次序执行,只要能够实现本申请公开的技术方案所期望的结果,本文在此不进行限制。It should be understood that steps may be reordered, added or deleted using the various forms of flow shown above. For example, the steps described in the present application can be executed in parallel, sequentially or in different orders, as long as the desired results of the technical solutions disclosed in the present application can be achieved, no limitation is imposed herein.
上述具体实施方式,并不构成对本申请保护范围的限制。本领域技术人员应该明白的是,根据设计要求和其他因素,可以进行各种修改、组合、子组合和替代。任何在本申请的精神和原则之内所作的修改、等同替换和改进等,均应包含在本申请保护范围之内。The above-mentioned specific embodiments do not constitute a limitation on the protection scope of the present application. It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may occur depending on design requirements and other factors. Any modifications, equivalent replacements and improvements made within the spirit and principles of this application shall be included within the protection scope of this application.
Claims (16)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010611345.6A CN111815500B (en) | 2020-06-29 | 2020-06-29 | Image processing method, device, equipment and storage medium |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010611345.6A CN111815500B (en) | 2020-06-29 | 2020-06-29 | Image processing method, device, equipment and storage medium |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN111815500A true CN111815500A (en) | 2020-10-23 |
| CN111815500B CN111815500B (en) | 2023-08-11 |
Family
ID=72855614
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202010611345.6A Active CN111815500B (en) | 2020-06-29 | 2020-06-29 | Image processing method, device, equipment and storage medium |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN111815500B (en) |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20160085994A1 (en) * | 2014-09-24 | 2016-03-24 | Kevin Pereira | Endorsement of unmodified photographs using watermarks |
| CN107464206A (en) * | 2017-07-26 | 2017-12-12 | 维沃移动通信有限公司 | A kind of watermark adding method and mobile terminal |
| CN107784023A (en) * | 2016-08-31 | 2018-03-09 | 北京国双科技有限公司 | The generation method and device of a kind of graph text information |
| CN108259318A (en) * | 2017-12-22 | 2018-07-06 | 北京智慧星光信息技术有限公司 | A method and device for distributing information |
| CN108648132A (en) * | 2018-04-16 | 2018-10-12 | 深圳市联软科技股份有限公司 | According to the method for graphic hotsopt watermark, system, terminal and medium |
-
2020
- 2020-06-29 CN CN202010611345.6A patent/CN111815500B/en active Active
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20160085994A1 (en) * | 2014-09-24 | 2016-03-24 | Kevin Pereira | Endorsement of unmodified photographs using watermarks |
| CN107784023A (en) * | 2016-08-31 | 2018-03-09 | 北京国双科技有限公司 | The generation method and device of a kind of graph text information |
| CN107464206A (en) * | 2017-07-26 | 2017-12-12 | 维沃移动通信有限公司 | A kind of watermark adding method and mobile terminal |
| CN108259318A (en) * | 2017-12-22 | 2018-07-06 | 北京智慧星光信息技术有限公司 | A method and device for distributing information |
| CN108648132A (en) * | 2018-04-16 | 2018-10-12 | 深圳市联软科技股份有限公司 | According to the method for graphic hotsopt watermark, system, terminal and medium |
Non-Patent Citations (2)
| Title |
|---|
| RUSTAM LATYPOV; EVGENI STOLOV: "Ternary Picture as Watermark for Audio Files", 《2020 3RD INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS & INFORMATION SECURITY (ICCAIS)》 * |
| 王君;: "基于asp.net的图像管理系统的水印技术研究", 电子技术与软件工程, no. 07 * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN111815500B (en) | 2023-08-11 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP6922538B2 (en) | API learning | |
| US9008433B2 (en) | Object tag metadata and image search | |
| CN112136123B (en) | Characterizing files for similarity searching | |
| KR20210038467A (en) | Method and apparatus for generating an event theme, device and storage medium | |
| JP2010073114A (en) | Image information search device, image information search method, computer program for the same | |
| JP6932360B2 (en) | Object search method, device and server | |
| CN104933056A (en) | Uniform resource locator (URL) de-duplication method and device | |
| CN102768683B (en) | A kind of searching method of pictorial information and searcher | |
| CN111708938A (en) | Method, apparatus, electronic device and storage medium for information processing | |
| CN110851136A (en) | Data acquisition method, device, electronic device and storage medium | |
| CN103678704A (en) | Picture recognition method, system, equipment and device based on picture information | |
| CN111984825A (en) | Method and apparatus for searching video | |
| CN112115313A (en) | Regular expression generation, data extraction method, apparatus, equipment and medium | |
| CN110990057A (en) | Extraction method, device, equipment and medium of small program sub-chain information | |
| CN112559913B (en) | Data processing method, device, computing equipment and readable storage medium | |
| JP2022091686A (en) | Data annotation methods, devices, electronic devices and storage media | |
| AU2022228142B2 (en) | Intelligent change summarization for designers | |
| CN111752960A (en) | Data processing method and device | |
| CN111984883A (en) | Tag mining method, device, device and storage medium | |
| CN110532404A (en) | One provenance multimedia determines method, apparatus, equipment and storage medium | |
| CN111666771A (en) | Semantic label extraction device, electronic equipment and readable storage medium of document | |
| CN112052347B (en) | Image storage method, device and electronic device | |
| CN112148279B (en) | Log information processing method, device, electronic equipment and storage medium | |
| CN117251471B (en) | Data query methods, devices, electronic equipment and storage media | |
| CN111815500A (en) | Image processing method, device, equipment and storage medium |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |