[go: up one dir, main page]

CN114238772B - Content-adaptive and aware web map intelligent recommendation system - Google Patents

Content-adaptive and aware web map intelligent recommendation system Download PDF

Info

Publication number
CN114238772B
CN114238772B CN202111603724.1A CN202111603724A CN114238772B CN 114238772 B CN114238772 B CN 114238772B CN 202111603724 A CN202111603724 A CN 202111603724A CN 114238772 B CN114238772 B CN 114238772B
Authority
CN
China
Prior art keywords
map
service
image
color
recommendation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202111603724.1A
Other languages
Chinese (zh)
Other versions
CN114238772A (en
Inventor
韩效遥
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN202111603724.1A priority Critical patent/CN114238772B/en
Publication of CN114238772A publication Critical patent/CN114238772A/en
Application granted granted Critical
Publication of CN114238772B publication Critical patent/CN114238772B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/906Clustering; Classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/907Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/909Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using geographical or spatial information, e.g. location
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Remote Sensing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Processing Or Creating Images (AREA)

Abstract

The application integrates image content-based clustering and image recommendation into network map service retrieval and recommendation, greatly improves map service search experience in the prior art, improves service discovery efficiency, provides a whole set of technical processes of data collection processing, thumbnail generation, image clustering and platform construction, realizes network map service automatic sampling based on information quantity and approximation degree, color and texture-driven image clustering map recommendation, designs a map service retrieval recommendation platform for realizing a Web end on the basis, has accurate grasp and identification of map content, can conduct network map intelligent recommendation based on content adaptive perception, develops a map service retrieval platform based on keyword search and image content recommendation, has high matching degree of retrieval map content and actual requirements, verifies rationality and effectiveness of the method, and has higher practical value.

Description

Network map intelligent recommendation system with self-adaptive sensing content
Technical Field
The application relates to an intelligent network map recommendation system, in particular to an intelligent network map recommendation system with self-adaptive sensing of content, and belongs to the technical field of intelligent network map content recommendation.
Background
The network map service is a novel map organization mode in the mobile Internet era, integrates the characteristics of multi-scale coupling, high-level interoperability and the like on the basis of basic map visualization, enters a big data era, and becomes the optimal knowledge expression and display service in a lower resource demand, visual visualization form and seamless cross-platform connection. In recent years, the number and quality of map services are continuously improved, and the demands of the public for map services are gradually improved, so that how to effectively manage the map services and quickly match the demands of users becomes a research and development and application hotspot.
Currently, a large amount of geographic information data, geographic information networks, geographic information portals, geographic information documents and the like are scattered on heterogeneous network nodes, and form massive network geographic information resources. Meanwhile, the units and individuals can carry out auxiliary decision making, application management, auxiliary scientific research and the like by acquiring various geographic information resources. Among them, the network map service represented by OGC WMS is widely applied in various industries, and how to quickly search for a map service meeting the requirements is a current urgent problem to be solved.
Compared to traditional keyword text-based searches, image content-based searches have a better visual experience and more accurate search results. Hundred degrees, google, etc., have proposed a search function based on image content. If the two internet ideas are introduced into the map service, a map service recommendation function based on image content is developed, a keyword-based search function can be effectively assisted, the map service search experience in the prior art is improved, and the service discovery efficiency is improved.
The content-aware map recommendation system is an important and effective supplement to the text-based service search in the prior art, and the image content-based recommendation function makes up for a single geographic information service discovery mode based on keyword search in the prior art, and has important effects and great application value in enhancing user experience and improving map service discovery efficiency.
Image retrieval is initially mainly to retrieve images through text information, such as shooting dates of images, authors, categories and sizes of images, and the like, and such image retrieval modes are text-based image retrieval technologies. Over ten years of development, image retrieval has evolved to content-based retrieval, primarily retrieving images by their color, texture, shape, etc. features, with the objective of avoiding the use of textual descriptions, but instead retrieving similar images by user-provided query images or user-specified image features based on visual similarity.
Text-based image retrieval is now well established, but has its drawbacks and disadvantages: firstly, labeling texts and keywords of images is completed manually; secondly, the manual annotation has the problems of different observation angles, subjective idea difference and the like, and annotation ambiguity can be generated, namely, the manual annotation is difficult to fully express an image containing a plurality of targets and even covering a plurality of emotion colors. The text-based image retrieval still has the defects, and as market demands and applications are improved deeply, image retrieval applications based on image contents such as hundred-degree image recognition, google image recognition and the like are sequentially presented, and although accuracy is still to be improved, image features can be automatically extracted, so that ambiguity is avoided.
In summary, the network map recommendation system in the prior art has obvious defects, and main defects and design difficulties of the network map recommendation system include:
Firstly, in recent years, the number and quality of map services are continuously improved, the demands of masses on the map services are gradually improved, and the prior art cannot effectively manage the map services and quickly match the demands of users; at present, a large amount of geographic information data, geographic information networks, geographic information portals, geographic information documents and the like are scattered on heterogeneous network nodes, so that massive network geographic information resources are formed, units and individuals can carry out auxiliary decision making, application management, auxiliary scientific research and the like by acquiring various geographic information resources, map services meeting requirements cannot be quickly searched and found in the prior art, and a more intelligent map searching recommendation method based on content is lacked;
Secondly, compared with the traditional keyword text-based search, the image content-based search has better visual experience and more accurate search results, the internet thought is not introduced into the map service in the prior art, and the map service recommendation function based on the image content is lacked, so that the map service search experience in the prior art is poor, and the service discovery efficiency is low; because the content-aware map recommendation is an effective supplement to the text-based service search in the prior art, the image content-based recommendation can make up for the geographic information service discovery mode of the single keyword search in the prior art, has an important effect and great application value in enhancing the user experience and improving the map service discovery efficiency, but the network map intelligent recommendation based on the image content in the prior art is difficult to realize;
Third, prior art text-based map image retrieval is well established, but has its drawbacks and shortcomings: firstly, labeling texts and keywords of images is completed manually; secondly, the manual annotation has the problems of different observation angles, subjective idea difference and the like, and annotation ambiguity can be generated, namely the manual annotation is difficult to fully express an image containing a plurality of targets and even covering some emotion colors, text-based image retrieval still has defects, and the image retrieval application based on image content is sequentially carried out along with the improvement of market requirements and application, but cannot be applied to a map, or the problems of low accuracy, incapability of automatically extracting image features, excessive ambiguity and the like exist;
Fourthly, a plurality of problems exist in the realization of network map recommendation based on content perception, namely, a feasible network map service automatic sampling method based on information quantity and approximation degree is lacked, a map service image sampling method under the constraint of the information quantity and approximation degree is lacked, a plurality of sampling images cannot be generated for each image layer, and an image library cannot be established by using the sampling images; secondly, a feasible image clustering map recommending method is lacked, a method for classifying the image layers based on a clustering method of an image group is lacked according to the characteristic design of a plurality of sampling images of each image layer, and a similar image layer recommending method considering the characteristics of the image group of the image layers is lacked; and thirdly, a feasible network map service retrieval recommendation platform design and development method is lacked, and a high-quality and high-efficiency network map intelligent retrieval recommendation platform is lacked.
Disclosure of Invention
Aiming at the defects of the prior art, the application integrates the clustering and image recommendation based on image content into the retrieval and recommendation of the network map service, greatly improves the experience of map service search in the prior art, improves the service discovery efficiency, provides a whole set of technical processes from data collection processing, thumbnail generation, image clustering and platform construction, realizes the automatic sampling of the network map service based on information quantity and approximation, realizes the image clustering map recommendation based on color and texture, designs a map service retrieval recommendation platform of a Web end on the basis, has accurate grasp and identification of the map content, can carry out the intelligent network map recommendation based on content self-adaptive perception, develops a map service retrieval platform based on keyword search and image content recommendation, has high matching degree between the retrieval map content and actual requirements, verifies the rationality and the effectiveness of the method, and has higher practical value.
In order to achieve the technical effects, the technical scheme adopted by the application is as follows:
the intelligent network map recommending system with self-adaptive content sensing integrates image content based clustering and image recommending into network map service searching and recommending, builds one set of technological process from data collecting process, thumbnail generating process, image clustering and platform, and designs Web terminal map service searching recommending platform based on the technological process, and includes mainly:
First, a network map service automatic sampling method based on information quantity and proximity, comprising: firstly, calculating and evaluating map tile information, and secondly, calculating the approximation degree of the map tiles with cross scales; positioning the rich-content area in the map by adopting an information quantity combined quadtree structure, wherein the information quantity is approximately expressed by the complexity of the image; the approximation degree is calculated through a color quantization unified histogram and is used for screening scaling factors with mutation, and the scaling factors are respectively used for realizing orderly traversal of map services from a plane position dimension and a scaling factor dimension, so that effective data is provided for clustering recommendation based on image content;
second, color and texture driven image cluster map recommendation, comprising: the map service recommendation method comprises the steps of firstly, collaborative representation of map features integrating color and texture features, secondly, clustering of map content images, thirdly, a clustering method based on an image group, and thirdly, recommending map services based on the image group; comprehensively representing image features by adopting color features and texture features, wherein the color features are represented by adopting a color quantization unified histogram, the texture features are represented by adopting a gray level co-occurrence matrix, and the similarity of the inclusion relation and the colors of image groups is considered, and the image group recommendation method reduces the similarity difference caused by image layer positioning by giving different weight values to the different approximation groups, so as to provide a multi-feature representation method for comprehensively representing the colors and the textures, and realize the map service recommendation based on the image groups;
thirdly, building an intelligent retrieval recommendation platform of the network map, which comprises the following steps: firstly, a map intelligent retrieval recommendation platform architecture, secondly, network map data acquisition and processing, thirdly, a map keyword retrieval library, fourthly, map image preprocessing and image retrieval, and thirdly, network map intelligent retrieval recommendation platform building access; a large number of map services are crawled by compiling a web crawler, metadata of the map services are analyzed and stored in a database, a keyword retrieval library is built based on Apache Solr services, keyword retrieval services are issued, thumbnail images are generated for each service and stored, color and texture characteristics of each thumbnail image are calculated and stored in a characteristic library, and clustering results are stored in a category library. When recommending similar images, firstly finding the category of the similar images and then finding the most similar images under the category to recommend the most similar images to the user.
The intelligent network map recommendation system based on the content adaptive perception further comprises a map service preview adaptive sampling method based on the content: in the tile map mode, map service concise expression is defined as screening a plurality of tiles which can represent map service contents from a large number of map tiles, and the map service concise expression is further decomposed into two sub-modules: tile selection in the planar position dimension and tile selection in the scaling dimension;
(1) Tile selection for the planar position dimension: quantizing the representativeness of the map tiles based on a specific scaling, and approximating the information quantity by adopting the complexity factors of the map tiles and taking the complexity factors as quantization indexes of the representativeness of the tiles;
(2) Tile selection for the scaling dimension: the key is to find the tile with mutation, quantitatively evaluate the similarity between the two scale tiles by adopting the image approximation degree, and judge the scaling with obvious change based on the similarity;
based on two quantization indexes of complexity and approximation, two rounds of screening are used for respectively realizing full coverage of plane position dimensions and automatic selection of tiles with multiple scales of scaling dimensions:
The 1 st round of screening, from the minimum scaling, each stage only selects the most complex tile, and uses the geographical range of the tile as constraint to process the next scaling, the first round of screening filters out most tiles;
round 2 screening: comparing the approximation degree of tiles between the scaling ratios, further removing redundancy, defining a sampling set as R, and specifically comprising the following steps:
Step one: starting from level L 0, adding the only map tile in level L 0 to R and using the map tile as a seed tile;
Step two: requesting 4 map tiles of the same geographic range as the seed tiles at a higher level, and taking the tile Ringo with the most abundant information content as a new seed tile;
Step three: splicing 4 tiles to obtain Rpvda, if the approximation degree of Rpvda and the seed tiles is larger than a critical value, adding Rpvda to R, otherwise, not adding;
repeating the second step and the third step until all the scaling ratios are traversed, and obtaining a final sampling graph set;
a plurality of variables are flexibly adjusted: the number of tiles of adjacent scaling does not necessarily meet the 4-time relation, the number of tiles spliced is affected, the first r (r is more than or equal to 1) tiles with the most abundant information are used as seed tiles, and the probability of obtaining representative tiles is increased.
The content self-adaptive perception network map intelligent recommendation system further comprises the following steps of calculating and evaluating map tile information: the method comprises the steps of 1 st-round screening based on map information quantity, selecting map tiles with the most abundant information quantity under each scaling, using a geographical space range covered by the seed tiles as constraint heuristic type traversing next-stage scaling, and ensuring that the seed tiles of other stages except L 0 and L 1 are not guaranteed to have the maximum information quantity, but obtaining higher hit rate with lower calculation cost, wherein the number of map tiles which participate in comparison at each stage after rough selection is not more than 4;
In the aspect of calculation and evaluation of map information, the data quantity of the map tiles after compression is used as a representation index of the information quantity of the map tiles, and the data quantity of the map tiles under the PNG format is used as an information quantity index.
The network map intelligent recommendation system with the content self-adaptive perception further calculates the approximation degree of the map tiles with the cross scale: comparing the approximations of map tiles between two adjacent scaling scales, and judging mutation or slow change according to the approximations, wherein only the map tiles with obvious changes are reserved; analyzing the similarity degree of map tiles from the color features, and quantitatively expressing the color features of the map by adopting a color quantitative unified histogram, and measuring the similarity of the map tiles based on cosine distance weights on the basis;
the application combines map expression characteristics to carry out two-point improvement on the histogram generation method:
Improvement 1, for color quantization: firstly, carrying out color quantization on map tiles, firstly splicing two tiles to be compared together, uniformly managing all colors of the two tiles, and obtaining a remarkable and unique color set through quantization;
improvement 2, calculating the frequency of occurrence of color: firstly, dividing a map into a plurality of continuous color blocks, and calculating the occurrence frequency of corresponding colors by using the circumferences of the color blocks, wherein the dividing standard of the continuous color blocks is that pixels with the same color are adjacent in a four-neighborhood, and a molar neighborhood boundary tracking method is adopted in circumference calculation;
Under RGB color space, color quantization adopts a minimum variance method, the number of reserved colors is specified, the occurrence frequency of each color in two tiles is counted based on quantized color sets, a uniform histogram of color quantization is obtained, and finally the similarity degree between the histograms is calculated by adopting cosine distance; the color weight value with the largest duty ratio in the histogram is set to 0, and the influence on the approximation degree is restrained.
The intelligent recommendation system for the network map with the self-adaptive sensing content further comprises the step of recommending the image cluster map driven by colors and textures: classifying map images according to image features, clustering the map layers to narrow a similar map layer searching range before recommending the map layers, respectively using a color quantization unified histogram and a gray level co-occurrence matrix to represent the color and texture features of the images, and clustering and recommending image groups aiming at network map service features on the basis of generating 3 sampling images for each map layer;
Map feature collaborative representation of integrated color and texture features: for a color map, the color features and texture features of the color map are respectively marked as G s and G r, for two given images D and E, the color feature difference between the two images is marked as A s, the texture feature difference is marked as A r,k1 and k 2, and the weight values of the distances between the two features are respectively marked, so that the comprehensive distance between the D and E is:
A(D,E)=k1*As(GSD,GSE)+k2*Ar(GrD,GrE)
This is used as a similarity comparison method between the map image and the search.
The network map intelligent recommendation system with the content self-adaptive perception comprises the following steps of: all objects in the map data set are used as cluster centers at the beginning of clustering, each cluster center is updated through a message transmission mode in each iteration, one cluster center is selected as the center of the object which fails in competition after each iteration, and a plurality of high-quality clustering results are finally obtained through continuous iteration and selection;
Clustering is carried out according to the approximation degree among M map data points, the approximation degree forms an M multiplied by M similarity matrix, all map data points are used as potential clustering centers Z, the larger the value C (w, w) on the diagonal line of the matrix C is, the larger the probability that the point w becomes the clustering center is, the value of the point w is the reference degree q, a transmission mechanism mainly comprises an attraction factor and a attribution factor, the attraction factor and the attribution factor matrix are updated, attenuation coefficients k are introduced for avoiding oscillation of results, and when information is updated, each information is set to be k times of the last iteration updated value plus 1-k times of the updated value of the information, wherein the attenuation coefficient k is a real number between 0 and 1. I.e. the iteration value of t (j, w) and d (j, w) at the (r+1) th time:
dr+1(j,w)←(1-k)dr+1(j,w)+kdr(j,w)
the specific steps of the map content image clustering are as follows:
step 1: initializing parameters and reading map content data;
Step 2: calculating an approximation matrix, solving a median value of the approximation matrix and giving the median value to the parameter illuminance q;
Step 3: updating the attraction factor matrix and updating the attribution factor matrix;
Step 4: judging whether the maximum iteration times are reached or the termination condition is met, if so, jumping to the step 2, otherwise, performing the step 5;
step 5: and obtaining a final cluster center Z, and dividing each data object into corresponding class clusters.
The network map intelligent recommendation system with the content self-adaptive perception is further based on a clustering method of image groups: generating 3 preview images for each image layer by adopting a content-based map service preview self-adaptive sampling method, wherein the 3 preview images are expressions of information changes of different scaling proportions of the image layer, and the 3 preview images of each image layer need to be synthesized no matter clustering or image layer recommendation;
Taking 3 preview images of each layer as an image group, clustering the image groups of each layer, firstly analyzing the characteristics of the 3 preview images in the image group, wherein the 3 images come from different zoom levels of the same layer, the layers of different zoom levels express the same content of the layers under different scales, and the color expression of the same object under different scales cannot be changed;
When the image groups are clustered according to the color characteristics, the color characteristics of the image groups are synthesized, the color quantization unified histograms of the 3 images A, B, C are set as R A,RB,RC respectively, the color characteristics of the image groups are R=R A+RB+RC due to the similarity of the color characteristics of the images in the image groups, the characteristics of the images in the image groups are calculated and considered, and the color characteristics of the image groups are comprehensively considered.
The content self-adaptive perception network map intelligent recommendation system further comprises map service recommendation based on image groups: recommending 3 preview images of the comprehensive consideration layers based on the similar layers of the image content, and providing a similar service recommending method based on the map service sampling images on the basis;
The method comprises the steps of setting a layer D and a layer E, wherein each layer is provided with 3 preview images, namely D 1、D2、D3 and E 1、E2、E3 respectively, and calculating the similarity between a sampling image group of the layer D and a sampling image group of the layer E; considering that the similarity of two images is low due to the fact that the 3 images of each image layer are reduced due to the fact that similar services are positioned to different levels or different areas, the following image group similarity calculation method is proposed:
The first step: calculating the image approximations of the image groups D 1、D2、D3 and E 1、E2、E3 respectively, wherein the approximations are C ji respectively by adopting a color-plus-texture feature calculation method, j= {1,2,3} is the subscript of D j, i= {1,2,3} is the subscript of E i, and in the 9 groups of comparison, assuming that the maximum value in C ji is C 11, D 1 is the most similar to E 1, and marking C 1=C11, and excluding all groups containing D i and E i, namely C 22、C23、C32、C33;
And a second step of: similarly, find the maximum value in four groups C 22、C23、C32、C33, set as C 32, note C 2=C32, exclude image D 3 and image E 2, at this time, leave a group of images D 2 and E 3, note C 3=C23;
And a third step of: after the first step and the second step, the most similar three-group image combination mode is found between the two image groups, in order to reduce the influence of different zoom levels and different positions on the similarity, different weight values k are set for each group of similarity, and the similarity between the final image groups is as follows:
C=k1×C1+k2×C2+k3×C3
wherein 1 is greater than or equal to k 1>k2>k3 is greater than or equal to 0.
The network map intelligent recommendation system with the content self-adaptive perception is further provided with a network map intelligent retrieval recommendation platform: the method comprises the steps of collecting geographic information service data, processing the data, designing and constructing a keyword search library and image feature search, constructing a Web-end network map service keyword search and similar service recommendation platform, crawling a large number of map services through a Web crawler, analyzing metadata of the map services, storing the map services into a database, constructing a keyword search library, sampling representative images for each image layer based on Apache Solr, calculating color and texture features of each image to construct a feature library, storing clustering results into a category library through clustering, and when similar images need to be recommended, finding out the category of the system, finding out the layer to which the most similar images belong under the category, and recommending the layer to a user.
Map intelligent searching recommendation platform architecture
The method comprises the steps of constructing network map services scattered on all nodes of the Internet by web crawlers, analyzing the collected structured network map service data, storing the data in a database to serve as metadata for user display, and in addition, generating thumbnails for each layer by combining an automatic generation method of network map service thumbnails through a layer acquisition interface provided by the service, wherein the thumbnails serve as service content representatives and are data bases for image feature extraction and image clustering, and the map intelligent retrieval recommendation platform framework is divided into 3 layers:
layer 1: network map service acquisition and preprocessing layer: capturing scattered map services on the Internet by compiling a web map service theme crawler, then downloading and analyzing metadata according to service versions and specifications of the map services, establishing a database table structure according to each parameter information of the map services based on a structured organization form, and storing processed data into a MySQL database;
Layer 2: image preprocessing and keyword retrieval library construction: the layer is divided into two parts, namely keyword retrieval and similar service recommendation aiming at the platform; establishing a direction index library from keywords to services aiming at keyword retrieval; generating a thumbnail for each service for similar service recommendations; the color and texture characteristics of the reserved service are calculated in advance in the background and stored in the characteristic library, so that convenience is provided for subsequent clustering;
Layer 3: platform query and recommendation logic implementation: background logic for realizing a query recommendation platform mainly comprises: and inquiring and displaying the map service according to the keywords, and recommending similar services according to the thumbnail content.
(II) network map data acquisition and processing
OGC WMSs and ARCGIS MAPSERVICE on a collection network are collected by adopting a developed geographic information service theme crawler, and ideas and logic of the geographic information service theme crawler comprise:
(1) Acquiring a seed point URL by using keywords through a search engine;
(2) Reading a seed point URL webpage by compiling a crawler program, and extracting other URL links obtained from the webpage;
(3) Analyzing the webpage content, and if the webpage contains the WMS, the web MAP SERVICE and the ARCGIS REST SERVICES keywords, reserving the webpage as a URL to be processed;
(4) If the URL link contains a service=wms & request= getcappbilities keyword and the page contains WMS-capbilities, determining that the URL link is OGC WMS service; if the link contains/arcgis/rest/services and the page content contains ARCGIS REST SERVICES directors, the service is determined to be ARCGIS SERVICE, and the service link is stored in the database.
The fetched service is stored in a database in a linked mode, metadata of the service is obtained, the request and metadata analysis are carried out on the webpage according to the service release specification type and the service version of the service, and only important information in the service release specification type and the service version of the service release specification type is selected to be stored and put in storage.
The network map intelligent recommendation system with the content self-adaptive perception comprises the following steps: establishing an inverted index library of the keywords to the services, establishing a mapping of the keywords to the service layers, and selecting two fields of description information and title of each service layer as fields matched with the keywords;
(1) The information of each layer of WMS and ARCGIS MAP SERVICE service is regarded as a Document, title and Description in the information of each layer are selected as fields of words to be segmented, the information to be indexed is handed to Tokenizer component, and the Token information of each Document is obtained through punctuation removal and pause removal operations;
(2) Then, the words are delivered to a language analysis tool, and words Term is obtained after the words obtained before are processed, including capitalization and lowercase, and words are processed into root forms;
(3) The lements Term are transmitted to an indexer Indexer, the indexed lements are established into word dictionaries and ordered according to alphabetical order, and the same lements Term are combined into a document inverted chain table Posting List;
The application aims at OGC WMS and ARCGIS MAPSERVICE map services, and finds proper service description information in metadata of the map services by carrying out keyword index on service layer information;
Establishing a search service, namely establishing a word-based inverted index table, separating English by using spaces among words and punctuation marks, and displaying information of search results, wherein the information comprises: layer header, layer description information, layer name, sample graph connection, corresponding fields are mapRiRle, mapAesc, mapMame, images respectively.
Keyword retrieval service publication: keyword retrieval is issued as a single service, the keyword retrieval service is decoupled from the whole system, tomcat is selected as an issuing server of the Solr service, after successful service issuing, information in Solr is checked through a visualized web page, a retrieval result is queried through page configuration query conditions, the service is integrated into a platform, and SolrJ tools are adopted to call the Solr search service.
Compared with the prior art, the application has the innovation points and advantages that:
firstly, clustering and image recommendation based on image content are integrated into retrieval and recommendation of network map service, so that the experience of map service search in the prior art is greatly improved, service discovery efficiency is improved, a whole set of technical processes are provided from data collection processing, thumbnail generation, image clustering and platform construction, a map service retrieval recommendation platform of a Web end is designed on the basis, the map content is accurately grasped and identified by the provided map service recommendation method based on the image content, network map intelligent recommendation can be carried out based on content adaptive perception, a map service retrieval platform based on keyword search and image content recommendation is developed, a recommendation layer has higher similarity with a comparison layer, the matching degree of the retrieval map content and actual requirements is high, the rationality and the effectiveness of the method are verified, and the method has good practical value;
Secondly, the application introduces the searching based on the image content into the network map service, develops the map service recommending function based on the image, assists the searching based on keywords, improves the searching experience of the network map service in the prior art, and provides an automatic network map service sampling method based on information quantity and approximation degree, wherein the information quantity is combined with a quadtree structure to position the region with rich content in the map, and the information quantity is approximately expressed by the complexity of the image; the approximation degree is calculated through a color quantization unified histogram and is used for screening the scaling ratio with mutation, the two scaling ratios respectively realize orderly traversal of map service from a plane position dimension and a scaling ratio dimension, effective data is provided for clustering recommendation based on image content, experiments prove that the image complexity is very effective in positioning the key position of a map, the map tile with the most abundant content under the same scaling ratio can be accurately selected, good effect is obtained by screening the key dimension by using the approximation degree, and the map service discovery efficiency is improved;
thirdly, the application provides a color feature and texture feature comprehensive representation image feature, the color feature adopts a color quantization unified histogram representation, the texture feature selects gray level co-occurrence matrix representation, and the similarity of the inclusion relation and color of an image group is considered, the image group recommendation method reduces the similarity difference caused by image layer positioning by giving different weight values to different groups of approximations, the color and texture-driven image cluster map recommendation method considers the similarity of the inclusion relation and color of the image group, and the image group recommendation method reduces the similarity difference caused by image layer positioning by giving different weight values to different groups of approximations, and experiments show that the application comprehensively considers the color and texture feature of the image group, considers the characteristics of the image group and has better recommendation results for different ground object type image layers;
Fourth, the application builds the network map intelligent retrieval recommendation platform, crawls a large number of map services through compiling a web crawler, analyzes metadata thereof and stores the map services in a database, builds a keyword retrieval library based on Apache Solr service and issues keyword retrieval service, generates and stores a thumbnail for each service, calculates the color and texture characteristics of each thumbnail and stores the thumbnail in a feature library, stores a clustering result in a category library, and finds the category of the thumbnail and then finds out the most similar image to recommend to a user under the category when recommending similar images, thereby being an important and effective supplement to the text-based service search in the prior art, making up a single geographic information service discovery mode based on the keyword search in the prior art, and having important effects and huge application value in enhancing user experience and improving the map service discovery efficiency.
Drawings
Fig. 1 is a schematic diagram of data amounts of map symbol output PNG images of different complexity.
Fig. 2 is an example diagram of locating a map complex area based on PNG data volume.
Fig. 3 is a conceptual partitioning diagram of multiple levels of a multi-source vector road network real-time fusion model.
FIG. 4 is a representation of similar color features in an image set of the present application.
Fig. 5 is a schematic diagram of a first step of screening of a map service recommendation based on an image group.
FIG. 6 is a schematic diagram of a second step of screening of image group based map service recommendations.
Fig. 7 is a schematic diagram of an input image set for experiment one of the present application.
Fig. 8 is a schematic diagram of a recommended image group for experiment one of the present application.
Fig. 9 is a schematic diagram of an input image set for experiment two of the present application.
FIG. 10 is a schematic diagram of a recommended image set for experiment two of the present application.
Fig. 11 is a schematic diagram of an input image set for experiment three of the present application.
FIG. 12 is a schematic view of a recommended image set for experiment three of the present application.
FIG. 13 is a diagram of a network map intelligent recommendation platform framework for content adaptive awareness.
FIG. 14 is a schematic diagram showing specific steps of the key index of the present application.
Fig. 15 is a schematic diagram of the map image preprocessing and image retrieval process steps.
Detailed description of the preferred embodiments
The technical scheme of the network map intelligent recommendation system with the content self-adaptive perception, which is provided by the application, is further described below with reference to the accompanying drawings, so that the application can be better understood and implemented by those skilled in the art.
The user experience of map service search and the efficiency of service discovery are of paramount importance, and image content-based searches have better visual experience and better image search efficiency than keyword text-based searches. The search function based on the image content is introduced by hundred degrees, google and the like, if the idea is introduced into the network map service, the map service recommendation function based on the image is developed, the search based on keywords can be effectively assisted, the network map service search experience in the prior art is improved, and the map service discovery efficiency is improved.
The application integrates image content-based clustering and image recommendation into network map service retrieval and recommendation to improve the map service searching experience in the prior art and improve service discovery efficiency, proposes a whole set of technical processes of data collection processing, thumbnail generation, image clustering and platform construction, and designs a map service retrieval recommendation platform for realizing a Web end on the basis, and mainly comprises the following steps:
First, a network map service automatic sampling method based on information quantity and proximity, comprising: calculating and evaluating map tile information, and calculating the approximation degree of the map tiles with the cross scales;
The method for generating the map service thumbnail based on the information quantity and the approximation degree adopts the information quantity and the quadtree structure to locate the rich-content area in the map, and the information quantity is approximately expressed by the complexity of the image; the approximation degree is calculated through a color quantization unified histogram and is used for screening scaling factors with mutation, and the scaling factors are used for orderly traversing the map service from a plane position dimension and a scaling factor dimension respectively, so that effective data is provided for clustering recommendation based on image content.
Second, color and texture driven image cluster map recommendation, comprising: map feature collaborative representation of comprehensive color and texture features, map content image clustering, an image group-based clustering method and image group-based map service recommendation;
Comprehensively representing image features by adopting color features and texture features, wherein the color features are represented by adopting a color quantization unified histogram, the texture features are represented by adopting a gray level co-occurrence matrix, and the similarity of the inclusion relation and the colors of image groups is considered, and the image group recommendation method reduces the similarity difference caused by image layer positioning by giving different weight values to the different approximation groups, so as to provide a multi-feature representation method for comprehensively representing the colors and the textures, and realize the map service recommendation based on the image groups;
Thirdly, building an intelligent retrieval recommendation platform of the network map, which comprises the following steps: the system comprises a map intelligent retrieval recommendation platform architecture, a network map data acquisition process, a map keyword retrieval library, map image preprocessing and image retrieval, and a network map intelligent retrieval recommendation platform architecture access;
A map service retrieval platform based on keyword search and image content recommendation: a large number of map services are crawled by compiling a web crawler, metadata of the map services are analyzed and stored in a database, a keyword retrieval library is built based on Apache Solr services, keyword retrieval services are issued, thumbnail images are generated for each service and stored, color and texture characteristics of each thumbnail image are calculated and stored in a characteristic library, and clustering results are stored in a category library. When recommending similar images, firstly finding the category of the similar images and then finding the most similar images under the category to recommend the most similar images to the user. The platform adopts a B/C structure, and can stably run through web pages after being released.
The application provides a map service recommending method based on image content, and develops a map service searching platform based on keyword searching and image content recommending, and the recommending layer has higher similarity with the comparing layer, thereby verifying the rationality and the effectiveness of the method.
1. Content-based map service preview self-adaptive sampling method
The map service in the prior art mostly exists in the form of map tiles, and the map tiles are small-size map images which are pre-rendered by a server side and are segmented according to a quadtree rule. For a network map service containing n levels of scaling, the minimum level of scaling L 0 only contains 1 tile covering the whole map range, the maximum level of scaling L n-1 contains 4n-1 map tiles, the size of the map tiles is close to the requirement of the thumbnail, compression is not needed, and flexible splicing can be performed, so that part of network map tiles are used as preview pictures, in the tile map mode, the map service concise expression is defined as that a plurality of tiles which can represent map service contents are screened out from a large number of map tiles, and the map service concise expression is further decomposed into two sub-modules: tile selection in the planar position dimension and tile selection in the scaling dimension.
(1) Tile selection for the planar position dimension: quantifying the representativeness of map tiles based on a specific scaling, for a static map, people tend to pay more attention to more complex parts such as areas with dense symbol distribution or compact mark placement, because such areas can provide more abundant information, and from this thought, the application adopts the complexity factor of the map tiles to approximately represent the information and serve as a quantification index of the representativeness of the tiles;
(2) Tile selection for the scaling dimension: the key is to find the tiles with mutation, and the map tiles with different scales have common points and differences; the common points are more, and the change among the scales is slight; the number of different points is large, and the change between the representation scales is obvious; subtle and significant correspond to subtle and abrupt changes in the map scale space, whereas abrupt changes are more noticeable to the user and should also be more incorporated into the map service thumbnail. The application adopts the image approximation degree to quantitatively evaluate the similarity degree between the two scale tiles, and judges the scaling ratio which is obviously changed based on the similarity degree.
Based on two quantization indexes of complexity and approximation, two rounds of screening are used for respectively realizing full coverage of plane position dimensions and automatic selection of tiles with multiple scales of scaling dimensions:
The 1 st round of screening, from the minimum scaling, each stage only selects the most complex tile, and uses the geographical range of the tile as constraint to process the next scaling, the first round of screening filters out most tiles;
round 2 screening: comparing the approximation degree of tiles between the scaling ratios, further removing redundancy, defining a sampling set as R, and specifically comprising the following steps:
Step one: starting from level L 0, adding the only map tile in level L 0 to R and using the map tile as a seed tile;
Step two: requesting 4 map tiles of the same geographic range as the seed tiles at a higher level, and taking the tile Ringo with the most abundant information content as a new seed tile;
Step three: splicing 4 tiles to obtain Rpvda, if the approximation degree of Rpvda and the seed tiles is larger than a critical value, adding Rpvda to R, otherwise, not adding;
repeating the second step and the third step until all the scaling ratios are traversed, and obtaining a final sampling graph set;
A plurality of variables are flexibly adjusted: the number of tiles of adjacent scaling does not necessarily meet the 4-time relation, the number of tiles which affect the splicing is increased, the first r (r is more than or equal to 1) tiles with the most abundant information are used as seed tiles, and the probability of obtaining representative tiles is increased.
Map tile information amount calculation evaluation
From coarse to detailed, the number of map tiles per level of scaling grows exponentially, which is computationally intensive if all tiles are compared. According to the application, the 1 st round of screening is carried out based on the information quantity of the map, the map tiles with the most abundant information quantity are selected as seed tiles under each scaling, the geographical space range covered by the seed tiles is used as constraint heuristic type to traverse the next scaling, the maximum information quantity of the seed tiles of other levels except the L 0 and the L 1 levels cannot be ensured, but the higher hit rate can be obtained with lower calculation cost, the number of map tiles which participate in comparison of each level after rough selection is not more than 4, and the calculation quantity is greatly reduced;
In terms of map information amount calculation evaluation, the prior art is mostly based on statistical analysis of vector targets and is not suitable for information amount calculation of map tiles (raster images). Compared with the natural scene image, the map contains regular characters and map symbols, the number of colors is small, the regional boundary is sharp, the homogeneity of color blocks is high, and a high compression ratio can be obtained by adopting a lossless compression method. The application adopts the data quantity of the compressed map tiles as the characterization index of the information quantity. The map tiles are in PNG format, are very suitable for processing homogeneous areas in images, have higher compression ratio and can better meet the compression requirement of maps. The data volume of the PNG image is output by the map symbols with different complexity, and the data volumes of the square symbols, the round dot symbols, the annular symbols and the color symbols are sequentially increased, so that the PNG is very sensitive to image changes generated by visual variables such as shapes, textures, colors and the like, and the complexity of the map content can be represented. Fig. 2 shows an example of locating a complex area of a map based on PNG data size, each image is formed by splicing four tiles, the broken line box indicates the tile with the largest data size and requests the next scaling in this range, and it can be obtained from the three-time zooming process that the data size of the PNG image can better reflect the complexity factor of the map content. The application therefore uses the data volume of map tiles in PNG format as an information volume indicator.
(II) calculating the approximation degree of the map tiles of the cross-scale
The filtering is performed internally to each scale depending on the amount of information, and does not take into account the relationship between the different scales. After screening according to the information quantity, each scaling ratio obtains a map tile, and a map tile set is formed. The collection may contain several very similar tiles, with expression redundancy when used as a preview. The application compares the approximation degree of the map tiles between two adjacent scaling scales, judges the abrupt change or gradual change according to the approximation degree, only keeps the map tiles with obvious change, and although the maps with different scaling scales have difference in detail degree and symbol style, the overall ground object layout and distribution mode are similar, the calculation difficulty of the approximation degree of the tiles is reduced to a certain extent, the color features play a decisive role in the human map perception and cognition process as important visual variables of map symbols, so the similarity degree of the map tiles is analyzed from the color features.
The application combines map expression characteristics to carry out two-point improvement on the histogram generation method:
Improvement 1, for color quantization: in order to consider readability and list, the types of symbols contained in the map are limited, the corresponding color quantity is also extremely lower than that of a natural scene image, however, due to the application of technologies such as anti-aliasing and the like, the edge pixels of the map symbols introduce color gradual change, the total color quantity is indirectly improved, as shown in fig. 3, map tiles in A contain 8238 colors in total, therefore, firstly, the map tiles are subjected to color quantization, and fig. 3, B, C and D are quantization results of the A with different color quantities, the visible tiles can be accurately expressed by only 16 colors, if two tiles are respectively quantized, the obtained color sets may not be consistent, the similarity of the colors per se needs to be additionally considered in the subsequent calculation of the histogram distance, and the calculation complexity is high. Firstly, splicing two tiles to be compared together, uniformly managing all colors of the two tiles, and quantifying to obtain a remarkable and unique color set;
Improvement 2, calculating the frequency of occurrence of color: the conventional method is to count the number of pixels belonging to a certain color, and the method cannot reasonably reflect the map content composition, because map symbols, especially linear symbols and punctuation symbols, usually express important information in a small number of pixels, and a histogram based on the number of pixels can greatly weaken the positions of the two types of symbols. Aiming at the problem, the map is firstly divided into a plurality of continuous color blocks, the circumferences of the color blocks are used for calculating the occurrence frequency of corresponding colors, the division standard of the continuous color blocks is that pixels with the same color are adjacent in a four-neighborhood, and a molar neighborhood boundary tracking method is adopted in circumference calculation.
In the RGB color space, the color quantization adopts a minimum variance method, and the reserved color quantity is specified. Based on the quantized color set, the occurrence frequency of each color in two tiles is counted respectively to obtain a color quantization unified histogram, and finally the similarity between the histograms is calculated by adopting cosine distance. According to the histogram of the two tiles when the color quantity is 16, the approximation degree is 0.172. The color quantity influences the value of the approximation degree, and after the experiment is tried many times, the approximation degree gradually becomes stable when the color quantity is more than the colors actually appearing in the map. Thus, subsequent experiments set a relatively high color quantity. In addition, the thematic map often contains large-area blank areas or ground color filling, and the areas with excessive weight values weaken the positions of other map symbols, so that the approximation index is interfered. The application adjusts during approximation calculation, sets the color weight value with the largest proportion in the histogram as 0, and suppresses the influence on approximation.
(III) map service preview adaptive sampling experiment analysis
The experiment selects three map services to verify the method, including a public map service and two thematic map services (medical facility distribution map and river level map). Firstly, three map tiles of the service under the minimum scaling (namely L 0 level) are obtained as initial seed tiles, and from the representative map tiles of the automatically obtained 10 level scaling, the public map service is finally positioned in a building dense area with rich content, and the content change of the map service is better reflected in the whole amplifying process. The two thematic map services have less map tile content at maximum scale, but from the previous stages of scale, the positioning of the complex area is accurate.
The 32 colors are used for color quantization, then approximation calculation of tiles among scales is performed, and in general, two map services show different rules in the amplifying process: the mutation of the public map service occurs in the later stage, and the content of the map is obviously changed due to the introduction of elements such as blocks, buildings, greenbelts and the like; mutations in the thematic map service occur earlier because of mitigation of symbol collisions, selection of element targets, conversion of expression models, replacement of symbol patterns, and the like. Repeated research and judgment are carried out through multiple experiments, the application sets 0.1 as a critical value for thumbnail screening, and the similarity is larger than 0.1, namely, mutation is considered to occur and is brought into a thumbnail set; otherwise, not as a thumbnail.
2. Color and texture driven image cluster map recommendation
The map image is classified according to image characteristics, the image layers are clustered to reduce the searching range of similar image layers before recommending the image layers, the searching efficiency is improved, the color quantization unified histogram and the gray level co-occurrence matrix are respectively used for representing the color and texture characteristics of the image, and the image group clustering recommending method aiming at the network map service characteristics is provided on the basis of generating 3 sampling images for each image layer.
Map feature collaborative representation of integrated color and texture features
For a color map, the color features and texture features of the color map are respectively marked as G s and G r, for two given images D and E, the color feature difference between the two images is marked as A s, the texture feature difference is marked as A r,k1 and k 2, and the weight values of the distances between the two features are respectively marked, so that the comprehensive distance between the D and E is:
A(D,E)=k1*As(GSD,GSE)+k2*Ar(GrD,GrE)
This is used as a similarity comparison method between the map image and the search.
(II) clustering of map content images
All objects in the map data set are used as cluster centers at the beginning of clustering, each cluster center is updated through a message transmission mode in each iteration, one cluster center is selected as a center for the object which fails in competition after each iteration, a plurality of high-quality clustering results are finally obtained through continuous iteration and selection, the influence of the initial random cluster center on the final result in the partitioning method is avoided, and the result is very stable.
Clustering is carried out according to the approximation degree among M map data points, the approximation degree forms an M multiplied by M similarity matrix, all map data points are used as potential clustering centers Z, the larger the value C (w, w) on the diagonal line of the matrix C is, the larger the probability that the point w becomes the clustering center is, the value of the point w is the reference degree q, a transmission mechanism mainly comprises an attraction factor and a attribution factor, the attraction factor and the attribution factor matrix are updated, attenuation coefficients k are introduced for avoiding oscillation of results, and when information is updated, each information is set to be k times of the last iteration updated value plus 1-k times of the updated value of the information, wherein the attenuation coefficient k is a real number between 0 and 1. I.e. the iteration value of t (j, w) and d (j, w) at the (r+1) th time:
dr+1(j,w)←(1-k)dr+1(j,w)+kdr(j,w)
the specific steps of the map content image clustering are as follows:
step 1: initializing parameters and reading map content data;
Step 2: calculating an approximation matrix, solving a median value of the approximation matrix and giving the median value to the parameter illuminance q;
Step 3: updating the attraction factor matrix and updating the attribution factor matrix;
Step 4: judging whether the maximum iteration times are reached or the termination condition is met, if so, jumping to the step 2, otherwise, performing the step 5;
step 5: and obtaining a final cluster center Z, and dividing each data object into corresponding class clusters.
(III) clustering method based on image group
According to the method, 2355 WMS layers are selected, a content-based map service preview self-adaptive sampling method is adopted, 3 preview images are generated for each layer, the 3 preview images are representations of information changes of different scaling proportions of the layer, and the 3 preview images of each layer are required to be synthesized no matter clustering or layer recommendation.
The 3 preview images of each layer are used as an image group, the image group of each layer is clustered, the characteristics of the 3 preview images in the image group are analyzed firstly, the 3 images come from different zoom levels of the same layer, the layers with different zoom levels express the same content of the layer under different scales, the color expression of the same object under different scales cannot be changed, and most of the 3 images in the image group have similar color characteristics, as shown in fig. 4.
When the image groups are clustered according to the color characteristics, the color characteristics of the image groups are synthesized, the color quantization unified histograms of the 3 images A, B, C are set as R A,RB,RC respectively, the color characteristics of the image groups are R=R A+RB+RC due to the similarity of the color characteristics of the images in the image groups, the characteristics of the images in the image groups are calculated and considered, and the color characteristics of the image groups are comprehensively considered.
(IV) image group-based map service recommendation
Based on the 3 preview images of the image layers of the similar image layer recommendation comprehensive consideration image content, the application provides a similar service recommendation method based on a map service sampling image.
There are 3 preview images for each of the layers D and E, D 1、D2、D3 and E 1、E2、E3, respectively, to calculate the similarity between the sample image sets of the layers D and E.
Considering that the 3 images of each layer come from different zoom levels, even though layer D is similar to layer E (layers of each zoom level are very similar), during the image sampling screening process, there is a high probability that a single level of local information differences will be present for the 3 images of each layer to be sampled in different regions or at different zoom levels, which will result in a low similarity between the set of images.
Considering that the similarity of two images is low due to the fact that the 3 images of each image layer are reduced due to the fact that similar services are positioned to different levels or different areas, the following image group similarity calculation method is proposed:
The first step: calculating the image approximations of the image groups D 1、D2、D3 and E 1、E2、E3 respectively, wherein the approximations are C ji respectively by adopting a color-plus-texture feature calculation method, j= {1,2,3} is the subscript of D j, i= {1,2,3} is the subscript of E i, and assuming that the maximum value in C ji is C 11, in the 9-group comparison, D 1 is most similar to E 1 and marked with C 1=C11, and at this time, all groups containing D i and E i are excluded, namely C 22、C23、C32、C33 is left;
And a second step of: similarly, the maximum value found in the four groups C 22、C23、C32、C33 is set as C 32 and denoted as C 2=C32, as shown in fig. 6, excluding the images D 3 and E 2, and leaving the group of images D 2 and E 3 and denoted as C 3=C23;
And a third step of: after the first step and the second step, the most similar three-group image combination mode is found between the two image groups, in order to reduce the influence of different zoom levels and different positions on the similarity, different weight values k are set for each group of similarity, and the similarity between the final image groups is as follows:
C=k1×C1+k2×C2+k3×C3
wherein 1 is greater than or equal to k 1>k2>k3 is greater than or equal to 0.
(V) image clustering map recommendation experiment analysis
Based on the clustering result of the map content images, the similarity recommendation experiment is carried out by adopting the image group similarity measurement method provided by the application, and the experimental result is as follows:
experiment one: the recommendation result of the image group fig. 7 is shown in fig. 8, and the input data is map base map data and the recommendation results are map layers according to experiments, so that the image group similarity calculation method provided by the application has a good recommendation result for map planar layers. The recommended result gives consideration to both the color features and the texture features of the image group.
Experiment II: the recommended result of fig. 9 is shown in fig. 10, the input image of experiment two is a complex image, the image has complex texture and complex color, the recommended result is an image with similar texture, and the colors are similar and are all arranged in front. The image group similarity measurement method provided by the application has a good effect on complex images.
Experiment III: the recommended results of the image group in fig. 11 are shown in fig. 12, the three inputs of the experiment are layers of simple linear symbols, the recommended results are linear layers, the linear image groups with similar colors have higher similarity, and the linear image groups with large color differences are arranged in the follow-up. The image group similarity measurement method provided by the application is better recommended for the image group with linear textures.
The three groups of experiments show that the image group recommendation method aiming at the network map service characteristics comprehensively considers the color and texture characteristics of the image group and has good recommendation results for different ground object types.
3. Intelligent retrieval recommendation platform for constructing network map
The method comprises the steps of collecting geographic information service data, processing the data, designing and constructing a keyword search library and image feature search, constructing a Web-end network map service keyword search and similar service recommendation platform, crawling a large number of map services through a Web crawler, analyzing metadata of the map services, storing the map services into a database, constructing a keyword search library, sampling representative images for each image layer based on Apache Solr, calculating color and texture features of each image to construct a feature library, storing clustering results into a category library through clustering, and when similar images need to be recommended, finding out the category of the system, finding out the layer to which the most similar images belong under the category, and recommending the layer to a user.
Map intelligent searching recommendation platform architecture
The network map service scattered on each node of the Internet is collected by constructing a web crawler, the collected structured network map service data is analyzed and stored in a database to serve as metadata for user display, in addition, the thumbnail is generated for each layer by combining an automatic generation method of the network map service thumbnail through a layer acquisition interface provided by the service, the thumbnail serves as a service content representative, and the thumbnail is a data base for subsequent image feature extraction and image clustering, and the quick retrieval of the network map service and the acquisition of related map service are facilitated by constructing a retrieval recommendation platform, so that the information acquisition capability is improved. Based on this, the application proposes a map intelligent retrieval recommendation platform framework as shown in fig. 13.
The map intelligent retrieval recommendation platform framework is divided into 3 layers:
layer 1: network map service acquisition and preprocessing layer: capturing scattered map services on the Internet by compiling a web map service theme crawler, then downloading and analyzing metadata according to service versions and specifications of the map services, establishing a database table structure according to each parameter information of the map services based on a structured organization form, and storing processed data into a MySQL database;
Layer 2: image preprocessing and keyword retrieval library construction: the layer is divided into two parts, namely keyword retrieval and similar service recommendation aiming at the platform; establishing a direction index library from keywords to services aiming at keyword retrieval; generating a thumbnail for each service for similar service recommendations; the application has reserved the color and texture characteristics of the service, and because the characteristic extraction and calculation are complex, the characteristics are calculated in advance in the background and stored in the characteristic library, thereby providing convenience for the subsequent clustering;
Layer 3: platform query and recommendation logic implementation: background logic for realizing a query recommendation platform mainly comprises: and inquiring and displaying the map service according to the keywords, and recommending similar services according to the thumbnail content.
(II) network map data acquisition and processing
OGC WMSs and ARCGIS MAPSERVICE on the network are collected by adopting a developed geographic information service theme crawler, and ideas and logics of the geographic information service theme crawler are as follows:
(1) Acquiring a seed point URL by using keywords through a search engine;
(2) Reading a seed point URL webpage by compiling a crawler program, and extracting other URL links obtained from the webpage;
(3) Analyzing the webpage content, and if the webpage contains the WMS, the web MAP SERVICE and the ARCGIS REST SERVICES keywords, reserving the webpage as a URL to be processed;
(4) If the URL link contains a service=wms & request= getcappbilities keyword and the page contains WMS-capbilities, determining that the URL link is OGC WMS service; if the link contains/arcgis/rest/services and the page content contains ARCGIS REST SERVICES directors, the service is determined to be ARCGIS SERVICE, and the service link is stored in the database.
The fetched service is stored in a database in a linked form, and requests and metadata analysis are carried out on the web page according to the service release specification type and the service version of the service in order to obtain the metadata of the service.
(III) map keyword search library
If the query of the key words to the service is simply matched from the database by using SQL sentences, the speed is very low, and the jam is easy to occur under the condition of high concurrency, so that the efficiency is low. Therefore, to solve these problems, an inverted index library of keywords to services is created, a mapping of keywords to service layers is created, and two fields of description information and title of each service layer are selected as fields for keyword matching. The specific steps for creating the key index are shown in fig. 14.
(1) The information of each layer of WMS and ARCGIS MAP SERVICE service is regarded as a Document, title and Description in the information of each layer are selected as fields of words to be segmented, the information to be indexed is handed to Tokenizer component, and the Token information of each Document is obtained through punctuation removal and pause removal operations;
(2) Then, the words are delivered to a language analysis tool, and words Term is obtained after the words obtained before are processed, including capitalization and lowercase, and words are processed into root forms;
(3) The lements Term are transmitted to an indexer Indexer, the indexed lements are established into word dictionaries and ordered according to alphabetical order, and the same lements Term are combined into a document inverted chain table Posting List;
The application aims at two map services, namely OGC WMS and ARCGIS MAPSERVICE, and in order to index keywords of service layer information, proper service description information is found in metadata of the service layer information.
Establishing a search service, namely establishing a word-based inverted index table, separating English by using spaces among words and punctuation marks, and displaying information of search results, wherein the information comprises: layer header, layer description information, layer name, sample graph connection, corresponding fields are mapRiRle, mapAesc, mapMame, images respectively.
Keyword retrieval service publication: in order to relieve the high concurrency server pressure and improve the system stability, the keyword search is issued as a single service, so that the keyword search service is decoupled from the whole system, the system stability can be improved, the later system maintenance is facilitated, tomcat is selected as the issuing server of the Solr service, the information in the Solr is checked through a visual web page after the service is successfully issued, the search result is queried through page configuration query conditions, the service is integrated into a platform, a SolrJ tool is adopted, and the Solr search service is called.
(IV) map image preprocessing and image retrieval
A sampling image is generated for the OGC WMS, a respective image acquisition interface is referred, then a content-based map service previewing self-adaptive sampling method is adopted to generate a thumbnail for each service image layer, an important index for evaluating the performance of a recommendation system is the retrieval speed, the image resource data amount is large, the images in an image library are more, the sequential retrieval efficiency is low, if similar images are gathered into one type, the type of the image group of the image layer is firstly searched when the similar image layer is recommended, and then the retrieval is carried out only in the type, so that the image layer searching range can be greatly reduced.
After the image feature database is extracted, clustering operation is carried out on the color and texture features of the images, an image classification table is established, the images in each class are similar image sets, in the process of carrying out keyword query, the thumbnail of the image layer selected by a user is automatically used as an input image, the class to which the input image belongs is directly searched in a clustering result library, and then searching is carried out in the class, so that the image which is most similar to the input image or meets the specified condition is found and recommended to the user.
The map image preprocessing and image retrieval process includes the flow of fig. 15.
(V) building access of network map intelligent retrieval recommendation platform
The platform is based on a B/S architecture, adopts a Java technical stack as an implementation technology of the B/S architecture, integrates network map service management, keyword-based service retrieval and image content-based service recommendation, provides a technical scheme for integrating all services, builds a display page by using a Bootstrap technology at the front end of the platform, develops a background service by using a SPRINGMVC +spring+MyBatis integration frame of Java at the background, publishes retrieval services by using Solr and Tomcat servers as a background server, provides a keyword query interface for the background to call, stores map service metadata information by using a MySQL database, and caches service metadata information by using Redis as a cache database.
(1) Database and Redis cache
Each map service contains a plurality of map layer information, and the number of map layer information varies from a few map layers to thousands map layer information. These layer information are fixed. If the service information is displayed every time, the layer under the same service is retrieved from MySQL for display, the efficiency is necessarily low. Generating its layer information cache for each service is a solution of the present application.
The specific buffer generation time and method are as follows: when the service is accessed for the first time, searching description information of the service and layer information contained in the description information from MySQL and storing the information into Redis in json format; when the second access is performed, the service information corresponding to the service is directly obtained from Redis, so that the page response speed is improved.
(2) Map service publishing and accessing
The website background server adopts Tomcat, and the main functions are demonstrated as follows: and (5) a website entry page is searched by inputting keywords.

Claims (10)

1.内容自适应感知的网络地图智能推荐系统,其特征在于,把基于图像内容的聚类与图像推荐融入网络地图服务的检索和推荐,从数据收集处理、缩略图生成、图像聚类、平台搭建一整套技术流程,再此基础上设计实现Web端的地图服务检索推荐平台,包括:1. Content-adaptive web map intelligent recommendation system, characterized by integrating image content-based clustering and image recommendation into the retrieval and recommendation of web map services, from data collection and processing, thumbnail generation, image clustering, platform construction, a complete set of technical processes, and then designing and implementing a web-based map service retrieval and recommendation platform on this basis, including: 第一,基于信息量和近似度的网络地图服务自动采样方法,包括:一是地图瓦片信息量计算评估,二是计算跨尺度地图瓦片近似度;采用信息量联合四叉树结构定位地图中内容丰富的区域,信息量由图像复杂度近似表达;近似度通过色彩量化统一直方图计算,用来筛选发生突变的缩放比例,二者分别从平面位置维和缩放比例维实现地图服务的有序遍历,为基于图像内容的聚类推荐提供有效数据;First, an automatic sampling method for web map services based on information volume and similarity includes: first, calculating and evaluating the information volume of map tiles, and second, calculating the similarity of cross-scale map tiles; using information volume combined with a quadtree structure to locate content-rich areas in the map, and the information volume is approximately expressed by image complexity; the similarity is calculated by a color quantized unified histogram to screen the scale ratio that has undergone mutations. The two methods realize the orderly traversal of map services from the plane position dimension and the scale ratio dimension respectively, providing effective data for clustering recommendations based on image content; 第二,色彩与纹理驱动的图像聚类地图推荐,包括:一是综合色彩与纹理特征的地图特征协同表示,二是地图内容图像聚类,三是基于图像组的聚类方法,四是基于图像组的地图服务推荐;采用色彩特征和纹理特征综合表示图像特征,色彩特征采用色彩量化统一直方图表示,纹理特征选择灰度共生矩阵表示,并考虑图像组的包含关系和色彩的相似性,图像组推荐方法通过对近似度不同组赋予不同权重值减少图层定位引起的近似度差异,提出综合色彩与纹理的多特征表示方法,实现基于图像组的地图服务推荐;Second, color and texture driven image clustering map recommendation, including: first, map feature collaborative representation of integrated color and texture features, second, map content image clustering, third, clustering method based on image group, fourth, map service recommendation based on image group; color features and texture features are used to comprehensively represent image features. Color features are represented by color quantization unified histogram, and texture features are represented by grayscale co-occurrence matrix. The inclusion relationship and color similarity of image groups are considered. The image group recommendation method reduces the proximity difference caused by layer positioning by assigning different weight values to different proximity groups. A multi-feature representation method of integrated color and texture is proposed to realize map service recommendation based on image groups. 第三,搭建网络地图智能检索推荐平台,包括:一是地图智能检索推荐平台架构,二是是网络地图数据采集处理,三是地图关键字检索库,四是地图图像预处理与图像检索,五是网络地图智能检索推荐平台搭建访问;通过编译网络爬虫爬取大量地图服务,解析其元数据存储到数据库,基于Apache Solr服务建立关键字检索库并发布关键字检索服务,为每个服务生成缩略图并存储,计算每幅缩略图的色彩与纹理特征存储到特征库,把聚类结果存储到类别库中,推荐相似图像时,首先找到其所在类别然后在此类别下找出最相似图像推荐给用户。Third, build an intelligent retrieval and recommendation platform for online maps, including: 1. The map intelligent retrieval and recommendation platform architecture, 2. The network map data collection and processing, 3. The map keyword retrieval library, 4. The map image preprocessing and image retrieval, 5. The network map intelligent retrieval and recommendation platform construction and access; crawl a large number of map services by compiling web crawlers, parse their metadata and store them in the database, establish a keyword retrieval library based on the Apache Solr service and publish the keyword retrieval service, generate thumbnails for each service and store them, calculate the color and texture features of each thumbnail and store them in the feature library, store the clustering results in the category library, and when recommending similar images, first find the category to which it belongs and then find the most similar image under this category and recommend it to the user. 2.根据权利要求1所述内容自适应感知的网络地图智能推荐系统,其特征在于,基于内容的地图服务预览自适应采样方法:采用部分网络地图瓦片作为预览图,在瓦片地图模式下,地图服务简明表达定义为从大量地图瓦片中筛选出若干可代表地图服务内容的瓦片,进一步分解为两个子模块:平面位置维的瓦片选择与缩放比例维的瓦片选择;2. According to the content-adaptive-aware network map intelligent recommendation system of claim 1, it is characterized by a content-based map service preview adaptive sampling method: using part of the network map tiles as preview images, in the tile map mode, the map service is concisely expressed as screening out a number of tiles that can represent the map service content from a large number of map tiles, and further decomposed into two sub-modules: tile selection in the plane position dimension and tile selection in the scaling dimension; (1)平面位置维的瓦片选择:基于特定缩放比例,量化地图瓦片的代表性,采用地图瓦片的复杂度因子近似表示其信息量并作为瓦片代表性的量化指标;(1) Tile selection in the plane position dimension: Based on a specific scaling ratio, the representativeness of the map tile is quantified. The complexity factor of the map tile is used to approximate its information content and is used as a quantitative indicator of tile representativeness. (2)缩放比例维的的瓦片选择:关键是找到发生突变的瓦片,采用图像近似度来定量评价两个比例尺瓦片之间的相似程度,并以此为依据来判定发生显著变化的缩放比例;(2) Tile selection in the scaling dimension: The key is to find the tile where mutation occurs. Image approximation is used to quantitatively evaluate the similarity between two scale tiles, and based on this, the scaling ratio where significant changes occur is determined. 以复杂度和近似度两个量化指标为基础,用两轮筛选分别实现平面位置维全覆盖和缩放比例维多尺度的瓦片自动选择:Based on the two quantitative indicators of complexity and approximation, two rounds of screening are used to achieve full coverage of the plane position dimension and automatic selection of multi-scale tiles in the scaling dimension: 第1轮筛选,从最小的缩放比例开始,每一级只选择最复杂的瓦片,并以该瓦片的地理范围作为约束处理下一个缩放比例,第一轮筛选滤除绝大多数瓦片;In the first round of screening, starting from the smallest scale, only the most complex tiles are selected at each level, and the geographical range of the tile is used as a constraint to process the next scale. The first round of screening filters out most tiles; 第2轮筛选:比较缩放比例间瓦片的近似度,进一步去冗余,将采样集合定义为R,具体步骤如下:Second round of screening: Compare the similarity of tiles between scaling ratios, further remove redundancy, and define the sampling set as R. The specific steps are as follows: 步骤一:从L0级开始,将L0级中唯一的地图瓦片加入R,并作为种子瓦片;Step 1: Starting from level L 0 , add the only map tile in level L 0 to R and use it as the seed tile; 步骤二:请求更高一级的、与种子瓦片相同地理范围的4张地图瓦片,将其中信息量最丰富的瓦片Ringo作为新的种子瓦片;Step 2: Request four higher-level map tiles with the same geographical range as the seed tile, and use the tile Ringo with the richest information as the new seed tile; 步骤三:拼合4张瓦片得到Rpvda,若Rpvda和种子瓦片的近似度大于临界值,将Rpvda加入R,反之则不加入;Step 3: Combine the four tiles to get Rpvda. If the similarity between Rpvda and the seed tile is greater than the critical value, add Rpvda to R, otherwise do not add it. 重复步骤二和步骤三直至遍历所有缩放比例,得到最终的采样图集合;Repeat steps 2 and 3 until all scaling ratios are traversed to obtain the final set of sampling images; 多个变量灵活调节:相邻缩放比例的瓦片数量并不一定满足4倍关系,影响拼合的瓦片个数,将信息量最丰富的前r(r≥1)个瓦片作为种子瓦片,增大得到代表性瓦片的概率。Flexible adjustment of multiple variables: The number of tiles with adjacent scaling ratios does not necessarily satisfy the 4-fold relationship, which affects the number of tiles to be stitched together. The first r (r≥1) tiles with the most information are used as seed tiles to increase the probability of obtaining representative tiles. 3.根据权利要求2所述内容自适应感知的网络地图智能推荐系统,其特征在于,地图瓦片信息量计算评估:基于地图信息量进行第1轮筛选,在每个缩放比例下选择信息量最丰富的地图瓦片作为种子瓦片,并以种子瓦片覆盖的地理空间范围为约束启发式遍历下一级缩放比例,除L0和L1级之外,并不保证其它级别的种子瓦片具有最大信息量,但以较小计算代价得到较高命中率,粗选后每一级参与比较的地图瓦片均不超过4个;3. The content-adaptive-aware network map intelligent recommendation system according to claim 2 is characterized in that the map tile information volume calculation and evaluation: the first round of screening is performed based on the map information volume, and the map tile with the richest information volume is selected as the seed tile at each zoom ratio, and the geographic space range covered by the seed tile is used as a constraint to heuristically traverse the next zoom ratio. Except for L0 and L1 levels, it is not guaranteed that the seed tiles of other levels have the maximum information volume, but a higher hit rate is obtained at a lower calculation cost. After rough selection, the number of map tiles involved in the comparison at each level does not exceed 4; 在地图信息量计算评估方面,采用地图瓦片在压缩后的数据量作为其信息量的表征指标,采用地图瓦片在PNG格式下的数据量作为信息量指标。In terms of the calculation and evaluation of map information volume, the data volume of the compressed map tiles is used as the representation indicator of its information volume, and the data volume of the map tiles in PNG format is used as the information volume indicator. 4.根据权利要求1所述内容自适应感知的网络地图智能推荐系统,其特征在于,计算跨尺度地图瓦片近似度:比对相邻两个缩放比例间地图瓦片的近似度,并依此判断突变或缓变,只保留发生显著变化的地图瓦片;从色彩特征入手分析地图瓦片的相似程度,采用色彩量化统一直方图来量化表达地图的色彩特征,在此基础上基于余弦距离权重度量地图瓦片的近似度;4. The content-adaptive-aware network map intelligent recommendation system according to claim 1 is characterized in that the similarity of cross-scale map tiles is calculated: the similarity of map tiles between two adjacent zoom ratios is compared, and sudden or slow changes are judged accordingly, and only map tiles with significant changes are retained; the similarity of map tiles is analyzed starting from color features, and a color quantization unified histogram is used to quantify the color features of the map, and on this basis, the similarity of map tiles is measured based on the cosine distance weight; 结合地图表达特征对直方图生成方法做两点改进:Combining the map expression features, we make two improvements to the histogram generation method: 改进1,针对色彩量化:首先对地图瓦片进行色彩量化,先将待比较的两张瓦片拼合到一起,统一管理二者的所有颜色,量化得到显著且唯一的色彩集合;Improvement 1, for color quantization: First, quantize the colors of the map tiles. First, combine the two tiles to be compared together, unify all the colors of the two tiles, and quantize to obtain a significant and unique color set; 改进2,计算颜色的出现频率:首先将地图剖分为若干个连续色块,并用这些色块的周长来计算相应色彩的出现频率,连续色块的划分标准是相同色彩的像素在四邻域相邻接,周长计算采用摩尔邻域边界跟踪方法;Improvement 2, calculate the frequency of color occurrence: first divide the map into several continuous color blocks, and use the perimeter of these color blocks to calculate the frequency of occurrence of the corresponding color. The division standard of continuous color blocks is that pixels of the same color are adjacent in four neighborhoods. The perimeter calculation adopts Moore neighborhood boundary tracking method; RGB色彩空间下,色彩量化采用最小方差法,指定保留的色彩数量,基于量化后的色彩集合,分别统计每种色彩在两张瓦片中的出现频率,得到色彩量化统一直方图,最后采用余弦距离计算直方图之间的相似程度;将直方图中占比最大的色彩权重值设置为0,抑制其对近似度的影响。In the RGB color space, color quantization uses the minimum variance method to specify the number of retained colors. Based on the quantized color set, the frequency of occurrence of each color in the two tiles is counted to obtain a unified color quantization histogram. Finally, the cosine distance is used to calculate the similarity between the histograms. The weight value of the color with the largest proportion in the histogram is set to 0 to suppress its influence on the approximation. 5.根据权利要求1所述内容自适应感知的网络地图智能推荐系统,其特征在于,色彩与纹理驱动的图像聚类地图推荐:根据图像特征对地图图像分类,图层推荐前先对图层进行聚类缩小相似图层查找范围,分别使用色彩量化统一直方图和灰度共生矩阵表示图像的色彩与纹理特征,在为每个图层生成3张采样图的基础上,针对网络地图服务特征的图像组聚类推荐;5. According to the content-adaptive-aware network map intelligent recommendation system of claim 1, it is characterized by color and texture-driven image clustering map recommendation: map images are classified according to image features, layers are clustered before layer recommendation to narrow the search range of similar layers, color quantization unified histogram and gray-level co-occurrence matrix are used to represent the color and texture features of the image, and on the basis of generating 3 sampling images for each layer, image group clustering recommendation is performed based on network map service features; 综合色彩与纹理特征的地图特征协同表示:对一幅彩色地图,其色彩特征和纹理特征分别记为Gs和Gr,对给定的两幅图像D和E,记它们之间的色彩特征差异为As,纹理特征差异为Ar,k1和k2分别为两种特征间距离的权重值,则D与E之间的综合距离为:Map feature collaborative representation of integrated color and texture features: For a color map, its color feature and texture feature are recorded as Gs and Gr , respectively. For two given images D and E, the color feature difference between them is recorded as As , and the texture feature difference is recorded as Ar . k1 and k2 are the weight values of the distance between the two features, respectively. Then the comprehensive distance between D and E is: A(D,E)=k1*As(GSD,GSE)+k2*Ar(GrD,GrE)A (D, E) = k 1 *A s (G SD , G SE ) + k 2 *A r (G rD , G rE ) 以此作为地图图像和检索的相似性比较方法。This is used as a similarity comparison method for map images and retrieval. 6.根据权利要求1所述内容自适应感知的网络地图智能推荐系统,其特征在于,地图内容图像聚类:在聚类之初把地图数据集中的所有对象都作为聚类中心,每次迭代中,各个聚类中心通过消息传递模式更新,每一次迭代之后在竞争中失败的对象选择一个聚类中心作为其中心,通过不断迭代和选择,最终得到多个优质的聚类结果;6. The content-adaptive-aware network map intelligent recommendation system according to claim 1 is characterized in that, in the clustering of map content images, at the beginning of clustering, all objects in the map data set are used as cluster centers, and in each iteration, each cluster center is updated through a message passing mode, and after each iteration, the object that fails in the competition selects a cluster center as its center, and through continuous iteration and selection, multiple high-quality clustering results are finally obtained; 根据M个地图数据点之间的近似度进行聚类,这近似度构成了M×M相似性矩阵,把所有地图数据点当做潜在的聚类中心Z,C矩阵对角线上的值c(w,w)越大,点w成为聚类中心的几率越大,其值为参照度q,传递机制包括吸引因子和归属因子,更新吸引因子和归属因子矩阵,为避免结果振荡,引入衰减系数k,在更新信息时,每个信息被设置为上次迭代更新值的k倍加上该次信息更新值的1-k倍,其中衰减系数k是0和1之间的实数,即第r+1次t(j,w)与d(j,w)的迭代值:Clustering is performed based on the approximation between M map data points. This approximation constitutes an M×M similarity matrix. All map data points are regarded as potential cluster centers Z. The larger the value c(w, w) on the diagonal of the C matrix, the greater the probability that point w will become a cluster center. Its value is the reference degree q. The transmission mechanism includes attraction factor and attribution factor. The attraction factor and attribution factor matrices are updated. To avoid result oscillation, an attenuation coefficient k is introduced. When updating information, each information is set to k times the last iterative update value plus 1-k times the information update value. The attenuation coefficient k is a real number between 0 and 1, that is, the iterative value of t(j, w) and d(j, w) for the r+1th time: dr+1(j,w)←(1-k)dr+1(j,w)+kdr(j,w)d r+1 (j, w) ← (1-k) d r+1 (j, w) + kd r (j, w) 地图内容图像聚类的具体步骤:Specific steps for map content image clustering: 步骤1:初始化参数,并读取地图内容数据;Step 1: Initialize parameters and read map content data; 步骤2:计算近似度矩阵,并求出近似度矩阵的中位值并赋给参照度q;Step 2: Calculate the proximity matrix, find the median value of the proximity matrix and assign it to the reference degree q; 步骤3:更新吸引因子矩阵、更新归属因子矩阵;Step 3: Update the attraction factor matrix and the attribution factor matrix; 步骤4:判断是否达到最大迭代次数或满足终止条件,若是则跳转步骤2,否则进行步骤5;Step 4: Determine whether the maximum number of iterations has been reached or the termination condition has been met. If so, go to step 2, otherwise go to step 5; 步骤5:得到最终聚类中心Z,并将各数据对象划分到对应的类簇中。Step 5: Get the final cluster center Z and divide each data object into the corresponding cluster. 7.根据权利要求1所述内容自适应感知的网络地图智能推荐系统,其特征在于,基于图像组的聚类方法:采用基于内容的地图服务预览自适应采样方法,为每个图层生成3张预览图,这3张预览图是此图层不同缩放比例信息变化的表达,不管是聚类还是图层推荐,都需要综合每个图层的3张预览图;7. According to claim 1, the content-adaptive-aware network map intelligent recommendation system is characterized by: the clustering method based on the image group: adopting the content-based map service preview adaptive sampling method, generating 3 preview images for each layer, the 3 preview images are expressions of the changes in the information of different zoom ratios of this layer, whether it is clustering or layer recommendation, it is necessary to integrate the 3 preview images of each layer; 把每个图层的3张预览图作为一个图像组,对每个图层的图像组进行聚类,首先解析图像组中3张预览图的特征,3张图像来自于同一个图层不同缩放级别,不同缩放级别的图层是在不同比例尺下对图层相同内容的表达,而同一对象在不同比例尺下的色彩表达不会发生变化;The three preview images of each layer are taken as an image group. The image groups of each layer are clustered. First, the features of the three preview images in the image group are analyzed. The three images come from different zoom levels of the same layer. Layers at different zoom levels express the same content of the layer at different scales, and the color expression of the same object at different scales will not change. 根据色彩特征对图像组进行聚类时,综合图像组的色彩特征,设3张图像A、B、C的色彩量化统一直方图分别为RA,RB,RC,由于图像组中图像色彩特征的相似性,图像组的色彩特征为R=RA+RB+RC,计算考虑到图像组中图像的特征,综合考虑图像组的色彩特征。When clustering image groups according to color features, the color features of the image group are integrated. Suppose the color quantization unified histograms of the three images A, B, and C are RA , RB , and RC respectively. Due to the similarity of the color features of the images in the image group, the color feature of the image group is R= RA + RB + RC . The calculation takes into account the features of the images in the image group and comprehensively considers the color features of the image group. 8.根据权利要求1所述内容自适应感知的网络地图智能推荐系统,其特征在于,基于图像组的地图服务推荐:基于图像内容的相似图层推荐综合考虑图层的3张预览图,在此基础上,提出基于地图服务采样图的相似服务推荐方法;8. According to the content-adaptive-aware network map intelligent recommendation system of claim 1, it is characterized by the map service recommendation based on the image group: similar layer recommendation based on the image content comprehensively considers the three preview images of the layer, and on this basis, proposes a similar service recommendation method based on the map service sampling map; 设有图层D与图层E,每个图层各有3张预览图,分别为D1、D2、D3和E1、E2、E3,计算图层D与图层E的采样图像组之间的相似性;考虑到每个图层的3张图像减少由于相似服务定位到不同级别或不同区域引起的两张图像近似度低,提出以下图像组近似度计算方法:Suppose there are layers D and E, each layer has three preview images, namely D 1 , D 2 , D 3 and E 1 , E 2 , E 3 . The similarity between the sampled image groups of layer D and layer E is calculated. Considering that the three images of each layer reduce the low similarity of the two images caused by similar services being located at different levels or different areas, the following image group similarity calculation method is proposed: 第一步:分别计算图像组D1、D2、D3与图像组E1、E2、E3两两之间的图像近似度,采用色彩加纹理特征计算方法,近似度分别为Cji,其中j={1,2,3}为Dj的下标,i={1,2,3}为Ei的下标,假设Cji中的最大值为C11,则在这9组比较中,D1与E1最相似,记C1=C11,排除含有Di和Ei的所有组,即剩下C22、C23、C32、C33Step 1: Calculate the image similarity between image groups D 1 , D 2 , D 3 and image groups E 1 , E 2 , E 3 respectively, using the color plus texture feature calculation method, the similarity is C ji , where j = {1, 2, 3} is the subscript of D j , i = {1, 2, 3} is the subscript of E i , assuming that the maximum value in C ji is C 11 , then in these 9 groups of comparisons, D 1 is the most similar to E 1 , record C 1 = C 11 , exclude all groups containing D i and E i , that is, C 22 , C 23 , C 32 , C 33 are left; 第二步:同理,C22、C23、C32、C33四组中找出最大值,设为C32,记C2=C32,排除图像D3与图像E2,此时剩下D2与E3这一组图像,记C3=C23Step 2: Similarly, find the maximum value among the four groups C 22 , C 23 , C 32 , and C 33 , set it as C 32 , record C 2 = C 32 , exclude images D 3 and E 2 , and now only the group of images D 2 and E 3 is left, record C 3 = C 23 ; 第三步:经过第一步与第二步之后,两个图像组之间就找到了最相似的三组图像组合方式,为减少由于不同缩放级别和不同位置对相似性的影响,为每一组近似度设置不同的权重值k,最后的图像组之间的近似度为:Step 3: After the first and second steps, the three most similar image combinations are found between the two image groups. To reduce the impact of different zoom levels and different positions on the similarity, a different weight value k is set for each group of similarities. The final similarity between the image groups is: C=k1×C1+k2×C2+k3×C3 C=k 1 ×C 1 +k 2 ×C 2 +k 3 ×C 3 其中1≥k1>k2>k3≥0。Where 1≥k 1 >k 2 >k 3 ≥0. 9.根据权利要求1所述内容自适应感知的网络地图智能推荐系统,其特征在于,搭建网络地图智能检索推荐平台:包括地理信息服务数据采集、数据处理,关键字检索库与图像特征检索的设计和搭建,搭建一个Web端的网络地图服务关键字检索与相似服务推荐平台,通过网络爬虫爬取大量地图服务解析其元数据存储到数据库并建立关键字检索库,基于Apache Solr发布关键字检索服务,为每个图层采样有代表性的图像,然后计算每幅图像的色彩与纹理特征构建特征库,并通过聚类把聚类结果存储到类别库中,当需要推荐相似图像时,系统首先找到其所在类别,然后在此类别下找出最相似图像所属图层推荐给用户;9. According to the content-adaptive-aware web map intelligent recommendation system described in claim 1, it is characterized by building a web map intelligent retrieval recommendation platform: including geographic information service data collection, data processing, design and construction of keyword retrieval library and image feature retrieval, building a web-based web map service keyword retrieval and similar service recommendation platform, crawling a large number of map services through web crawlers, parsing their metadata, storing them in a database and establishing a keyword retrieval library, publishing keyword retrieval services based on Apache Solr, sampling representative images for each layer, and then calculating the color and texture features of each image to build a feature library, and storing the clustering results in a category library through clustering. When it is necessary to recommend similar images, the system first finds the category to which it belongs, and then finds the layer to which the most similar image belongs under this category and recommends it to the user; (一)地图智能检索推荐平台架构(I) Map intelligent retrieval and recommendation platform architecture 通过构建网络爬虫采集散落在互联网各个结点的网络地图服务,解析收集到的结构化的网络地图服务数据,存储到数据库中,作为为用户展示的元数据,此外,通过服务提供的图层获取接口,结合网络地图服务缩略图自动生成方法,为每个图层生成缩略图,这些缩略图作为服务内容代表,是后续进行图像特征提取和图像聚类的数据基础,地图智能检索推荐平台框架分为3层:By building a web crawler to collect web map services scattered on various nodes of the Internet, the collected structured web map service data is parsed and stored in the database as metadata for display to users. In addition, through the layer acquisition interface provided by the service, combined with the automatic generation method of web map service thumbnails, thumbnails are generated for each layer. These thumbnails represent the service content and are the data basis for subsequent image feature extraction and image clustering. The map intelligent retrieval recommendation platform framework is divided into three layers: 第1层:网络地图服务采集与预处理层:通过编译网络地图服务主题爬虫,对互联网上散落的地图服务进行抓取,然后依据其服务版本和其规范,对其元数据进行下载解析,基于结构化的组织形式,依据其各个参数信息建立数据库表结构,把处理好的数据存储进MySQL数据库中;Layer 1: Web map service collection and preprocessing layer: By compiling web map service theme crawlers, we can crawl scattered map services on the Internet, and then download and parse their metadata according to their service versions and specifications. Based on the structured organization form, we can establish a database table structure according to its various parameter information, and store the processed data in the MySQL database. 第2层:图像预处理与关键字检索库搭建:本层分为两部分,分别针对平台的关键字检索和相似服务推荐;针对关键字检索,建立从关键字到服务的方向索引库;针对相似服务推荐,为每一个服务生成缩略图;已预定服务的色彩与纹理特征,在后台提前计算出其特征并存储到特征库中,为后续的聚类提供便利;Layer 2: Image preprocessing and keyword search library construction: This layer is divided into two parts, one for the platform's keyword search and the other for similar service recommendations. For keyword search, a directional index library from keywords to services is established. For similar service recommendations, thumbnails are generated for each service. The color and texture features of the booked services are calculated in advance in the background and stored in the feature library to facilitate subsequent clustering. 第3层:平台查询和推荐逻辑实现:实现查询推荐平台的后台逻辑,包括:根据关键字的地图服务查询与展示,根据缩略图内容的相似服务推荐;Layer 3: Platform query and recommendation logic implementation: Implement the backend logic of the query recommendation platform, including: map service query and display based on keywords, and similar service recommendations based on thumbnail content; (二)网络地图数据采集处理(II) Network map data collection and processing 采用开发地理信息服务主题爬虫收集网络上的OGC WMS和ArcGIS MapService,地理信息服务主题爬虫思路和逻辑包括:The geographic information service theme crawler is developed to collect OGC WMS and ArcGIS MapService on the Internet. The ideas and logic of the geographic information service theme crawler include: (1)通过搜索引擎使用关键字获取种子点URL;(1) Obtain the seed point URL using keywords through a search engine; (2)通过编译爬虫程序读取种子点URL网页,从网页中提取获取到的其它URL链接;(2) Read the seed URL webpage by compiling a crawler program and extract other URL links obtained from the webpage; (3)解析网页内容,若包含WMS、web map service、ARCGIS REST Services关键字的页面则保留下来作为待处理URL;(3) Parse the web page content. If the page contains the keywords WMS, web map service, or ARCGIS REST Services, it will be retained as a URL to be processed. (4)若URL链接中包含有service=wms&request=getcappbilities关键字,且页面内中含有wms-capbilities则确定为OGC WMS服务;若链接中含有/arcgis/rest/services,且页面内容中包含有ArcGIS REST Services Directory,则确定为ArcGIS Service服务,把服务链接存储到数据库中;(4) If the URL link contains the keyword service=wms&request=getcapbilities and the page contains wms-capbilities, it is determined to be an OGC WMS service; if the link contains /arcgis/rest/services and the page content contains ArcGIS REST Services Directory, it is determined to be an ArcGIS Service and the service link is stored in the database; 抓取的服务已链接的形式存储到数据库中,获得服务的元数据,依照服务发布规范类型和其服务版本,对网页进行请求和元数据解析,只选择其中较重要的信息存储入库。The captured services are stored in the database in linked form, and the metadata of the services are obtained. According to the service publishing specification type and its service version, web pages are requested and metadata parsed, and only the more important information is selected and stored in the database. 10.根据权利要求9所述内容自适应感知的网络地图智能推荐系统,其特征在于,(三)地图关键字检索库:建立关键字对服务的反向索引库,建立关键字对服务图层的映射,选择每个服务图层的描述信息和标题两个字段作为关键字匹配的字段;10. The content-adaptive-aware network map intelligent recommendation system according to claim 9 is characterized by: (iii) map keyword retrieval library: establishing a reverse index library of keywords to services, establishing a mapping of keywords to service layers, and selecting the description information and title fields of each service layer as the keyword matching fields; (1)把WMS和ArcGIS Map Service服务每一个图层的信息看做一个Document,选择每个图层信息中的Title和Description作为待分词的字段,把待索引信息交给Tokenizer组件,通过去标点符号、去停顿次操作得到每一个Document的词元Token信息;(1) Treat each layer of WMS and ArcGIS Map Service as a document, select the Title and Description in each layer as the fields to be segmented, and pass the information to be indexed to the Tokenizer component to obtain the word-unit Token information of each document by removing punctuation and pauses; (2)之后交给语言分析工具,把之前得到的词元进行处理,包括大写变小写、单词变为词根形式处理后得到词Term;(2) The language analysis tool is then used to process the previously obtained lemmas, including converting uppercase letters to lowercase letters and converting single words to root forms, to obtain the word Term. (3)把词元Term传给索引器Indexer,将索引后的词元建立单词字典并按字母顺序排序,合并相同的词元Term成为文档倒排链表Posting List;(3) Pass the term to the indexer, create a word dictionary with the indexed terms and sort them in alphabetical order, merge the same terms into a posting list; 对象为OGC WMS与ArcGIS MapService两种地图服务,为对服务图层信息进行关键字索引,在其元数据中找到合适的服务描述信息;The objects are OGC WMS and ArcGIS MapService map services. The purpose is to perform keyword indexing on the service layer information and find the appropriate service description information in its metadata. 建立检索服务通过建立基于词的倒排索引表,英文使用词之间的空格和标点符号进行分隔,检索结果展示信息包括:图层标题、图层描述信息、图层名字、取样图连接,对应的字段分别为mapRiRle、mapAesc、mapMame、images;The search service is established by establishing a word-based inverted index table. In English, words are separated by spaces and punctuation marks. The search results display information including: layer title, layer description information, layer name, and sampling map connection. The corresponding fields are mapRiRle, mapAesc, mapMame, and images. 关键字检索服务发布:把关键字检索作为一个单独的服务进行发布,使关键字检索服务从整个系统中解耦,选择Tomcat作为solr服务的发布服务器,服务发布成功后通过可视化的web页面查看Solr中的信息,通过在页面配置查询条件查询到检索结果,把服务整合到平台中采用SolrJ工具,调用Solr搜索服务。Keyword search service publishing: Publish keyword search as a separate service to decouple the keyword search service from the entire system. Select Tomcat as the publishing server for the Solr service. After the service is successfully published, view the information in Solr through a visual web page. Query the search results by configuring query conditions on the page. Integrate the service into the platform using the SolrJ tool to call the Solr search service.
CN202111603724.1A 2021-12-24 2021-12-24 Content-adaptive and aware web map intelligent recommendation system Active CN114238772B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111603724.1A CN114238772B (en) 2021-12-24 2021-12-24 Content-adaptive and aware web map intelligent recommendation system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111603724.1A CN114238772B (en) 2021-12-24 2021-12-24 Content-adaptive and aware web map intelligent recommendation system

Publications (2)

Publication Number Publication Date
CN114238772A CN114238772A (en) 2022-03-25
CN114238772B true CN114238772B (en) 2024-11-12

Family

ID=80762983

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111603724.1A Active CN114238772B (en) 2021-12-24 2021-12-24 Content-adaptive and aware web map intelligent recommendation system

Country Status (1)

Country Link
CN (1) CN114238772B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114494082B (en) * 2022-04-02 2022-07-08 成都理工大学工程技术学院 A method of image map tile image processing based on image recognition

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102915348A (en) * 2012-09-27 2013-02-06 北京百度网讯科技有限公司 Method and system for recommending search information to user in map
CA2957798A1 (en) * 2016-02-12 2017-08-12 Intellicheck Mobilisa, Inc. System and methods for analyzing information from identification documents

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080177640A1 (en) * 2005-05-09 2008-07-24 Salih Burak Gokturk System and method for using image analysis and search in e-commerce
US7685133B2 (en) * 2006-05-24 2010-03-23 The United States Of America As Represented By The Secretary Of The Navy System and method for automated discovery, binding, and integration of non-registered geospatial web services
JP2010218373A (en) * 2009-03-18 2010-09-30 Olympus Corp Server system, terminal apparatus, program, information storage medium, and image retrieving method
CN102968465B (en) * 2012-11-09 2015-07-29 同济大学 Network information service platform and the search service method based on this platform thereof
US20140278786A1 (en) * 2013-03-14 2014-09-18 Twain Liu-Qiu-Yan System and method to survey and evaluate items according to people's perceptions and to generate recommendations based on people's perceptions
CN103914536B (en) * 2014-03-31 2017-11-07 北京百度网讯科技有限公司 A kind of point of interest for electronic map recommends method and system
CN105022724B (en) * 2015-06-25 2018-01-16 中国人民解放军信息工程大学 A kind of statistical symbol automatic selecting method based on statistics with drawing demand
WO2017168472A1 (en) * 2016-03-30 2017-10-05 パナソニックIpマネジメント株式会社 Position estimation device, position estimation method, and control program
CN106055594A (en) * 2016-05-23 2016-10-26 成都陌云科技有限公司 Information providing method based on user interests
US20180285447A1 (en) * 2017-03-31 2018-10-04 Nec Personal Computers, Ltd. Content recommendation apparatus, content recommendation system, content recommendation method, and program
WO2019059435A1 (en) * 2017-09-22 2019-03-28 라인 가부시키가이샤 Method and system for making recommendations using content including location data

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102915348A (en) * 2012-09-27 2013-02-06 北京百度网讯科技有限公司 Method and system for recommending search information to user in map
CA2957798A1 (en) * 2016-02-12 2017-08-12 Intellicheck Mobilisa, Inc. System and methods for analyzing information from identification documents

Also Published As

Publication number Publication date
CN114238772A (en) 2022-03-25

Similar Documents

Publication Publication Date Title
Wang et al. Richpedia: a large-scale, comprehensive multi-modal knowledge graph
CN109271477B (en) Method and system for constructing classified corpus by means of Internet
CN102902826B (en) A kind of image method for quickly retrieving based on reference picture index
CN105843850B (en) Search optimization method and device
CN109885632B (en) Space science and application data retrieval method, system, medium and equipment
CN107844493B (en) File association method and system
CN116629258B (en) Structured analysis method and system for judicial document based on complex information item data
CN109308324A (en) An image retrieval method and system based on hand-drawn style recommendation
CN119046493B (en) Image retrieval method based on fusion of multi-scale features and spatial attention mechanism
CN117556118B (en) Visual recommendation system and method based on scientific research big data prediction
CN111538903A (en) Method and device for determining search recommended word, electronic equipment and computer readable medium
CN114238772B (en) Content-adaptive and aware web map intelligent recommendation system
CN117218668A (en) Intelligent small-scale method and system based on deep learning and rules
KR100706389B1 (en) Image retrieval method and device considering similarity between images
CN111967238B (en) Medical record template knowledge base construction method, medical system and construction device thereof
CN113609842A (en) A method for obtaining scenic review data and travel experience evaluation
CN118643242A (en) A method, device, equipment and storage medium for obtaining hotspot data
CN118277578A (en) General construction method for standard literature knowledge graph
CN116450776A (en) Oil and Gas Pipeline Network Laws, Regulations and Technical Standards Retrieval System Based on Knowledge Graph
CN115099241A (en) Massive tourism network text semantic analysis method based on model fusion
CN116226494A (en) Crawler system and method for information search
CN114791978A (en) News recommendation method, device, equipment and storage medium
CN112989060A (en) GCN-based major event trend prediction method
KR102785954B1 (en) System for providing legal technology based case searching service
CN111143694A (en) Information pushing method and device, storage equipment and program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant