CN107004167A - Recruit through open public examination standardization and data de-duplication - Google Patents
Recruit through open public examination standardization and data de-duplication Download PDFInfo
- Publication number
- CN107004167A CN107004167A CN201580064463.7A CN201580064463A CN107004167A CN 107004167 A CN107004167 A CN 107004167A CN 201580064463 A CN201580064463 A CN 201580064463A CN 107004167 A CN107004167 A CN 107004167A
- Authority
- CN
- China
- Prior art keywords
- title
- standardization
- open public
- job
- open
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation; Time management
- G06Q10/105—Human resources
-
- G06Q10/40—
Landscapes
- Business, Economics & Management (AREA)
- Human Resources & Organizations (AREA)
- Engineering & Computer Science (AREA)
- Strategic Management (AREA)
- Entrepreneurship & Innovation (AREA)
- Operations Research (AREA)
- Economics (AREA)
- Marketing (AREA)
- Data Mining & Analysis (AREA)
- Quality & Reliability (AREA)
- Tourism & Hospitality (AREA)
- Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
优先权要求priority claim
本PCT申请案要求2014年9月30日提交的发明名称为“公开招聘标准化和重复数据删除(JOB POSTING STANDARDIZATION AND DEDUPLICATION)”的第14/502,224号美国专利申请案的优先权权益,并且要求2014年9月30日提交的发明名称为“公开招聘标准化和重复数据删除(JOB POSTING STANDARDIZATION AND DEDUPLICATION)”的第14/502,261号美国专利申请案的优先权权益,两个申请案全部以引用方式并入本文中。This PCT application claims the benefit of priority to U.S. Patent Application No. 14/502,224, filed September 30, 2014, entitled "JOB POSTING STANDARDIZATION AND DEDUPLICATION," and claims the 2014 Priority benefit of U.S. Patent Application No. 14/502,261, filed September 30, 2009, entitled "JOB POSTING STANDARDIZATION AND DEDUPLICATION," both applications incorporated by reference in their entirety into this article.
技术领域technical field
本发明大体上涉及用于托管公开招聘的数据处理系统,并且在一些实施例中,涉及用于对存在于不同第三方系统上的公开招聘进行标准化和重复数据删除的技术。The present invention generally relates to data processing systems for hosting open jobs and, in some embodiments, to techniques for normalizing and deduplicating data for open jobs residing on disparate third-party systems.
背景技术Background technique
在典型的工作托管服务中,公司代表将公开招聘发布到工作托管服务,使得工作托管服务的用户可以搜索、浏览以及在一些情况下,申请与具体公开招聘相关联的工作。作为可向工作托管服务的用户呈现公开招聘的交换,发布公开招聘的公司通常将支付一些费用。In a typical job hosting service, a company representative posts an opening to the job hosting service so that users of the job hosting service can search, browse, and in some cases, apply for jobs associated with the particular opening. In exchange for being able to present the open job to users of the job hosting service, the company posting the open job will typically pay some fee.
附图说明Description of drawings
在附图的各图中借助于实例而非限制性地说明一些实施例。Some embodiments are illustrated by way of example and not limitation in the various figures of the drawings.
图1是说明根据一些实例实施例的适用于社交网络服务的网络环境的网络图。Figure 1 is a network diagram illustrating a network environment suitable for social networking services, according to some example embodiments.
图2是说明根据一些实例实施例的社交网络系统的组件的框图。Figure 2 is a block diagram illustrating components of a social networking system according to some example embodiments.
图3A是说明根据一些实例实施例的在执行用于标准化从第三方系统获得的公开招聘的方法时工作捕获模块和工作标准化模块的操作的流程图。3A is a flowchart illustrating the operation of a job capture module and a job normalization module in performing a method for normalizing open job offers obtained from third-party systems, according to some example embodiments.
图3B是说明根据一些实例实施例的在执行用于标准化从第三方系统获得的公开招聘的方法时工作标准化模块的任选操作的流程图。3B is a flowchart illustrating optional operations of a job normalization module in performing a method for normalizing open job offers obtained from a third-party system, according to some example embodiments.
图4A是说明根据一些实例实施例的在执行用于对从第三方系统获得的公开招聘进行重复数据删除的方法时工作重复数据删除模块,以及任选地工作捕获模块和/或工作标准化模块的操作的流程图。4A is a diagram illustrating a job deduplication module, and optionally a job capture module and/or a job normalization module, when performing a method for deduplicating open job offers obtained from a third-party system, according to some example embodiments. A flowchart of the operation.
图4B是说明根据一些实例实施例的在执行用于对从第三方系统获得的公开招聘进行重复数据删除的方法时工作重复数据删除模块的任选操作的流程图。4B is a flowchart illustrating optional operation of a job deduplication module in performing a method for deduplicating open job offers obtained from a third-party system, according to some example embodiments.
图5是说明其上可以实施一个或多个实施例的机器的实例的框图。Figure 5 is a block diagram illustrating an example of a machine on which one or more embodiments may be implemented.
具体实施方式detailed description
本发明描述单独地提供工作托管服务的方法、系统和计算机程序产品,所述工作托管服务向付费和未付费公开招聘(有时称为工作公告)提供不同水平的服务。在以下详细描述中,出于解释的目的,阐述众多特定细节以便提供对本发明所揭示的主题各个方面的透彻理解。然而,本领域的技术人员将明白,可以在没有这些特定细节的情况下实践本发明所揭示的主题。在其它情况下,尚未详细描述众所周知的方法、程序和组件,以免混淆本发明所揭示的主题。The present invention describes methods, systems, and computer program products that separately provide a job hosting service that provides varying levels of service to paid and unpaid open job offers (sometimes referred to as job postings). In the following detailed description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of various aspects of the presently disclosed subject matter. It will be apparent, however, to one skilled in the art that the presently disclosed subject matter may be practiced without these specific details. In other instances, well-known methods, procedures, and components have not been described in detail so as not to obscure the presently disclosed subject matter.
根据一些实施例,工作托管服务(例如,与社交网络系统相关联)托管付费和未付费公开招聘两者。例如,通过工作托管服务的公开招聘模块,工作托管服务的用户可以提供关于具体职位空缺的信息并且生成付费公开招聘。公开招聘通常由可获得职位空缺的公司或组织的名称、职位空缺的职位名称、工作职能的描述、所需或建议技能、教育程度以及证书和/或专长等组成。作为支付一些费用的交换,付费公开招聘将有资格呈现给用户(例如,工作托管服务与其集成的社交网络系统的人员)。According to some embodiments, the job hosting service (eg, associated with the social networking system) hosts both paid and unpaid open jobs. For example, through the job hosting service's job openings module, users of the job hosting service can provide information about specific job openings and generate paid job openings. An open job offer typically consists of the name of the company or organization where the job opening is available, the job title of the job opening, a description of the job function, required or suggested skills, education, and certifications and/or specializations. In exchange for paying some fees, Paid Open Jobs will be eligible to be presented to users (for example, persons with social networking systems with which the job hosting service integrates).
在一些实施例中,工作托管服务可以托管付费公开招聘和未付费公开招聘。在一些情况下,付费公开招聘可以直接列在工作托管服务上,而未付费公开招聘可以从第三方系统接收。然而,从第三方系统接收的公开招聘的数据格式可能与工作托管服务对于其公开招聘使用的数据格式不匹配。此外,从第三方系统接收的公开招聘可以表示已由工作托管服务列出的公开招聘。In some embodiments, the job hosting service may host paid open jobs and unpaid open jobs. In some cases, paid job openings may be listed directly on the job hosting service, while unpaid job openings may be received from third-party systems. However, the data format of the open positions received from the third-party system may not match the data format used by the job hosting service for its open positions. Additionally, an open job received from a third-party system may represent an open job that has been listed by the job hosting service.
除了付费公开招聘之外,工作托管服务可以从不同的外部托管的第三方招聘网站摄取公开招聘。在一些实施例中,自动计算机程序(例如,“bot”或“spider”)自动“抓取”相关互联网网站并且发现供摄入的公开招聘。在一些实施例中,从由一个或多个第三方合作伙伴保持的数据馈送中获得公开招聘。工作托管服务存储付费公开招聘和未付费公开招聘两者或使另一实体代表其存储付费公开招聘和未付费公开招聘两者,即,已通过公开招聘模块生成并且已向社交网络系统支付费用的公开招聘以及从第三方网站获得且尚未向社交网络系统支付费用的公开招聘。In addition to paid open jobs, job hosting services can ingest open jobs from various externally hosted third-party job sites. In some embodiments, an automated computer program (eg, a "bot" or "spider") automatically "crawls" relevant Internet sites and finds open jobs for ingestion. In some embodiments, open jobs are obtained from data feeds maintained by one or more third-party partners. The job hosting service stores or causes another entity to store both paid and unpaid job openings on its behalf, i.e., job openings that have been generated through the job opening module and for which a fee has been paid to the social networking system Open jobs and open jobs obtained from third-party websites for which no payment has been made to the social networking system.
在一些实施例中,未付费公开招聘仅有资格通过工作搜索接口向社交网络服务的人员呈现。因此,未付费或免费公开招聘通常将仅呈现给可以称为“积极的求职候选人”或“积极的求职者”的社交网络服务人员。这些积极的求职者是通常积极地参与寻找新就业机会的人员。付费公开招聘还有资格通过搜索接口呈现给社交网络服务的人员,但是还通过各种不同其它信道呈现给这些人员。例如,工作推荐引擎可以将人员简档与公开招聘匹配,目标是基于人员的简档数据而将相关公开招聘(即,可能是人员所关注的公开招聘)呈现给社交网络服务的人员。In some embodiments, unpaid open jobs are only eligible for presentation to personnel of the social networking service through the job search interface. As such, unpaid or free open jobs will generally only be presented to social networking service personnel who may be referred to as "active candidates" or "aggressive job seekers." These active job seekers are people who are usually actively involved in finding new employment opportunities. Paid open jobs are also eligible to be presented to personnel of the social networking service through the search interface, but also through a variety of other channels. For example, a job recommendation engine may match a person's profile to an open job with the goal of presenting relevant job openings (ie, those that may be of interest to the person) to the person of the social networking service based on the person's profile data.
在一些实施例中,从第三方系统接收的公开招聘的数据格式可能与社交网络系统的工作托管服务对于其公开招聘使用的数据格式不匹配。在此类实施例中,工作托管服务标准化从第三方系统接收的公开招聘,使得公开招聘可以集成到工作托管服务中。In some embodiments, the data format of an open job received from a third-party system may not match the data format used by the social networking system's job hosting service for its open job. In such embodiments, the job hosting service normalizes open offers received from third-party systems such that the open offers can be integrated into the job hosting service.
在一些实施例中,从第三方系统接收的公开招聘表示已集成到工作托管服务中的公开招聘。在此类实施例中,工作托管服务执行公开招聘重复数据删除,并且如果确定新的公开招聘优于(例如,更具权威性)已集成的公开招聘,那么用新的公开招聘代替已集成的公开招聘。In some embodiments, the open job received from the third party system represents an open job that has been integrated into the job hosting service. In such embodiments, the job hosting service performs open job deduplication and, if it determines that the new open job is superior (e.g., more authoritative) to the integrated open job, replaces the integrated open job with the new open job open recruitment.
可以至少部分由一个或多个处理器执行本文所描述的实例方法的不同操作,所述处理器暂时性地被配置成(例如,通过软件指令)或永久性被配置成执行相关操作。无论暂时性地配置还是永久性地配置,此类处理器可以构成用于执行一个或多个操作或功能的处理器实施的模块或物体。在一些实例实施例中,本文提及的模块和物体可以包括处理器实施的模块和/或物体。The various operations of the example methods described herein may be performed at least in part by one or more processors that are either temporarily configured (eg, by software instructions) or permanently configured to perform the associated operations. Whether temporarily or permanently configured, such a processor may constitute a processor-implemented module or object for performing one or more operations or functions. In some example embodiments, the modules and objects referred to herein may include processor-implemented modules and/or objects.
类似地,本文所描述的方法可以至少部分是处理器实施的。例如,方法的操作中的至少一些可以由一个或多个处理器或处理器实施的模块执行。某些操作的执行可以分布在一个或多个处理器之中,不仅驻留在单个机器或计算机内,而且跨越多个机器或计算机部署。在一些实例实施例中,一个或多个处理器可以位于单个位置中(例如,在家庭环境、办公室环境内,在服务器群处等),而在其它实施例中,处理器可以跨越多个位置分布。Similarly, the methods described herein may be at least partially processor-implemented. For example, at least some of the operations of a method may be performed by one or more processors or processor-implemented modules. The execution of certain operations may be distributed among one or more processors, not only residing within a single machine or computer, but deployed across multiple machines or computers. In some example embodiments, one or more processors may be located in a single location (e.g., within a home environment, office environment, at a server farm, etc.), while in other embodiments, the processors may span multiple locations distributed.
一个或多个处理器还可以用于支持“云计算”环境中或软件即服务(“SaaS”)的上下文内的相关操作的性能。例如,操作中的至少一些操作可以由计算机的(例如,包含处理器的机器的)群组执行,这些操作可通过网络(例如,因特网)以及通过一个或多个合适接口(例如,应用程序接口(API))访问。One or more processors may also be used to support the performance of related operations in a "cloud computing" environment or within the context of software as a service ("SaaS"). For example, at least some of the operations may be performed by a group of computers (e.g., a machine including a processor) over a network (e.g., the Internet) and through one or more suitable interfaces (e.g., an application program interface (API)) access.
图1是说明根据一些实例实施例的适用于社交网络服务的网络环境100的网络图。网络环境100包含服务器机器110、数据库115和用于用户152的装置150,所有都通过网络190以通信方式耦合到彼此。服务器机器110可以形成基于网络的系统105的全部或一部分(例如,基于云的服务器系统被配置成将一个或多个服务提供到装置130和150)。数据库115可以存储用于社交网络服务的公开招聘。服务器机器110、第一装置130和第二装置150可以各自完全或部分实施于计算机系统中,如下文相对于图5所描述。Figure 1 is a network diagram illustrating a network environment 100 suitable for social networking services, according to some example embodiments. Network environment 100 includes server machine 110 , database 115 , and devices 150 for users 152 , all communicatively coupled to each other through network 190 . Server machine 110 may form all or part of network-based system 105 (eg, a cloud-based server system configured to provide one or more services to devices 130 and 150 ). Database 115 may store open jobs for social networking services. The server machine 110, the first device 130 and the second device 150 may each be fully or partially implemented in a computer system, as described below with respect to FIG. 5 .
图1中还示出用户152。用户152可以是人类用户(例如,人类)、机器用户(例如,通过软件程序配置以与装置150交互的计算机)或其任何合适的组合(例如,由机器辅助的人类或由人类监督的机器)。用户152不是网络环境100的一部分,但是与装置150相关联。在一些实施例中,装置150是桌上型电脑、车辆计算机、平板电脑、导航装置、便携式媒体装置、智能电话或由用户152操作的可穿戴式装置(例如,智能手表或智能眼镜)。Also shown in FIG. 1 is user 152 . User 152 may be a human user (e.g., a human), a machine user (e.g., a computer configured by a software program to interact with device 150), or any suitable combination thereof (e.g., a human assisted by a machine or a machine supervised by a human) . User 152 is not part of network environment 100 , but is associated with device 150 . In some embodiments, device 150 is a desktop computer, vehicle computer, tablet computer, navigation device, portable media device, smartphone, or wearable device (eg, smart watch or smart glasses) operated by user 152 .
图1中所示的机器、数据库或装置中的任一个可以实施于由软件(例如,一个或多个软件模块)修改(例如,配置或编程)为专用计算机的通用计算机中,以执行用于所述机器、数据库或装置的本文所描述的功能中的一个或多个。例如,下文相对于图5论述能够实施本文中所描述的方法中的任一个或多个的计算机系统。如本文所使用,“数据库”是数据存储资源并且可以存储构造为文本文件、表格、电子表格、关系型数据库(例如,对象关系型数据库)、三元组仓库、分层数据存储装置或其任何合适的组合的数据。此外,图1中所说明的机器、数据库或装置中的任两个或更多个可以组合成单个机器,并且用于任何单个机器、数据库或装置的本文中所描述的功能可以在多个机器、数据库或装置中细分。Any of the machines, databases, or devices shown in FIG. 1 can be implemented in a general-purpose computer modified (e.g., configured or programmed) by software (e.g., one or more software modules) as a special-purpose computer to execute the One or more of the functions described herein of said machine, database or device. For example, a computer system capable of implementing any one or more of the methods described herein is discussed below with respect to FIG. 5 . As used herein, a "database" is a data storage resource and may store data structured as text files, tables, spreadsheets, relational databases (e.g., object-relational databases), triple stores, hierarchical data stores, or any suitable combination of data. Furthermore, any two or more of the machines, databases, or devices illustrated in Figure 1 may be combined into a single machine, and the functionality described herein for any single machine, database, or device may be implemented on multiple machines , database or device segment.
网络190可以是实现机器、数据库和装置(例如,服务器机器110和装置130)之间或之中的通信的任何网络。因此,网络190可以是有线网络、无线网络(例如,移动或蜂窝网络)或其任何合适的组合。网络190可以包含构成专用网络、公用网络(例如,因特网)或其任何合适组合的一个或多个部分。因此,网络190可以包含并入有局域网(LAN)、广域网(WAN)、因特网、移动电话网络(例如,蜂窝网络)、有线电话网络(例如,简易老式电话系统(POTS)网络)、无线数据网络(例如,WiFi®或WiMax®网络)或其任何合适组合的一个或多个部分。网络190的任何一个或多个部分可以通过传输媒体传送信息。如本文所使用,“传输媒体”是指能够传送(例如,传输)指令以供机器(例如,供此机器的一个或多个处理器)执行的任何无形(例如,暂时性)媒体,并且包含数字或模拟通信信号或其它无形媒体以促进此软件的通信。Network 190 may be any network that enables communication between or among machines, databases, and devices (eg, server machine 110 and device 130 ). Accordingly, network 190 may be a wired network, a wireless network (eg, a mobile or cellular network), or any suitable combination thereof. Network 190 may comprise one or more portions that constitute a private network, a public network (eg, the Internet), or any suitable combination thereof. Thus, network 190 may comprise a local area network (LAN), a wide area network (WAN), the Internet, a mobile telephone network (e.g., a cellular network), a wired telephone network (e.g., a Plain Old Telephone System (POTS) network), a wireless data network (eg, WiFi® or WiMax® networks) or any suitable combination thereof. Any one or more portions of network 190 may communicate information over transmission media. As used herein, "transmission medium" means any intangible (e.g., transitory) medium capable of conveying (e.g., transporting) instructions for execution by a machine (e.g., by one or more processors of the machine), and includes digital or analog communication signals or other intangible media to facilitate communications with this software.
图2是说明根据一些实例实施例的社交网络系统210的组件的框图。社交网络系统210是图1的基于网络系统105的实例。在一些实施例中,社交网络系统210包含工作捕获模块202、应用程序服务器模块204、工作标准化模块206和工作重复数据删除模块208,所有模块都被配置成彼此通信(例如,通过互连件、总线、共享存储器、开关等)。Figure 2 is a block diagram illustrating components of a social networking system 210, according to some example embodiments. Social networking system 210 is an example of web-based system 105 of FIG. 1 . In some embodiments, social networking system 210 includes job capture module 202, application server module 204, job normalization module 206, and job deduplication module 208, all of which are configured to communicate with each other (e.g., via an interconnect, bus, shared memory, switches, etc.).
尽管图2将公开招聘数据库220说明为单个数据库,但是公开招聘数据库220可以包含多个数据库,所述数据库可以位于一个位置或多个位置中。类似地,尽管图2将公开招聘数据库220说明为不同于社交网络系统210,但在一些实施例中,公开招聘数据库220并入社交网络系统210内。Although FIG. 2 illustrates open job database 220 as a single database, open job database 220 may comprise multiple databases, which may be located in one location or in multiple locations. Similarly, although FIG. 2 illustrates the open job database 220 as distinct from the social networking system 210 , in some embodiments the open job database 220 is incorporated within the social networking system 210 .
在一些实施例中,工作捕获模块202从第三方系统170捕获、接收或以其它方式获取公开招聘。如图3A和3B中所描述,在一些实施例中,在将公开招聘集成到公开招聘数据库220中之前,工作标准化模块206标准化公开招聘。如在图4A和4B中所描述,如果此集成不会产生替代优良公开招聘的较差公开招聘,那么工作重复数据删除模块208将公开招聘集成到公开招聘数据库220中。In some embodiments, the job capture module 202 captures, receives or otherwise obtains open job offers from the third party system 170 . As depicted in FIGS. 3A and 3B , in some embodiments, the job normalization module 206 normalizes open jobs prior to integrating the open jobs into the open job database 220 . As depicted in FIGS. 4A and 4B , the job deduplication module 208 integrates the open offers into the open offer database 220 if this integration does not result in a poor open offer to replace a good open offer.
在一些情况下,工作捕获模块202、工作标准化模块206和/或工作重复数据删除模块208被配置成脱机和/或周期性地处理数据。例如,工作捕获模块202可以包含服务器,所述服务器周期性地从相关第三方因特网网站获取公开招聘。对第三方公开招聘进行标准化和重复数据删除可以是计算密集型的;因此,可以脱机完成工作标准化和/或重复数据删除。In some cases, job capture module 202 , job normalization module 206 , and/or job deduplication module 208 are configured to process data offline and/or periodically. For example, the job capture module 202 may comprise a server that periodically retrieves open jobs from relevant third-party Internet sites. Normalizing and deduplicating third-party open jobs can be computationally intensive; therefore, work normalizing and/or deduplicating can be done offline.
如将相对于图3A至3B进一步描述,工作捕获模块202结合工作标准化模块206可以获得和标准化未付费公开招聘以集成到公开招聘数据库220中。As will be further described with respect to FIGS. 3A-3B , job capture module 202 in conjunction with job normalization module 206 may obtain and normalize unpaid open jobs for integration into open job database 220 .
可以使用硬件(例如,机器的一个或多个处理器)或硬件和软件的组合来实施本文中描述的模块中的任一个或多个。例如,本文所描述的任何模块可以配置处理器(例如,在机器的一个或多个处理器之中),以执行用于所述模块的本文中所描述操作。此外,这些模块中的任何两个或更多个可以组合成单个模块,并且用于单个模块的本文中所描述功能可以在多个模块之中细分。此外,根据不同实例实施例,本文中描述为在单个机器、数据库或装置内实施例的模块可以跨越多个机器、数据库或装置分布。Any one or more of the modules described herein may be implemented using hardware (eg, one or more processors of a machine) or a combination of hardware and software. For example, any module described herein may configure a processor (eg, within one or more processors of a machine) to perform the operations described herein for that module. Furthermore, any two or more of these modules may be combined into a single module, and functionality described herein for a single module may be subdivided among multiple modules. Furthermore, modules described herein as being implemented within a single machine, database or device may be distributed across multiple machines, databases or devices, according to various example embodiments.
在一些实施例中,公开招聘数据库220含有由工作托管服务识别的一组预定义职位名称。例如,所述组预定义职位名称可以包含例如“客户经理”、“系统工程师”、“销售经理”等的职位名称。在一些实施例中,公开招聘数据库220含有由工作托管服务识别的一组预定义工作资历水平。例如,所述组预定义工作资历水平可以包含例如“实习生”、“初级”、“中级”、“高级”、“管理”、“经理”等的资历水平。In some embodiments, open job posting database 220 contains a set of predefined job titles identified by job hosting services. For example, the set of predefined job titles may include job titles such as "customer manager", "system engineer", "sales manager" and so on. In some embodiments, open job posting database 220 contains a set of predefined job seniority levels identified by job hosting services. For example, the set of predefined job seniority levels may include seniority levels such as "intern," "junior," "intermediate," "senior," "management," "manager," and the like.
图3A是说明根据一些实例实施例的在执行用于标准化从第三方系统获得的公开招聘的方法300时工作捕获模块202和工作标准化模块206的操作的流程图。可以使用上文相对于图2描述的模块由基于网络的系统105执行方法300中的操作。如图3A中所示,方法300包含操作302、304、306、308和310。3A is a flowchart illustrating the operation of the job capture module 202 and the job normalization module 206 in performing a method 300 for normalizing open job offers obtained from third-party systems, according to some example embodiments. The operations in method 300 may be performed by network-based system 105 using the modules described above with respect to FIG. 2 . As shown in FIG. 3A , method 300 includes operations 302 , 304 , 306 , 308 , and 310 .
通过获得和标准化来自第三方系统的公开招聘,除了社交网络系统付费向其用户呈现的公开招聘之外,社交网络系统210的工作托管服务还可以向其用户呈现来自其它工作来源的公开招聘。By obtaining and normalizing job openings from third-party systems, the job hosting service of social networking system 210 may present job openings to its users from other job sources in addition to the job openings that the social networking system pays to present to its users.
在操作302处,第一实体(例如,社交网络系统210的工作托管服务)获得(例如,使用工作捕获模块202)表示第三方系统170上的公开招聘的数据。公开招聘包含职位名称和职位描述。在一些实施例中,公开招聘还包含以下项中的至少一个:用人单位名称、招聘行业、工作的地理位置和所需技能。At operation 302 , a first entity (eg, the job hosting service of the social networking system 210 ) obtains (eg, using the job capture module 202 ) data representing open job offers on the third party system 170 . Open jobs include job titles and job descriptions. In some embodiments, the open job offer also includes at least one of the following: employer name, hiring industry, geographic location of the job, and required skills.
在操作304处,标准化公开招聘的职位名称(例如,使用工作标准化模块206)以匹配由第一实体识别的预定义职位名称。在一些实施例中,图3B中所说明的方法350的一个或多个操作352至362执行为职位名称标准化过程的一部分。At operation 304, the open job titles are normalized (eg, using the job normalization module 206) to match predefined job titles identified by the first entity. In some embodiments, one or more operations 352 through 362 of method 350 illustrated in FIG. 3B are performed as part of a job title normalization process.
在操作306处,标准化职位描述以符合由第一实体识别的数据格式。在一些实施例中,标准化职位描述包含对职位描述执行拼写检查/校正和/或语法检查/校正。At operation 306, the job description is standardized to conform to the data format identified by the first entity. In some embodiments, normalizing the job description includes performing spell checking/correction and/or grammar checking/correction on the job description.
在操作308处,标准化职位名称和标准化职位描述组合到标准化公开招聘中。在一些实施例中,例如元数据的额外信息还包含在标准化公开招聘中。At operation 308, the standardized job title and the standardized job description are combined into a standardized open job offer. In some embodiments, additional information such as metadata is also included in the standardized open job offer.
在操作310处,标准化公开招聘集成到第一实体(例如,社交网络系统210)的招聘系统(例如,工作托管服务)中。在一些实施例中,在标准化公开招聘的集成之前,对标准化公开招聘执行工作重复数据删除过程(例如,图4A的方法400)。At operation 310, the standardized open job offer is integrated into a recruiting system (eg, a job hosting service) of a first entity (eg, social networking system 210). In some embodiments, a job deduplication process (eg, method 400 of FIG. 4A ) is performed on the standardized open offers prior to integration of the standardized open offers.
图3B是说明根据一些实例实施例的在执行用于标准化从第三方系统获得的公开招聘的方法350时工作标准化模块206的任选操作的流程图。可以使用上文相对于图2描述的模块由基于网络的系统105执行方法350中的操作。如图3B中所示,方法350包含操作352、354、356、358、360、362、364和366。3B is a flowchart illustrating optional operations of the job normalization module 206 in performing a method 350 for normalizing open job offers obtained from a third-party system, according to some example embodiments. The operations in method 350 may be performed by network-based system 105 using the modules described above with respect to FIG. 2 . As shown in FIG. 3B , method 350 includes operations 352 , 354 , 356 , 358 , 360 , 362 , 364 , and 366 .
在操作352处,清除在职位名称中出现的不合需要的字符。例如,在一些实施例中,句点在职位名称中是不合需要的。如果公开招聘中的职位名称是“S.E.in San Francisco,C.A.”,则清除句点将产生“SE in San Francisco,CA”的修改后职位名称。在一些实施例中,通过应用于职位名称的正规表达式来清除不合需要的字符。At operation 352, undesirable characters appearing in the job title are cleaned. For example, in some embodiments, periods are undesirable in job titles. If the job title in an open job posting is "S.E. in San Francisco, C.A.", removing the period will result in a modified job title of "SE in San Francisco, CA." In some embodiments, undesired characters are cleaned by a regular expression applied to the job title.
在操作354处,地理位置确定在职位名称内并且从职位名称清除。例如,如果输入此操作的职位名称是“SE in San Francisco,CA”,那么输出的职位名称将是“SE”。At operation 354, the geographic location is determined within and cleared from the job title. For example, if the job title input to this operation is "SE in San Francisco,CA", the output job title will be "SE".
在操作356处,在表示缩写时,用第一实体识别的单词或词组替代职位名称内的缩写。例如,如果输入此操作的职位名称是“SE”,那么职位名称将是“Systems Engineer”。At operation 356, the word or phrase recognized by the first entity is substituted for the abbreviation within the job title when representing the abbreviation. For example, if the job title entered into this action is "SE", the job title would be "Systems Engineer".
在一些实施例中,使用职位名称内的上下文和/或职位描述内的上下文消除缩写的歧义。在一些实施例中,通过参考职位描述内多次出现的单词来消除缩写的歧义。例如,缩写“SE”可以表示例如“系统工程师”、“销售工程师”、“体育新闻编辑”、“清洁工”、“结构工程师”、“高级工程师”等的预定义职位名称。在一些实施例中,如果在职位描述内出现与预定义职位名称的潜在匹配,那么这会增加此潜在匹配时正确匹配的概率。In some embodiments, abbreviations are disambiguated using context within the job title and/or context within the job description. In some embodiments, abbreviations are disambiguated by referring to multiple occurrences of words within the job description. For example, the abbreviation "SE" may represent predefined job titles such as "systems engineer", "sales engineer", "sports news editor", "cleaner", "structural engineer", "senior engineer", etc. In some embodiments, if a potential match to a predefined job title occurs within the job description, this increases the probability that this potential match is a correct match.
在操作358处,职位名称的单词分成单词的列表。例如,如果输入此操作的职位名称是“系统工程师”,那么此操作的输出将是单词“系统”和“工程师”的列表。At operation 358, the words of the job title are separated into lists of words. For example, if the job title input to this operation is "system engineer", then the output of this operation will be a list of the words "system" and "engineer".
在操作360处,产生单词列表中的单词的所有可能排列。例如,如果单词的列表是“系统”和“工程师”,那么可能排列将是“系统工程师”和“工程师系统”。At operation 360, all possible permutations of the words in the word list are generated. For example, if the list of words is "system" and "engineer", then the possible permutations would be "system engineer" and "system engineer".
在操作362处,选择单词的排列作为与由第一实体识别的至少一个预定义职位名称最紧密匹配的标准化职位名称。例如,如果可能排列是“系统工程师”和“工程师系统”,那么“系统工程师”将选择作为标准化职位名称。At operation 362, the permutation of words is selected as the standardized job title that most closely matches the at least one predefined job title identified by the first entity. For example, if the possible permutations are "systems engineer" and "engineer systems," then "systems engineer" would be chosen as the standardized job title.
在操作364处,确定对应于标准化职位名称的职位名称编号。例如,如果标准化职位名称是“系统工程师”,那么在具体工作托管服务内的对应职位名称编号可以是525。At operation 364, a job title number corresponding to the standardized job title is determined. For example, if the standardized job title is "systems engineer," then the corresponding job title number within the specific job hosting service may be 525.
此外,在操作364处,确定对应于职位名称编号的工作资历水平。例如,对应于“系统工程师”的职位名称编号525的工作资历水平可以是“中级”。Additionally, at operation 364, a job seniority level corresponding to the job title number is determined. For example, the job seniority level corresponding to the job title number 525 of "system engineer" may be "intermediate".
在操作366处,职位名称编号和工作资历水平包含在标准化公开招聘中。在一些实施例中,在将标准化公开招聘集成到公开招聘数据库220中之前,职位名称编号和工作资历水平包含在标准化公开招聘中。At operation 366, the job title number and job seniority level are included in the standardized open job offer. In some embodiments, the job title number and job seniority level are included in the standardized open jobs before they are integrated into the open job database 220 .
图4A是说明根据一些实例实施例的在执行用于对从第三方系统获得的公开招聘进行重复数据删除的方法400时工作重复数据删除模块208,以及任选地工作捕获模块202和/或工作标准化模块206的操作的流程图。可以使用上文相对于图2描述的模块由基于网络的系统105执行方法400中的操作。如图4A所示,方法400包含操作402、404、406、408、410和412。4A is a diagram illustrating the job deduplication module 208, and optionally the job capture module 202 and/or job A flowchart of the operation of the normalization module 206 . The operations in method 400 may be performed by network-based system 105 using the modules described above with respect to FIG. 2 . As shown in FIG. 4A , method 400 includes operations 402 , 404 , 406 , 408 , 410 , and 412 .
通过对来自第三方系统的公开招聘进行重复数据删除,社交网络系统210的工作托管服务可以组织向其用户呈现相同工作的重复公开招聘。By de-duplicating postings from third-party systems, the job hosting service of social networking system 210 can organize duplicate postings from presenting the same job to its users.
在操作402处,任选地,第一实体(例如,社交网络系统210的工作托管服务)获得(例如,使用工作捕获模块202)表示第三方系统170上的公开招聘的数据。在一些实施例中,公开招聘包含以下项中的至少一个:职位名称、职位描述、用人单位名称、招聘行业、工作的地理位置和所需技能。在一些实施例中,方法400的操作402基本上类似于方法300的操作302。At operation 402 , optionally, a first entity (eg, the job hosting service of the social networking system 210 ) obtains (eg, using the job capture module 202 ) data representing open job offers on the third party system 170 . In some embodiments, the open job offer includes at least one of the following: job title, job description, employer name, hiring industry, geographic location of the job, and required skills. In some embodiments, operation 402 of method 400 is substantially similar to operation 302 of method 300 .
在操作404处,任选地,标准化公开招聘的职位名称(例如,使用工作标准化模块206)以匹配由第一实体识别的预定义职位名称。在一些实施例中,图3B中所说明的方法350的一个或多个操作352至362执行为职位名称标准化过程的一部分。At operation 404, the openly advertised job titles are optionally normalized (eg, using the job normalization module 206) to match predefined job titles identified by the first entity. In some embodiments, one or more operations 352 through 362 of method 350 illustrated in FIG. 3B are performed as part of a job title normalization process.
在操作406处,将第一源值分配到标准化公开招聘。在一些实施例中,至少部分由第三方系统的源类型确定第一源值。例如,在一些实施例中,识别三个第三方源类型:工作用人单位的网站、电子求职者跟踪系统(ATS)和电子招聘网站。ATS的实例包含Taleo®、ADP®等。电子招聘网站的实例包含Monster.com®、Indeed®、Craigslist®等。At operation 406, a first source value is assigned to the standardized open job offer. In some embodiments, the first source value is determined at least in part by the source type of the third party system. For example, in some embodiments, three third-party source types are identified: a job employer's website, an electronic applicant tracking system (ATS), and an e-recruitment website. Examples of ATS include Taleo®, ADP®, and the like. Examples of e-recruitment sites include Monster.com®, Indeed®, Craigslist®, and others.
在一些实施例中,存在源类型的层级。例如,工作用人单位的网站在源类型层级中视为最高,电子ATS在源类型层级中视为第二高,并且电子招聘网站在源类型层级中视为最低。因此,从用人单位自有网站获得的公开招聘具有比从电子ATS获得的公开招聘高的源值,从电子ATS获得的公开招聘进而具有比从电子招聘网站获得的公开招聘高的源值。In some embodiments, there is a hierarchy of source types. For example, a job employer's website is considered the highest in the source type hierarchy, electronic ATS is considered the second highest in the source type hierarchy, and an e-recruitment website is considered the lowest in the source type hierarchy. Therefore, an open job offer obtained from an employer's own website has a higher source value than an open job offer obtained from an electronic ATS, which in turn has a higher source value than an open job obtained from an electronic recruitment website.
此外,对于从相同源类型内的源获得的公开招聘,源值可以不同。例如,从dice.com获得的公开招聘可以具有比从Craigslist®获得的公开招聘高的源值。在一些实施例中,工作托管服务的管理员能够将源值分配到不同类型的源(例如,通过用户接口)。Additionally, the source value can be different for open jobs obtained from sources within the same source type. For example, an open job obtained from dice.com may have a higher source value than an open job obtained from Craigslist®. In some embodiments, an administrator of the job hosting service can assign source values to different types of sources (eg, through a user interface).
在操作408处,产生标准化公开招聘的散列值并且将所述散列值分配到标准化公开招聘。在一些实施例中,基于标准化职位名称、地理位置和用人单位名称而产生散列值。At operation 408, a hash value of the standardized job opening is generated and assigned to the standardized job opening. In some embodiments, hash values are generated based on standardized job titles, geographic locations, and employer names.
在一些实施例中,使用除了散列之外的比较数据的方法,例如,校验和、统计分析方法和机器学习方法,例如神经网络或其它监督学习方法。In some embodiments, methods of comparing data other than hashing are used, such as checksums, statistical analysis methods, and machine learning methods, such as neural networks or other supervised learning methods.
在操作410处,确定与标准化公开招聘基本上类似的公开招聘是否存在于社交网络系统210的工作托管服务中。在使用散列的一些实施例中,通过将散列值与社交网络系统210的工作托管服务内的公开招聘的多个散列值相比较来作出此确定,在操作408处针对标准化公开招聘产生所述散列值。At operation 410 , it is determined whether an open job substantially similar to the standardized open job exists in the job hosting service of the social networking system 210 . In some embodiments using hashing, this determination is made by comparing the hash value to a plurality of hash values for open jobs within the job hosting service of the social networking system 210, generated at operation 408 for a standardized open job. The hash value.
在使用散列的一些实施例中,如果标准化公开招聘的散列值与已集成到工作托管服务中的公开招聘的散列值足够匹配,那么标准化公开招聘和已集成的公开招聘被认为基本上类似。在一些实施例中,如果标准化公开招聘的散列值与已集成到工作托管服务中的公开招聘的散列值足够匹配,那么执行两个公开招聘的职位描述的正文的比较。在一些实施例中,比较涉及计算或比较两个公开招聘的已计算出的相似性测量。例如,杰卡德相似系数可以用于比较两个公开招聘之间的相似性。In some embodiments using hashing, a standardized open job and an integrated open job are considered to be substantially similar. In some embodiments, a comparison of the text of the job descriptions of the two open offers is performed if the hash values of the standardized open offers sufficiently match the hash values of the open offers that have been integrated into the job hosting service. In some embodiments, comparing involves calculating or comparing calculated similarity measures of two open jobs. For example, the Jaccard similarity coefficient can be used to compare the similarity between two open job offers.
在使用除了散列之外的比较方法的一些实施例中,不同比较技术可以用于确定公开招聘之间的大量相似性。例如,可以执行公开招聘的类似属性和/或公开招聘内的关键词的比较以确定大量相似性。In some embodiments that use comparison methods other than hashing, different comparison techniques may be used to determine substantial similarities between open job offers. For example, a comparison of similar attributes of job openings and/or keywords within job openings may be performed to determine a number of similarities.
在一些实施例中,如果标准化公开招聘和已集成的公开招聘基本上相似,那么具有最高源值的公开招聘存储于工作托管服务中,而具有较低源值的公开招聘被舍弃。在两个公开招聘具有相同源值的情况下,将保存最早的公开招聘。In some embodiments, if the standardized open job and the integrated open job are substantially similar, the open job with the highest source value is stored in the job hosting service, and the open job with the lower source value is discarded. In the case of two open jobs with the same source value, the oldest open job will be saved.
在一些实施例中,如果标准化公开招聘和已集成的公开招聘基本上相似,那么保存两个公开招聘并且在显示公开招聘时或仅在显示公开招聘之前确定向用户显示的公开招聘。例如,如果在显示具体工作的公开招聘时,工作的付费公开招聘已到期并且先前尚未向用户显示付费公开招聘,那么实际上将显示标准化公开招聘。如果到期的付费公开招聘先前已向用户显示,那么到期的付费公开招聘作为具体工作的公开招聘向用户显示。In some embodiments, if the standardized open job and the integrated open job are substantially similar, both open jobs are saved and the open job to display to the user is determined when the open job is displayed or just before the open job is displayed. For example, if when showing an opening for a specific job, the paid opening for the job has expired and the paid opening has not been previously shown to the user, then the standardized opening will actually be shown. If an due paid job opening was previously displayed to the user, then the due paid job opening is displayed to the user as a job-specific job opening.
在一些实施例中,如果确定工作托管服务中不存在基本上类似的公开招聘,那么标准化公开招聘集成到工作托管服务中。In some embodiments, the standardized opening is integrated into the job hosting service if it is determined that no substantially similar openings exist in the job hosting service.
在操作412处,在工作托管服务内,用标准化公开招聘替代基本上类似的公开招聘。在一些实施例中,响应于基本上类似的公开招聘已识别为不是付费公开招聘并且标准化公开招聘的源值大于工作托管服务中的基本上类似的公开招聘的源值而执行替代。在符合这些条件时的替代防止未付费公开招聘替代工作托管服务内的付费公开招聘,并且防止较低权威性的未付费公开招聘替代工作托管服务内的更具权威性的未付费公开招聘。At operation 412, a substantially similar open job is replaced with a standardized open job within the job hosting service. In some embodiments, the substitution is performed in response to the substantially similar open offer being identified as not being a paid open offer and the source value of the normalized open offer being greater than the source value of the substantially similar open offer in the job hosting service. Substitution when these conditions are met prevents unpaid job openings from replacing paid job openings within the job hosting service, and prevents less authoritative unpaid job openings from replacing more authoritative unpaid job openings within the job hosting service.
图4B是说明根据一些实例实施例的在执行用于对从第三方系统获得的公开招聘进行重复数据删除的方法450时工作重复数据删除模块208的任选操作的流程图。方法450中的操作可以使用上文相对于图2描述的模块由基于网络的系统105执行。如图4B所示,方法450包含操作452和454。FIG. 4B is a flowchart illustrating optional operations of the job deduplication module 208 in performing a method 450 for deduplicating open job offers obtained from a third-party system, according to some example embodiments. The operations in method 450 may be performed by network-based system 105 using the modules described above with respect to FIG. 2 . As shown in FIG. 4B , method 450 includes operations 452 and 454 .
在操作452处,确定基本上类似的公开招聘是付费公开招聘。在一些实施例中,至少部分基于社交网络系统210是否收取报酬而由社交网络系统210的至少一个客户作出此确定,以向社交网络系统210的工作托管服务的至少一个用户152呈现基本上类似的公开招聘。在一些实施例中,作出此确定以防止未付费公开招聘替代社交网络系统210的工作托管服务内的付费公开招聘。At operation 452, it is determined that the substantially similar job opening is a paid job opening. In some embodiments, this determination is made by at least one customer of the social-networking system 210 based at least in part on whether the social-networking system 210 receives compensation to present at least one user 152 of the job-hosting service of the social-networking system 210 with a substantially similar open recruitment. In some embodiments, this determination is made to prevent unpaid open jobs from replacing paid open jobs within the job hosting service of the social networking system 210 .
在操作454处,在接收用户152提交的相关工作搜索之后,向社交网络系统210的用户152呈现标准化公开招聘。在一些实施例中,社交网络系统210的用户152在社交网络系统210内提交工作搜索。在此类系统中,社交网络系统210向用户152呈现与所提交工作搜索相关的一组公开招聘。在一些实施例中,呈现的公开招聘可以包含付费公开招聘、未付费公开招聘,或其某一组合。At operation 454 , the standardized open job openings are presented to the user 152 of the social networking system 210 after receiving the relevant job search submitted by the user 152 . In some embodiments, a user 152 of the social networking system 210 submits a job search within the social networking system 210 . In such a system, social-networking system 210 presents user 152 with a set of open jobs related to a submitted job search. In some embodiments, the open jobs presented may include paid open jobs, unpaid open jobs, or some combination thereof.
图5说明其上可以执行本文所论述的技术(例如,方法)中的任一个或多个的实例机器500的框图。在替代实施例中,机器500可以充当独立装置或可以连接(例如,联网)到其它机器。在联网部署中,机器500可以在服务器机器、客户端机器或服务器-客户端网络环境两者的能力中操作。在实例中,机器500可以充当同级间(P2P)(或其它分布式)网络环境中的同级机器。机器500可以是个人计算机(PC)、平板PC、机顶盒(STB)、个人数字助理(PDA)、移动电话、网络器具、网络路由器、开关或桥接器,或能够执行(依序或以其它方式)指定将由所述机器采取的动作的指令的任何机器。此外,虽然仅说明单一机器,但术语“机器”也将被视为包含个别地或共同地执行指令集(或多个集合)以执行本文中所论述的方法中的任何一个或多和(例如,云计算、软体即服务(SaaS)、其它计算机集群配置)的任何机器集合。5 illustrates a block diagram of an example machine 500 on which any one or more of the techniques (eg, methods) discussed herein may be performed. In alternative embodiments, machine 500 may act as a standalone device or may be connected (eg, networked) to other machines. In a networked deployment, the machine 500 can operate in the capacity of a server machine, a client machine, or both in a server-client network environment. In an example, machine 500 may act as a peer machine in a peer-to-peer (P2P) (or other distributed) network environment. Machine 500 may be a personal computer (PC), tablet PC, set-top box (STB), personal digital assistant (PDA), mobile phone, network appliance, network router, switch or bridge, or be capable of executing (in sequence or otherwise) Any machine that specifies instructions for actions to be taken by said machine. Furthermore, while a single machine is illustrated, the term "machine" shall also be taken to include a set (or multiple sets) of instructions which individually or jointly execute to perform any one or more of the methodologies discussed herein (e.g. , cloud computing, software as a service (SaaS), other computer cluster configurations) any collection of machines.
如本文中所描述,实例可以包含逻辑或多个组件或机构,或可以由逻辑或多个组件或机构操作。电路组是在包含硬件(例如,简单电路、门、逻辑等)的有形实体中实施的电路集合。电路组成员可以随时间和基础硬件可变性为灵活的。电路组包含在操作时可以单独或组合地执行指定操作的成员。在实例中,电路组的硬件可以被永恒地设计成执行特定操作(例如,硬接线)。在实例中,电路组的硬体可以包含可变地连接的物理组件(例如,执行单元、晶体管、简单电路等),所述物理组件包含以物理方式修改(例如,不变集中式粒子的磁性、电气可移动放置等)以编码特定操作的指令。在连接物理组件时,硬件构成的基础电特性例如从绝缘体改变成导体或反之亦然。指令使嵌入式硬件(例如,执行单元或负载机构)能够经由可变连接而产生硬件中的电路组的成员,以当在操作中时执行特定操作的部分。因此,计算机可读媒体在装置正在操作时以通信方式耦合到电路组成员的其它组件。在实例中,物理组件中的任一个可以用于多于一个电路组中的多于一个成员中。例如,在操作中,执行单元在一个时间点处可以用于第一电路组的第一电路中,且由第一电路组中的第二电路或由第二电路组中的第三电路在不同时间重新使用。As described herein, an instance may contain logic or multiple components or mechanisms, or may be operated by logic or multiple components or mechanisms. A circuit group is a collection of circuits implemented in a tangible entity comprising hardware (eg, simple circuits, gates, logic, etc.). Circuit group membership can be flexible over time and underlying hardware variability. A circuit group contains members that, when operated, can individually or in combination perform specified operations. In an example, the hardware of the circuit pack may be permanently designed (eg, hardwired) to perform a particular operation. In an example, the hardware of a circuit pack may consist of variably connected physical components (e.g., execution units, transistors, simple circuits, etc.) that contain physically modified (e.g., magnetic , electrically removable placement, etc.) to encode instructions for specific operations. When physical components are connected, the underlying electrical properties of the hardware component change, for example, from an insulator to a conductor or vice versa. Instructions enable embedded hardware (eg, an execution unit or load mechanism) to create, via variable connections, members of a circuit group in the hardware to, when in operation, perform portions of a particular operation. Accordingly, the computer-readable medium is communicatively coupled to other components of the circuit pack member when the device is in operation. In an example, any of the physical components may be used in more than one member of more than one circuit group. For example, in operation, an execution unit may at one point in time be used in a first circuit of a first circuit group, and be executed in a different manner by a second circuit in the first circuit group or by a third circuit in the second circuit group. Time to reuse.
机器(例如,计算机系统)500可以包含硬件处理器502(例如,中央处理单元(CPU)、图形处理单元(GPU)、硬件处理器核心,或其任何组合)、主存储器504和静态存储器506,所述元件中的一些或全部元件可以经由互连件(例如,总线)508彼此通信。机器500可以进一步包含显示单元510、字母数字输入装置512(例如,键盘)和用户接口(UI)导航装置514(例如,鼠标)。在实例中,显示单元510、输入装置512和UI导航装置514可以是触摸屏显示器。机器500可以另外包含存储装置(例如,驱动单元)516、信号产生装置518(例如,扬声器)、网络接口装置520和一个或多个传感器521,例如,全球定位系统(GPS)传感器、指南针、加速计或其它传感器。机器500可以包含输出控制器528,例如,串行(例如,通用串行总线(USB))、并行或其它有线或无线(例如,红外(IR)、近场通信(NFC)等)连接以与一个或多个外围装置(例如,打印机、读卡器等)通信或控制所述一个或多个外围装置。A machine (e.g., a computer system) 500 may include a hardware processor 502 (e.g., a central processing unit (CPU), a graphics processing unit (GPU), a hardware processor core, or any combination thereof), a main memory 504, and a static memory 506, Some or all of the elements may communicate with each other via an interconnect (eg, bus) 508 . The machine 500 may further include a display unit 510, an alphanumeric input device 512 (eg, a keyboard), and a user interface (UI) navigation device 514 (eg, a mouse). In an example, the display unit 510, the input device 512 and the UI navigation device 514 may be a touch screen display. The machine 500 may additionally include a storage device (e.g., a drive unit) 516, a signal generating device 518 (e.g., a speaker), a network interface device 520, and one or more sensors 521, such as a global positioning system (GPS) sensor, compass, accelerometer, meter or other sensor. Machine 500 may include an output controller 528, e.g., serial (e.g., Universal Serial Bus (USB)), parallel, or other wired or wireless (e.g., infrared (IR), near field communication (NFC), etc.) One or more peripheral devices (eg, printer, card reader, etc.) communicate with or control the one or more peripheral devices.
存储装置516可以包含其上存储体现本文中所描述的技术或功能中的任何一个或多个或由本文中所描述的技术或功能中的任何一个或多个利用的一组或多组数据结构或指令524(例如,软件)的机器可读媒体522。指令524还可以在其由机器500执行期间完全或至少部分地驻留于主存储器504内、静态存储器506内或硬件处理器502内。在实例中,硬件处理器502、主存储器504、静态存储器506或存储装置516中的一个或任何组合可以构成机器可读媒体。Storage 516 may include stored thereon one or more sets of data structures embodying or utilized by any one or more of the techniques or functions described herein or machine-readable medium 522 of instructions 524 (eg, software). Instructions 524 may also reside, completely or at least partially, within main memory 504 , within static memory 506 , or within hardware processor 502 during execution by machine 500 . In an example, one or any combination of hardware processor 502, main memory 504, static memory 506, or storage device 516 may constitute a machine-readable medium.
虽然将机器可读媒体522说明为单一媒体,但术语“机器可读媒体”可以包含被配置成存储一个或多个指令524的单一媒体或多个媒体(例如,集中式或分布式数据库,和/或相关联的高速缓存器和服务器)。Although machine-readable medium 522 is illustrated as being a single medium, the term "machine-readable medium" may encompass a single medium or multiple media configured to store one or more instructions 524 (e.g., a centralized or distributed database, and and/or associated caches and servers).
术语“机器可读媒体”可以包含能够存储、编码或载送用于由机器500执行且使机器500执行本发明的技术中的任何一个或多个的指令,或能够存储、编码或载送由此类指令使用或与此类指令相关联的数据结构的任何媒体。非限制性机器可读媒体实例可以包含固态存储器以及光学和磁性媒体。在实例中,集中式机器可读媒体包括具有多个粒子的机器可读媒体,所述粒子具有不变(例如,静止)质量。因此,集中式机器可读媒体是非暂时性传播信号。集中式机器可读媒体的具体实例可以包含:非易失性存储器,例如半导体存储器装置(例如,电可编程只读存储器(EPROM)或电可擦除可编程只读存储器(EEPROM))和闪存存储器装置;磁盘,例如内部硬盘和可移动盘;磁光碟;以及CD-ROM和DVD-ROM盘。The term "machine-readable medium" may include instructions capable of storing, encoding, or carrying instructions for execution by machine 500 that cause machine 500 to perform any one or more of the techniques of the present invention, or any medium of data structures used by or associated with such instructions. Non-limiting examples of machine-readable media can include solid-state memory and optical and magnetic media. In an example, a centralized machine-readable medium includes a machine-readable medium having a plurality of particles having a constant (eg, rest) mass. Accordingly, the centralized machine-readable medium is a non-transitory propagated signal. Specific examples of centralized machine-readable media may include: non-volatile memory, such as semiconductor memory devices (e.g., Electrically Programmable Read-Only Memory (EPROM) or Electrically Erasable Programmable Read-Only Memory (EEPROM)) and flash memory memory devices; magnetic disks, such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM and DVD-ROM disks.
指令524可以进一步利用多个传输协议中的任一个(例如,帧中继、因特网协议(IP)、传输控制协议(TCP)、用户数据报协议(UDP)、超文本传输协议(HTTP)等)经由网络接口装置520使用传输媒体在通信网络526上传输或接收。实例通信网络可以包含局域网(LAN)、广域网(WAN)、分组数据网络(例如,因特网)、移动电话网络(例如,蜂窝网络)、简易老式电话(POTS)网络和无线数据网络(例如,称为Wi-Fi®的电气电子工程师学会(IEEE)802.11系列标准、称为WiMax®的IEEE 802.16系列标准)、IEEE 802.15.4系列标准、同级间(P2P)网络,以及其它网络。在实例中,网络接口装置520可以包含一个或多个物理插口(例如,以太网、同轴或电话插口)或一个或多个天线以连接到通信网络526。在实例中,网络接口装置520可以包含多个天线以使用单输入多输出(SIMO)、多输入多输出(MIMO)或多输入单输出(MISO)技术中的至少一个无线地通信。术语“传输媒体”应被视为包含能够存储、编码或载送指令以由机器500执行的任何无形媒体,且包含数字或模拟通信信号或用以促进此软件的通信的其它无形媒体。Instructions 524 may further utilize any of a number of transport protocols (e.g., Frame Relay, Internet Protocol (IP), Transmission Control Protocol (TCP), User Datagram Protocol (UDP), Hypertext Transfer Protocol (HTTP), etc.) The transmission medium is used to transmit or receive over the communication network 526 via the network interface device 520 . Example communication networks may include local area networks (LANs), wide area networks (WANs), packet data networks (e.g., the Internet), mobile telephone networks (e.g., cellular networks), plain old telephone (POTS) networks, and wireless data networks (e.g., known as The Institute of Electrical and Electronics Engineers (IEEE) 802.11 series of standards for Wi-Fi®, the IEEE 802.16 series of standards known as WiMax®), the IEEE 802.15.4 series of standards, peer-to-peer (P2P) networks, and others. In an example, network interface device 520 may include one or more physical jacks (eg, Ethernet, coaxial, or telephone jacks) or one or more antennas to connect to communication network 526 . In an example, network interface device 520 may include multiple antennas to communicate wirelessly using at least one of single-input multiple-output (SIMO), multiple-input multiple-output (MIMO), or multiple-input single-output (MISO) techniques. The term "transmission medium" shall be taken to include any intangible medium capable of storing, encoding or carrying instructions for execution by machine 500, and including digital or analog communication signals or other intangible media used to facilitate communication of such software.
附加注释和实例实施例:Additional notes and example embodiments:
实例1包含包括以下项的主题(例如,方法、用于执行动作的构件,或包含指令的机器可读媒体,所述指令在由机器执行时使机器执行动作):通过第一实体获得表示第三方招聘系统上的公开招聘的数据,所述数据包含职位名称和职位描述;标准化职位名称以匹配由第一实体识别的多个预定义职位名称中的至少一个;标准化职位描述以符合由第一实体识别的数据格式;将标准化职位名称和标准化职位描述组合成标准化公开招聘;以及将标准化公开招聘集成到第一实体的招聘系统中。Example 1 encompasses subject matter (e.g., a method, means for performing an action, or a machine-readable medium comprising instructions that when executed by a machine cause the machine to perform an action) that includes obtaining, by a first entity, a representation Open recruitment data on a third-party recruitment system, the data including job titles and job descriptions; standardizing job titles to match at least one of a plurality of predefined job titles identified by the first entity; standardizing job descriptions to match job descriptions identified by the first entity A data format for entity identification; combining the standardized job title and the standardized job description into a standardized open recruitment; and integrating the standardized open recruitment into the recruitment system of the first entity.
实例2可以包含实例1的主题,或可以任选地与所述主题组合以包含,其中标准化职位名称包含清除出现的不合需要的字符,使用至少一个正规表达式来执行所述清除。Example 2 may incorporate the subject matter of Example 1, or may optionally be combined with said subject matter to comprise, wherein normalizing job titles includes cleanup of occurrences of undesirable characters, said cleanup being performed using at least one regular expression.
实例3可以包含实例1至2中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中标准化职位名称包含以下项中的至少一个:确定职位名称内的地理位置并且从职位名称中清除所确定的地理位置;或确定职位名称内的用人单位名称并且从职位名称中清除所确定的用人单位名称。Example 3 may comprise the subject matter of one or any combination of Examples 1 to 2, or may optionally be combined with said subject matter to comprise, wherein the standardized job title comprises at least one of the following: determining the geographic location within the job title and Clear the determined geographic location from the job title; or determine the employer name within the job title and clear the determined employer name from the job title.
实例4可以包含实例1至3中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中标准化职位名称包含在表示缩写时用第一实体识别的单词或词组替代职位名称内的缩写。Example 4 may comprise the subject matter of one or any combination of Examples 1 to 3, or may optionally be combined with said subject matter to comprise, wherein the standardized position title comprises replacing the position with the word or phrase recognized by the first entity when denoting the abbreviation Abbreviation within the name.
实例5可以包含实例1至4中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中替代包含使用职位名称内的上下文和职位描述内的上下文中的至少一个消除缩写的歧义。Example 5 may comprise the subject matter of one or any combination of Examples 1 to 4, or may optionally be combined with said subject matter to comprise, wherein the substitution comprises using at least one of the context within the job title and the context within the job description to eliminate Abbreviation ambiguity.
实例6可以包含实例1至5中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中标准化职位名称包含:将包括有序的多个单词的职位名称分成单词的列表;根据单词的列表产生单词的多个排列;以及从单词的多个排列中选择最紧密匹配由第一实体识别的多个预定义职位名称中的至少一个的单词的排列。Example 6 may comprise the subject matter of one or any combination of Examples 1 to 5, or may optionally be combined with said subject matter to comprise wherein normalizing a job title comprises: dividing a job title comprising an ordered plurality of words into words a list; generating a plurality of permutations of words from the list of words; and selecting, from the plurality of permutations of words, a permutation of words that most closely matches at least one of the plurality of predefined job titles identified by the first entity.
实例7可以包含实例1至6中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中标准化职位名称进一步包含确定对应于标准化职位名称的职位名称编号和工作资历水平,并且其中职位名称编号和工作资历水平包含在标准化公开招聘中。Example 7 may comprise the subject matter of one or any combination of Examples 1 to 6, or may optionally be combined with said subject matter to comprise wherein standardizing the job title further comprises determining the job title number and job seniority level corresponding to the standardized job title , and where the job title number and job seniority level are included in the standardized open recruitment.
实例8可以包含实例1至7中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中标准化公开招聘包含地理位置、用人单位名称、招聘行业和工作技能的识别中的至少一个。Example 8 may comprise the subject matter of one or any combination of Examples 1 to 7, or may optionally be combined with said subject matter to comprise wherein standardized open recruitment includes identification of geographic location, employer name, hiring industry, and job skills at least one of the .
实例9可以包含实例1至8中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,主题(例如,设备、装置或系统)包括:包含存储器和至少一个处理器的机器;可由机器执行的工作捕获模块,其被配置成通过第一实体获得表示第三方招聘系统上的公开招聘的数据,所述数据包含职位名称和职位描述;以及可由机器执行的工作标准化模块,其被配置成:标准化职位名称以匹配由第一实体识别的多个预定义职位名称中的至少一个;标准化职位描述以符合由第一实体识别的数据格式;将标准化职位名称和标准化职位描述组合成标准化公开招聘;以及将标准化公开招聘集成到第一实体的招聘系统中。Example 9 may comprise the subject matter of one or any combination of Examples 1 to 8, or may optionally be combined with said subject matter to comprise that the subject matter (e.g., apparatus, apparatus, or system) comprises: a machine; a machine-executable job capture module configured to obtain, by the first entity, data representing an open job offer on a third-party recruitment system, the data comprising a job title and a job description; and a machine-executable job standardization module, It is configured to: standardize the job title to match at least one of a plurality of predefined job titles identified by the first entity; standardize the job description to conform to the data format identified by the first entity; combine the standardized job title and the standardized job description into a standardized open recruitment; and integrating the standardized open recruitment into the recruitment system of the first entity.
实例10可以包含实例9的主题,或可以任选地与所述主题组合以包含,其中标准化职位名称包含清除出现的不合需要的字符,使用至少一个正规表达式来执行所述清除。Example 10 may comprise, or may optionally be combined with, the subject matter of Example 9 to comprise, wherein normalizing job titles comprises cleaning occurrences of undesirable characters, said cleaning being performed using at least one regular expression.
实例11可以包含实例9至10中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中标准化职位名称包含以下项中的至少一个:确定职位名称内的地理位置并且从职位名称中清除所确定的地理位置;或确定职位名称内的用人单位名称并且从职位名称中清除所确定的用人单位名称。Example 11 may comprise the subject matter of one or any combination of Examples 9 to 10, or may optionally be combined with said subject matter to comprise wherein normalizing job titles comprises at least one of the following: determining a geographic location within a job title and Clear the determined geographic location from the job title; or determine the employer name within the job title and clear the determined employer name from the job title.
实例12可以包含实例9至11中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中标准化职位名称包含在表示缩写时用第一实体识别的单词或词组替代职位名称内的缩写。Example 12 may comprise the subject matter of one or any combination of Examples 9 to 11, or may optionally be combined with said subject matter to comprise, wherein the standardized position title comprises replacing the position with the word or phrase recognized by the first entity when denoting the abbreviation Abbreviation within the name.
实例13可以包含实例9至12中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中替代包含使用职位名称内的上下文和职位描述内的上下文中的至少一个消除缩写的歧义。Example 13 may comprise the subject matter of one or any combination of Examples 9 to 12, or may optionally be combined with said subject matter to comprise, wherein the substitution comprises using at least one of the context within the job title and the context within the job description to eliminate Abbreviation ambiguity.
实例14可以包含实例9至13中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中标准化职位名称包含:将包括有序的多个单词的职位名称分成单词的列表;根据单词的列表产生单词的多个排列;以及从单词的多个排列中选择最紧密匹配由第一实体识别的多个预定义职位名称中的至少一个的单词的排列。Example 14 may comprise the subject matter of one or any combination of Examples 9 to 13, or may optionally be combined with said subject matter to comprise wherein normalizing job titles comprises: dividing a job title comprising an ordered plurality of words into words a list; generating a plurality of permutations of words from the list of words; and selecting, from the plurality of permutations of words, a permutation of words that most closely matches at least one of the plurality of predefined job titles identified by the first entity.
实例15可以包含实例9至14中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中标准化职位名称进一步包含确定对应于标准化职位名称的职位名称编号和工作资历水平,并且其中职位名称编号和工作资历水平包含在标准化公开招聘中。Example 15 may comprise the subject matter of one or any combination of Examples 9 to 14, or may optionally be combined with said subject matter to comprise wherein standardizing job titles further comprises determining a job title number and job seniority level corresponding to the standardized job title , and where the job title number and job seniority level are included in the standardized open recruitment.
实例16可以包含实例9至15中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中标准化公开招聘包含地理位置、用人单位名称、招聘行业和工作技能的识别中的至少一个。Example 16 may comprise the subject matter of one or any combination of Examples 9 to 15, or may optionally be combined with said subject matter to comprise, wherein standardized open recruitment includes identification of geographic location, employer name, hiring industry and job skills at least one of the .
实例17可以包含实例1至16中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,主题(例如,方法、用于执行动作的构件,或包含指令的机器可读媒体,所述指令在由机器执行时使机器执行动作)包括:通过第一实体获得表示第三方系统上的公开招聘的数据;标准化数据以产生标准化公开招聘;将第一源值分配到标准化公开招聘,所述第一源值至少部分由第三方系统的源类型确定;产生标准化公开招聘的第一散列值并且将所述第一散列值分配给标准化公开招聘;确定具有第二源值和第二散列值的基本上类似的公开招聘存在于第一实体的招聘系统中;以及在第一实体的招聘系统内用标准化公开招聘替代基本上类似的公开招聘,所述替代响应于以下项执行:基本上类似的公开招聘已识别为不是付费公开招聘并且第一源值大于第二源值。Example 17 may comprise the subject matter of one or any combination of Examples 1 to 16, or may optionally be combined with said subject matter to comprise, subject matter (e.g., a method, means for performing an action, or machine-readable medium, the instructions, when executed by the machine, cause the machine to perform actions) comprising: obtaining, by a first entity, data representing open job offers on a third-party system; normalizing the data to produce a standardized open job offer; assigning the first source value to the standardized open job posting recruiting, the first source value determined at least in part by a source type of a third-party system; generating a first hash value of a standardized open job offer and assigning the first hash value to the standardized open job posting; determining a second source value having A substantially similar open offer to the second hash value exists in the first entity's recruiting system; and the substantially similar open offer is replaced within the first entity's recruiting system by a standardized open offer responsive to Item Execution: A substantially similar job opening has been identified as not being a paid job opening and the first source value is greater than the second source value.
实例18可以包含实例17的主题,或可以任选地与所述主题组合以包含,其中表示第三方系统上的公开招聘的数据包含职位名称、地理位置和用人单位名称,其中标准化公开招聘包含标准化职位名称,并且其中基于标准化职位名称、地理位置和用人单位名称而产生标准化公开招聘的第一散列值。Example 18 may comprise the subject matter of Example 17, or may optionally be combined with said subject matter to comprise, wherein the data representing open offers on a third party system comprises job title, geographic location and employer name, wherein normalizing open offers comprises normalizing A job title, wherein a first hash value of the standardized open recruitment is generated based on the standardized job title, the geographic location, and the name of the employer.
实例19可以包含实例17至18中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中第三方系统的源类型是用人单位的网站、电子求职者跟踪系统和电子招聘网站中的至少一个。Example 19 may comprise, or may optionally be combined with, the subject matter of Examples 17 to 18, or any combination thereof, wherein the source type of the third party system is an employer's website, an electronic job applicant tracking system, and an electronic At least one of the job sites.
实例20可以包含实例17至19中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中用人单位的网站的源值大于电子求职者跟踪系统的源值,并且其中电子求职者跟踪系统的源值大于电子招聘网站的源值。Example 20 may comprise the subject matter of one or any combination of Examples 17 to 19, or may optionally be combined with said subject matter to comprise, wherein the source value of the employer's website is greater than the source value of the electronic job applicant tracking system, and wherein The source value of the electronic applicant tracking system is greater than the source value of the electronic recruitment website.
实例21可以包含实例17至20中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中确定基本上类似的公开招聘存在于第一实体的招聘系统中包含将第一散列值与第一实体的招聘系统内的公开招聘的多个散列值相比较,所述多个散列值包含第二散列值。Example 21 may comprise, or may optionally be combined with, the subject matter of any one or any combination of Examples 17 to 20 to comprise wherein determining that a substantially similar open recruitment exists in the recruitment system of the first entity comprises adding A hash value is compared to a plurality of hash values of open job offers within the recruitment system of the first entity, the plurality of hash values including the second hash value.
实例22可以包含实例17至21中的一个或任何组合的主题,或可以任选地与所述主题组合以包含基于第一实体是否收取报酬而由第一实体的至少一个客户确定基本上类似的公开招聘是付费公开招聘,以向第一实体的招聘系统的至少一个用户呈现基本上类似的公开招聘。Example 22 may comprise the subject matter of one or any combination of Examples 17 to 21, or may optionally be combined with said subject matter to comprise a substantially similar The open job is a paid open job to present a substantially similar open job to at least one user of the first entity's recruiting system.
实例23可以包含实例17至22中的一个或任何组合的主题,或可以任选地与所述主题组合以包含在接收由第一实体的招聘系统的用户提交的相关工作搜索之后,向用户呈现标准化公开招聘。Example 23 may comprise the subject matter of one or any combination of Examples 17 to 22, or may optionally be combined with said subject matter to comprise presenting Standardized open recruitment.
实例24可以包含实例1至23中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,主题(例如,设备、装置或系统)包括:包含存储器和至少一个处理器的机器;可由机器执行的工作捕获模块,其被配置成通过第一实体获得表示第三方招聘系统上的公开招聘的数据;可由机器执行的工作标准化模块,其被配置成标准化公开招聘;以及可由机器执行的工作重复数据删除模块,其被配置成:将第一源值分配到标准化公开招聘,所述第一源值至少部分由第三方系统的源类型确定;产生标准化公开招聘的第一散列值并且将所述第一散列值分配到标准化公开招聘;确定具有第二源值和第二散列值的基本上类似的公开招聘存在于第一实体的招聘系统中;以及在第一实体的招聘系统内用标准化公开招聘替代基本上类似的公开招聘,所述替代响应于以下项执行:基本上类似的公开招聘已识别为不是付费公开招聘并且第一源值大于第二源值。Example 24 may comprise the subject matter of one or any combination of Examples 1 to 23, or may optionally be combined with said subject matter to comprise that the subject matter (e.g., apparatus, apparatus, or system) comprises: a machine; a machine-executable job capture module configured to obtain, by the first entity, data representing open job offers on a third-party recruitment system; a machine-executable job standardization module configured to standardize open job offers; and a machine-executable job capture module configured to standardize open job offers; Executing a job deduplication module configured to: assign a first source value to the standardized open job offer, the first source value determined at least in part by the source type of the third-party system; generate a first hash of the standardized open job posting value and assigning said first hash value to a standardized open job offer; determining that a substantially similar open job posting with a second source value and a second hash value exists in the first entity's recruitment system; and Substituting a standardized open job within the recruitment system of the substantially similar job opening, the replacement being performed in response to the substantially similar job opening being identified as not being a paid job opening and the first source value being greater than the second source value.
实例25可以包含实例24的主题,或可以任选地与所述主题组合以包含,其中表示第三方系统上的公开招聘的数据包含职位名称、地理位置和用人单位名称,其中标准化公开招聘包含标准化职位名称,并且其中基于标准化职位名称、地理位置和用人单位名称而产生标准化公开招聘的第一散列值。Example 25 may comprise the subject matter of Example 24, or may optionally be combined with said subject matter to comprise, wherein the data representing an open offer on a third-party system comprises job title, geographic location, and employer name, wherein normalizing the open offer comprises normalizing A job title, wherein a first hash value of the standardized open recruitment is generated based on the standardized job title, the geographic location, and the name of the employer.
实例26可以包含实例24至25中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中第三方系统的源类型是用人单位的网站、电子求职者跟踪系统和电子招聘网站中的至少一个。Example 26 may incorporate the subject matter of one or any combination of Examples 24 to 25, or may optionally be combined with said subject matter to comprise, wherein the source type of the third party system is an employer's website, an electronic job applicant tracking system, and an electronic At least one of the job sites.
实例27可以包含实例24至26中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中用人单位的网站的源值大于电子求职者跟踪系统的源值,并且其中电子求职者跟踪系统的源值大于电子招聘网站的源值。Example 27 may comprise, or may optionally be combined with, the subject matter of any one or any combination of Examples 24 to 26 to comprise, wherein the source value of the employer's website is greater than the source value of the electronic job applicant tracking system, and wherein The source value of the electronic applicant tracking system is greater than the source value of the electronic recruitment website.
实例28可以包含实例24至27中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中工作重复数据删除模块被配置成通过将第一散列值与第一实体的招聘系统内的公开招聘的多个散列值相比较而至少部分确定基本上类似的公开招聘存在于第一实体的招聘系统中,所述多个散列值包含第二散列值。Example 28 may comprise, or may optionally be combined with, the subject matter of any one or any combination of Examples 24 to 27 to comprise, wherein the working deduplication module is configured to combine the first hash value with the first entity A plurality of hash values of open jobs within the recruitment system of the first entity is compared to determine at least in part that a substantially similar open job exists in the first entity's recruitment system, the plurality of hash values including the second hash value.
实例29可以包含实例24至28中的一个或任何组合的主题,或可以任选地与所述主题组合以包含,其中工作重复数据删除模块被配置成至少部分基于第一实体是否收取报酬而由第一实体的至少一个客户确定基本上类似的公开招聘是付费公开招聘,以向第一实体的招聘系统的至少一个用户呈现基本上类似的公开招聘。Example 29 may comprise, or may optionally be combined with, the subject matter of any one or any combination of Examples 24 to 28 to comprise, wherein the job deduplication module is configured to be based at least in part on whether the first entity is paid by At least one customer of the first entity determines that the substantially similar job opening is a paid job opening to present the substantially similar job opening to at least one user of the first entity's recruiting system.
实例30可以包含实例24至29中的一个或组合的主题,或可以任选地与所述主题组合以包含,呈现模块被配置成在接收用户提交的相关工作搜索之后向第一实体的招聘系统的用户呈现标准化公开招聘。Example 30 may comprise, or may optionally be combined with, the subject matter of one or a combination of Examples 24 to 29 to comprise, a presentation module configured to present to the recruiting system of the first entity after receiving a relevant job search submitted by a user of users present a standardized open job offer.
这些非限制性实例中的每一个可以独立存在,或可以各种排列或组合与其它实例中的一个或多个组合。Each of these non-limiting examples can stand alone or be combined with one or more of the other examples in various permutations or combinations.
本文中使用在计算机网络和计算机系统的领域中的常规术语。所述术语在本领域中已知并且为方便起见,仅提供为非限制性实例。因此,除非另外说明,否则权利要求书中的对应术语的解释不限于任何特定定义。因此,权利要求书中使用的术语应给予最广泛的合理解释。Conventional terms in the fields of computer networks and computer systems are used herein. Such terms are known in the art and are provided for convenience only as non-limiting examples. Accordingly, the interpretation of corresponding terms in the claims is not limited to any specific definition unless otherwise stated. Accordingly, terms used in the claims are to be given the broadest reasonable interpretation.
尽管本文中已说明且描述具体实施例,但本领域普通技术人员将了解,计算出实现相同目的的任何布置可以取代所示的具体实施例。本领域普通技术人员将明白许多修改。因此,本申请案预期涵盖任何修改或变化。Although specific embodiments have been illustrated and described herein, those of ordinary skill in the art will appreciate that any arrangement calculated to achieve the same purpose may be substituted for the specific embodiments shown. Many modifications will be apparent to those of ordinary skill in the art. Accordingly, this application is intended to cover any adaptations or variations.
以上详细描述包含对附图的参考,所述附图形成详细描述的一部分。附图借助于说明示出可以实践的具体实施例。这些实施例在本文中还称为“实例”。此类实例可以包含除了所示出或所描述的那些元件之外的元件。然而,本发明人还预期其中仅提供所示出或所描述的那些元件的实例。此外,本发明人还预期使用相对于具体实例(或其一个或多于一个方面)或相对于本文所示出或所描述的其它实例(或其一个或多于一个方面)而示出或描述的那些元件的任何组合或排列的实例(或其一个或多于一个方面)。The above detailed description contains references to the accompanying drawings, which form a part of the detailed description. The drawings show, by way of illustration, specific embodiments that may be practiced. These embodiments are also referred to herein as "Examples." Such examples may contain elements in addition to those shown or described. However, the inventors also contemplate examples where only those elements shown or described are provided. In addition, the inventors also contemplate the use of any language shown or described with respect to a particular example (or one or more aspects thereof) or with respect to other examples (or one or more aspects thereof) shown or described herein. Examples of any combination or permutation of those elements (or one or more aspects thereof).
在此文献中参考的所有公开案、专利和专利文献以全文引用的方式并入本文中,就如同以引用的方式个别地并入一般。在此文献与以引用方式并入的那些文献之间发生用法不一致的情况下,所并入的参考文献中的用法应被视为补充此文献的用法;对于不可调和的不一致,此文献中的用法起主导作用。All publications, patents, and patent documents referenced in this document are herein incorporated by reference in their entirety as if individually incorporated by reference. In the event of usage inconsistencies between this document and those documents incorporated by reference, the usage in the incorporated reference shall be deemed to supplement that of this document; for irreconcilable inconsistencies, the usage in this document shall Usage plays a leading role.
在此文献中,如专利文献中所常见而使用术语“一”以包含一个或多于一个,这与“至少一个”或“一个或多个”的任何其它例子或使用无关。在此文献中,术语“或”用于指代非排它性或,使得除非另有指示,否则“A或B”包含“A而非B”、“B而非A”以及“A和B”。在此文献中,术语“包含”和“其中(in which)”用作相应术语“包括”和“其中(wherein)”的通俗等效术语。此外,在所附权利要求书中,术语“包含”和“包括”为开放式的,也就是说,包含除权利要求书中在此术语之后列出的那些元件之外的元件的系统、装置、物品或过程仍被视为在权利要求书的范围内。此外,在所附权利要求书中,术语“第一”、“第二”和“第三”等仅用作标记,且并不意图对其对象施加数字要求。In this document, the term "a" is used as is common in patent documents to include one or more than one, regardless of any other instance or use of "at least one" or "one or more". In this document, the term "or" is used to denote a non-exclusive or such that "A or B" includes "A and not B", "B and not A" and "A and B" unless otherwise indicated ". In this document, the terms "comprising" and "in which" are used as colloquial equivalents of the corresponding terms "including" and "wherein". Furthermore, in the appended claims, the terms "comprises" and "comprising" are open-ended, that is, systems, devices that contain elements other than those listed after this term in the claims , an article, or a process are still considered within the scope of the claims. Furthermore, in the appended claims, the terms "first", "second", and "third", etc. are used as labels only and are not intended to impose numerical requirements on their objects.
本文中所描述的方法实例可以至少部分地由机器或计算机实施。一些实例可以包含编码有可用于配置电子装置以执行如在以上实例中描述的方法的指令的计算机可读媒体或机器可读媒体。此类方法的实施方案可以包含代码,例如微码、汇编语言代码、高级语言代码等等。此代码可以包含用于执行各种方法的计算机可读指令。所述代码可以形成计算机程序产品的部分。此外,在实例中,代码可以例如在执行期间或在其它时间有形地存储在一个或多个易失性、非暂时性或非易失性有形计算机可读媒体上。这些有形计算机可读媒体的实例可以包含,但不限于,硬盘、可移动磁盘、可移动光盘(例如,光盘和数字视频磁盘)、盒式磁带、存储卡或棒、随机存取存储器(RAM)、只读存储器(ROM)等。Method examples described herein may be at least partially implemented by a machine or computer. Some examples can include a computer-readable medium or a machine-readable medium encoded with instructions that can be used to configure an electronic device to perform methods as described in the above examples. An implementation of such methods may comprise code, such as microcode, assembly language code, high level language code, and the like. This code may include computer readable instructions for performing various methods. The code may form part of a computer program product. Furthermore, in examples, the code may be tangibly stored on one or more volatile, non-transitory, or non-volatile tangible computer-readable media, eg, during execution or at other times. Examples of such tangible computer readable media may include, but are not limited to, hard disks, removable disks, removable optical disks (such as compact disks and digital video disks), magnetic tape cartridges, memory cards or sticks, random access memory (RAM) , read-only memory (ROM), etc.
以上描述预期为说明性的而非限制性的。例如,上述实例(或其一个或多个方面)可以彼此组合使用。例如一般技术人员在查阅以上描述后可以使用其它实施例。提供符合37C.F.R.§1.72(b)的摘要以允许读者快速确定技术公开内容的本质并且通过摘要不用于解释或限制权利要求书的范围或意义的理解来提交所述摘要。此外,在以上具体实施方式中,可以将各种特征分组在一起以简化本发明。不应将此情况解释为期望未主张的揭示特征对任何权利要求来说是必需的。相反地,本发明主题可以在于比特定所揭示实施例的所有特征要少。因此,特此将所附权利要求书并入到具体实施方式中,其中每一权利要求作为单独实施例而独立存在,且预期此类实施例可以各种组合或排列彼此组合。实施例的范围应该通过参考所附权利要求书以及此权利要求书所授予的等效物的完整范围来确定。The above description is intended to be illustrative rather than limiting. For example, the above examples (or one or more aspects thereof) may be used in combination with each other. For example, other embodiments may be utilized by one of ordinary skill after reviewing the above description. The abstract is provided in compliance with 37 C.F.R. §1.72(b) to allow the reader to quickly ascertain the nature of the technical disclosure and is presented with the understanding that the abstract is not intended to interpret or limit the scope or meaning of the claims. Furthermore, in the foregoing Detailed Description, various features may be grouped together in order to simplify the present disclosure. This should not be interpreted as intending that an unclaimed disclosed feature is essential to any claim. Rather, inventive subject matter may lie in less than all features of a particular disclosed embodiment. Thus the following claims are hereby incorporated into the Detailed Description, with each claim standing on its own as a separate embodiment, and it is contemplated that such embodiments may be combined with each other in various combinations and permutations. The scope of the embodiments should be determined with reference to the appended claims, along with the full range of equivalents to which such claims are entitled.
Claims (45)
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US14/502,224 US20160092838A1 (en) | 2014-09-30 | 2014-09-30 | Job posting standardization and deduplication |
| US14/502261 | 2014-09-30 | ||
| US14/502224 | 2014-09-30 | ||
| US14/502,261 US10043157B2 (en) | 2014-09-30 | 2014-09-30 | Job posting standardization and deduplication |
| PCT/US2015/022480 WO2016053382A1 (en) | 2014-09-30 | 2015-03-25 | Job posting standardization and deduplication |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN107004167A true CN107004167A (en) | 2017-08-01 |
| CN107004167B CN107004167B (en) | 2022-04-19 |
Family
ID=55631209
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201580064463.7A Active CN107004167B (en) | 2014-09-30 | 2015-03-25 | Publication recruitment normalization and deduplication |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN107004167B (en) |
| WO (1) | WO2016053382A1 (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10043157B2 (en) | 2014-09-30 | 2018-08-07 | Microsoft Technology Licensing, Llc | Job posting standardization and deduplication |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10997560B2 (en) | 2016-12-23 | 2021-05-04 | Google Llc | Systems and methods to improve job posting structure and presentation |
| US10607273B2 (en) | 2016-12-28 | 2020-03-31 | Google Llc | System for determining and displaying relevant explanations for recommended content |
| US9996523B1 (en) | 2016-12-28 | 2018-06-12 | Google Llc | System for real-time autosuggestion of related objects |
Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20060229899A1 (en) * | 2005-03-11 | 2006-10-12 | Adam Hyder | Job seeking system and method for managing job listings |
| US20080065633A1 (en) * | 2006-09-11 | 2008-03-13 | Simply Hired, Inc. | Job Search Engine and Methods of Use |
| US20080065630A1 (en) * | 2006-09-08 | 2008-03-13 | Tong Luo | Method and Apparatus for Assessing Similarity Between Online Job Listings |
| CN101512594A (en) * | 2006-05-16 | 2009-08-19 | 鲍恩蒂乔布斯有限公司 | Method to facilitate engagement and communication between a company and a recruiter |
| CN101520867A (en) * | 2009-04-03 | 2009-09-02 | 汤溪蔚 | Method and system for convenient network job hunting and recruitment |
| US7720791B2 (en) * | 2005-05-23 | 2010-05-18 | Yahoo! Inc. | Intelligent job matching system and method including preference ranking |
| CN102378973A (en) * | 2009-03-30 | 2012-03-14 | 爱萨有限公司 | System and method for data deduplication |
| US20130290205A1 (en) * | 2012-04-30 | 2013-10-31 | Gild, Inc. | Recruiting service graphical user interface |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8375026B1 (en) * | 2006-01-13 | 2013-02-12 | CareerBuilder, LLC | Method and system for matching data sets of non-standard formats |
| US8271473B2 (en) * | 2007-06-25 | 2012-09-18 | Jobs2Web, Inc. | System and method for career website optimization |
| US8473503B2 (en) * | 2011-07-13 | 2013-06-25 | Linkedin Corporation | Method and system for semantic search against a document collection |
| US20140149206A1 (en) * | 2012-11-29 | 2014-05-29 | Linkedin Corporation | Combined sponsored and unsponsored content group |
-
2015
- 2015-03-25 WO PCT/US2015/022480 patent/WO2016053382A1/en not_active Ceased
- 2015-03-25 CN CN201580064463.7A patent/CN107004167B/en active Active
Patent Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20060229899A1 (en) * | 2005-03-11 | 2006-10-12 | Adam Hyder | Job seeking system and method for managing job listings |
| US7720791B2 (en) * | 2005-05-23 | 2010-05-18 | Yahoo! Inc. | Intelligent job matching system and method including preference ranking |
| CN101512594A (en) * | 2006-05-16 | 2009-08-19 | 鲍恩蒂乔布斯有限公司 | Method to facilitate engagement and communication between a company and a recruiter |
| US20080065630A1 (en) * | 2006-09-08 | 2008-03-13 | Tong Luo | Method and Apparatus for Assessing Similarity Between Online Job Listings |
| US8099415B2 (en) * | 2006-09-08 | 2012-01-17 | Simply Hired, Inc. | Method and apparatus for assessing similarity between online job listings |
| US20080065633A1 (en) * | 2006-09-11 | 2008-03-13 | Simply Hired, Inc. | Job Search Engine and Methods of Use |
| CN102378973A (en) * | 2009-03-30 | 2012-03-14 | 爱萨有限公司 | System and method for data deduplication |
| CN101520867A (en) * | 2009-04-03 | 2009-09-02 | 汤溪蔚 | Method and system for convenient network job hunting and recruitment |
| US20130290205A1 (en) * | 2012-04-30 | 2013-10-31 | Gild, Inc. | Recruiting service graphical user interface |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10043157B2 (en) | 2014-09-30 | 2018-08-07 | Microsoft Technology Licensing, Llc | Job posting standardization and deduplication |
Also Published As
| Publication number | Publication date |
|---|---|
| CN107004167B (en) | 2022-04-19 |
| WO2016053382A1 (en) | 2016-04-07 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20230252324A1 (en) | Machine learning techniques for internet protocol address to domain name resolution systems | |
| CN104601438B (en) | A kind of friend recommendation method and apparatus | |
| US20200210466A1 (en) | Hybrid entity matching to drive program execution | |
| US20160034853A1 (en) | Determining a user's latent preference | |
| US20140122472A1 (en) | Sorting Social Profile Search Results Based on Computing Personal Similarity Scores | |
| US10073870B2 (en) | Method and apparatus for providing data correction and management | |
| US10009440B2 (en) | Crowdsourcing entity information | |
| US20160132832A1 (en) | Generating company profiles based on member data | |
| US11410463B1 (en) | Image based user identification across multiple online systems | |
| US20180336529A1 (en) | Job posting standardization and deduplication | |
| CN105900117A (en) | Method and system for acquisition, normalization, matching, and enrichment of data | |
| US20160092838A1 (en) | Job posting standardization and deduplication | |
| US20170124472A1 (en) | Activity sensing online preference assay | |
| US10445386B2 (en) | Search result refinement | |
| KR20200102500A (en) | Method, apparatus and selection engine for classification matching of videos | |
| CN107004167B (en) | Publication recruitment normalization and deduplication | |
| JP2015060432A (en) | Program, computer, and training data generation support method | |
| US20170004531A1 (en) | Advertisement selection using information retrieval systems | |
| US10409830B2 (en) | System for facet expansion | |
| WO2017132499A1 (en) | Timely propagation of network content | |
| US11080605B1 (en) | Interest matched interaction initialization | |
| US20160063441A1 (en) | Job poster identification | |
| US20170318073A1 (en) | Course ingestion and recommendation | |
| US10412076B2 (en) | Identifying users based on federated user identifiers | |
| EP3293696A1 (en) | Similarity search using polysemous codes |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| TA01 | Transfer of patent application right |
Effective date of registration: 20180503 Address after: Washington State Applicant after: Micro soft technique license Co., Ltd Address before: American California Applicant before: LINKEDIN CORPORATION |
|
| TA01 | Transfer of patent application right | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |