Disclosure of Invention
The invention overcomes the defects of the technology and provides a data annotation platform.
In order to achieve the purpose, the invention adopts the following technical scheme:
a data annotation platform comprises a user management module, a data annotation management module, a data center module, a data acquisition unit and a user interface module, wherein the user management module is used for user authentication and authority management, the data annotation management module is used for data annotation management, the data center module is used for data storage, the data acquisition unit is used for acquiring input data which are respectively in data communication with the data annotation management module, the data center module and the user interface module, the data annotation management module is in data communication with the user management module and the data center module, the user interface module is in data communication with the user management module, the user interface module comprises an annotation workbench interface, a project task interface, a data source management interface, a team management interface and a basic setting interface, the data annotation management module comprises a task team member management unit and a role authority management unit which are used for displaying in the team management interface, the role authority management unit has the authority of setting whether a team member has a final draft submission to a marked task, the data annotation management module comprises a task storage function key and a final draft submission function key which are used for displaying in the annotation workbench interface, a project setting unit and a marking task management unit which are used for displaying in the project task interface, the final draft submission function key of a user without the final draft submission authority is gray or hidden, the project setting unit is provided with the functions of creating projects, building team projects and selecting team members, each project can be provided with a plurality of tasks, and the marking task management unit is provided with a task management list and a task submission function key for submitting tasks, the task submission function keys of users without task submission authority are grayed out or hidden.
Preferably, a data set is arranged in the data center module, and the data labeling management module includes a data set management unit corresponding to the data set and used for operating and displaying in the data source management interface.
Preferably, a corpus and a graph database are further arranged in the data center module, the data labeling management module comprises a data acquisition unit for operating and displaying in the data source management interface, and the data acquisition unit is provided with an acquisition source list and a function key for adding an acquisition source.
Preferably, the data labeling management module comprises a labeling scheme template management unit and an auxiliary tool setting unit, wherein the labeling scheme template management unit and the auxiliary tool setting unit are used for operating and displaying in the basic setting interface, and a labeling scheme template list which is used after being opened and a function key with a user-defined labeling scheme are arranged in the labeling scheme template management unit.
Preferably, the data annotation management module further comprises a former reference list, an entity label list, a shortcut key list, a relationship label list and an annotation status display unit, which are used for operating and displaying in the annotation workbench interface, and an annotation guide setting unit and an annotation scheme setting unit, which are used for operating and displaying in the project task interface, wherein the annotation status display unit displays the former reference list, the entity label list, the shortcut key list, the relationship label list and the annotation data in the annotation workbench interface.
Preferably, the auxiliary tool setting unit comprises an entity extraction tool setting and an automatic word segmentation tool setting, the entity extraction tool setting is a setting for judging whether to perform magnetic labeling on an entity in the extracted text or not when the data is labeled, and the automatic word segmentation tool setting is a setting for judging whether to automatically perform word segmentation on the labeled text or not when the data is labeled.
Compared with the prior art, the invention has the beneficial effects that:
the user interface module of the data annotation platform is provided with a team management interface, so that team management is facilitated, annotation tasks are completed by teams, and the practicability is good; the role authority management unit is convenient for setting whether the team member has the authority of submitting the final draft for the marked task, the members without the authority of submitting the final draft can store the completed task through the task storage function key, and the members with the authority can submit the task through the function key of submitting the final draft, so that the practicability is good; the setting of the project setting unit is convenient for creating projects, establishing team projects and selecting team members, so that task assignment is supported, standard project team management is provided, a team is helped to complete annotation task cooperation more easily, and the practicability is good; the setting of the task submitting function key in the labeling task management unit is convenient for setting whether a team member has a submitting function to the labeled task or not, and the member with authority can finally submit the task, so that better management control is facilitated.
Detailed Description
The features of the present invention and other related features are further described in detail below by way of examples to facilitate understanding by those skilled in the art:
as shown in fig. 1 to 2, a data annotation platform comprises a user management module, a data annotation management module, a data center module, a data acquisition unit and a user interface module, wherein the user management module is used for user authentication and authority management, the data annotation management module is used for data annotation management, the data center module is used for data storage, the data acquisition unit is used for acquiring input data, and the data acquisition unit is respectively in data communication with the data annotation management module, the data center module and the user interface module, the data annotation management module is in data communication with the user management module and the data center module, and the user interface module is in data communication with the user management module, wherein the user interface module comprises an annotation workbench interface, a project task interface, a data source management interface, a team management interface, And a basic setting interface, wherein the data annotation management module comprises a task team member management unit and a role authority management unit which are used for operating and displaying in the team management interface, the role authority management unit has the authority of setting whether a team member has a final draft submission to a marked task, the data annotation management module comprises a task saving function key and a final draft submission function key which are used for operating and displaying in the annotation workbench interface, a project setting unit and a marking task management unit which are used for operating and displaying in the project task interface, the final draft submission function key of a user without the final draft submission authority is gray or hidden, the project setting unit is provided with the functions of creating projects, creating team projects and selecting team members, each project can be provided with a plurality of tasks, and the marking task management unit is provided with a task management list and a task submission function key for submitting tasks, the task submission function keys of users without task submission authority are grayed out or hidden.
As described above, the user interface module of the annotation platform of the present disclosure is provided with a team management interface, which is convenient for team management, and the annotation task is completed by a team, so that the practicability is good; the role authority management unit is convenient for setting whether the team member has the authority of submitting the final draft for the marked task, the members without the authority of submitting the final draft can store the completed task through the task storage function key, and the members with the authority can submit the task through the function key of submitting the final draft, so that the practicability is good; the setting of the project setting unit is convenient for creating projects, establishing team projects and selecting team members, so that task assignment is supported, standard project team management is provided, a team is helped to complete annotation task cooperation more easily, and the practicability is good; the setting of the task submitting function key in the labeling task management unit is convenient for setting whether a team member has a submitting function to the labeled task or not, and the member with authority can finally submit the task, so that better management control is facilitated.
As described above, the data center module is provided with the data set, and the data annotation management module includes the data set management unit corresponding to the data set and used for operating and displaying in the data source management interface.
As described above, in the specific implementation, the data center module is further provided with a corpus and a graph database, the data labeling management module includes a data acquisition unit for operating and displaying in the data source management interface, and the data acquisition unit is provided with an acquisition source list and a function key for adding an acquisition source.
As described above, in specific implementation, the data labeling management module includes a labeling scheme template management unit and an auxiliary tool setting unit for operating and displaying in the basic setting interface, and the labeling scheme template management unit is provided with a list of labeling scheme templates ready for use when opening a box and a function key with a customized labeling scheme, so that the data labeling management module is more convenient to use.
As described above, in specific implementation, the data annotation management module further includes a precedent reference list, an entity tag list, a shortcut key list, a relationship tag list, and an annotation status display unit for performing operation and display in the annotation workbench interface, and an annotation guide setting unit and an annotation scheme setting unit for performing operation and display in the project task interface, where the annotation status display unit displays the precedent reference list, the entity tag list, the shortcut key list, the relationship tag list, and the annotation data in the annotation workbench interface.
As mentioned above, before data labeling, a team firstly makes a labeling scheme, can add a labeling scheme template, set a label set and define each label shortcut key, which is beneficial to better labeling work.
As above, during concrete implementation, appurtenance setting unit is including entity extraction tool setting and automatic word segmentation tool setting, entity extraction tool sets up the setting of whether carrying out the magnetism mark to the entity in extracting the text when the data mark, automatic word segmentation tool sets up the setting of whether carrying out the word segmentation to the mark text voluntarily when the data mark, so, be favorable to carrying out automatic word segmentation of AI and the automatic mark of AI at the during operation to reduce some work load, later carry out artifical quality control and correction again, accomplish man-machine cooperation mark, the practicality is good.
As described above, in the specific implementation, high-quality labeling is performed on the data of the content operation platform, so that the public service level and efficiency can be improved. Data annotation is carried out on commodity data, searched commodity contents, sentence contexts and the like of the e-commerce industry, and an accurate user portrait can be established through an intelligent recommendation system, so that commodities which are more in line with interests of the user are recommended for the user, and the conversion rate is effectively improved. In the adverse drug reaction early warning system, the text content of the drug indications is labeled, so that a high-quality data set is provided for the training of an adverse reaction prediction algorithm, and the accuracy of the algorithm is improved.
As described above, the present disclosure is directed to a data annotation platform, and all technical solutions that are the same as or similar to the present disclosure should be considered as falling within the scope of the present disclosure.