Drug Extraction

Drug name recognition and normalisation/grounding to DrugBank ids and standard names.

Package provides 2 taggers:
1. DrugTagger - CRF-based with DrugBank presence feature (see feature set for details).
2. DrugnameGazetteer - gazetteer/dictionary-based. Dictionary created from DrugBank.ca database.
Both taggers include grounding/normalisation to DrugBank ids and standard names.

Feature set:
Word, Word-1, Word+1, Word-1_Word, Word_Word+1, DrugBankPresence, POS
DrugBankPresence feature indicates the presence of the drug name in the DrugBank.

Using CONLL-Evaluation:
processed 32065 tokens with 3656 phrases; found: 3251 phrases; correct: 2786.
accuracy: 95.25%; precision: 85.70%; recall: 76.20%; FB1: 80.67

Using GATE Corpus Benchmark:
Strict: P: 0.65 R: 0.73 F1: 0.69
Lenient: P: 0.74 R: 0.84 F1: 0.78

The details of how to reproduce evaluation, see README.

To use standalone version for tagging download DrugExtractionStandalone.tar.gz from Files.

Project Samples

Project Activity

See All Activity >

Follow Drug Extraction

Drug Extraction Web Site

User Reviews

Be the first to post a review of Drug Extraction!

Additional Project Details

User Interface

Console/Terminal

Programming Language

Java

Related Categories

Java Bio-Informatics Software, Java Linguistics Software, Java Machine Learning Software

Registered

2015-06-10

Similar Business Software

Vertex AI

Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery...

See Software
Houzz Pro

Houzz Pro is the #1 construction management solution for residential contractors and designers. Get an all-in-one solution that spans the full customer lifecycle, including marketing, CRM, estimates, takeoffs, 3D floor plans, project management, selections, online invoicing & payments,...

See Software
Google Cloud BigQuery

BigQuery is a serverless, multicloud data warehouse that simplifies the process of working with all types of data so you can focus on getting valuable business insights quickly. At the core of Google’s data cloud, BigQuery allows you to simplify data integration, cost effectively and securely...

See Software
The Asset Guardian EAM (TAG)

Meet The Asset Guardian (TAG) Mobi – Tackle Downtime Now TAG Mobi is the solution for preventive maintenance and asset management (EAM) within Microsoft Dynamics 365 Business Central. It helps manufacturing teams reduce risk and minimize downtime by offering dependable, integrated asset...

See Software
Bioconductor

The Bioconductor project aims to develop and share open source software for precise and repeatable analysis of biological data. We foster an inclusive and collaborative community of developers and data scientists. Resources to maximize the potential of Bioconductor. From basic functionalities to...

See Software
AvPro Software

AvPro Software is comprehensive and easy-to-use. It's perfect for Aircraft MRO, Certified Repair Station (CRS), Aircraft Operators, and parts brokers. You can track Aircraft Parts I(nventory, Work Orders, and much more. Modular in nature and specifically designed for aircraft maintenance...

See Software