medaCy

🏥 Medical Text Mining and Information Extraction with spaCy 🏥

MedaCy is a text processing and learning framework built over spaCy to support the lightning fast prototyping, training, and application of highly predictive medical NLP models. It is designed to streamline researcher workflow by providing utilities for model training, prediction and organization while insuring the replicability of systems.

🌟 Features

Highly predictive, shared-task dominating out-of-the-box trained models for medical named entity recognition.
Customizable pipelines with detailed development instructions and documentation.
Allows the designing of replicable NLP systems for reproducing results and encouraging the distribution of models whilst still allowing for privacy.
Active community development spearheaded and maintained by NLP@VCU.
Detailed API.

💭 Where to ask questions

MedaCy is actively maintained by @AndriyMulyar and @CoreySutphin. The best way to receive immediate responses to any questions is to raise an issue. Make sure to first consult the API. See how to formulate a good issue or feature request in the Contribution Guide.

💻 Installation Instructions

Medacy can be installed for general use or for pipeline development / research purposes.

Application	Run
Prediction and Model Training (stable)	`pip install git+https://github.com/NLPatVCU/medaCy.git`
Prediction and Model Training (latest)	`pip install git+https://github.com/NLPatVCU/medaCy.git@development`
Pipeline Development and Contribution	See Contribution Instructions

📚 Power of medaCy

After installing medaCy and medaCy's clinical model, simply run:

from medacy.model import Model

model = Model.load_external('medacy_model_clinical_notes')
annotation = model.predict("The patient was prescribed 1 capsule of Advil for 5 days.")
print(annotation)

and receive instant predictions:

{
    'entities': {
        'T3': ('Drug', 40, 45, 'Advil'),
        'T1': ('Dosage', 27, 28, '1'), 
        'T2': ('Form', 29, 36, 'capsule'),
        'T4': ('Duration', 46, 56, 'for 5 days')
     },
     'relations': []
}

To explore medaCy's other models or train your own, visit the examples section.

Reference

@ARTICLE {
    author  = "Andriy Mulyar, Natassja Lewinski and Bridget McInnes",
    title   = "TAC SRIE 2018: Extracting Systematic Review Information with MedaCy",
    journal = "National Institute of Standards and Technology (NIST) 2018 Systematic Review Information Extraction (SRIE) > Text Analysis Conference",
    year    = "2018",
    month   = "nov"
}

License

This package is licensed under the GNU General Public License.

Authors

Andriy Mulyar, Corey Sutphin, Bobby Best, Steele Farnsworth, and Bridget T McInnes

Name		Name	Last commit message	Last commit date
Latest commit History 411 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
docs		docs
examples		examples
medacy		medacy
.coveragerc		.coveragerc
.gitattributes		.gitattributes
.gitignore		.gitignore
.travis.yml		.travis.yml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
Vagrantfile		Vagrantfile
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

medaCy

🌟 Features

💭 Where to ask questions

💻 Installation Instructions

📚 Power of medaCy

Reference

License

Authors

Acknowledgments

About

Releases

Packages

Languages

License

SamMahen/medaCy

Folders and files

Latest commit

History

Repository files navigation

medaCy

🌟 Features

💭 Where to ask questions

💻 Installation Instructions

📚 Power of medaCy

Reference

License

Authors

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages