A full spaCy pipeline and models for scientific/biomedical documents
This repository contains custom pipes and models related to using spaCy for scientific documents. In particular, there is a custom tokenizer that adds tokenization rules on top of spaCy’s rule-based tokenizer, a POS tagger and syntactic parser trained on biomedical data and an entity span detection model. Separately, there are also NER models for more specific tasks. Just looking to test out the models on your data? Check out our demo. Installation Installing scispacy requires two steps: installing the library […]
Read more