We use this repository to maintain our releases of pre-trained pipelines and models for the Spark NLP library. For more info please take a look at our releases.
Take a look at our official Spark NLP page: http://nlp.johnsnowlabs.com/ for user documentation and examples
pretrained(name, lang)
function to use
Model | Name | en |
---|---|---|
LemmatizerModel (Lemmatizer) | lemma_antbnc |
Download |
PerceptronModel (POS) | pos_anc |
Download |
NerCrfModel (NER with GloVe) | ner_crf |
Download |
NerDLModel (NER with GloVe) | ner_dl |
Download |
NerDLModel (NER with GloVe) | ner_dl_contrib |
Download |
NerDLModel (NER with BERT) | ner_dl_bert_base_cased |
Download |
NerDLModel (OntoNotes with GloVe 100d) | onto_100 |
Download |
NerDLModel (OntoNotes with GloVe 300d) | onto_300 |
Download |
WordEmbeddings (GloVe) | glove_100d |
Download |
BertEmbeddings (base_uncased) | bert_base_uncased |
Download |
BertEmbeddings (base_cased) | bert_base_cased |
Download |
BertEmbeddings (large_uncased) | bert_large_uncased |
Download |
BertEmbeddings (large_cased) | bert_large_cased |
Download |
DeepSentenceDetector | ner_dl_sentence |
Download |
ContextSpellCheckerModel (Spell Checker) | spellcheck_dl |
Download |
SymmetricDeleteModel (Spell Checker) | spellcheck_sd |
Download |
NorvigSweetingModel (Spell Checker) | spellcheck_norvig |
Download |
ViveknSentimentModel (Sentiment) | sentiment_vivekn |
Download |
DependencyParser (Dependency) | dependency_conllu |
Download |
TypedDependencyParser (Dependency) | dependency_typed_conllu |
Download |
Model | Name | fr |
---|---|---|
LemmatizerModel (Lemmatizer) | lemma |
Download |
PerceptronModel (POS UD) | pos_ud_gsd |
Download |
NerDLModel (glove_840B_300) | wikiner_840B_300 |
Download |
Feature | Description |
---|---|
Lemma | Trained by Lemmatizer annotator on lemmatization-lists by Michal Měchura |
POS | Trained by PerceptronApproach annotator on the Universal Dependencies |
NER | Trained by NerDLApproach annotator with Char CNNs - BiLSTM - CRF and GloVe Embeddings on the WikiNER corpus and supports the identification of PER , LOC , ORG and MISC entities |
Model | Name | de |
---|---|---|
LemmatizerModel (Lemmatizer) | lemma |
de |
PerceptronModel (POS UD) | pos_ud_hdt |
de |
NerDLModel (glove_840B_300) | wikiner_840B_300 |
de |
Feature | Description |
---|---|
Lemma | Trained by Lemmatizer annotator on lemmatization-lists by Michal Měchura |
POS | Trained by PerceptronApproach annotator on the Universal Dependencies |
NER | Trained by NerDLApproach annotator with Char CNNs - BiLSTM - CRF and GloVe Embeddings on the WikiNER corpus and supports the identification of PER , LOC , ORG and MISC entities |
Model | Name | it |
---|---|---|
LemmatizerModel (Lemmatizer) | lemma_dxc |
Download |
SentimentDetector (Sentiment) | sentiment_dxc |
Download |
PerceptronModel (POS UD) | pos_ud_isdt |
Download |
NerDLModel (glove_840B_300) | wikiner_840B_300 |
Download |
Feature | Description |
---|---|
Lemma | Trained by Lemmatizer annotator on DXC Technology dataset |
POS | Trained by PerceptronApproach annotator on the Universal Dependencies |
NER | Trained by NerDLApproach annotator with Char CNNs - BiLSTM - CRF and GloVe Embeddings on the WikiNER corpus and supports the identification of PER , LOC , ORG and MISC entities |
Model | Name | xx |
---|---|---|
WordEmbeddings (GloVe) | glove_840B_300 |
Download |
WordEmbeddings (GloVe) | glove_6B_300 |
Download |
BertEmbeddings (multi_cased) | bert_multi_cased |
Download |
pretrained(name, lang)
function to use
It is required to specify 3rd argument to pretrained(name, lang, loc)
function (location) to add the location of these
Model | name | language | loc |
---|---|---|---|
NerDLModel | ner_clinical |
en | clinical/models |
AssertionLogRegModel | assertion_ml |
en | clinical/models |
AssertionDLModel | assertion_dl |
en | clinical/models |
NerDLModel | deidentify_dl |
en | clinical/models |
DeIdentificationModel | deidentify_rb |
en | clinical/models |
WordEmbeddingsModel | embeddings_clinical |
en | clinical/models |
BertEmbeddingsModel | biobert_pubmed_cased |
en | clinical/models |
BertEmbeddingsModel | biobert_pmc_cased |
en | clinical/models |
BertEmbeddingsModel | biobert_pubmed_pmc_cased |
en | clinical/models |
BertEmbeddingsModel | biobert_clinical_cased |
en | clinical/models |
BertEmbeddingsModel | biobert_discharge_cased |
en | clinical/models |
PerceptronModel | pos_clinical |
en | clinical/models |
EntityResolverModel | resolve_icd10 |
en | clinical/models |
EntityResolverModel | resolve_icd10cm_cl_em |
en | clinical/models |
EntityResolverModel | resolve_icd10pcs_cl_em |
en | clinical/models |
ContextSpellCheckerModel | context_spell_med |
en | clinical/models |
NOTE:
noncontrib
pipelines are compatible with Windows
operating systems.
Pipelines | Name | en |
---|---|---|
Explain Document ML | explain_document_ml |
Download |
Explain Document DL | explain_document_dl |
Download |
Explain Document DL Win | explain_document_dl_noncontrib |
Download |
Explain Document DL Fast | explain_document_dl_fast |
Download |
Explain Document DL Fast Win | explain_document_dl_fast_noncontrib |
Download |
Recognize Entities DL | recognize_entities_dl |
Download |
Recognize Entities DL Win | recognize_entities_dl_noncontrib |
Download |
OntoNotes Entities Small | onto_recognize_entities_sm |
Download |
OntoNotes Entities Large | onto_recognize_entities_lg |
Download |
Match Datetime | match_datetime |
Download |
Match Pattern | match_pattern |
Download |
Match Chunk | match_chunks |
Download |
Match Phrases | match_phrases |
Download |
Clean Stop | clean_stop |
Download |
Clean Pattern | clean_pattern |
Download |
Clean Slang | clean_slang |
Download |
Check Spelling | check_spelling |
Download |
Analyze Sentiment | analyze_sentiment |
Download |
Dependency Parse | dependency_parse |
Download |
Pipelines | Name | fr |
---|---|---|
Explain Document Large | explain_document_lg |
Download |
Explain Document Medium | explain_document_md |
Download |
Entity Recognizer Large | entity_recognizer_lg |
Download |
Entity Recognizer Medium | entity_recognizer_md |
Download |
Pipelines | Name | it |
---|---|---|
Explain Document Large | explain_document_lg |
Download |
Explain Document Medium | explain_document_md |
Download |
Entity Recognizer Large | entity_recognizer_lg |
Download |
Entity Recognizer Medium | entity_recognizer_md |
Download |