transformers/model_cards/sagorsarker/codeswitch-spaeng-ner-lince
Sagor Sarker 4753816e39
added bangla-bert-base model card and also modified other model cards (#7071)
* added bangla-bert-base

* Apply suggestions from code review

Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-09-11 15:17:25 -04:00
..
README.md added bangla-bert-base model card and also modified other model cards (#7071) 2020-09-11 15:17:25 -04:00

language datasets license tags
es
en
lince
MIT
codeswitching
spanish-english
ner

codeswitch-spaeng-ner-lince

This is a pretrained model for Name Entity Recognition of spanish-english code-mixed data used from LinCE

This model is trained for this below repository.

https://github.com/sagorbrur/codeswitch

To install codeswitch:

pip install codeswitch

Name Entity Recognition of Spanish-English Mixed Data

  • Method-1

from transformers import AutoTokenizer, AutoModelForTokenClassification, pipeline

tokenizer = AutoTokenizer.from_pretrained("sagorsarker/codeswitch-spaeng-ner-lince")

model = AutoModelForTokenClassification.from_pretrained("sagorsarker/codeswitch-spaeng-ner-lince")

ner_model = pipeline('ner', model=model, tokenizer=tokenizer)

ner_model("put any spanish english code-mixed sentence")

  • Method-2
from codeswitch.codeswitch import NER
ner = NER('spa-eng')
text = "" # your mixed sentence 
result = ner.tag(text)
print(result)