transformers/model_cards/sagorsarker/codeswitch-nepeng-lid-lince
Sagor Sarker cc4ba79f68
added model card for codeswitch-spaeng-sentiment-analysis-lince (#6727)
* added model card for codeswitch-spaeng-sentiment-analysis-lince model also update other model card

* fixed typo

* fixed typo

* fixed typo

* fixed typo

* fixed typo

* fixed typo

* fixed typo

* Update README.md
2020-08-26 17:26:32 -04:00
..
README.md added model card for codeswitch-spaeng-sentiment-analysis-lince (#6727) 2020-08-26 17:26:32 -04:00

language datasets license tags
ne
en
LinCE
MIT
codeswitching
nepali-english
language-identification

codeswitch-nepeng-lid-lince

This is a pretrained model for language identification of nepali-english code-mixed data used from LinCE.

This model is trained for this below repository.

https://github.com/sagorbrur/codeswitch

To install codeswitch:

pip install codeswitch

Identify Language

  • Method-1

from transformers import AutoTokenizer, AutoModelForTokenClassification, pipeline

tokenizer = AutoTokenizer.from_pretrained("sagorsarker/codeswitch-nepeng-lid-lince")

model = AutoModelForTokenClassification.from_pretrained("sagorsarker/codeswitch-nepeng-lid-lince")
lid_model = pipeline('ner', model=model, tokenizer=tokenizer)

lid_model("put any nepali english code-mixed sentence")

  • Method-2
from codeswitch.codeswitch import LanguageIdentification
lid = LanguageIdentification('nep-eng') 
text = "" # your code-mixed sentence 
result = lid.identify(text)
print(result)