mirror of
https://github.com/huggingface/transformers.git
synced 2025-08-02 19:21:31 +06:00
233 B
233 B
This model is pre-trained XLNET with 12 layers.
It comes with paper: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models
Project Page: SBERT-WK