mirror of
https://github.com/huggingface/transformers.git
synced 2025-07-29 01:02:25 +06:00
7 lines
370 B
Markdown
7 lines
370 B
Markdown
# ArXiv-NLP GPT-2 checkpoint
|
|
|
|
This is a GPT-2 small checkpoint for PyTorch. It is the official `gpt2-small` fine-tuned to ArXiv paper on the computational linguistics field.
|
|
|
|
## Training data
|
|
|
|
This model was trained on a subset of ArXiv papers that were parsed from PDF to txt. The resulting data is made of 80MB of text from the computational linguistics (cs.CL) field. |