transformers/examples/glue
Ethan Perez e5c393dceb
[Bug fix] Using loaded checkpoint with --do_predict (instead of… (#3437)
* Using loaded checkpoint with --do_predict

Without this fix, I'm getting near-random validation performance for a trained model, and the validation performance differs per validation run. I think this happens since the `model` variable isn't set with the loaded checkpoint, so I'm using a randomly initialized model. Looking at the model activations, they differ each time I run evaluation (but they don't with this fix).

* Update checkpoint loading

* Fixing model loading
2020-03-30 17:06:08 -04:00
..
README.md [WIP] Lightning glue example (#3290) 2020-03-17 11:46:42 -04:00
run_pl_glue.py [Bug fix] Using loaded checkpoint with --do_predict (instead of… (#3437) 2020-03-30 17:06:08 -04:00
run_pl.sh [WIP] Lightning glue example (#3290) 2020-03-17 11:46:42 -04:00

GLUE Benchmark

Based on the script run_glue.py.

Run PyTorch version using PyTorch-Lightning

Run bash run_pl.sh from the glue directory. This will also install pytorch-lightning and the requirements in examples/requirements.txt. It is a shell pipeline that will automatically download, pre-process the data and run the specified models. Logs are saved in lightning_logs directory.

Pass --n_gpu flag to change the number of GPUs. Default uses 1. At the end, the expected results are: TEST RESULTS {'val_loss': tensor(0.0707), 'precision': 0.852427800698191, 'recall': 0.869537067011978, 'f1': 0.8608974358974358}