mirror of https://github.com/huggingface/transformers.git synced 2025-07-23 14:29:01 +06:00

History

Ethan Perez e5c393dceb [Bug fix] Using loaded checkpoint with --do_predict (instead of… (#3437 ) * Using loaded checkpoint with --do_predict Without this fix, I'm getting near-random validation performance for a trained model, and the validation performance differs per validation run. I think this happens since the `model` variable isn't set with the loaded checkpoint, so I'm using a randomly initialized model. Looking at the model activations, they differ each time I run evaluation (but they don't with this fix). * Update checkpoint loading * Fixing model loading		2020-03-30 17:06:08 -04:00
..
README.md	[WIP] Lightning glue example (#3290 )	2020-03-17 11:46:42 -04:00
run_pl_glue.py	[Bug fix] Using loaded checkpoint with --do_predict (instead of… (#3437 )	2020-03-30 17:06:08 -04:00
run_pl.sh	[WIP] Lightning glue example (#3290 )	2020-03-17 11:46:42 -04:00

README.md

GLUE Benchmark

Based on the script run_glue.py.

Run PyTorch version using PyTorch-Lightning

Run bash run_pl.sh from the glue directory. This will also install pytorch-lightning and the requirements in examples/requirements.txt. It is a shell pipeline that will automatically download, pre-process the data and run the specified models. Logs are saved in lightning_logs directory.

Pass --n_gpu flag to change the number of GPUs. Default uses 1. At the end, the expected results are: TEST RESULTS {'val_loss': tensor(0.0707), 'precision': 0.852427800698191, 'recall': 0.869537067011978, 'f1': 0.8608974358974358}