mirror of https://github.com/huggingface/transformers.git synced 2025-07-25 07:18:58 +06:00

History

Patrick von Platen 96f57c9ccb [Benchmark] Memory benchmark utils (#4198 ) * improve memory benchmarking * correct typo * fix current memory * check torch memory allocated * better pytorch function * add total cached gpu memory * add total gpu required * improve torch gpu usage * update memory usage * finalize memory tracing * save intermediate benchmark class * fix conflict * improve benchmark * improve benchmark * finalize * make style * improve benchmarking * correct typo * make train function more flexible * fix csv save * better repr of bytes * better print * fix __repr__ bug * finish plot script * rename plot file * delete csv and small improvements * fix in plot * fix in plot * correct usage of timeit * remove redundant line * remove redundant line * fix bug * add hf parser tests * add versioning and platform info * make style * add gpu information * ensure backward compatibility * finish adding all tests * Update src/transformers/benchmark/benchmark_args.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/benchmark/benchmark_args_utils.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * delete csv files * fix isort ordering * add out of memory handling * add better train memory handling Co-authored-by: Lysandre Debut <lysandre@huggingface.co>		2020-05-27 23:22:16 +02:00
..
adversarial	BIG Reorganize examples (#4213 )	2020-05-07 13:48:44 -04:00
benchmarking	[Benchmark] Memory benchmark utils (#4198 )	2020-05-27 23:22:16 +02:00
bertology	Adds predict stage for glue tasks, and generate result files which can be submitted to gluebenchmark.com (#4463 )	2020-05-21 09:17:44 -04:00
contrib	[Tests, GPU, SLOW] fix a bunch of GPU hardcoded tests in Pytorch (#4468 )	2020-05-19 21:35:04 +02:00
distillation	Fix un-prefixed f-string	2020-05-18 11:20:46 -04:00
language-modeling	add DistilBERT to supported models (#4558 )	2020-05-25 14:50:45 -04:00
multiple-choice	per_device instead of per_gpu/error thrown when argument unknown (#4618 )	2020-05-27 11:36:55 -04:00
question-answering	Add back --do_lower_case to uncased models (#4245 )	2020-05-26 21:13:07 -04:00
summarization	BIG Reorganize examples (#4213 )	2020-05-07 13:48:44 -04:00
text-classification	per_device instead of per_gpu/error thrown when argument unknown (#4618 )	2020-05-27 11:36:55 -04:00
text-generation	[doc] Fix broken links + remove crazy big notebook	2020-05-07 18:44:18 -04:00
token-classification	per_device instead of per_gpu/error thrown when argument unknown (#4618 )	2020-05-27 11:36:55 -04:00
translation/t5	[isort] add known 3rd party to setup.cfg (#4053 )	2020-04-28 17:12:00 -04:00
lightning_base.py	BIG Reorganize examples (#4213 )	2020-05-07 13:48:44 -04:00
README.md	per_device instead of per_gpu/error thrown when argument unknown (#4618 )	2020-05-27 11:36:55 -04:00
requirements.txt	[Benchmark] Memory benchmark utils (#4198 )	2020-05-27 23:22:16 +02:00
test_examples.py	per_device instead of per_gpu/error thrown when argument unknown (#4618 )	2020-05-27 11:36:55 -04:00
xla_spawn.py	[TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223 )	2020-05-08 14:10:05 -04:00

README.md

Examples

Version 2.9 of transformers introduces a new Trainer class for PyTorch, and its equivalent TFTrainer for TF 2.

Here is the list of all our examples:

grouped by task (all official examples work for multiple models)
with information on whether they are built on top of Trainer/TFTrainer (if not, they still work, they might just lack some features),
whether they also include examples for pytorch-lightning, which is a great fully-featured, general-purpose training library for PyTorch,
links to Colab notebooks to walk through the scripts and run them easily,
links to Cloud deployments to be able to deploy large-scale trainings in the Cloud with little to no setup.

This is still a work-in-progress – in particular documentation is still sparse – so please contribute improvements/pull requests.

The Big Table of Tasks

Task	Example datasets	Trainer support	TFTrainer support	pytorch-lightning	Colab
`language-modeling`	Raw text	✅	-	-
`text-classification`	GLUE, XNLI	✅	✅	✅
`token-classification`	CoNLL NER	✅	✅	✅	-
`multiple-choice`	SWAG, RACE, ARC	✅	✅	-
`question-answering`	SQuAD	-	✅	-	-
`text-generation`	-	-	-	-
`distillation`	All	-	-	-	-
`summarization`	CNN/Daily Mail	-	-	-	-
`translation`	WMT	-	-	-	-
`bertology`	-	-	-	-	-
`adversarial`	HANS	-	-	-	-

Important note

Important To make sure you can successfully run the latest versions of the example scripts, you have to install the library from source and install some example-specific requirements. Execute the following steps in a new virtual environment:

git clone https://github.com/huggingface/transformers
cd transformers
pip install .
pip install -r ./examples/requirements.txt

One-click Deploy to Cloud (wip)

Azure

Running on TPUs

When using Tensorflow, TPUs are supported out of the box as a tf.distribute.Strategy.

When using PyTorch, we support TPUs thanks to pytorch/xla. For more context and information on how to setup your TPU environment refer to Google's documentation and to the very detailed pytorch/xla README.

In this repo, we provide a very simple launcher script named xla_spawn.py that lets you run our example scripts on multiple TPU cores without any boilerplate. Just pass a --num_cores flag to this script, then your regular training script with its arguments (this is similar to the torch.distributed.launch helper for torch.distributed).

For example for run_glue:

python examples/xla_spawn.py --num_cores 8 \
	examples/text-classification/run_glue.py
	--model_name_or_path bert-base-cased \
	--task_name mnli \
	--data_dir ./data/glue_data/MNLI \
	--output_dir ./models/tpu \
	--overwrite_output_dir \
	--do_train \
	--do_eval \
	--num_train_epochs 1 \
	--save_steps 20000

Feedback and more use cases and benchmarks involving TPUs are welcome, please share with the community.