transformers/examples/research_projects/performer
Duong A. Nguyen 1e8140caad
Fix RESOURCE_EXHAUSTED error when dealing with large datasets in Flax example scripts (#18069)
* Fix RESOURCE_EXHAUSTED error for large datasets on Flax example scripts

* using np.permutation for creating batch_idx

* train_samples_idx -> training_samples_idx

* fix type hints
2022-07-11 15:59:08 +02:00
..
full_script.sh Adding performer fine-tuning research exampke (#9239) 2020-12-21 21:19:41 +01:00
modeling_flax_performer_utils.py Black preview (#17217) 2022-05-12 16:25:55 -04:00
modeling_flax_performer.py Adding performer fine-tuning research exampke (#9239) 2020-12-21 21:19:41 +01:00
README.md Adding performer fine-tuning research exampke (#9239) 2020-12-21 21:19:41 +01:00
run_mlm_performer.py Fix RESOURCE_EXHAUSTED error when dealing with large datasets in Flax example scripts (#18069) 2022-07-11 15:59:08 +02:00
sanity_script.sh Adding performer fine-tuning research exampke (#9239) 2020-12-21 21:19:41 +01:00

Performer fine-tuning

Example authors: @TevenLeScao, @Patrickvonplaten

Paper authors: Krzysztof Choromanski, Valerii Likhosherstov, David Dohan, Xingyou Song, Andreea Gane, Tamas Sarlos, Peter Hawkins, Jared Davis, Afroz Mohiuddin, Lukasz Kaiser, David Belanger, Lucy Colwell, Adrian Weller

Requirements

datasets, flax and jax. wandb integration is built-in if you want to use it.

Examples

sanity_script.sh will launch performer fine-tuning from the bert-base-cased checkpoint on the Simple Wikipedia dataset (a small, easy-language English Wikipedia) from datasets. full_script.sh will launch performer fine-tuning from the bert-large-cased checkpoint on the English Wikipedia dataset from datasets.

Here are a few key arguments:

  • Remove the --performer argument to use a standard Bert model.

  • Add --reinitialize to start from a blank model rather than a Bert checkpoint.

  • You may change the Bert size by passing a different checkpoint to the --model_name_or_path argument.

  • Passing your user name to the --wandb_user_name argument will trigger weights and biases logging.

  • You can choose a dataset with --dataset_name and --dataset_config. Our viewer will help you find what you need.