transformers/examples/tensorflow
Leonid Boytsov c82e017aa9
Misc. fixes for Pytorch QA examples: (#16958)
1. Fixes evaluation errors popping up when you train/eval on squad v2 (one was newly encountered and one that was previously reported Running SQuAD 1.0 sample command raises IndexError #15401 but not completely fixed).
2. Removes boolean arguments that don't use store_true. Please, don't use these: *ANY non-empty string is being converted to True in this case and this clearly is not the desired behavior (and it creates a LOT of confusion).
3. All no-trainer test scripts are now saving metric values in the same way (with the right prefix eval_), which is consistent with the trainer-based versions.
4. Adds forgotten model.eval() in the no-trainer versions. This improved some results, but not everything (see the discussion in the end). Please, see the F1 scores and the discussion below.
2022-04-27 08:51:39 -04:00
..
benchmarking Examples reorg (#11350) 2021-04-21 11:11:20 -04:00
language-modeling fix rum_clm.py seeking text column name twice (#16624) 2022-04-19 14:38:25 +01:00
multiple-choice Dev version 2022-04-06 11:08:12 -04:00
question-answering Misc. fixes for Pytorch QA examples: (#16958) 2022-04-27 08:51:39 -04:00
summarization Dev version 2022-04-06 11:08:12 -04:00
text-classification Dev version 2022-04-06 11:08:12 -04:00
token-classification Add use_auth to load_datasets for private datasets to PT and TF examples (#16521) 2022-04-04 10:27:45 -04:00
translation Dev version 2022-04-06 11:08:12 -04:00
README.md Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00

Examples

This folder contains actively maintained examples of use of 🤗 Transformers organized into different NLP tasks. All examples in this folder are TensorFlow examples, and are written using native Keras rather than classes like TFTrainer, which we now consider deprecated. If you've previously only used 🤗 Transformers via TFTrainer, we highly recommend taking a look at the new style - we think it's a big improvement!

In addition, all scripts here now support the 🤗 Datasets library - you can grab entire datasets just by changing one command-line argument!

A note on code folding

Most of these examples have been formatted with #region blocks. In IDEs such as PyCharm and VSCode, these blocks mark named regions of code that can be folded for easier viewing. If you find any of these scripts overwhelming or difficult to follow, we highly recommend beginning with all regions folded and then examining regions one at a time!

The Big Table of Tasks

Here is the list of all our examples:

Task Example datasets
language-modeling WikiText-2
multiple-choice SWAG
question-answering SQuAD
summarization XSum
text-classification GLUE
token-classification CoNLL NER
translation WMT

Coming soon

  • Colab notebooks to easily run through these scripts!