mirror of https://github.com/huggingface/transformers.git synced 2025-07-04 05:10:06 +06:00

History

Leonid Boytsov c82e017aa9 Misc. fixes for Pytorch QA examples: (#16958 ) 1. Fixes evaluation errors popping up when you train/eval on squad v2 (one was newly encountered and one that was previously reported Running SQuAD 1.0 sample command raises IndexError #15401 but not completely fixed). 2. Removes boolean arguments that don't use store_true. Please, don't use these: *ANY non-empty string is being converted to True in this case and this clearly is not the desired behavior (and it creates a LOT of confusion). 3. All no-trainer test scripts are now saving metric values in the same way (with the right prefix eval_), which is consistent with the trainer-based versions. 4. Adds forgotten model.eval() in the no-trainer versions. This improved some results, but not everything (see the discussion in the end). Please, see the F1 scores and the discussion below.		2022-04-27 08:51:39 -04:00
..
benchmarking	Examples reorg (#11350 )	2021-04-21 11:11:20 -04:00
language-modeling	fix `rum_clm.py` seeking text column name twice (#16624 )	2022-04-19 14:38:25 +01:00
multiple-choice	Dev version	2022-04-06 11:08:12 -04:00
question-answering	Misc. fixes for Pytorch QA examples: (#16958 )	2022-04-27 08:51:39 -04:00
summarization	Dev version	2022-04-06 11:08:12 -04:00
text-classification	Dev version	2022-04-06 11:08:12 -04:00
token-classification	Add use_auth to load_datasets for private datasets to PT and TF examples (#16521 )	2022-04-04 10:27:45 -04:00
translation	Dev version	2022-04-06 11:08:12 -04:00
README.md	Updates the default branch from master to main (#16326 )	2022-03-23 03:46:59 -04:00

README.md

Examples

This folder contains actively maintained examples of use of 🤗 Transformers organized into different NLP tasks. All examples in this folder are TensorFlow examples, and are written using native Keras rather than classes like TFTrainer, which we now consider deprecated. If you've previously only used 🤗 Transformers via TFTrainer, we highly recommend taking a look at the new style - we think it's a big improvement!

In addition, all scripts here now support the 🤗 Datasets library - you can grab entire datasets just by changing one command-line argument!

A note on code folding

Most of these examples have been formatted with #region blocks. In IDEs such as PyCharm and VSCode, these blocks mark named regions of code that can be folded for easier viewing. If you find any of these scripts overwhelming or difficult to follow, we highly recommend beginning with all regions folded and then examining regions one at a time!

The Big Table of Tasks

Here is the list of all our examples:

Task	Example datasets
`language-modeling`	WikiText-2
`multiple-choice`	SWAG
`question-answering`	SQuAD
`summarization`	XSum
`text-classification`	GLUE
`token-classification`	CoNLL NER
`translation`	WMT

Coming soon

Colab notebooks to easily run through these scripts!