transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-04 13:20:12 +06:00

Author	SHA1	Message	Date
Stas Bekman	a73281e3e4	[examples] max samples can't be bigger than the len of dataset (#16501 ) * [examples] max samples can't be bigger than then len of dataset * do tf and flax	2022-03-30 12:33:16 -07:00
Sylvain Gugger	088c1880b7	Big file_utils cleanup (#16396 ) * Big file_utils cleanup * This one still needs to be treated separately	2022-03-25 07:25:20 -04:00
Sylvain Gugger	4975002df5	Reorganize file utils (#16264 ) * Split file_utils in several submodules * Fixes * Add back more objects * More fixes * Who exactly decided to import that from there? * Second suggestion to code with code review * Revert wront move * Fix imports * Adapt all imports * Adapt all imports everywhere * Revert this import, will fix in a separate commit	2022-03-23 10:26:33 -04:00
Lysandre Debut	eca77f4719	Updates the default branch from master to main (#16326 ) * Updates the default branch from master to main * Links from `master` to `main` * Typo * Update examples/flax/README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-23 03:46:59 -04:00
Joao Gante	e7f34ccd4f	Swag example: Update doc format (#16014 )	2022-03-09 13:25:34 +00:00
Joao Gante	62d847602a	Update TF multiple choice example (#15868 )	2022-03-08 13:16:34 +00:00
Sylvain Gugger	79d28e80b6	v4.18.0.dev.0	2022-03-03 10:19:58 -05:00
Joao Gante	05c237ea94	Update TF QA example (#15870 )	2022-03-02 10:38:13 +00:00
Joao Gante	3f2e636850	Update TF LM examples (#15855 )	2022-03-01 14:12:58 +00:00
Joao Gante	3956b133b6	TF text classification examples (#15704 ) * Working example with to_tf_dataset * updated text_classification * more comments	2022-02-21 17:17:59 +00:00
Sylvain Gugger	d0b5ed110a	Harder check for IndexErrors in QA scripts (#15438 ) * Harder check for IndexErrors in QA scripts * Make test stronger	2022-02-01 15:49:13 -05:00
Lysandre	eab338104d	Docs for version v4.16.0	2022-01-27 13:11:51 -05:00
Lysandre	f87db5e412	Release: v4.16.0	2022-01-27 13:06:33 -05:00
Russell Klopfer	27b819b0e3	use block_size instead of max_seq_length in tf run_clm example (#15036 ) * use block_size instead of max_seq_length * fixup * remove pad_to_block_size Co-authored-by: Russell Klopfer <russell@kloper.us>	2022-01-12 08:57:00 -05:00
Patrick von Platen	fa39ff9fc4	Docs for v4.16.0dev0	2021-12-22 20:39:44 +01:00
Patrick von Platen	05fa1a7ac1	Release: v4.15.0	2021-12-22 18:43:15 +01:00
Lysandre	7c9c41f43c	Docs for v4.14.0	2021-12-15 18:29:53 +01:00
Lysandre	960d8cb41d	Release: v4.14.0	2021-12-15 18:20:35 +01:00
Lysandre	ab31b3e41b	Docs for v4.14.0dev0	2021-12-09 17:09:23 +01:00
Lysandre	4da3a696e4	Release: v4.13.0	2021-12-09 16:55:21 +01:00
Julien Chaumond	6cdc3a7844	[urls to hub] Replace outdated model tags with their now-canonical pipeline types (#14617 ) * Replace outdated model tags with their now-canonical pipeline types * spam the CI till it's green	2021-12-06 04:35:01 -05:00
Nicholas Broad	69e16abf98	Switch from using sum for flattening lists of lists in group_texts (#14472 ) * remove sum for list flattening * change to chain() make chain object a list * delete empty lines per sgugger's suggestions Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Nicholas Broad <nicholas@nmbroad.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-11-22 16:17:26 -05:00
Matt	267867e851	Quick fix to TF summarization example (#14401 )	2021-11-15 13:45:51 +00:00
Matt	7f20bf0d43	Fixing requirements for TF LM models and use correct model mappings (#14372 ) * Fixing requirements for TF LM models and use correct model mappings * make style	2021-11-11 15:34:00 +00:00
Lysandre	b8fad022a0	v4.13.0.dev0	2021-10-28 12:56:46 -04:00
Lysandre	62bf536631	Release v4.12.0	2021-10-28 12:09:49 -04:00
Christopher Akiki	f9c16b02e3	Replace "Masked" with "Causal" in TF CLM example (#14014 )	2021-10-21 16:19:30 +01:00
Dhananjay Shettigar	319beb64eb	#12789 Replace assert statements with exceptions (#13909 ) * #12789 Replace assert statements with exceptions * fix-copies: made copy changes to utils_qa.py in examples/pytorch/question-answering and examples/tensorflow/question-answering * minor refactor for clarity	2021-10-07 09:09:01 -04:00
Lysandre	11c69b8045	Docs for version v4.11.0	2021-09-27 14:19:38 -04:00
Lysandre	dc193c906d	Release: v4.11.0	2021-09-27 14:14:09 -04:00
Lysandre	5ee67a4412	Docs for v4.10.0	2021-08-31 16:02:31 +02:00
Lysandre	d12bbe4942	Release: v4.10.0	2021-08-31 15:53:10 +02:00
Matt	702f4a49cd	Fixed CLM model still using MODEL_FOR_MASKED_LM_MAPPING (#13002 )	2021-08-31 13:21:39 +01:00
Sylvain Gugger	139e830158	Update label2id in the model config for run_glue (#13334 )	2021-08-30 10:35:09 -04:00
Stefan Schweter	4046e66e40	examples: only use keep_linebreaks when reading TXT files (#13320 ) * examples: only use keep_linebreaks when reading TXT files for all CLM examples * examples: only use keep_linebreaks when reading TXT files for all CLM examples * examples: only use keep_linebreaks when reading TXT files for all CLM examples	2021-08-28 16:22:29 +02:00
Stefan Schweter	319d840b46	examples: add keep_linebreaks option to CLM examples (#13150 ) * examples: add keep_linebreaks option to text dataset loader for all CLM examples * examples: introduce new keep_linebreaks option as data argument in CLM examples	2021-08-27 11:35:45 +02:00
Sylvain Gugger	3ec851dc5e	Fix QA examples for roberta tokenizer (#12928 )	2021-07-28 09:47:49 -04:00
Elysium1436	f3d0866ed9	Correct validation_split_percentage argument from int (ex:5) to float (0.05) (#12897 ) * Fixed train_test_split test_size argument * `Seq2SeqTrainer` set max_length and num_beams only when non None (#12899) * set max_length and num_beams only when non None * fix instance variables * fix code style * [FLAX] Minor fixes in CLM example (#12914) * readme: fix retrieval of vocab size for flax clm example * examples: fix flax clm example when using training/evaluation files * Fix module path for symbolic_trace example Co-authored-by: cchen-dialpad <47165889+cchen-dialpad@users.noreply.github.com> Co-authored-by: Stefan Schweter <stefan@schweter.it> Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>	2021-07-27 21:01:40 -04:00
Matt	569f61a760	Add TF multiple choice example (#12865 ) * Add new multiple-choice example, remove old one	2021-07-26 15:15:51 +01:00
Lysandre	40de2d5a4f	Docs for v4.10.0dev0	2021-07-22 12:52:25 +02:00
Lysandre	72aee83ced	Release: v4.9.0	2021-07-22 12:11:55 +02:00
Matt	f9ac677eba	Update TF examples README (#12703 ) * Update Transformers README, rename token_classification example to token-classification to be consistent with the others * Update examples/tensorflow/README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Add README for TF token classification * Update examples/tensorflow/token-classification/README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update examples/tensorflow/token-classification/README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-07-14 15:15:25 +01:00
Matt	65bf05cd18	Adding TF translation example (#12667 ) * Adding TF translation example * Fixes and style pass for TF translation example * Remove unused postprocess_text copied from run_summarization * Adding README * Review fixes * Move changes to model.config to after we've initialized the model	2021-07-13 19:08:25 +01:00
Matt	379f649434	TF summarization example (#12617 ) * Adding a TF summarization example * Style pass * Style fixes * Updates for review comments * Adding README * Style pass * Remove unused import	2021-07-12 15:58:38 +01:00
Sylvain Gugger	6f1adc4334	Fix group_lengths for short datasets (#12558 )	2021-07-08 07:23:41 -04:00
Matt	ea55675024	NER example for Tensorflow (#12469 ) * NER example for Tensorflow * Style pass * Style pass * Added metric computation on the evaluation set * Style pass * Fixed label masking * Style pass * Style pass	2021-07-05 15:42:18 +01:00
Souvic Chakraborty	d5b8fe3b90	Validation split added: custom data files @sgugger, @patil-suraj (#12407 ) * Validation split added: custom data files Validation split added in case of no validation file and loading custom data * Updated documentation with custom file usage Updated documentation with custom file usage * Update README.md * Update README.md * Update README.md * Made some suggested stylistic changes * Used logger instead of print. Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Made similar changes to add validation split In case of a missing validation file, a validation split will be used now. * max_train_samples to be used for training only max_train_samples got misplaced, now corrected so that it is applied on training data only, not whole data. * styled * changed ordering * Improved language of documentation Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Improved language of documentation Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Fixed styling issue * Update run_mlm.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-07-01 13:22:42 -04:00
Matt	7e22609e0f	Tensorflow LM examples (#12358 ) * Tensorflow MLM example * Add CLM example * Style fixes, adding missing checkpoint code from the CLM example * Fix TPU training, avoid massive dataset warnings * Fix incorrect training length calculation for multi-GPU training * Fix incorrect training length calculation for multi-GPU training * Refactors and nitpicks from the review * Style pass * Adding README	2021-06-28 19:31:44 +01:00
Sylvain Gugger	276bc149d2	Fix copies	2021-06-28 12:26:40 -04:00
Sylvain Gugger	57461ac0b4	Add possibility to maintain full copies of files (#12312 )	2021-06-28 10:02:53 -04:00

1 2

66 Commits