transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-30 09:42:22 +06:00

Author	SHA1	Message	Date
PenutChen	0f3ad1507e	Fix typo (#11369 )	2021-04-22 10:10:16 -04:00
Matt	2617396094	Correctly cast num_train_epochs to int (#11379 )	2021-04-22 13:49:59 +01:00
Takuya Makino	881945c0b5	Add space (#11373 )	2021-04-22 17:48:58 +05:30
johnson7788	5b5e4ca366	[run_translation.py] fix typo (#11372 ) fix typo Co-authored-by: johnson <johnson@github.com>	2021-04-22 17:47:11 +05:30
Patrick von Platen	58d8795d74	[Flax] Correct typo (#11374 ) * finish * fix copy	2021-04-22 13:11:44 +02:00
Patrick von Platen	880154d2e1	[Wav2Vec2] Fix special tokens for Wav2Vec2 tokenizer (#11349 ) * fix wav2vec2 tok * up	2021-04-22 12:23:08 +02:00
Sylvain Gugger	6f14eab50b	Add in torchhub	2021-04-21 19:17:29 -04:00
Sylvain Gugger	ff26f8ee3a	Add huggingface_hub dep for #11328	2021-04-21 19:12:58 -04:00
wlhgtc	5e04d70868	Fix token_type_ids error for big_bird model. (#11355 ) * MOD: fit chinese wwm to new datasets * MOD: move wwm to new folder * MOD: formate code * Styling * MOD add param and recover trainer * MOD: add token_type_ids method for big bird * MOD: format code * MOD: format code Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>	2021-04-21 19:37:57 +02:00
Stas Bekman	5aaf5aac0b	[contributing doc] explain/link to good first issue (#11346 ) * explain/link to good first issue * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-04-21 10:10:11 -07:00
Matt	6fe79e57d7	Move old TF text classification script to legacy (#11361 ) And update README to explain the work-in-progress!	2021-04-21 17:36:18 +01:00
Patrick von Platen	50595a3336	Remove boiler plate code (#11340 ) * remove boiler plate code * adapt roberta * correct docs * finish refactor	2021-04-21 18:34:38 +02:00
Matt	ac588594e2	Merge new TF example script (#11360 ) First of the new and more idiomatic TF examples!	2021-04-21 17:04:55 +01:00
Stas Bekman	9f72e8f4e1	[testing doc] bring doc up to date (#11359 ) * bring doc up to date * fix	2021-04-21 08:51:00 -07:00
lewtun	41f3133a3a	Extract metric_key_prefix during NotebookProgressCallback.on_evaluate (#11347 ) * Pass metric_key_prefix as kwarg to on_evaluate * Replace eval_loss with metric_key_prefix_loss * Default to "eval" if metric_key_prefix not in kwargs * Add kwargs to CallbackHandler.on_evaluate signature * Revert "Add kwargs to CallbackHandler.on_evaluate signature" This reverts commit `8d4c85ed51`. * Revert "Pass metric_key_prefix as kwarg to on_evaluate" This reverts commit `7766bfe271`. * Extract metric_key_prefix from metrics	2021-04-21 11:12:09 -04:00
Sylvain Gugger	dabeb15292	Examples reorg (#11350 ) * Base move * Examples reorganization * Update references * Put back test data * Move conftest * More fixes * Move test data to test fixtures * Update path * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address review comments and clean Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-04-21 11:11:20 -04:00
Stas Bekman	ca7ff64f5b	[deepspeed] fix resume from checkpoint (#11352 ) This PR fixes a bug that most likely somehow got exposed (not caused) by https://github.com/huggingface/transformers/pull/11318 - surprisingly the same test worked just fine before that other PR.	2021-04-21 07:48:15 -07:00
Sylvain Gugger	74712e22f3	Honor contributors to models (#11329 ) * Honor contributors to models * Fix typo * Address review comments * Add more authors	2021-04-21 09:47:27 -04:00
Nicolas Patry	aad95c7cde	Removed `max_length` from being mandatory within `generate`. (#11314 ) * Removed `max_length` from being mandatory within `generate`. - Moving on to fully using `StoppingCriteria` for `greedy` and `sample` modes. - `max_length` still used for `beam_search` and `group_beam_search` (Follow up PR) - Fixes a bug with MaxLengthStoppingCriteria (we should stop as soon a we hit the max_length, the comparison needs to be or equal, that affects the tests). - Added options to use `logits_processor` and `stopping_criteria` directly within `generate` function (so some users can define their own `logits_processor` and `stopping_criteria`). - Modified the backward compat tests to make sure we issue a warning. * Fix `max_length` argument in `generate`. * Moving validate to being functional. - Renamed `smax_length` to `stoppping_max_length`. * Removing `logits_processor` and `stopping_criteria` from `generate` arguments. * Deepcopy. * Fix global variable name.	2021-04-21 11:56:45 +02:00
Yusuke Mori	95dab34d55	Add an error message that fires when Reformer is not in training mode, but one runs .backward() (#11117 )	2021-04-21 00:23:37 +02:00
Sylvain Gugger	f1b938fda8	Update to use datasets remove_cloumns method (#11343 ) * Update to use datasets remove_cloumns method * Quality	2021-04-20 14:12:01 -04:00
Suraj Patil	cfd2eaa8cf	[GPTNeo] create local attention mask ones (#11335 ) * create local attention mask ones * remove old method, address patricks comment	2021-04-20 18:37:44 +05:30
Patrick von Platen	f464f10a2c	[Generate] Remove outdated code (#11331 ) * remove update function * update * refactor more * refactor	2021-04-20 15:16:02 +03:00
rajvi-k	bfd83c17a7	Added translation example script (#11196 ) * initial changes * modified evaluation * updated evaluation * updated evaluation on text translation example script * added translation example script * Formatted translation example script * Reformatted translation example * Fixed evaluation bug and added support for other tokenisers * Fixed evaluation bug and added support for other tokenisers * Added translation example script * Formatted summarization example script * Removed typos from summarization example script	2021-04-20 07:18:47 -04:00
Sylvain Gugger	c0328a6c26	Load checkpoint without re-creating the model (#11318 )	2021-04-19 20:31:29 -04:00
Sylvain Gugger	95037a169f	[Trainer] Add a progress bar for batches skipped (#11324 )	2021-04-19 19:04:52 -04:00
Stas Bekman	95ffbe1686	[Trainer] fix the placement on device with fp16_full_eval (#11322 ) * fix the placement on device with fp16_full_eval * deepspeed never goes on device	2021-04-19 11:55:33 -07:00
TAE YOUNGDON	3981ce3dd2	modify double considering special tokens in `language_modeling.py` (#11275 ) * Update language_modeling.py in "class TextDatasetForNextSentencePrediction(Dataset)", double considering "self.tokenizer.num_special_tokens_to_add(pair=True)" so, i remove self.block_size, and add parameter for "def create_examples_from_document". like "class LineByLineWithSOPTextDataset" do * Update language_modeling.py	2021-04-19 11:24:43 -04:00
e	5a34d8d982	move device statements outside if statements (#11292 )	2021-04-19 08:25:40 -04:00
Sylvain Gugger	d9c62047a8	Trainer support for IterableDataset for evaluation and predict (#11286 ) * Bulk of the work * Polish and tests * Update QA Trainer * Avoid breaking the predict method * Deprecation warnings * Store real eval dataloder * Get eval dataset reference before wrap	2021-04-16 16:01:58 -04:00
Lysandre	e783ea7304	Fix failing workflows	2021-04-16 08:09:51 -04:00
Nicolas Patry	92970c0cb9	Enabling multilingual models for translation pipelines. (#10536 ) * [WIP] Enabling multilingual models for translation pipelines. * decoder_input_ids -> forced_bos_token_id * Improve docstring. * Rebase * Fixing 2 bugs - Type token_ids coming from `_parse_and_tokenize` - Wrong index from tgt_lang. * Fixing black version. * Adding tests for _build_translation_inputs and add them for all tokenizers. * Mbart actually puts the lang code at the end. * Fixing m2m100. * Adding TF support to `deep_round`. * Update src/transformers/pipelines/text2text_generation.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Adding one line comment. * Fixing M2M100 `_build_translation_input_ids`, and fix the call site. * Fixing tests + deep_round -> nested_simplify Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-04-16 11:31:35 +02:00
Lysandre Debut	5254220e7f	Workflow fixes (#11270 )	2021-04-15 23:21:17 -04:00
Stas Bekman	dfc6dd8584	update dependency_versions_table (#11273 ) missed this updating when bumped the version.	2021-04-15 19:10:29 -07:00
Sylvain Gugger	2550b41aa2	Tokenizer fast save (#11234 ) * Save fast tokenizers in both formats * Fix for HerBERT * Proper fix * Properly test new behavior	2021-04-15 09:32:32 -04:00
Sylvain Gugger	6e1ee47b36	Support for set_epoch (#11258 )	2021-04-15 07:36:32 -04:00
Nicolas Patry	c3fcba3219	Adding pipeline task aliases. (#11247 ) * Adding task aliases and adding `token-classification` and `text-classification` tasks. * Cleaning docstring.	2021-04-15 09:51:24 +02:00
Sylvain Gugger	aaaed56ffc	Trainer iterable dataset (#11254 ) * IterableDatasetShard * Test and integration in Trainer * Update src/transformers/trainer_pt_utils.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Style Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-04-14 17:02:26 -04:00
Stas Bekman	83206ca6a8	[deepspeed] test on one node 2 gpus max (#11237 ) * test on one node 2 gpus max * fix the other place * refactor * fix * cleanup * more exact version	2021-04-14 11:06:59 -07:00
Sylvain Gugger	25e1af36e0	Fix #10128 (#11248 )	2021-04-14 11:47:54 -04:00
Stas Bekman	63ca402380	[troubleshooting] add 2 points of reference to the offline mode (#11236 ) * add 2 points of reference to the offline mode * link the new doc * add error message * Update src/transformers/modeling_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * style * rename * Trigger CI Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-04-14 08:39:23 -07:00
Yusuke Mori	075e821d1d	Add prefix to examples in model_doc rst (#11226 ) * Add prefix to examples in model_doc rst * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-04-14 10:58:55 -04:00
Thomas Wood	4670b57ce9	Fix dimention misspellings. (#11238 ) * Update modeling_gpt_neo.py dimention -> dimension * Update configuration_speech_to_text.py dimention -> dimension	2021-04-14 10:39:37 -04:00
Sudharsan S T	f25444cb22	Close open files to suppress ResourceWarning (#11240 ) Co-authored-by: Sudharsan Thirumalai <sudharsan.t@sprinklr.com>	2021-04-14 10:31:04 -04:00
Lysandre Debut	7fe5aaa8b0	Stale bot updated (#10562 ) * Updated stale bot * Specify issue number * Remove particular handling of assignees * Unleash the stalebot * Remove debug branch	2021-04-14 10:24:31 -04:00
Joel Stremmel	9337c6c668	make embeddings plural in warning message (#11228 )	2021-04-14 10:13:25 -04:00
Nithin Holla	653076ca30	Save the Wav2Vec2 processor before training starts (#10910 ) Co-authored-by: nithin19 <nithin@amberscript.com>	2021-04-14 14:52:06 +03:00
Stas Bekman	3d339ee659	[Deepspeed] zero3 tests band aid (#11235 ) * temp band-aid * style	2021-04-13 17:58:09 -04:00
Lysandre Debut	1ad7b0398c	Run CI on deepspeed and fairscale (#11172 ) * Run CI on deepspeed and fairscale * Test it on this branch :) * Rename * Update the CI image	2021-04-13 15:47:06 -04:00
Sylvain Gugger	f38cd4373f	Indent code block in the documentation (#11233 ) * Indent code block * Indent code blocks version 2 * Quality	2021-04-13 15:36:36 -04:00

1 2 3 4 5 ...

7133 Commits