transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-28 16:52:24 +06:00

Author	SHA1	Message	Date
Suraj Patil	44f64132a5	remove final_logits_bias (#10606 )	2021-03-10 09:52:31 +05:30
Allen Wang	6f52fce673	Fixes an issue in `text-classification` where MNLI eval/test datasets are not being preprocessed. (#10621 ) * Fix MNLI tests * Linter fix	2021-03-09 22:13:45 -05:00
Sylvain Gugger	72d9e039f9	Fix tests of TrainerCallback (#10615 ) * Fix tests of TrainerCallback * Update tests/test_trainer_callback.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-03-09 16:25:32 -05:00
Sylvain Gugger	0d909f6bd8	Fairscale FSDP fix model save (#10596 ) * Hotfix fairscale FSDP * Evaluation works * Save on process zero	2021-03-09 14:42:07 -05:00
Bhadresh Savani	ac17f71159	added max_sample args and metrics changes (#10602 )	2021-03-09 12:06:56 -05:00
Philipp Schmid	c19c811a2d	Trigger add sm information (#10610 ) * added sm to ua * update id * removed id * removed comments * added env variable * changed variable name * make quality happy * added sguggers feedback * make styling happy and remove brackets * added sm to ua * update id * removed id * removed comments * added env variable * changed variable name * make quality happy * added sguggers feedback * make styling happy and remove brackets	2021-03-09 17:31:45 +01:00
Suraj Patil	20c10258a4	layerdrop 0 (#10604 )	2021-03-09 17:35:07 +03:00
Lysandre	95ab06778c	Update cache version for github actions	2021-03-09 07:10:58 -05:00
Patrick von Platen	9a06b6b11b	[FeatureExtractorSavingUtils] Refactor PretrainedFeatureExtractor (#10594 ) * save first version * finish refactor * finish refactor * correct naming * correct naming * shorter names * Update src/transformers/feature_extraction_common_utils.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * change name * finish Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-03-09 12:16:59 +03:00
Stas Bekman	b6a28e9ac9	[docs] How to solve "Title level inconsistent" sphinx error (#10600 ) * How to solve: Title level inconsistent * list chars	2021-03-08 20:16:33 -08:00
Lysandre Debut	546cbe7e9e	Speedup tf tests (#10601 ) * Pipeline tests should be slow * Temporarily mark some tests as slow * Temporarily mark Barthez tests as slow	2021-03-08 21:44:07 -05:00
Ratthachat (Jung)	696e8a4365	Add TFRag (#9002 ) * Create modeling_tf_dpr.py * Add TFDPR * Add back TFPegasus, TFMarian, TFMBart, TFBlenderBot last commit accidentally deleted these 4 lines, so I recover them back * Add TFDPR * Add TFDPR * clean up some comments, add TF input-style doc string * Add TFDPR * Make return_dict=False as default * Fix return_dict bug (in .from_pretrained) * Add get_input_embeddings() * Create test_modeling_tf_dpr.py The current version is already passed all 27 tests! Please see the test run at : https://colab.research.google.com/drive/1czS_m9zy5k-iSJbzA_DP1k1xAAC_sdkf?usp=sharing * fix quality * delete init weights * run fix copies * fix repo consis * del config_class, load_tf_weights They shoud be 'pytorch only' * add config_class back after removing it, test failed ... so totally only removing "use_tf_weights = None" on Lysandre suggestion * newline after .. note:: * import tf, np (Necessary for ModelIntegrationTest) * slow_test from_pretrained with from_pt=True At the moment we don't have TF weights (since we don't have official official TF model) Previously, I did not run slow test, so I missed this bug * Add simple TFDPRModelIntegrationTest Note that this is just a test that TF and Pytorch gives approx. the same output. However, I could not test with the official DPR repo's output yet * upload correct tf model * remove position_ids as missing keys * create modeling_tf_rag * add tests for tf * add tf tests * revert wrong pt commit * further refactor * further refactor * refactor * Update modeling_tf_rag.py - input_processing - fix prepare_input_for_generation (mostly fix generate bug) - bring back from_pretrained hack in order to test generate * delete colab pieces of code * Show case of greedy "generate" Temporarily change from beam_search test to greedy_search test to show case that TF and PT do get equivalent output. * cosmetic update * correct typos * update * push some progress * make easy check * fix rag save from pretrained * Update src/transformers/modeling_tf_utils.py * remove commented out lines * delete unnecessary lines * add simple test case for nq_checkpoint Add nq_checkpoint test to show that current version without hack still fails * temporarily put ugly hack back again * Add TFRagSequenceForGeneration!! * __init__.py , import TFRagSequenceForGeneration * Add TFRagSequence tests! * rag init.py - add TFRagSequenceForGeneration * fix from_pretrained * fix prepare_inputs_for_generation * Beam search for RagToken! * minor clean up * add tf.cast in TFRagModel * More tf.cast * Add all remaining tests (still have issues) * delete all T5 related * make style * fix load weight prefix * fix bart * fix return_dict for tf_rag make all tests pass .. Hooray * fix some tests * fix code quality * fix qualtiy check * finish tests tf rag * add tf rag to docs * remove TFT5 from docstring Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * remove TFT5 from docstring Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Delete outdated comments Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * improve doc strings * add generative model classes * fix adjust token logic * refactor generate for TFRag * using shape_list, not _get_shape Co-authored-by: Julien Plu <plu.julien@gmail.com> * axis=[1]->axis=1 * delete NEED_HELP comment * improve readability Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * improve readability Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * improve readability Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Indicating model is in a developing state in docstrings As suggested by Julien * small last changes * apply sylvains suggestions * finish tf rag Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: patrickvonplaten <patrick@huggingface.co> Co-authored-by: Julien Plu <plu.julien@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-03-09 00:49:51 +03:00
Sylvain Gugger	3ced9b3eb9	Check layer types for Optimizer construction (#10598 ) * Check layer types for Optimizer construction * Duplicate class	2021-03-08 16:40:11 -05:00
Sylvain Gugger	821d518e03	Revert "Tests" This reverts commit `b35e7b68ca`.	2021-03-08 16:05:55 -05:00
Sylvain Gugger	4196bfeda0	Revert "Style" This reverts commit `a8ec52efc2`.	2021-03-08 16:05:52 -05:00
Sylvain Gugger	a8ec52efc2	Style	2021-03-08 16:04:46 -05:00
Sylvain Gugger	b35e7b68ca	Tests	2021-03-08 16:04:30 -05:00
Stas Bekman	f284089ec4	[examples tests on multigpu] resolving require_torch_non_multi_gpu_but_fix_me (#10561 ) * batch 1 * this is tpu * deebert attempt * the rest	2021-03-08 11:11:40 -08:00
Bhadresh Savani	dfd16af832	Added max_sample_ arguments (#10551 ) * reverted changes of logging and saving metrics * added max_sample arguments * fixed code * white space diff * reformetting code * reformatted code	2021-03-08 13:57:10 -05:00
Stas Bekman	917f104502	[examples tests] various fixes (#10584 ) * fix sharded ddp enum * test fixes * stronger validation + apex breaks other tests	2021-03-08 10:28:44 -08:00
Stas Bekman	6f84531e61	offline mode for firewalled envs (part 2) (#10569 ) * more readable test * add all the missing places * one more nltk * better exception check * revert	2021-03-08 08:52:20 -08:00
Sylvain Gugger	5469369480	Fix version control with anchors (#10595 ) * Fix version control with anchors * Simplify	2021-03-08 10:19:22 -05:00
Stas Bekman	f882966004	fix double wrapping + test (#10583 )	2021-03-08 10:15:55 -05:00
Mehrad Moradshahi	b880508440	tokenization_marian.py: use current_spm for decoding (#10357 ) * Fix Marian decoding Tokenizer's decode and batch_decode now accepts a new argument (use_source_tokenizer) which indicates whether the source spm should be used to decode ids. This is useful for Marian models specificallly when decoding source input ids. * Adapt docstrings Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>	2021-03-08 08:14:31 -05:00
Lysandre	8fd7eb34e2	Correct YAML	2021-03-08 07:13:49 -05:00
Lysandre Debut	89b8d4f568	Enable torch 1.8.0 on GPU CI (#10593 ) * Enable torch 1.8.0 in GPU CI * Disable torch-scatter	2021-03-08 07:11:43 -05:00
Suraj Patil	2a737bffef	[M2M100] fix positional embeddings (#10590 ) * fix tests * emb should be a parameter * fix positional embeddings * fix make_weights * don't save pos embeds * add comment to describe the clamping	2021-03-08 16:06:19 +05:30
Oren Amsalem	d59464db6b	fix BART Summarization example in doc (#10582 )	2021-03-08 15:45:06 +05:30
Eunhyuk Shin	3b583d02d6	Fix typo in docstring for pipeline (#10591 )	2021-03-08 15:40:03 +05:30
Stas Bekman	e6ce636e02	fix nltk lookup (#10585 )	2021-03-07 22:09:58 -08:00
Yu	9dd054fba2	fix tf doc bug (#10570 )	2021-03-07 22:31:50 -05:00
Suraj Patil	f6e74a63ca	Add m2m100 (#10236 ) * m2m_100 * no layernorm_embedding * sinusoidal positional embeddings * update pos embeddings * add default config values * tokenizer * add conversion script * fix config * fix pos embed * remove _float_tensor * update tokenizer * update lang codes * handle lang codes * fix pos embeds * fix spm key * put embedding weights on device * remove qa and seq classification heads * fix convert script * lang codes pn one line * fix embeds * fix tokenizer * fix tokenizer * add fast tokenizer * style * M2M100MT => M2M100 * fix copyright, style * tokenizer converter * vocab file * remove fast tokenizer * fix embeds * fix tokenizer * fix tests * add tokenizer tests * add integration test * quality * fix model name * fix test * doc * doc * fix doc * add copied from statements * fix tokenizer tests * apply review suggestions * fix urls * fix shift_tokens_right * apply review suggestions * fix * fix doc * add lang code to id * remove unused function * update checkpoint names * fix copy * fix tokenizer * fix checkpoint names * fix merge issue * style	2021-03-06 22:14:16 +05:30
Lysandre	fd01104435	Temporarily disable stale bot	2021-03-06 00:21:50 -05:00
Stas Bekman	88a951e3cc	offline mode for firewalled envs (#10407 ) * offline mode start * add specific values * fix fallback * add test * better values check and range * test that actually works * document the offline mode * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * more strict check * cleaner test * pt-only test * style Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-03-05 17:27:48 -08:00
Daniel Hug	90ecc29656	Refactoring checkpoint names for multiple models (#10527 ) * Refactor checkpoint name in ALBERT and ALBERT_tf * Refactor checkpoint name in BART and BART_tf * Refactor checkpoint name in BERT generation * Refactor checkpoint name in Blenderbot_tf * Refactor checkpoint name in Blenderbot_small_tf * Refactor checkpoint name in ConvBERT AND CONVBERT_TF * Refactor checkpoint name in CTRL AND CTRL_TF * Refactor checkpoint name in DistilBERT AND DistilBERT_TF * Refactor checkpoint name in DistilBERT redo * Refactor checkpoint name in Electra and Electra_tf * Refactor checkpoint name in FlauBERT and FlauBERT_tf * Refactor checkpoint name in FSMT * Refactor checkpoint name in GPT2 and GPT2_tf * Refactor checkpoint name in IBERT * Refactor checkpoint name in LED and LED_tf * Refactor checkpoint name in Longformer and Longformer_tf * Refactor checkpoint name in Lxmert and Lxmert_tf * Refactor checkpoint name in Marian_tf * Refactor checkpoint name in MBART and MBART_tf * Refactor checkpoint name in MobileBERT and MobileBERT_tf * Refactor checkpoint name in mpnet and mpnet_tf * Refactor checkpoint name in openai and openai_tf * Refactor checkpoint name in pegasus_tf * Refactor checkpoint name in reformer * Refactor checkpoint name in Roberta and Roberta_tf * Refactor checkpoint name in SqueezeBert * Refactor checkpoint name in Transformer_xl and Transformer_xl_tf * Refactor checkpoint name in XLM and XLM_tf * Refactor checkpoint name in XLNET and XLNET_tf * Refactor checkpoint name in BERT_tf * run make tests, style, quality, fixup	2021-03-05 18:06:55 -05:00
Lysandre Debut	defe9e20fe	Stale Bot (#10509 ) * Add stale bot to Github Actions * Update message * Message for assignee * Update scripts/stale.py * Uncomment & stop testing	2021-03-05 16:41:50 -05:00
Sylvain Gugger	7da995c00c	Fix embeddings for PyTorch 1.8 (#10549 ) * Fix embeddings for PyTorch 1.8 * Try with PyTorch 1.8.0 * Fix embeddings init * Fix copies * Typo * More typos	2021-03-05 16:18:48 -05:00
Chen Liang	3e056c1003	Typo correction. (#10531 ) DEBERTA_PRETRAINED_MODEL_ARCHIVE_LIST => DEBERTA_V2_PRETRAINED_MODEL_ARCHIVE_LIST in line 31.	2021-03-05 15:27:09 -05:00
Joakim Warholm	9f8bc87cbe	fixed dead link in trainer doc (#10554 )	2021-03-05 14:56:37 -05:00
Lysandre Debut	6b58e15507	Fix torch 1.8.0 segmentation fault (#10546 ) * Only run one test * Patch segfault * Fix summarization pipeline * Ready for merge	2021-03-05 12:10:19 -05:00
Patrick von Platen	395ffcd757	fix run seq2seq (#10547 )	2021-03-05 18:17:12 +03:00
Nicolas Patry	54e55b52d4	Fixing conversation test for torch 1.8 (#10545 )	2021-03-05 09:24:14 -05:00
Lysandre	dc9aaa3848	Pin torch to 1.7.1 in tests while we resolve issues	2021-03-05 07:57:35 -05:00
lewtun	12b66215cf	Fix example of custom Trainer to reflect signature of compute_loss (#10537 )	2021-03-05 07:44:53 -05:00
Lysandre	093b88f4e9	Update scatter to use torch 1.8.0	2021-03-05 07:31:51 -05:00
Patrick von Platen	c503a1c15e	[ProphetNet] Bart-like Refactor (#10501 ) * first step to refactor * make all fast tests pass * make all slow tests pass * save intermediate * correct cache * finish PR * make fp16 work	2021-03-04 23:27:12 +03:00
Sylvain Gugger	6290169eb3	Rework TPU checkpointing in Trainer (#10504 ) * Rework TPU checkpointing in Trainer * Wraps the barrier in a dist test * Address review comments * Remove line	2021-03-04 11:46:11 -05:00
Philipp Schmid	805c5200dc	Removes overwrites for output_dir (#10521 ) * removed overwrites * remove default value for output_dir * adjusted typing	2021-03-04 17:12:37 +01:00
Sylvain Gugger	a5bd40b75c	Not always consider a local model a checkpoint in run_glue (#10517 )	2021-03-04 11:11:39 -05:00
Sylvain Gugger	745ea78dcc	Revert "Not always consider a local model a checkpoint in run_glue" This reverts commit `f3660613bc`.	2021-03-04 09:45:18 -05:00

... 8 9 10 11 12 ...

7165 Commits