transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Bhadresh Savani	dfd16af832	Added max_sample_ arguments (#10551 ) * reverted changes of logging and saving metrics * added max_sample arguments * fixed code * white space diff * reformetting code * reformatted code	2021-03-08 13:57:10 -05:00
Stas Bekman	917f104502	[examples tests] various fixes (#10584 ) * fix sharded ddp enum * test fixes * stronger validation + apex breaks other tests	2021-03-08 10:28:44 -08:00
Stas Bekman	6f84531e61	offline mode for firewalled envs (part 2) (#10569 ) * more readable test * add all the missing places * one more nltk * better exception check * revert	2021-03-08 08:52:20 -08:00
Sylvain Gugger	5469369480	Fix version control with anchors (#10595 ) * Fix version control with anchors * Simplify	2021-03-08 10:19:22 -05:00
Stas Bekman	f882966004	fix double wrapping + test (#10583 )	2021-03-08 10:15:55 -05:00
Mehrad Moradshahi	b880508440	tokenization_marian.py: use current_spm for decoding (#10357 ) * Fix Marian decoding Tokenizer's decode and batch_decode now accepts a new argument (use_source_tokenizer) which indicates whether the source spm should be used to decode ids. This is useful for Marian models specificallly when decoding source input ids. * Adapt docstrings Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>	2021-03-08 08:14:31 -05:00
Lysandre	8fd7eb34e2	Correct YAML	2021-03-08 07:13:49 -05:00
Lysandre Debut	89b8d4f568	Enable torch 1.8.0 on GPU CI (#10593 ) * Enable torch 1.8.0 in GPU CI * Disable torch-scatter	2021-03-08 07:11:43 -05:00
Suraj Patil	2a737bffef	[M2M100] fix positional embeddings (#10590 ) * fix tests * emb should be a parameter * fix positional embeddings * fix make_weights * don't save pos embeds * add comment to describe the clamping	2021-03-08 16:06:19 +05:30
Oren Amsalem	d59464db6b	fix BART Summarization example in doc (#10582 )	2021-03-08 15:45:06 +05:30
Eunhyuk Shin	3b583d02d6	Fix typo in docstring for pipeline (#10591 )	2021-03-08 15:40:03 +05:30
Stas Bekman	e6ce636e02	fix nltk lookup (#10585 )	2021-03-07 22:09:58 -08:00
Yu	9dd054fba2	fix tf doc bug (#10570 )	2021-03-07 22:31:50 -05:00
Suraj Patil	f6e74a63ca	Add m2m100 (#10236 ) * m2m_100 * no layernorm_embedding * sinusoidal positional embeddings * update pos embeddings * add default config values * tokenizer * add conversion script * fix config * fix pos embed * remove _float_tensor * update tokenizer * update lang codes * handle lang codes * fix pos embeds * fix spm key * put embedding weights on device * remove qa and seq classification heads * fix convert script * lang codes pn one line * fix embeds * fix tokenizer * fix tokenizer * add fast tokenizer * style * M2M100MT => M2M100 * fix copyright, style * tokenizer converter * vocab file * remove fast tokenizer * fix embeds * fix tokenizer * fix tests * add tokenizer tests * add integration test * quality * fix model name * fix test * doc * doc * fix doc * add copied from statements * fix tokenizer tests * apply review suggestions * fix urls * fix shift_tokens_right * apply review suggestions * fix * fix doc * add lang code to id * remove unused function * update checkpoint names * fix copy * fix tokenizer * fix checkpoint names * fix merge issue * style	2021-03-06 22:14:16 +05:30
Lysandre	fd01104435	Temporarily disable stale bot	2021-03-06 00:21:50 -05:00
Stas Bekman	88a951e3cc	offline mode for firewalled envs (#10407 ) * offline mode start * add specific values * fix fallback * add test * better values check and range * test that actually works * document the offline mode * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * more strict check * cleaner test * pt-only test * style Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-03-05 17:27:48 -08:00
Daniel Hug	90ecc29656	Refactoring checkpoint names for multiple models (#10527 ) * Refactor checkpoint name in ALBERT and ALBERT_tf * Refactor checkpoint name in BART and BART_tf * Refactor checkpoint name in BERT generation * Refactor checkpoint name in Blenderbot_tf * Refactor checkpoint name in Blenderbot_small_tf * Refactor checkpoint name in ConvBERT AND CONVBERT_TF * Refactor checkpoint name in CTRL AND CTRL_TF * Refactor checkpoint name in DistilBERT AND DistilBERT_TF * Refactor checkpoint name in DistilBERT redo * Refactor checkpoint name in Electra and Electra_tf * Refactor checkpoint name in FlauBERT and FlauBERT_tf * Refactor checkpoint name in FSMT * Refactor checkpoint name in GPT2 and GPT2_tf * Refactor checkpoint name in IBERT * Refactor checkpoint name in LED and LED_tf * Refactor checkpoint name in Longformer and Longformer_tf * Refactor checkpoint name in Lxmert and Lxmert_tf * Refactor checkpoint name in Marian_tf * Refactor checkpoint name in MBART and MBART_tf * Refactor checkpoint name in MobileBERT and MobileBERT_tf * Refactor checkpoint name in mpnet and mpnet_tf * Refactor checkpoint name in openai and openai_tf * Refactor checkpoint name in pegasus_tf * Refactor checkpoint name in reformer * Refactor checkpoint name in Roberta and Roberta_tf * Refactor checkpoint name in SqueezeBert * Refactor checkpoint name in Transformer_xl and Transformer_xl_tf * Refactor checkpoint name in XLM and XLM_tf * Refactor checkpoint name in XLNET and XLNET_tf * Refactor checkpoint name in BERT_tf * run make tests, style, quality, fixup	2021-03-05 18:06:55 -05:00
Lysandre Debut	defe9e20fe	Stale Bot (#10509 ) * Add stale bot to Github Actions * Update message * Message for assignee * Update scripts/stale.py * Uncomment & stop testing	2021-03-05 16:41:50 -05:00
Sylvain Gugger	7da995c00c	Fix embeddings for PyTorch 1.8 (#10549 ) * Fix embeddings for PyTorch 1.8 * Try with PyTorch 1.8.0 * Fix embeddings init * Fix copies * Typo * More typos	2021-03-05 16:18:48 -05:00
Chen Liang	3e056c1003	Typo correction. (#10531 ) DEBERTA_PRETRAINED_MODEL_ARCHIVE_LIST => DEBERTA_V2_PRETRAINED_MODEL_ARCHIVE_LIST in line 31.	2021-03-05 15:27:09 -05:00
Joakim Warholm	9f8bc87cbe	fixed dead link in trainer doc (#10554 )	2021-03-05 14:56:37 -05:00
Lysandre Debut	6b58e15507	Fix torch 1.8.0 segmentation fault (#10546 ) * Only run one test * Patch segfault * Fix summarization pipeline * Ready for merge	2021-03-05 12:10:19 -05:00
Patrick von Platen	395ffcd757	fix run seq2seq (#10547 )	2021-03-05 18:17:12 +03:00
Nicolas Patry	54e55b52d4	Fixing conversation test for torch 1.8 (#10545 )	2021-03-05 09:24:14 -05:00
Lysandre	dc9aaa3848	Pin torch to 1.7.1 in tests while we resolve issues	2021-03-05 07:57:35 -05:00
lewtun	12b66215cf	Fix example of custom Trainer to reflect signature of compute_loss (#10537 )	2021-03-05 07:44:53 -05:00
Lysandre	093b88f4e9	Update scatter to use torch 1.8.0	2021-03-05 07:31:51 -05:00
Patrick von Platen	c503a1c15e	[ProphetNet] Bart-like Refactor (#10501 ) * first step to refactor * make all fast tests pass * make all slow tests pass * save intermediate * correct cache * finish PR * make fp16 work	2021-03-04 23:27:12 +03:00
Sylvain Gugger	6290169eb3	Rework TPU checkpointing in Trainer (#10504 ) * Rework TPU checkpointing in Trainer * Wraps the barrier in a dist test * Address review comments * Remove line	2021-03-04 11:46:11 -05:00
Philipp Schmid	805c5200dc	Removes overwrites for output_dir (#10521 ) * removed overwrites * remove default value for output_dir * adjusted typing	2021-03-04 17:12:37 +01:00
Sylvain Gugger	a5bd40b75c	Not always consider a local model a checkpoint in run_glue (#10517 )	2021-03-04 11:11:39 -05:00
Sylvain Gugger	745ea78dcc	Revert "Not always consider a local model a checkpoint in run_glue" This reverts commit `f3660613bc`.	2021-03-04 09:45:18 -05:00
Sylvain Gugger	f3660613bc	Not always consider a local model a checkpoint in run_glue	2021-03-04 09:44:02 -05:00
Sylvain Gugger	948b730f97	Remove unsupported methods from ModelOutput doc (#10505 )	2021-03-03 14:55:18 -05:00
Sylvain Gugger	b70f441b72	Smp grad accum (#10488 ) * Fix gradient accumulation for SM Model Parallelism * Style and divide loss by grad accum steps	2021-03-03 12:13:29 -05:00
felixgwu	d064fb5647	Fix the bug in constructing the all_hidden_states of DeBERTa v2 (#10466 ) * fix all_hidden_states * use output_states instead of next_kv	2021-03-03 12:05:21 -05:00
Stas Bekman	188574ac50	remap MODEL_FOR_QUESTION_ANSWERING_MAPPING classes to names auto-generated file (#10487 ) * remap classes to strings * missing new util * style * doc * move the autogenerated file * Trigger CI	2021-03-03 08:54:00 -08:00
Sylvain Gugger	801ff969ce	Refactor checkpoint name in BERT and MobileBERT (#10424 ) * Refactor checkpoint name in BERT and MobileBERT * Add option to check copies * Add QuestionAnswering * Add last models * Make black happy	2021-03-03 11:21:17 -05:00
Jeff Yang	39f70a4058	feat(docs): navigate with left/right arrow keys (#10481 ) * feat(docs): navigate with left/right arrow keys * fix: add missing comma	2021-03-03 11:17:12 -05:00
Patrick von Platen	2d2ed2cc18	[T5] Fix speed degradation bug t5 (#10496 ) * fix speed degradation bug t5 * fix for all models * fix code quality	2021-03-03 12:42:41 +03:00
WybeKoper	5dc303e281	Fixed minor spelling mistakes (#10489 ) Co-authored-by: WybeKoper <WybeKoper@users.noreply.github.com>	2021-03-03 14:17:25 +05:30
Mehrad Moradshahi	1750e62900	Generate can return cross-attention weights too (#10493 )	2021-03-03 13:57:02 +05:30
Martin Schmitt	b013842244	Changed `num_beams` to `num_beams // num_beam_groups` when initialising `PrefixConstrainedLogitsProcessor` in `_get_logits_processor` to fix compatibility issue when constrained decoding is used together with grouped beam search (#10475 )	2021-03-02 10:41:54 +03:00
Lysandre Debut	0c2325198f	Add I-BERT to README (#10462 )	2021-03-01 12:12:31 -05:00
Lysandre Debut	9248e27037	Remove Anthony from the bug reports in Transformers	2021-03-01 10:23:40 -05:00
Suraj Patil	a106bde5a7	[Wav2Vec2FeatureExtractor] smal fixes (#10455 ) * smal fixes * don't check for None	2021-03-01 20:19:52 +05:30
Patrick von Platen	11655fafdd	remove feature extraction config (#10457 )	2021-03-01 12:30:12 +03:00
Patrick von Platen	0234de8418	Add Fine-Tuning for Wav2Vec2 (#10145 ) * add encode labels function to tokenizer * start adding finetuning * init dropout * upload * correct convert script * apply changes * fix second typo * make first dummy training run * adapt convert script * push confg for comparison * remove conf * finish training * adapt data collator * add research folder * update according to fairseq feedback * some minor corrections * refactor masking indices a bit * some minor changes * clean tokenizer * finish clean-up * remove previous logic * update run script * correct training * finish changes * finish model * correct bug * fix training a bit more * add some tests * finish gradient checkpointing * finish example * correct gradient checkpointing * improve tokenization method * revert changes in tokenizer * revert general change * adapt fine-tuning * update * save intermediate test * Update README.md * finish finetuning * delete conversion script * Update src/transformers/models/wav2vec2/configuration_wav2vec2.py * Update src/transformers/models/wav2vec2/processing_wav2vec2.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * finish wav2vec2 script * finish wav2vec2 fine-tuning * finalize test * correct test * adapt tests * finish * remove test file Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-03-01 12:13:17 +03:00
Patrick von Platen	3c733f3208	Update ibert.rst (#10445 )	2021-02-28 19:03:49 +03:00
Darigov Research	aeba4f95bb	Adds terms to Glossary (#10443 ) * feat: Adds three definitions to glossary from @cronoik Needed a definition for transformer which in turn needed 2 more definitions To do with issue https://github.com/huggingface/transformers/issues/9078 * fix: Adjusts definition of neural network to make it easier to read	2021-02-28 08:27:54 -05:00

1 2 3 4 5 ...

6697 Commits