transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Julien Chaumond	4f3e93cfaf	[file_utils] do not gobble certain kinds of requests.ConnectionError (#10235 ) * do not gobble certain kinds of requests.ConnectionError * Apply review comments Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2021-03-18 12:37:45 -04:00
James Thomin	ce9724e1bd	Fix bug in input check for LengthGroupSampler (#10783 ) This commit fixes a bug in the LengthGroupSampler where if model_input_name is not set, the default value is None instead of "input_ids"	2021-03-18 10:25:57 -04:00
Suraj Patil	5f19c07a70	add run_common_voice script (#10767 ) * add initial script * finish script * add shell script example * accept chars_to_ignor as cl arg * align the script with other example scripts * add torchaudio dep	2021-03-18 17:21:16 +05:30
Mohamed El-Geish	af8afdc88d	wav2vec2: support datasets other than LibriSpeech (#10581 ) * wav2vec2: support datasets other than LibriSpeech * Formatting run_asr.py to pass code quality test * bundled orthography options and added verbose logs * fixing a typo in timit fine-tuning script * update comment for clarity * resize_lm_head and load custom vocab from file * adding a max_duration_in_seconds filter * do not assign `duration_filter` lambda, use a def * log untransliterated text as well * fix base model for arabic * fix duration filter when target_sr is not set * drop duration_in_seconds when unneeded * script for wav2vec2-large-lv60-timit-asr * fix for "tha" in arabic corpus (huggingface#10581) * adding more options to work with common_voice * PR feedback (huggingface#10581) * small README change	2021-03-18 10:20:26 +03:00
Patrick von Platen	0b98ca368f	[Flax] Adapt Flax models to new structure (#9484 ) * Create modeling_flax_eletra with code copied from modeling_flax_bert * Add ElectraForMaskedLM and ElectraForPretraining * Add modeling test for Flax electra and fix naming and arg in Flax Electra model * Add documentation * Fix code style * Create modeling_flax_eletra with code copied from modeling_flax_bert * Add ElectraForMaskedLM and ElectraForPretraining * Add modeling test for Flax electra and fix naming and arg in Flax Electra model * Add documentation * Fix code style * Fix code quality * Adjust tol in assert_almost_equal due to very small difference between model output, ranging 0.0010 - 0.0016 * Remove redundant ElectraPooler * save intermediate * adapt * correct bert flax design * adapt roberta as well * finish roberta flax * finish * apply suggestions * apply suggestions Co-authored-by: Chris Nguyen <anhtu2687@gmail.com>	2021-03-18 09:44:17 +03:00
Funtowicz Morgan	5c0bf39782	Add support for detecting intel-tensorflow version (#10781 ) Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>	2021-03-18 01:25:47 +01:00
Mansi Mane	0282e24eef	Smmp batch not divisible by microbatches fix (#10778 ) * Added debug prints * Added config * Added prints * Added prints * Added extra samples to SequentialDistributedSampler * Added extra samples to SequentialDistributedSampler Updated SequentialDistributedSampler call * Added deubg prints * Removed extra prints * Making predicitons and labels multiple of batchsize * updated number of microbatches * Removed extra prints * Made start_remainder similar to DistributedSamplerWithLoop * Minor spacing update * Added debug prints Added config Added prints Added prints * Added extra samples to SequentialDistributedSampler Updated SequentialDistributedSampler call Added extra samples to SequentialDistributedSampler Added deubg prints Removed extra prints Making predicitons and labels multiple of batchsize updated number of microbatches Removed extra prints Squashing redundant commits * Made start_remainder similar to DistributedSamplerWithLoop Minor spacing update Made start_remainder similar to DistributedSamplerWithLoop * Test and styling * Rename test Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>	2021-03-17 19:18:11 -04:00
Sylvain Gugger	40b049c701	Check copies blackify (#10775 ) * Apply black before checking copies * Fix for class methods * Deal with lonely brackets * Remove debug and add forward changes * Separate copies and fix test * Add black as a test dependency	2021-03-17 18:11:20 -04:00
Stas Bekman	393739194e	[examples] document resuming (#10776 ) * document resuming in examples * fix * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * put trainer code last, adjust notes Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-03-17 12:48:35 -07:00
Stas Bekman	85a114ef47	[Issue template] need to update/extend who to tag (#10728 ) * [Issue template] need to update/extend who to tag 1. need to update who to tag for `tensorflow` 2. also requesting to add someone to tag for models hub issues - perhaps separate sub-entries for UI and code - e.g. I don't know who to tag for broken models: https://github.com/huggingface/transformers/issues/10726 Thanks. * model hub instructions * s/jplu/LysandreJik/	2021-03-17 11:33:14 -07:00
Stas Bekman	3318c246f3	make failure to find a resume checkpoint fatal + tests (#10777 )	2021-03-17 11:16:37 -07:00
Stas Bekman	cd8c93f701	[DeepSpeed] improve checkpoint loading code plus tests (#10760 ) * deepspeed checkpoint loading code plus tests * style * style	2021-03-17 10:22:58 -07:00
Stas Bekman	01c7fb04be	[DeepSpeed] simplify init (#10762 )	2021-03-17 10:21:03 -07:00
Patrick von Platen	0486ccdd3d	small improvements (#10773 )	2021-03-17 18:10:17 +03:00
Sylvain Gugger	d7e0d59bb7	Fix URLs	2021-03-17 11:03:43 -04:00
Stas Bekman	8715d20c97	[doc] [testing] extend the pytest -k section with more examples (#10761 ) * [doc] [testing] extend -k section This PR adds more examples on using `pytest -k` - I always forget that I want to use `-k A OR B` when I want several tests - I keep trying AND and it doesn't match any. * style	2021-03-17 09:23:38 -04:00
Patrick von Platen	f20d75a13f	up (#10771 )	2021-03-17 16:15:14 +03:00
Cheng Li	c83fbc5f2d	[Deepspeed] Allow HF optimizer and scheduler to be passed to deepspeed (#10464 ) * pass hf optimizer and scheduler to deepspeed if not specified in ds config * pass hf optimizer and scheduler to deepspeed if not specified in ds config * update * make init_deepspeed support config dict * fix docstring formatting * clean up trainer's comments * add new tests * fix type * composit argparse doesn't work * style * add a new test, rename others * document new functionality * complete tests, add docs * style * correct level * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * add new methods to the doc * must tell DS we are using a non-native optimizer * add protection against cpu_offload + HF optimizer combo * fix the cli overrides * sync docs + tests * restore AdamW * better docs * need new version * no longer needed * remove outdate information * refactor duplicated code Co-authored-by: Stas Bekman <stas@stason.org> Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-03-16 15:51:09 -07:00
Lysandre Debut	c23248443c	Patches full import failure when sentencepiece is not installed (#10752 ) * Patches full import failure when sentencepiece is not installed * Dummies :)	2021-03-16 15:58:20 -04:00
Lysandre	73fe40898d	Docs for v4.4.1	2021-03-16 15:41:49 -04:00
Lysandre Debut	2097aa1826	Patches the full import failure and adds a test (#10750 ) * Patches the full import failure and adds a test * Add comment	2021-03-16 15:37:52 -04:00
Lysandre	1b5ce1e63b	Development on v4.5.0dev0	2021-03-16 11:41:15 -04:00
Lysandre	c988db5af2	Release v4.4.0	2021-03-16 11:33:35 -04:00
Sylvain Gugger	5c02b97ca2	Fix URLs from #10744 (#10748 )	2021-03-16 11:31:29 -04:00
Sylvain Gugger	a0a027c2ed	Add DistributedSamplerWithLoop (#10746 ) * Add DistributedSamplerWithLoop * Fix typo * Test and small fix	2021-03-16 11:22:39 -04:00
Lysandre Debut	1449222217	Fix DeBERTa + Conversational pipeline slow tests (#10743 ) * Fix DeBERTa-v2 variable assignment * Fix conversational pipeline test	2021-03-16 11:18:20 -04:00
Suraj Patil	d3d388b934	fix M2M100 example (#10745 )	2021-03-16 20:20:00 +05:30
Sylvain Gugger	b5492582d0	Remove old links to CDN (#10744 )	2021-03-16 10:48:53 -04:00
Lysandre Debut	5dcc08f1df	Fix S2T example (#10741 )	2021-03-16 08:55:07 -04:00
Sylvain Gugger	813d730c46	Release utils (#10735 ) * Examples version update * Refactor a bit * All version updates * Fixes * README cleanup * Post-release/patch * Fixes * More fixes * Tests * More fixes * Moar fixes * Make commands and update setup * Replace spaces with weird tabs * Fix test * Style	2021-03-16 08:41:47 -04:00
Patrick von Platen	9f8619c6aa	Flax testing should not run the full torch test suite (#10725 ) * make flax tests pytorch independent * fix typo * finish * improve circle ci * fix return tensors * correct flax test * re-add sentencepiece * last tokenizer fixes * finish maybe now	2021-03-16 08:05:37 +03:00
Russell Klopfer	87d685b8a9	independent training / eval with local files (#10710 ) * independent training / eval with local files * remove redundant assert	2021-03-15 19:35:26 -04:00
Sylvain Gugger	4c379daf64	Add minimum version check in examples (#10724 ) * Add minimum version check in examples * Style * No need for new line maybe? * Add helpful comment	2021-03-15 19:29:54 -04:00
Joe Davison	966ba081c9	zero-shot pipeline multi_class -> multi_label (#10727 )	2021-03-15 16:02:46 -06:00
Lysandre Debut	58f672e65c	Tests run on Docker (#10681 ) * Tests run on Docker Co-authored-by: Morgan <funtowiczmo@gmail.com> * Comments from code review * Reply to itself * Dependencies Co-authored-by: Morgan <funtowiczmo@gmail.com>	2021-03-15 17:28:01 -04:00
MikeG112	d41dd5359b	[Wav2Vec2] Fix documentation inaccuracy (#10694 ) * Update super class reference * Update default value reference * Update src/transformers/models/wav2vec2/feature_extraction_wav2vec2.py * Fix format style Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-03-15 20:11:17 +03:00
Sylvain Gugger	f5c097fc4d	Fix backward compatibility with EvaluationStrategy (#10718 )	2021-03-15 10:20:38 -04:00
Patrick von Platen	d9e693e1d0	make wav2vec2 test deterministic (#10714 )	2021-03-15 09:50:05 -04:00
Sylvain Gugger	6bef764506	Multiple fixes in SageMakerTrainer (#10687 ) * Handle save differently * Missing imports * Fix typo * Adapt to recent changes in save_pretrained * Forgotten brackets * Optimizer load * Fix world size * Deal wth None * Remove needless self	2021-03-15 09:28:15 -04:00
Adam Pocock	3f1714f8a7	Adding required flags to non-default arguments in hf_argparser (#10688 ) * Adding required flags to non-default arguments. Signed-off-by: Adam Pocock <adam.pocock@oracle.com> * make style fix. * Update src/transformers/hf_argparser.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-03-15 09:27:55 -04:00
Théo Matussière	6f840990a7	split seq2seq script into summarization & translation (#10611 ) * split seq2seq script, update docs * needless diff * fix readme * remove test diff * s/summarization/translation Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * cr * fix arguments & better mbart/t5 refs * copyright Co-authored-by: Suraj Patil <surajp815@gmail.com> * reword readme Co-authored-by: Suraj Patil <surajp815@gmail.com> * s/summarization/translation * short script names * fix tests * fix isort, include mbart doc * delete old script, update tests * automate source prefix * automate source prefix for translation * s/translation/trans Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * fix script name (short version) * typos Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * exact parameter Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * remove superfluous source_prefix calls in docs * rename scripts & warn for source prefix * black * flake8 Co-authored-by: theo <theo@matussie.re> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>	2021-03-15 09:11:42 -04:00
Igor Shalyminov	505494a86f	GPT2DoubleHeadsModel made parallelizable (#10658 ) * GPT2DoubleHeadsModel made parallelizeable * GPT2DoubleHeadsModel added as parallelizeable onto the GPT2 test suite	2021-03-15 09:10:44 -04:00
Sylvain Gugger	e12d6f513e	Distributed barrier before loading model (#10685 )	2021-03-15 08:28:15 -04:00
Sylvain Gugger	339fc51acc	fix styling	2021-03-15 07:59:35 -04:00
cronoik	4c41c6622c	Wrong link to super class (#10709 ) Documentation was referring to slow tokenizer class while it should be the fast tokenizer.	2021-03-15 07:39:10 -04:00
Suraj Patil	fcf10214e0	enable loading Mbart50Tokenizer with AutoTokenizer (#10690 ) * enable auto tokenizer for mbart50 tokenizers * fix imports	2021-03-15 16:20:37 +05:30
Patrick von Platen	bd8f6cafd4	make rag tests smaller (#10679 )	2021-03-15 10:07:12 +03:00
Stas Bekman	4c32f9f26e	AdamW is now supported by default (#9624 )	2021-03-12 13:40:07 -08:00
ymfa	fa35cda91e	Pass encoder outputs into GenerationMixin (#10599 ) * Pass encoder_outputs into generate() * Remove an if-statement * Reformat * Minimize changes to generate() * Comment on input_ids	2021-03-12 21:43:11 +05:30
PaulLerner	00cad2e5c1	fix: #10628 expanduser path in TrainingArguments (#10660 ) * fix: #10628 expanduser path in TrainingArguments * docs: explain why we expand paths in TrainingArguments * Style Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>	2021-03-12 09:18:19 -05:00

1 2 3 4 5 ...

6792 Commits