transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-03 03:31:05 +06:00

Author	SHA1	Message	Date
Yeb Havinga	91fb62d01c	Speedup training by using numpy instead of jnp for batch shuffling (#15963 ) Speedup training by using numpy instead of jnp for batch shuffling Co-authored-by: Yeb Havinga <y.t.havinga@mgrid.net>	2022-03-08 12:18:38 +01:00
Nicolas Patry	ea07064a5c	Returning outputs only when asked for for MaskFormer. (#15936 ) * Returning outputs only when asked for for MaskFormer. * Adding `output_auxiliary_logits` to the config.	2022-03-08 11:17:57 +01:00
NielsRogge	b19f3e69a0	[Tests] Fix ViTMAE integration test (#15949 ) * Fix test across both cpu and gpu * Fix typo	2022-03-08 10:49:44 +01:00
NielsRogge	9879a1d5f0	Fix LayoutLMv2 test (#15939 ) * Fix LayoutLMv2 test * Update black	2022-03-08 10:49:30 +01:00
Yih-Dar	8b9ae45549	Set scale_embedding to False in some TF tests (#15952 ) * set scale_embedding to False to avoid large (> 1e-5) output differences between PT/TF Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-07 22:14:33 +01:00
Steven Liu	38cc35069c	Update training scripts docs (#15931 ) * 📝 first draft * 🖍 apply feedback * 🖍 remove examples from toctree * 🗑 remove examples from docs/source	2022-03-07 13:29:14 -06:00
Sylvain Gugger	c87cfd653c	Better error message when inputs are empty	2022-03-07 13:29:16 -05:00
Francesco Saverio Zuppichini	e9fa7cd5d7	Make is_thing_map in Feature Extractor post_process_panoptic_segmentation defaults to all instances (#15954 ) * is_thing_map defaults to all instances * better naming * control flow * resolving conversations	2022-03-07 19:10:32 +01:00
Sanchit Gandhi	2596f95e84	Fix Embedding Module Bug in Flax Models (#15920 )	2022-03-07 18:17:45 +01:00
Sanchit Gandhi	1a62b25caf	Backprop Test for Freeze FlaxWav2Vec2 Feature Encoder (#15938 ) * Backprop Test for Freeze FlaxWav2Vec2 Feature Encoder * remove jnp.ndarray type suggestion * assert frozen grads are precisely zero	2022-03-07 18:10:15 +01:00
Konstantin Dobler	544fd9876b	Support modern list type hints in HfArgumentParser (#15951 ) * Support modern list type hint in HfArgumentParser * Fix formatting with black	2022-03-07 10:22:48 -05:00
Suraj Patil	60b81dfa6f	remove re-defination of FlaxWav2Vec2ForCTCModule (#15965 )	2022-03-07 14:58:44 +01:00
Chan Woo Kim	ef9c3ca348	[Bug Fix] Beam search example in docs fails & a fix (integrating `max_length` in `BeamScorer.finalize()`) (#15555 ) * added the test and fix * had left out a comment	2022-03-07 09:10:18 +01:00
Francesco Saverio Zuppichini	9932ee4b4b	made MaskFormerModelTest faster (#15942 )	2022-03-04 19:11:48 +01:00
NielsRogge	e8efaecb87	Move dependency to call method (#15941 )	2022-03-04 18:53:54 +01:00
Chan Woo Kim	5c6f57ee75	Constrained Beam Search [With Disjunctive Decoding] (#15761 ) * added classes to get started with constrained beam search * in progress, think i can directly force tokens now but not yet with the round robin * think now i have total control, now need to code the bank selection * technically works as desired, need to optimize and fix design choices leading to undersirable outputs * complete PR #1 without disjunctive decoding * removed incorrect tests * Delete k.txt * Delete test.py * Delete test.sh * revert changes to test scripts * genutils * full implementation with testing, no disjunctive yet * shifted docs * passing all tests realistically ran locally * removing accidentally included print statements * fixed source of error in initial PR test * fixing the get_device() vs device trap * fixed documentation docstrings about constrained_beam_search * fixed tests having failing for Speech2TextModel's floating point inputs * fix cuda long tensor * added examples and testing for them and founx & fixed a bug in beam_search and constrained_beam_search * deleted accidentally added test halting code with assert False * code reformat * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py * fixing based on comments on PR * took out the testing code that should but work fails without the beam search moditification ; style changes * fixing comments issues * docstrings for ConstraintListState * typo in PhrsalConstraint docstring * docstrings improvements * finished adding what is sort of an opinionated implementation of disjunctive generation, but it revealed errors in inner beam search logic during testing. * fixed bug found in constrained beam search that used beam_idx that were not global across all the batches * disjunctive constraint working 100% correctly * passing all tests * Accidentally included mlruns * Update src/transformers/generation_beam_constraints.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/generation_beam_constraints.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * complete overhaul of type complexities and other nits * strict type checks in generate() * fixing second round of feedback by narsil * fixed failing generation test because of type check overhaul * generation test fail fix * fixing test fails Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-03-04 18:18:34 +01:00
Francesco Saverio Zuppichini	040c11f6da	Tests for MaskFormerFeatureExtractor's post_process*** methods (#15929 ) * proper tests for post_process*** methods in feature extractor * mask th == 0 * Update tests/maskformer/test_feature_extraction_maskformer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * make style Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-04 18:04:19 +01:00
Yih-Dar	f0aacc140b	Do not change the output from tuple to list - to match PT's version (#15918 ) * Do not change the output from tuple to list - to match PT's version * Fix the same issues for 5 other models and the template Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-04 17:50:24 +01:00
Patrick von Platen	10b76987fc	[FlaxT5 Example] fix flax t5 example pretraining (#15835 )	2022-03-04 17:04:43 +01:00
Javier de la Rosa	01485ceec3	Add missing support for Flax XLM-RoBERTa (#15900 ) * Adding Flax XLM-RoBERTa * Add Flax to __init__ * Adding doc and dummy objects * Add tests * Add Flax XLM-R models autodoc * Fix tests * Add Flask XLM-RoBERTa to TEST_FILES_WITH_NO_COMMON_TESTS * Update src/transformers/models/xlm_roberta/modeling_flax_xlm_roberta.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update tests/xlm_roberta/test_modeling_flax_xlm_roberta.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update tests/xlm_roberta/test_modeling_flax_xlm_roberta.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Remove test on large Flask XLM-RoBERTa * Add tokenizer to the test Co-authored-by: Suraj Patil <surajp815@gmail.com>	2022-03-04 14:36:28 +01:00
Nicolas Patry	89c7d9cfba	Making MaskFormerForInstanceSegmentation. (#15934 ) Small adjustments. Adding in type hint. Last fix ? Only include the default dict thing, not the pipelines.	2022-03-04 13:56:15 +01:00
Nicolas Patry	7ade7c1794	Updating the slow tests: (#15893 ) Linked to https://github.com/huggingface/transformers/pull/15826	2022-03-04 12:32:19 +01:00
ParkSangJun	6b104c5bb0	Support CLIPTokenizerFast for CLIPProcessor (#15913 ) * Fix to support fast tokenizer with `CLIPProcessor` * Update CLIPProcessor test for fast tokenizer * Fix Docstring Style * Rename into meaningful Variable name in test code	2022-03-04 11:57:09 +01:00
Sanchit Gandhi	b71474895d	Update README.md	2022-03-04 09:58:45 +01:00
Nicolas Patry	a6e3b17981	Re-enabling all fast pipeline tests. (#15924 )	2022-03-04 09:53:00 +01:00
Patrick von Platen	a7df656f03	Update README.md (#15926 )	2022-03-04 00:22:38 +01:00
davidleonfdez	c0281feb50	Fix #15898 (#15928 )	2022-03-03 14:41:03 -05:00
NielsRogge	9251427c38	Add vision models to doc tests (#15905 ) * Add vision models to doc tests * Apply suggestions from code review * Add more models Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-03-03 19:46:31 +01:00
Francesco Saverio Zuppichini	742273a52a	fix for the output from post_process_panoptic_segmentation (#15916 )	2022-03-03 19:35:48 +01:00
Sylvain Gugger	7c45fe747f	Mark slow tests as slow	2022-03-03 11:03:24 -05:00
Nicolas Patry	3822e4a563	Enabling MaskFormer in pipelines (#15917 ) * Enabling MaskFormer in ppipelines No AutoModel though :( * Ooops local file.	2022-03-03 16:31:41 +01:00
Sylvain Gugger	79d28e80b6	v4.18.0.dev.0	2022-03-03 10:19:58 -05:00
Patrick von Platen	6cbfa7bf4c	[Doctests] Fix ignore bug and add more doc tests (#15911 ) * finish speech doc tests * finish * boom * Update src/transformers/models/speech_to_text/modeling_speech_to_text.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-03 16:01:56 +01:00
Nicolas Patry	b693cbf99c	The tests were not updated after the addition of `torch.diag` (#15890 ) in the scoring (which is more correct)	2022-03-03 15:33:49 +01:00
Sanchit Gandhi	3c4fbc616f	Freeze FlaxWav2Vec2 Feature Encoder (#15873 ) * Freeze FlaxWav2Vec2 Feature Encoder * add to all module apply * add backprop test	2022-03-03 14:17:13 +01:00
Li-Huai (Allan) Lin	7b3bd1f21a	Fix and improve REALM fine-tuning (#15297 ) * Draft * Add test * Update src/transformers/models/realm/modeling_realm.py * Apply suggestion * Add block_mask * Update * Update * Add block_embedding_to * Remove no_grad * Use AutoTokenizer * Remove model.to overridding	2022-03-03 14:10:15 +01:00
Patrick von Platen	439de3f7f9	[Fix link in pipeline doc] (#15906 )	2022-03-03 07:43:13 -05:00
Yih-Dar	4cd7ed4b3b	Fix a TF Vision Encoder Decoder test (#15896 ) * send PyTorch inputs to the correct device * Fix: TypeError: can't convert cuda:0 device type tensor to numpy. Use Tensor.cpu() to copy the tensor to host memory first. Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-03 13:21:31 +01:00
Sylvain Gugger	39249c9589	Fix doc links in release utils (#15903 )	2022-03-02 18:06:31 -05:00
Sylvain Gugger	3d2242869d	Update delete-dev-doc job to match build-dev-doc (#15891 ) * Update delete-dev-doc job to match build-dev-doc * More debug info * More debug info * Stash if needed * Remove the comment update * Fix paths * Wtf is going on.. * Fix git status test * Try another way * I don't understand what's happening * Bash shell * What's happening now... * What's happening now... * Try like this * Back to trying to use bash * And like that? * Refine tests * Stash after adding new files * Stash after adding new files * Proper commit sha and PR number * Address review comments	2022-03-02 16:18:54 -05:00
NielsRogge	89be34c36c	Fix SegformerForImageClassification (#15895 ) * Fix reshape * Apply suggestion from code review Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-03-02 21:57:39 +01:00
Suraj Patil	130b987880	[XGLM] run sampling test on CPU to be deterministic (#15892 ) * run sampling test on CPU to be deterministic * input_ids on CPU	2022-03-02 17:55:49 +01:00
Joao Gante	baab5e7cdf	TF generate refactor - Sample (#15793 ) * Add TF logits wrappers * Add sample method * add tests for TF logit wrappers * TF generate sample tests now run on CPU Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>	2022-03-02 16:13:54 +00:00
NielsRogge	96ae92be8c	[SegFormer] Add deprecation warning (#15889 ) * Add deprecation warning * Remove from docs and hide in kwargs * Improve implementation Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-03-02 16:20:47 +01:00
Sanchit Gandhi	8fd4731072	Fix Bug in FlaxWav2Vec2 Slow Test (#15887 )	2022-03-02 16:02:26 +01:00
Francesco Saverio Zuppichini	d83d22f578	Maskformer (#15682 ) * maskformer * conflicts * conflicts * minor fixes * feature extractor test fix refactor MaskFormerLoss following conversation MaskFormer related types should not trigger a module time import error missed one removed all the types that are not used update config mapping minor updates in the doc resolved conversation that doesn't need a discussion minor changes resolved conversations fixed DetrDecoder * minor changes minor changes fixed mdx file test feature_extractor return types functional losses -> classes removed the return type test for the feature extractor minor changes + style + quality * conflicts? * rebase master * readme * added missing files * deleded poolformers test that where in the wrong palce * CI * minor changes * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * resolved conversations * minor changes * conversations [Unispeech] Fix slow tests (#15818) * remove soundfile old way of loading audio * Adapt slow test [Barthez Tokenizer] Fix saving (#15815) [TFXLNet] Correct tf xlnet generate (#15822) * [TFXLNet] Correct tf xlnet * adapt test comment Fix the push run (#15807) Fix semantic segmentation pipeline test (#15826) Fix dummy_inputs() to dummy_inputs in symbolic_trace doc (#15776) Add model specific output classes to PoolFormer model docs (#15746) * Added model specific output classes to poolformer docs * Fixed Segformer typo in Poolformer docs Adding the option to return_timestamps on pure CTC ASR models. (#15792) * Adding the option to return_timestamps on pure CTC ASR models. * Remove `math.prod` which was introduced in Python 3.8 * int are not floats. * Reworking the PR to support "char" vs "word" output. * Fixup! * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Quality. Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> HFTracer.trace should use/return self.graph to be compatible with torch.fx.Tracer (#15824) Fix tf.concatenate + test past_key_values for TF models (#15774) * fix wrong method name tf.concatenate * add tests related to causal LM / decoder * make style and quality * clean-up * Fix TFBertModel's extended_attention_mask when past_key_values is provided * Fix tests * fix copies * More tf.int8 -> tf.int32 in TF test template * clean-up * Update TF test template * revert the previous commit + update the TF test template * Fix TF template extended_attention_mask when past_key_values is provided * Fix some styles manually * clean-up * Fix ValueError: too many values to unpack in the test * Fix more: too many values to unpack in the test * Add a comment for extended_attention_mask when there is past_key_values * Fix TFElectra extended_attention_mask when past_key_values is provided * Add tests to other TF models * Fix for TF Electra test: add prepare_config_and_inputs_for_decoder * Fix not passing training arg to lm_head in TFRobertaForCausalLM * Fix tests (with past) for TF Roberta * add testing for pask_key_values for TFElectra model Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> [examples/summarization and translation] fix readme (#15833) Add ONNX Runtime quantization for text classification notebook (#15817) Re-enable doctests for the quicktour (#15828) * Re-enable doctests for the quicktour * Re-enable doctests for task_summary (#15830) * Remove & Framework split model report (#15825) Add TFConvNextModel (#15750) * feat: initial implementation of convnext in tensorflow. * fix: sample code for the classification model. * chore: added checked for from the classification model. * chore: set bias initializer in the classification head. * chore: updated license terms. * chore: removed ununsed imports * feat: enabled argument during using drop_path. * chore: replaced tf.identity with layers.Activation(linear). * chore: edited default checkpoint. * fix: minor bugs in the initializations. * partial-fix: tf model errors for loading pretrained pt weights. * partial-fix: call method updated * partial-fix: cross loading of weights (4x3 variables to be matched) * chore: removed unneeded comment. * removed playground.py * rebasing * rebasing and removing playground.py. * fix: renaming TFConvNextStage conv and layer norm layers * chore: added initializers and other minor additions. * chore: added initializers and other minor additions. * add: tests for convnext. * fix: integration tester class. * fix: issues mentioned in pr feedback (round 1). * fix: how output_hidden_states arg is propoagated inside the network. * feat: handling of arg for pure cnn models. * chore: added a note on equal contribution in model docs. * rebasing * rebasing and removing playground.py. * feat: encapsulation for the convnext trunk. * Fix variable naming; Test-related corrections; Run make fixup * chore: added Joao as a contributor to convnext. * rebasing * rebasing and removing playground.py. * rebasing * rebasing and removing playground.py. * chore: corrected copyright year and added comment on NHWC. * chore: fixed the black version and ran formatting. * chore: ran make style. * chore: removed from_pt argument from test, ran make style. * rebasing * rebasing and removing playground.py. * rebasing * rebasing and removing playground.py. * fix: tests in the convnext subclass, ran make style. * rebasing * rebasing and removing playground.py. * rebasing * rebasing and removing playground.py. * chore: moved convnext test to the correct location * fix: locations for the test file of convnext. * fix: convnext tests. * chore: applied sgugger's suggestion for dealing w/ output_attentions. * chore: added comments. * chore: applied updated quality enviornment style. * chore: applied formatting with quality enviornment. * chore: revert to the previous tests/test_modeling_common.py. * chore: revert to the original test_modeling_common.py * chore: revert to previous states for test_modeling_tf_common.py and modeling_tf_utils.py * fix: tests for convnext. * chore: removed output_attentions argument from convnext config. * chore: revert to the earlier tf utils. * fix: output shapes of the hidden states * chore: removed unnecessary comment * chore: reverting to the right test_modeling_tf_common.py. * Styling nits Co-authored-by: ariG23498 <aritra.born2fly@gmail.com> Co-authored-by: Joao Gante <joao@huggingface.co> Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com> * minor changes * doc fix in feature extractor * doc * typose * removed detr logic from config * removed detr logic from config * removed num_labels * small fix in the config * auxilary -> auxiliary * make style * some test is failing * fix a weird char in config prevending doc-builder * retry to fix the doc-builder issue * make style * new try to fix the doc builder * CI * change weights to facebook Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: ariG23498 <aritra.born2fly@gmail.com> Co-authored-by: Joao Gante <joao@huggingface.co> Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>	2022-03-02 15:48:20 +01:00
Ross Johnstone	e535c389aa	Fix tiny typo (#15884 )	2022-03-02 15:37:05 +01:00
Rahul Huilgol	2eb7bb15e7	Updates in Trainer to support new features in SM Model Parallel library (#15877 ) * Create optimizer after model creation for SMP * update dp_rank to rdp_rank for opt_state_dict * update world_size and process_index for smp * Address comments * Lint fix Co-authored-by: Cavdar <dcavdar@a07817b12d7e.ant.amazon.com>	2022-03-02 07:55:14 -05:00
Joao Gante	05c237ea94	Update TF QA example (#15870 )	2022-03-02 10:38:13 +00:00
Nicolas Patry	6e57a56987	Adding timestamps for CTC with LM in ASR pipeline. (#15863 ) * Adding timestamps for CTC with LM in ASR pipeline. * iRemove print. * Nit change.	2022-03-02 10:49:05 +01:00

1 2 3 4 5 ...

9239 Commits