transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 10:12:23 +06:00

Author	SHA1	Message	Date
Ross Johnstone	e535c389aa	Fix tiny typo (#15884 )	2022-03-02 15:37:05 +01:00
Rahul Huilgol	2eb7bb15e7	Updates in Trainer to support new features in SM Model Parallel library (#15877 ) * Create optimizer after model creation for SMP * update dp_rank to rdp_rank for opt_state_dict * update world_size and process_index for smp * Address comments * Lint fix Co-authored-by: Cavdar <dcavdar@a07817b12d7e.ant.amazon.com>	2022-03-02 07:55:14 -05:00
Joao Gante	05c237ea94	Update TF QA example (#15870 )	2022-03-02 10:38:13 +00:00
Nicolas Patry	6e57a56987	Adding timestamps for CTC with LM in ASR pipeline. (#15863 ) * Adding timestamps for CTC with LM in ASR pipeline. * iRemove print. * Nit change.	2022-03-02 10:49:05 +01:00
Joao Gante	8a133490bf	Add TF generate sample tests with all logit processors (#15852 ) * Add GPT2 TF generate sample test with all logits processor * Add T5 generate sample test	2022-03-02 09:48:11 +00:00
Patrick von Platen	40040727ab	[Bart] Fix implementation note doc (#15879 )	2022-03-02 10:24:32 +01:00
Michael Benayoun	4bfe75bd08	M2M100 support for ONNX export (#15193 ) * Add M2M100 support for ONNX export * Delete useless imports * Add M2M100 to tests * Fix protobuf issue	2022-03-02 10:03:14 +01:00
Lysandre Debut	d1a29078c0	Remove stash for now (#15882 )	2022-03-01 22:36:19 -05:00
Stas Bekman	b842d7277a	fix deepspeed tests (#15881 ) * fix deepspeed tests * style * more fixes	2022-03-01 19:27:28 -08:00
Steven Liu	6ccfa2170c	Inference for multilingual models (#15836 ) * 📝 first draft for multilingual models * 🖍 make style	2022-03-01 15:10:31 -06:00
Lysandre Debut	26426923b7	No self-hosted runner for dev documentation (#15710 )	2022-03-01 14:05:54 -05:00
Mishig Davaadorj	00eaffc81f	Bump up doc node version to 16 (#15874 )	2022-03-01 18:37:57 +01:00
Suraj Patil	afca0d5192	use python 3.7 for flax self-push tests (#15865 ) * set python 3.7 for flax tests * setup-python@v2 * python-dev * install -y * python3-dev * install kenlm from source * install cython * cd to kenlm * kenlm install * don't install kenlm * change flax pretrained to run flax tests * cleanup * remove python-dev	2022-03-01 18:26:30 +01:00
NielsRogge	286fdc6b3c	[vision] Add problem_type support (#15851 ) * Add problem_type to missing models * Fix deit test Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-03-01 18:09:52 +01:00
Lysandre Debut	7ff9d450cd	Scatter should run on CUDA (#15872 )	2022-03-01 11:47:17 -05:00
NielsRogge	c008afea3c	Add link to notebooks (#15791 ) Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-03-01 17:44:20 +01:00
Patrick von Platen	e064f08150	Add time stamps for wav2vec2 with lm (#15854 ) * [Wav2Vec2 With LM] add timestamps * correct * correct * Apply suggestions from code review * correct * Update src/transformers/models/wav2vec2_with_lm/processing_wav2vec2_with_lm.py * make style * Update src/transformers/models/wav2vec2_with_lm/processing_wav2vec2_with_lm.py Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> * make style * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-01 17:03:05 +01:00
Joao Gante	3f2e636850	Update TF LM examples (#15855 )	2022-03-01 14:12:58 +00:00
Lysandre Debut	54f0db4066	Add PT + TF automatic builds (#15860 ) * Add PT + TF automatic builds * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Wrap up Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-03-01 08:55:11 -05:00
Patrick von Platen	9863f7d228	[Benchmark tools] Deprecate all (#15848 ) * [Benchmark tools] Deprecate all * up	2022-03-01 11:26:20 +01:00
Eduardo Gonzalez Ponferrada	df5a4094a6	Add Data2Vec (#15507 ) * Add data2vec model cloned from roberta * Add checkpoint conversion script * Fix copies * Update docs * Add checkpoint conversion script * Remove fairseq data2vec_text script and fix format * Add comment on where to get data2vec_text.py * Remove mock implementation cheat.py and fix style * Fix copies * Remove TF and Flax classes from init * Add back copy from fairseq data2vec_text.py and fix style * Update model name in docs/source/index.mdx to be CamelCase * Revert model name in table to lower-case to get check_table test to pass * Update src/transformers/models/data2vec/__init__.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/convert_data2vec_original_pytorch_checkpoint_to_pytorch.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/model_doc/data2vec.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/model_doc/data2vec.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/auto/configuration_auto.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/configuration_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/test_modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/configuration_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update documentation * Copy-paste Data2VecConfig from BertConfig * Update config checkpoint to point to edugp/data2vec-nlp-base. Fix style and repo-consistency * Update config special tokens to match RoBERTa * Split multiple assertions and add individual error messages * Rename Data2VecModel to Data2VecForTextModel * Add Data2Vec to _toctree.yml * Rename Data2VecEmbeddings to Data2VecForTextEmbeddings * Add initial Data2VecForAudio model (unfinished). Only matching fairseq's implementation up to the feature encoder (before positional encoding). * finish audio model * finish audio file * Update names and fix style, quality and repo consistency * Remove Data2VecAudioForPretraining. Add tests for Data2VecAudio, mimicking the Wav2Vec2 test suite. Fix bias initilization in positional conv layers. Move back configurations for audio and text to separate files. * add inputs to logits to data2vec' * correct autio models * correct config auto * correct tok auto * Update utils/tests_fetcher.py * delete unnecessary files * delete unnecessary files * further renaming * make all tests pass * finish * remove useless test file * Update tests/test_modeling_common.py * Update utils/check_repo.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec_text.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Fix copies * Update docs * Remove fairseq data2vec_text script and fix format * Add comment on where to get data2vec_text.py * Remove mock implementation cheat.py and fix style * Fix copies * Remove TF and Flax classes from init * Add back copy from fairseq data2vec_text.py and fix style * Update model name in docs/source/index.mdx to be CamelCase * Revert model name in table to lower-case to get check_table test to pass * Update documentation * Update src/transformers/models/data2vec/__init__.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/convert_data2vec_original_pytorch_checkpoint_to_pytorch.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/auto/configuration_auto.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/configuration_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/test_modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/configuration_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Copy-paste Data2VecConfig from BertConfig * Update config checkpoint to point to edugp/data2vec-nlp-base. Fix style and repo-consistency * Update config special tokens to match RoBERTa * Split multiple assertions and add individual error messages * Rename Data2VecModel to Data2VecForTextModel * Add Data2Vec to _toctree.yml * Rename Data2VecEmbeddings to Data2VecForTextEmbeddings * Add initial Data2VecForAudio model (unfinished). Only matching fairseq's implementation up to the feature encoder (before positional encoding). * finish audio model * finish audio file * add inputs to logits to data2vec' * Update names and fix style, quality and repo consistency * Remove Data2VecAudioForPretraining. Add tests for Data2VecAudio, mimicking the Wav2Vec2 test suite. Fix bias initilization in positional conv layers. Move back configurations for audio and text to separate files. * correct autio models * correct config auto * correct tok auto * delete unnecessary files * delete unnecessary files * Update utils/tests_fetcher.py * further renaming * make all tests pass * finish * remove useless test file * Update tests/test_modeling_common.py * Update utils/check_repo.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec_text.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Move data2vec tests to new structure * Fix test imports for text tests * Remove fairseq files * Change paper link to arxiv * Modify Data2Vec documentation to reflect that the encoder is not shared across the audio and text models in the current implementation. * Update text model checkpoint to be facebook/data2vec-text-base * Add 'Copy from' statements and update paper links and docs * fix copy from statements * improve copied from * correct more copied from statements * finish copied from stuff * make style * add model to README * add to master Co-authored-by: Eduardo Gonzalez Ponferrada <eduardo@ferrumhealth.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-01 11:09:20 +01:00
Patrick von Platen	ddbb485c41	[TF-PT-Tests] Fix PyTorch - TF tests for different GPU devices (#15846 )	2022-02-28 15:46:46 -05:00
Nicolas Patry	97f9b8a27b	Fixing the timestamps with chunking. (#15843 ) * Fixing the timestamps with chunking. * The changes modified (and fixed) the striding tests. * Adding a tokenizer test. * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Defense -> comment. * Update src/transformers/models/wav2vec2/tokenization_wav2vec2.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-02-28 21:00:21 +01:00
lewtun	410e26c7ad	Fix (deprecated) ONNX exporter to account for new tf2onnx API (#15856 ) * Fix (deprecated) ONNX exporter to account for new tf2onnx API	2022-02-28 20:17:44 +01:00
Sanchit Gandhi	e3342edc4e	Flax Speech-Encoder-Decoder Model (#15613 ) * rebase * Delete shift tokens func * downsample decoder input seq len for init * correct attention mask * add tests * pt flax cross test * make fixup * init file for import * change pt-flax cross test threshold * pt-flax test logits only * move tests * make repo-consistency * consistent indentation Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-02-28 12:22:36 +01:00
Patrick von Platen	935a76d90d	[UniSpeechSat] correct unispeech sat (#15847 )	2022-02-28 11:23:13 +01:00
Sayak Paul	84eaa6acf5	Add TFConvNextModel (#15750 ) * feat: initial implementation of convnext in tensorflow. * fix: sample code for the classification model. * chore: added checked for from the classification model. * chore: set bias initializer in the classification head. * chore: updated license terms. * chore: removed ununsed imports * feat: enabled argument during using drop_path. * chore: replaced tf.identity with layers.Activation(linear). * chore: edited default checkpoint. * fix: minor bugs in the initializations. * partial-fix: tf model errors for loading pretrained pt weights. * partial-fix: call method updated * partial-fix: cross loading of weights (4x3 variables to be matched) * chore: removed unneeded comment. * removed playground.py * rebasing * rebasing and removing playground.py. * fix: renaming TFConvNextStage conv and layer norm layers * chore: added initializers and other minor additions. * chore: added initializers and other minor additions. * add: tests for convnext. * fix: integration tester class. * fix: issues mentioned in pr feedback (round 1). * fix: how output_hidden_states arg is propoagated inside the network. * feat: handling of arg for pure cnn models. * chore: added a note on equal contribution in model docs. * rebasing * rebasing and removing playground.py. * feat: encapsulation for the convnext trunk. * Fix variable naming; Test-related corrections; Run make fixup * chore: added Joao as a contributor to convnext. * rebasing * rebasing and removing playground.py. * rebasing * rebasing and removing playground.py. * chore: corrected copyright year and added comment on NHWC. * chore: fixed the black version and ran formatting. * chore: ran make style. * chore: removed from_pt argument from test, ran make style. * rebasing * rebasing and removing playground.py. * rebasing * rebasing and removing playground.py. * fix: tests in the convnext subclass, ran make style. * rebasing * rebasing and removing playground.py. * rebasing * rebasing and removing playground.py. * chore: moved convnext test to the correct location * fix: locations for the test file of convnext. * fix: convnext tests. * chore: applied sgugger's suggestion for dealing w/ output_attentions. * chore: added comments. * chore: applied updated quality enviornment style. * chore: applied formatting with quality enviornment. * chore: revert to the previous tests/test_modeling_common.py. * chore: revert to the original test_modeling_common.py * chore: revert to previous states for test_modeling_tf_common.py and modeling_tf_utils.py * fix: tests for convnext. * chore: removed output_attentions argument from convnext config. * chore: revert to the earlier tf utils. * fix: output shapes of the hidden states * chore: removed unnecessary comment * chore: reverting to the right test_modeling_tf_common.py. * Styling nits Co-authored-by: ariG23498 <aritra.born2fly@gmail.com> Co-authored-by: Joao Gante <joao@huggingface.co> Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>	2022-02-25 18:19:16 +01:00
Lysandre Debut	0b5bf6abef	Framework split model report (#15825 )	2022-02-25 12:00:00 -05:00
Sylvain Gugger	0118c4f6a8	Re-enable doctests for the quicktour (#15828 ) * Re-enable doctests for the quicktour * Re-enable doctests for task_summary (#15830) * Remove &	2022-02-25 17:46:38 +01:00
Ella Charlaix	fd5b05eb81	Add ONNX Runtime quantization for text classification notebook (#15817 )	2022-02-25 11:29:35 -05:00
Suraj Patil	bf1fe32824	[examples/summarization and translation] fix readme (#15833 )	2022-02-25 17:28:16 +01:00
Yih-Dar	8635407bc7	Fix tf.concatenate + test past_key_values for TF models (#15774 ) * fix wrong method name tf.concatenate * add tests related to causal LM / decoder * make style and quality * clean-up * Fix TFBertModel's extended_attention_mask when past_key_values is provided * Fix tests * fix copies * More tf.int8 -> tf.int32 in TF test template * clean-up * Update TF test template * revert the previous commit + update the TF test template * Fix TF template extended_attention_mask when past_key_values is provided * Fix some styles manually * clean-up * Fix ValueError: too many values to unpack in the test * Fix more: too many values to unpack in the test * Add a comment for extended_attention_mask when there is past_key_values * Fix TFElectra extended_attention_mask when past_key_values is provided * Add tests to other TF models * Fix for TF Electra test: add prepare_config_and_inputs_for_decoder * Fix not passing training arg to lm_head in TFRobertaForCausalLM * Fix tests (with past) for TF Roberta * add testing for pask_key_values for TFElectra model Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-02-25 17:11:46 +01:00
Pavel Belevich	4818bf7aed	HFTracer.trace should use/return self.graph to be compatible with torch.fx.Tracer (#15824 )	2022-02-25 15:54:45 +01:00
Nicolas Patry	ad0d7d1745	Adding the option to return_timestamps on pure CTC ASR models. (#15792 ) * Adding the option to return_timestamps on pure CTC ASR models. * Remove `math.prod` which was introduced in Python 3.8 * int are not floats. * Reworking the PR to support "char" vs "word" output. * Fixup! * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Quality. Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-02-25 14:06:45 +01:00
Tanay Mehta	7566734d6f	Add model specific output classes to PoolFormer model docs (#15746 ) * Added model specific output classes to poolformer docs * Fixed Segformer typo in Poolformer docs	2022-02-25 13:43:56 +01:00
Pavel Belevich	7963578fc5	Fix dummy_inputs() to dummy_inputs in symbolic_trace doc (#15776 )	2022-02-25 11:32:23 +01:00
Sylvain Gugger	074645e32a	Fix semantic segmentation pipeline test (#15826 )	2022-02-25 09:21:29 +01:00
Lysandre Debut	b7e292aebd	Fix the push run (#15807 )	2022-02-24 19:30:17 +01:00
Patrick von Platen	cbf4391177	[TFXLNet] Correct tf xlnet generate (#15822 ) * [TFXLNet] Correct tf xlnet * adapt test comment	2022-02-24 19:23:34 +01:00
Patrick von Platen	2f0f9038e2	[Barthez Tokenizer] Fix saving (#15815 )	2022-02-24 19:09:09 +01:00
Patrick von Platen	ca57b45071	[Unispeech] Fix slow tests (#15818 ) * remove soundfile old way of loading audio * Adapt slow test	2022-02-24 19:08:54 +01:00
Sylvain Gugger	35ecf99cc4	Revert changes in logit size for semantic segmentation models (#15722 ) * Revert changes in logit size for semantic segmentation models * Address review comments	2022-02-24 15:52:52 +01:00
Sylvain Gugger	d1fcc90abf	Fix from_pretrained with default base_model_prefix (#15814 )	2022-02-24 11:43:51 +01:00
Sylvain Gugger	7f921bcf47	Fix add-new-model-like when old model checkpoint is not found (#15805 ) * Fix add-new-model-like command when old checkpoint can't be recovered * Style	2022-02-24 08:58:18 +01:00
Lysandre Debut	bb7949b35a	Fix model templates (#15806 ) * Fix model templates * Update paths	2022-02-23 18:27:29 -05:00
Lysandre	309e87e25e	Docker images should only run on a daily basis	2022-02-23 18:01:44 -05:00
Lysandre	c475f3ce2d	Scheduled tests should only run on a daily basis	2022-02-23 17:52:22 -05:00
Eliott C	6336017c15	Fix build_documentation CI (#15803 )	2022-02-23 21:53:51 +01:00
Lysandre Debut	a0e3480699	[Test refactor 5/5] Build docker images (#15729 )	2022-02-23 15:48:19 -05:00
Lysandre Debut	4c737f0e40	[Test refactor 4/5] Improve the scheduled tests (#15728 )	2022-02-23 15:48:05 -05:00

1 2 3 4 5 ...

9093 Commits