transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-19 04:28:26 +06:00

Author	SHA1	Message	Date
anruijian	2d506ea4c4	Fix tf random token masking probability in data collator (#21834 ) * fix tf random mask tokens probability * fix tf random mask tokens probability in collator for langauge modelling	2023-02-28 07:55:47 -05:00
Karim Foda	4fe744f528	Fix gradient checkpointing imagegpt (#21816 ) * Fix gradient checkpointing bug in gptneox * Fix gradient checkpointing bug in modeling_imagegpt.py * Revert gpt neox changes --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-02-28 07:47:04 -05:00
Karim Foda	e07a3d95f8	Fix gradient checkpointing bug in git (#21818 ) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-02-28 07:46:33 -05:00
Andy Ehrenberg	50db741417	check for None forced tokens (#21793 )	2023-02-28 13:24:43 +01:00
saswatmeher	50644cf624	Fix gradient checkpointing bug BioGpt (#21844 ) Co-authored-by: saswatmeher <saswatmeher@cse.iitb.ac.in>	2023-02-28 11:56:25 +00:00
Yih-Dar	a9dd124346	Rename `MobileViTModelTest` to `TFMobileViTModelTest` (#21825 ) Let's give TF a bit more love ❤️ 🙏 Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-02-28 08:10:29 +01:00
Stas Bekman	c7f3abc257	introduce `logger.warning_once` and use it for grad checkpointing code (#21804 ) * logger.warning_once * style	2023-02-27 13:25:06 -08:00
Yih-Dar	f95f60c829	Fix quality with `ruff==0.0.253` (#21828 ) fix quality with ruff 0.0.253 Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-02-27 19:38:44 +01:00
Joao Gante	92dfceb124	Inheritance-based framework detection (#21784 )	2023-02-27 15:31:55 +00:00
Karim Foda	7811bf7e73	Fix gradient checkpointing bug in gptneox (#21815 ) * Fix gradient checkpointing bug in gptneox * Remove use_cache block	2023-02-27 14:49:32 +00:00
fxmarty	0c7f93f5f1	Fix nn.init.trunc_normal_ call on torch.float16 data (#21789 ) fix nn.init.trunc_normal_ call on half data	2023-02-27 13:31:29 +01:00
fxmarty	ebf84f07ba	Fix PyTorch Perceiver `PerceiverFourierPositionEncoding` with fp16 (#21787 ) * fix perceiver fp16 * hopefully fix tests	2023-02-27 11:43:57 +00:00
Younes Belkada	831f3144a6	[`tests`] add `accelerate` marker (#21743 ) * add `accelerate` marker * add to docs * Update docs/source/en/testing.mdx	2023-02-27 12:33:34 +01:00
Arthur	c51dc4f927	[torch] remove deprecated uint8 in favor of bool (#21384 ) * uint8 -> bool * fix copies * style * update test modeling commen when checking attention buffers * style * use logical not on random mask instead of subtraction with 1 * remove torch uint8 * quality * remove modified modeling utils * Update based on review Co-authored-by: sgugger <sylvain.gugger@gmail.com> --------- Co-authored-by: sgugger <sylvain.gugger@gmail.com>	2023-02-27 11:46:02 +01:00
Arthur	cc44e72d14	[Pipeline] Add zero shot audio classificatoin pipeline (#21600 ) * add pipeline * update init * add zero shot to init * update inits and correct checkpoints * update base to support input features * add tests * Update src/transformers/pipelines/zero_shot_audio_classification.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update src/transformers/pipelines/zero_shot_audio_classification.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * update pieline code * use tiny checkpoint * nits and expected value with tiny model * style * last nit on tests values * fix styling * fix collate fn that was casting t float * update --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>	2023-02-27 11:43:44 +01:00
Tianqi Zhang (张天启)	2ea1ef9090	[FX tracer] Make `concrete_args` from outside available (#21775 ) make concrete_args from outside available	2023-02-27 08:57:57 +01:00
Thomas Paviot	ba2a5f13f7	Fix en documentation typos (#21799 ) * fix wrong url * typos in english documentation	2023-02-27 08:36:36 +01:00
Julian Weber	a36983653e	Fix type in gpt2 config docstring (#21782 ) Fix docstring gpt2 config	2023-02-27 08:19:19 +01:00
bofeng huang	3c0ce60855	[examples/summarization] deal with `max_length` and `num_beams` (#21740 ) * Override the decoding parameters of Seq2SeqTrainer * Fix quality * Fix max_length parameter * Fix quality * Remove redundant parameter max_length * Separate the preprocess of train and validation to use different max_target_length	2023-02-27 08:18:14 +01:00
Moshe Berchansky	9ddf4f4f03	Fix resume_from_checkpoint for deepspeed (#21735 ) * Fix resume_from_checkpoint for deepspeed Fix resume_from_checkpoint for deepspeed, by ensuring that the deepspeed engine is the one to load the checkpoint. * Empty commit to trigger CI * Removed deepspeed skipping Removed deepspeed skipping inside the _load_from_checkpoint function, as it is obsolete * another adjustment * Trigger CI * trigger circleci * style --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Stas Bekman <stas@stason.org>	2023-02-25 11:30:54 -08:00
Sanchit Gandhi	3dae0d7b4f	[SpeechT5] Fix HiFiGAN tests (#21788 )	2023-02-24 16:55:38 +01:00
Yi Heng Lim	59c1d5b96b	[GPT2, ProphetNet] Fix gradient checkpointing bug (#21772 ) * fix gradient checkpointing bug * fix gradient checkpointing bug * ran make fix-copies * fixed bug * fixed bug	2023-02-24 15:37:22 +00:00
Kashif Rasul	ba0e370dc1	[time series] updated expected values for integration test. (#21762 ) * updated expected * prediction_length fix * prediction_length default value * default prediction_length 24 * revert back prediction_length default * move prediction_length test	2023-02-24 12:36:54 +01:00
Joao Gante	440f39754b	Generate - update cookie cutters to not initialize cache with training and gradient checkpointing (#21759 )	2023-02-24 11:21:00 +00:00
Arthur	087436c98e	Fix-ci-whisper (#21767 ) * fix history * input_features instead of input ids for TFWhisport doctest * use translate intead of transcribe	2023-02-24 11:39:25 +01:00
bofeng huang	c8545d2a9c	[Whisper] Add SpecAugment (#21298 ) * Return and rescale attention_mask * Add SpecAugment to Whisper modeling * Fix test * Update docstring * Add SpecAug related parameters to model config * Add the _mask_input_features function to doc * Fix quality * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Remove dev comments * Add test * Resolve conflict * feat: mask {feature, time} prob fast tests * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: sanchit-gandhi <sanchit@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-02-24 11:07:52 +01:00
Sanchit Gandhi	75bd49ff88	[Flax] Fix erroneous kwargs being passed to generate config (#21765 )	2023-02-24 09:59:18 +01:00
Arthur	14f33205a7	Different behavior in DistilBERT when using "inputs_embeds" (#21752 ) * Different behavior in DistilBERT when using "inputs_embeds" Fixes #21089 * fix failing test	2023-02-24 09:48:07 +01:00
Sanchit Gandhi	13489248fa	[Examples] Generalise run audio classification for log-mel models (#21756 ) * [Examples] Generalise run audio classification for log-mel models * batch feature extractor * make style	2023-02-24 09:19:07 +01:00
Shubhamai	f7ca656f07	[Flax] adding support for batch norm layers (#21581 ) * [flax] adding support for batch norm layers * fixing bugs related to pt+flax integration * cleanup, batchnorm support in sharded pt to flax * support for batchnorm tests in pt+flax integration * simplifying checking batch norm layer	2023-02-24 08:47:33 +01:00
Connor Henderson	279008adc3	fix: Change is_last chunk calc and add conditional break in chunk_iter (#21612 ) * fix: Change is_last chunk calc and add conditional break * format fix * account for 0 and full stride_rights, add comment * add new test * make style * update slow whisper asr test timestamps * use nested_simplify on output and round timestamp to hundreths place	2023-02-24 08:30:32 +01:00
Clémentine Fourrier	4446b6b094	Graphormer fix (#21699 ) * Removed useless check for backend * fix style check for graphormer * Reverted change and corrected requires_backend for cython * code qual	2023-02-24 08:20:52 +01:00
Stas Bekman	633062639b	[deepspeed tests] fix issues introduced by #21700 (#21769 ) * [deepspeed tests] fix issues introduced by #21700 * fix * fix	2023-02-23 13:22:25 -08:00
Maria Khalusova	04d90ac49e	Auto api Value Error addition to Troubleshoot (#21708 ) * troubleshooting guide: added an error description for missing auto-mapping * minor polishing * changed the example * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/troubleshooting.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-02-23 11:51:18 -05:00
Batese2001	0ffa22f9f6	Added Type Hints for modeling_tf_encoder_decoder.py (#21673 ) * Ran Black formatting * Added imports and reformatted * Update src/transformers/models/encoder_decoder/modeling_tf_encoder_decoder.py --------- Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>	2023-02-23 14:08:26 +00:00
ydshieh	aa3787c8f0	Skip test_log_level for now	2023-02-23 12:11:20 +01:00
Joao Gante	1d4b797852	Generate: Fix GIT batched captioning (#21738 )	2023-02-23 09:50:37 +00:00
Younes Belkada	78a93d17c0	[`GPTNeo`] Fix gradient checkpointing bug (#21733 ) * fix bug * forward contrib credits from discussions * change logic --------- Co-authored-by: edbeeching <edbeeching@users.noreply.github.com>	2023-02-23 09:48:19 +01:00
Yih-Dar	36a6a1adb6	Fix 2 quicktour file doctest (#21742 ) * Update expect output values - as Hub repo. files are updated * Update expect output values - as librosa is from 0.9.2 to 0.10.0 on CI docker * fix * update one more --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-02-23 09:41:28 +01:00
Yih-Dar	ff143ae10e	Update doctest GH workflow file (#21744 ) update Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-02-23 09:40:53 +01:00
Naga Sai Abhinay	448e050b0d	Make ImageProcessorMixin compatible with subfolder kwarg (#21725 ) * Add subfolder support * Add kwarg docstring * formatting fix * Add test	2023-02-23 09:28:18 +01:00
Thomas Paviot	064f374874	typos in french documentation (#21750 )	2023-02-23 09:17:01 +01:00
Maria Khalusova	619d51e01f	Added "Open in Colab" to task guides (#21729 ) added Open in Colab to task guides	2023-02-22 08:32:35 -05:00
Matt	d913f4aa40	Fix to KerasMetricCallback when the model returns unstructured output (#21727 ) * Stop doing dict-things to non-dict inputs * Add a debug check * Add a debug check * Remove debug checks, looks good now! * make fixup	2023-02-22 13:15:14 +00:00
Sanchit Gandhi	82e61f3445	[SpeechT5HifiGan] Handle batched inputs (#21702 ) * [SpeechT5HifiGan] Handle batched inputs * fix docstring * rebase and new ruff style	2023-02-22 11:16:56 +01:00
Yih-Dar	09127c5713	Fix `GPTSanJapaneseModel` (#21731 ) * fix * skip test_model_parallelism * skip test_model_parallelism --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-02-22 11:09:04 +01:00
Yih-Dar	aff87da15b	Fix `ErnieMEmbeddings` device issue (#21726 ) * remove .parameters()).device * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-02-22 10:57:34 +01:00
Yih-Dar	2f2b19ff40	Change doc example for `BigBirdForQuestionAnswering` (#21723 ) Change doc example for BigBirdForQuestionAnswering Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-02-22 10:55:12 +01:00
Yih-Dar	354b338316	Remove `gptsan_japanese` from doctest list to avoid GPU OOM (#21722 ) remove from doctest list to avoid GPU OOM Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-02-22 10:51:00 +01:00
Sylvain Gugger	b19d64d852	Respect documentation on passive log level (#21700 ) * Respect documentation on passive log level * Fix test and set log level in examples * Add doc	2023-02-22 09:39:18 +01:00

... 57 58 59 60 61 ...

15053 Commits