anruijian
2d506ea4c4
Fix tf random token masking probability in data collator ( #21834 )
...
* fix tf random mask tokens probability
* fix tf random mask tokens probability in collator for langauge modelling
2023-02-28 07:55:47 -05:00
Karim Foda
4fe744f528
Fix gradient checkpointing imagegpt ( #21816 )
...
* Fix gradient checkpointing bug in gptneox
* Fix gradient checkpointing bug in modeling_imagegpt.py
* Revert gpt neox changes
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-02-28 07:47:04 -05:00
Karim Foda
e07a3d95f8
Fix gradient checkpointing bug in git ( #21818 )
...
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-02-28 07:46:33 -05:00
Andy Ehrenberg
50db741417
check for None forced tokens ( #21793 )
2023-02-28 13:24:43 +01:00
saswatmeher
50644cf624
Fix gradient checkpointing bug BioGpt ( #21844 )
...
Co-authored-by: saswatmeher <saswatmeher@cse.iitb.ac.in>
2023-02-28 11:56:25 +00:00
Yih-Dar
a9dd124346
Rename MobileViTModelTest
to TFMobileViTModelTest
( #21825 )
...
Let's give TF a bit more love ❤️ 🙏
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-02-28 08:10:29 +01:00
Stas Bekman
c7f3abc257
introduce logger.warning_once
and use it for grad checkpointing code ( #21804 )
...
* logger.warning_once
* style
2023-02-27 13:25:06 -08:00
Yih-Dar
f95f60c829
Fix quality with ruff==0.0.253
( #21828 )
...
fix quality with ruff 0.0.253
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-02-27 19:38:44 +01:00
Joao Gante
92dfceb124
Inheritance-based framework detection ( #21784 )
2023-02-27 15:31:55 +00:00
Karim Foda
7811bf7e73
Fix gradient checkpointing bug in gptneox ( #21815 )
...
* Fix gradient checkpointing bug in gptneox
* Remove use_cache block
2023-02-27 14:49:32 +00:00
fxmarty
0c7f93f5f1
Fix nn.init.trunc_normal_ call on torch.float16 data ( #21789 )
...
fix nn.init.trunc_normal_ call on half data
2023-02-27 13:31:29 +01:00
fxmarty
ebf84f07ba
Fix PyTorch Perceiver PerceiverFourierPositionEncoding
with fp16 ( #21787 )
...
* fix perceiver fp16
* hopefully fix tests
2023-02-27 11:43:57 +00:00
Younes Belkada
831f3144a6
[tests
] add accelerate
marker ( #21743 )
...
* add `accelerate` marker
* add to docs
* Update docs/source/en/testing.mdx
2023-02-27 12:33:34 +01:00
Arthur
c51dc4f927
[torch] remove deprecated uint8 in favor of bool ( #21384 )
...
* uint8 -> bool
* fix copies
* style
* update test modeling commen when checking attention buffers
* style
* use logical not on random mask instead of subtraction with 1
* remove torch uint8
* quality
* remove modified modeling utils
* Update based on review
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
---------
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
2023-02-27 11:46:02 +01:00
Arthur
cc44e72d14
[Pipeline] Add zero shot audio classificatoin pipeline ( #21600 )
...
* add pipeline
* update init
* add zero shot to init
* update inits and correct checkpoints
* update base to support input features
* add tests
* Update src/transformers/pipelines/zero_shot_audio_classification.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
* Update src/transformers/pipelines/zero_shot_audio_classification.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
* update pieline code
* use tiny checkpoint
* nits and expected value with tiny model
* style
* last nit on tests values
* fix styling
* fix collate fn that was casting t float
* update
---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
2023-02-27 11:43:44 +01:00
Tianqi Zhang (张天启)
2ea1ef9090
[FX tracer] Make concrete_args
from outside available ( #21775 )
...
make concrete_args from outside available
2023-02-27 08:57:57 +01:00
Thomas Paviot
ba2a5f13f7
Fix en documentation typos ( #21799 )
...
* fix wrong url
* typos in english documentation
2023-02-27 08:36:36 +01:00
Julian Weber
a36983653e
Fix type in gpt2 config docstring ( #21782 )
...
Fix docstring gpt2 config
2023-02-27 08:19:19 +01:00
bofeng huang
3c0ce60855
[examples/summarization] deal with max_length
and num_beams
( #21740 )
...
* Override the decoding parameters of Seq2SeqTrainer
* Fix quality
* Fix max_length parameter
* Fix quality
* Remove redundant parameter max_length
* Separate the preprocess of train and validation to use different max_target_length
2023-02-27 08:18:14 +01:00
Moshe Berchansky
9ddf4f4f03
Fix resume_from_checkpoint for deepspeed ( #21735 )
...
* Fix resume_from_checkpoint for deepspeed
Fix resume_from_checkpoint for deepspeed, by ensuring that the deepspeed engine is the one to load the checkpoint.
* Empty commit to trigger CI
* Removed deepspeed skipping
Removed deepspeed skipping inside the _load_from_checkpoint function, as it is obsolete
* another adjustment
* Trigger CI
* trigger circleci
* style
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>
2023-02-25 11:30:54 -08:00
Sanchit Gandhi
3dae0d7b4f
[SpeechT5] Fix HiFiGAN tests ( #21788 )
2023-02-24 16:55:38 +01:00
Yi Heng Lim
59c1d5b96b
[GPT2, ProphetNet] Fix gradient checkpointing bug ( #21772 )
...
* fix gradient checkpointing bug
* fix gradient checkpointing bug
* ran make fix-copies
* fixed bug
* fixed bug
2023-02-24 15:37:22 +00:00
Kashif Rasul
ba0e370dc1
[time series] updated expected values for integration test. ( #21762 )
...
* updated expected
* prediction_length fix
* prediction_length default value
* default prediction_length 24
* revert back prediction_length default
* move prediction_length test
2023-02-24 12:36:54 +01:00
Joao Gante
440f39754b
Generate - update cookie cutters to not initialize cache with training and gradient checkpointing ( #21759 )
2023-02-24 11:21:00 +00:00
Arthur
087436c98e
Fix-ci-whisper ( #21767 )
...
* fix history
* input_features instead of input ids for TFWhisport doctest
* use translate intead of transcribe
2023-02-24 11:39:25 +01:00
bofeng huang
c8545d2a9c
[Whisper] Add SpecAugment ( #21298 )
...
* Return and rescale attention_mask
* Add SpecAugment to Whisper modeling
* Fix test
* Update docstring
* Add SpecAug related parameters to model config
* Add the _mask_input_features function to doc
* Fix quality
* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Remove dev comments
* Add test
* Resolve conflict
* feat: mask {feature, time} prob fast tests
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-02-24 11:07:52 +01:00
Sanchit Gandhi
75bd49ff88
[Flax] Fix erroneous kwargs being passed to generate config ( #21765 )
2023-02-24 09:59:18 +01:00
Arthur
14f33205a7
Different behavior in DistilBERT when using "inputs_embeds" ( #21752 )
...
* Different behavior in DistilBERT when using "inputs_embeds"
Fixes #21089
* fix failing test
2023-02-24 09:48:07 +01:00
Sanchit Gandhi
13489248fa
[Examples] Generalise run audio classification for log-mel models ( #21756 )
...
* [Examples] Generalise run audio classification for log-mel models
* batch feature extractor
* make style
2023-02-24 09:19:07 +01:00
Shubhamai
f7ca656f07
[Flax] adding support for batch norm layers ( #21581 )
...
* [flax] adding support for batch norm layers
* fixing bugs related to pt+flax integration
* cleanup, batchnorm support in sharded pt to flax
* support for batchnorm tests in pt+flax integration
* simplifying checking batch norm layer
2023-02-24 08:47:33 +01:00
Connor Henderson
279008adc3
fix: Change is_last chunk calc and add conditional break in chunk_iter ( #21612 )
...
* fix: Change is_last chunk calc and add conditional break
* format fix
* account for 0 and full stride_rights, add comment
* add new test
* make style
* update slow whisper asr test timestamps
* use nested_simplify on output and round timestamp to hundreths place
2023-02-24 08:30:32 +01:00
Clémentine Fourrier
4446b6b094
Graphormer fix ( #21699 )
...
* Removed useless check for backend
* fix style check for graphormer
* Reverted change and corrected requires_backend for cython
* code qual
2023-02-24 08:20:52 +01:00
Stas Bekman
633062639b
[deepspeed tests] fix issues introduced by #21700 ( #21769 )
...
* [deepspeed tests] fix issues introduced by #21700
* fix
* fix
2023-02-23 13:22:25 -08:00
Maria Khalusova
04d90ac49e
Auto api Value Error addition to Troubleshoot ( #21708 )
...
* troubleshooting guide: added an error description for missing auto-mapping
* minor polishing
* changed the example
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/troubleshooting.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-02-23 11:51:18 -05:00
Batese2001
0ffa22f9f6
Added Type Hints for modeling_tf_encoder_decoder.py ( #21673 )
...
* Ran Black formatting
* Added imports and reformatted
* Update src/transformers/models/encoder_decoder/modeling_tf_encoder_decoder.py
---------
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
2023-02-23 14:08:26 +00:00
ydshieh
aa3787c8f0
Skip test_log_level for now
2023-02-23 12:11:20 +01:00
Joao Gante
1d4b797852
Generate: Fix GIT batched captioning ( #21738 )
2023-02-23 09:50:37 +00:00
Younes Belkada
78a93d17c0
[GPTNeo
] Fix gradient checkpointing bug ( #21733 )
...
* fix bug
* forward contrib credits from discussions
* change logic
---------
Co-authored-by: edbeeching <edbeeching@users.noreply.github.com>
2023-02-23 09:48:19 +01:00
Yih-Dar
36a6a1adb6
Fix 2 quicktour file doctest ( #21742 )
...
* Update expect output values - as Hub repo. files are updated
* Update expect output values - as librosa is from 0.9.2 to 0.10.0 on CI docker
* fix
* update one more
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-02-23 09:41:28 +01:00
Yih-Dar
ff143ae10e
Update doctest GH workflow file ( #21744 )
...
update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-02-23 09:40:53 +01:00
Naga Sai Abhinay
448e050b0d
Make ImageProcessorMixin compatible with subfolder kwarg ( #21725 )
...
* Add subfolder support
* Add kwarg docstring
* formatting fix
* Add test
2023-02-23 09:28:18 +01:00
Thomas Paviot
064f374874
typos in french documentation ( #21750 )
2023-02-23 09:17:01 +01:00
Maria Khalusova
619d51e01f
Added "Open in Colab" to task guides ( #21729 )
...
added Open in Colab to task guides
2023-02-22 08:32:35 -05:00
Matt
d913f4aa40
Fix to KerasMetricCallback when the model returns unstructured output ( #21727 )
...
* Stop doing dict-things to non-dict inputs
* Add a debug check
* Add a debug check
* Remove debug checks, looks good now!
* make fixup
2023-02-22 13:15:14 +00:00
Sanchit Gandhi
82e61f3445
[SpeechT5HifiGan] Handle batched inputs ( #21702 )
...
* [SpeechT5HifiGan] Handle batched inputs
* fix docstring
* rebase and new ruff style
2023-02-22 11:16:56 +01:00
Yih-Dar
09127c5713
Fix GPTSanJapaneseModel
( #21731 )
...
* fix
* skip test_model_parallelism
* skip test_model_parallelism
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-02-22 11:09:04 +01:00
Yih-Dar
aff87da15b
Fix ErnieMEmbeddings
device issue ( #21726 )
...
* remove .parameters()).device
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-02-22 10:57:34 +01:00
Yih-Dar
2f2b19ff40
Change doc example for BigBirdForQuestionAnswering
( #21723 )
...
Change doc example for BigBirdForQuestionAnswering
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-02-22 10:55:12 +01:00
Yih-Dar
354b338316
Remove gptsan_japanese
from doctest list to avoid GPU OOM ( #21722 )
...
remove from doctest list to avoid GPU OOM
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-02-22 10:51:00 +01:00
Sylvain Gugger
b19d64d852
Respect documentation on passive log level ( #21700 )
...
* Respect documentation on passive log level
* Fix test and set log level in examples
* Add doc
2023-02-22 09:39:18 +01:00