NielsRogge
dde718e7a6
[DETR and friends] Remove is_timm_available ( #21814 )
...
* First draft
* Fix to_dict
* Improve conversion script
* Update config
* Remove timm dependency
* Fix dummies
* Fix typo, add integration test
* Upload 101 model as well
* Remove timm dummies
* Fix style
---------
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2023-03-07 15:19:39 -05:00
Arthur
2156662dea
[TF] Fix creating a PR while pushing in TF framework ( #21968 )
...
* add create pr arg
* style
* add test
* ficup
* update test
* last nit fix typo
* add `is_pt_tf_cross_test` marker for the tsts
2023-03-07 17:32:08 +01:00
Matt
d128f2ffab
Stop requiring Torch for our TF examples! ( #21997 )
...
* Stop requiring Torch for our TF examples!
* Slight tweak to logging in the example itself
2023-03-07 15:54:10 +00:00
Sanchit Gandhi
7c39318136
[Whisper] Add model for audio classification ( #21754 )
...
* [Whisper] Add model for audio classification
* make fix-copies
* add to docs
* add docstring
* empty returns
* add code example
* switch to fleurs
* stick everything on one line
2023-03-07 16:20:21 +01:00
Yih-Dar
9402788b34
Skip test_multi_gpu_data_parallel_forward
for some model tests ( #21991 )
...
skip test_multi_gpu_data_parallel_forward for some model tests
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-03-07 14:23:36 +01:00
Yih-Dar
99c5c6079d
Update notification_service.py
( #21992 )
...
* better check
* better check
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-03-07 14:20:39 +01:00
regisss
10bcbcae30
Remove unneeded casts to bool ( #21983 )
...
Remove cast to Bool
2023-03-07 07:35:49 -05:00
NielsRogge
95408e9953
[DETR, YOLOS] Fix device bug ( #21974 )
...
* Fix integration test
* Add test
* Add test
2023-03-07 07:34:04 -05:00
Elad Segal
eec46b4f75
Fix MinNewTokensLengthLogitsProcessor when used with a list of eos tokens ( #21959 )
...
* Fix MinNewTokensLengthLogitsProcessor when used with a list of eos tokens
* fix docs
* Empty commit
* formatting
2023-03-07 11:59:22 +00:00
amyeroberts
4063fd9cba
Add check before int casting for PIL conversion ( #21969 )
...
* Add check before int casting for PIL conversion
* Line length
* Tidier logic
2023-03-07 11:14:09 +00:00
Yih-Dar
5b28b78332
Update Jukebox
tests ( #21984 )
...
* update expected values for jukebox
* update expected values for jukebox
* update expected values for jukebox
* update expected values for jukebox
* update expected values for jukebox
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-03-07 04:20:14 +01:00
PD Hall
31e3c6c393
docs: improve clarity for language modeling ( #21952 )
...
* docs: improve clarity for clm/mlm
* docs: remove incorrect explanation
* docs: remove incorrect explanation
---------
Co-authored-by: pdhall99 <pdhall99>
2023-03-06 13:13:43 -05:00
Karim Foda
0ce5236dd1
Fix gradient checkpointing bug in ESM ( #21980 )
2023-03-06 17:44:53 +00:00
Karim Foda
de496ef08b
Fix gradient checkpointing bug in Codegen ( #21979 )
2023-03-06 17:44:31 +00:00
Karim Foda
4a545d18e2
Fix gradient checkpointing bug in BlipText ( #21978 )
...
Make Format
2023-03-06 17:43:52 +00:00
Karim Foda
451263b841
Fix gradient checkpointing bug in Blenderbot Small ( #21977 )
2023-03-06 17:43:25 +00:00
Karim Foda
4f84dedc03
Fix gradient checkpointing bug in BigBird Pegasus ( #21976 )
2023-03-06 17:42:52 +00:00
Yih-Dar
f2a2616b74
Update expected values for test_xglm_sample
( #21975 )
...
update expected values for xglm
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-03-06 18:07:31 +01:00
Matt
5d8efc79db
Add TF contrastive image text finetuning example ( #21939 )
...
* Initial commit
* stash commit
* Add model checkpointing and pushing
* Fix model name inference
* Update README
* Update README
* Remove a couple of Torch references
* Update copyright date
* make fixup
* Update PushToHubCallback args!
* Remove the torch summary
* Add strategy.scope
2023-03-06 16:57:40 +00:00
Yih-Dar
9474abdf47
Use larger atol in torch.allclose
for some tests ( #21966 )
...
Use larger atol
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-03-06 17:41:00 +01:00
Aayush Neupane
64d95c44ec
Add missing parameter definition in layoutlm config ( #21960 )
...
Four parameters in `LayoutLM` config were missing definitions, Added their definition (copied from BertConfig).
2023-03-06 15:20:11 +00:00
Srimanth Agastyaraju
f3c75f8b44
[Generate] Fix gradient_checkpointing and use_cache bug for BLOOM ( #21956 )
...
Step 1 - Change use_cache fix
2023-03-06 14:56:40 +00:00
saswatmeher
934d0b8bdd
Fix bert issue ( #21963 )
...
Co-authored-by: saswatmeher <saswatmeher@cse.iitb.ac.in>
2023-03-06 14:55:31 +00:00
aws-sangeetha
0bb17295f0
Disable DDP for neuron ( #21953 )
...
Disable DDp for neuron
Co-authored-by: EC2 Default User <ec2-user@ip-172-31-42-72.us-west-2.compute.internal>
2023-03-06 09:33:44 -05:00
Arthur
bc33fbf956
[CI] Fix ci ( #21940 )
...
* fix `get_proposal_pos_embed`
* fix order
* style
* zero shot simplify test
* add approximate values for zero shot audio classification
2023-03-06 15:22:27 +01:00
Yih-Dar
fcf813417a
Update expected values in XLMProphetNetModelIntegrationTest
( #21957 )
...
update values
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-03-06 09:15:44 +01:00
Batese2001
699a2293cc
Fixed gradient_checkpointing/use_cache bug in blenderbot ( #21833 )
...
* Fixed gradient_checkpointing/use_cache bug in blenderbot
* Update modeling_blenderbot.py
* Added back if statement
* Formatted using black
2023-03-04 15:45:53 +00:00
Karim Foda
6feb39b43c
Fix gradient checkpointing bug in Roformer ( #21946 )
2023-03-04 15:44:33 +00:00
Karim Foda
6386eb9721
Fix gradient checkpointing bug in Rembert ( #21945 )
2023-03-04 15:44:06 +00:00
Karim Foda
f12c74f51e
Fix gradient checkpointing bug in Pegasus ( #21944 )
2023-03-04 15:43:32 +00:00
Karim Foda
f932ee61b9
Fix gradient checkpointing bug in OPT ( #21943 )
2023-03-04 15:42:57 +00:00
bofeng huang
003a7cc608
[Whisper] Fix feature normalization in WhisperFeatureExtractor
( #21938 )
...
Fix feature normalization in WhisperFeatureExtractor
2023-03-03 14:21:13 -05:00
Arthur
718e9d777f
[CLAP] Support batched inputs for CLAP. Fixes pipeline issues ( #21931 )
...
* fix pipeline
* fix feature_extraction clap
* you can now batch the `is_longer` attribute
* add tests
* fixup
* add expected scores
* comment on is_longert
2023-03-03 18:42:18 +01:00
Victor Muštar
c5fe06c59d
Update README logo ( #21933 )
2023-03-03 11:57:39 -05:00
Arthur
82aac00e0f
[Flan-UL2] Add-flan-ul2 ( #21929 )
...
* add doc and readme
* add model docs
* update toctree and fix copies
* update
* update doc file
* fix
* add FLAN-UL2 to configuration mapping
* fixup
* Apply suggestions from code review
* more clarification
---------
Co-authored-by: younesbelakda <younesbelkada@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
2023-03-03 17:57:24 +01:00
substanc3
956ae62139
Fix wrong documentation about DataCollator padding defaults ( #21919 )
...
* Fix wrong documentation about DataCollator padding defaults
* Fix styling
2023-03-03 11:51:54 -05:00
Yih-Dar
8c40ba73d8
Avoid failure in check_repo.py
due to missing backends ( #21930 )
...
* Update utils/check_repo.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update utils/check_repo.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-03-03 15:34:20 +01:00
Yih-Dar
d4306daea1
Fix AlignModelTest
tests ( #21923 )
...
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-03-03 14:47:09 +01:00
Zach Nussbaum
c5a1ff9ef0
feat: filter try/except when looking at custom code ( #21914 )
...
* feat: filter try/except
* Update src/transformers/dynamic_module_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-03-03 08:43:59 -05:00
Yih-Dar
02a77fa04c
Cleanup more auto mapping names ( #21909 )
...
* fix auto 2
* fix auto 2
* fix task guide issue
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-03-03 14:43:44 +01:00
Yih-Dar
b05e0bec88
Use large VM for repo_utils_job
( #21928 )
...
upgrade to large VM
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-03-03 14:43:03 +01:00
Yih-Dar
fa9d2ad7ec
Update model_split_percents
for WhisperModelTest
( #21922 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-03-03 14:35:08 +01:00
Karim Foda
c82bd37169
Fix gradient checkpointing megatron bert ( #21921 )
2023-03-03 11:50:21 +00:00
Karim Foda
99a62347fb
Fix gradient checkpointing bug in mvp ( #21920 )
2023-03-03 11:49:49 +00:00
Karim Foda
e407b5a323
Fix gradient checkpointing bug in MBart ( #21918 )
2023-03-03 11:49:27 +00:00
Arthur
dcec3277cd
faster forward following what is done for images ( #21906 )
...
* faster forward following what is done for images
* add missing licence
2023-03-03 06:18:18 +01:00
Matt
37e0974afc
Fix doctests for TFVisionTextDualEncoder ( #21910 )
2023-03-03 00:18:11 +00:00
Yih-Dar
9f5bfe1b99
Avoid modeling tests run in pipeline CI jobs ( #21911 )
...
* rework is_pipeline_test
* bring back 3 tests
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-03-02 21:23:06 +01:00
Kashif Rasul
db979f7588
[time series] Add Time series inputs tests ( #21846 )
...
* intial test of inputs
* added test for generation
* remove asserts
* fixed test
* Update tests/models/time_series_transformer/test_modeling_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
---------
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
2023-03-02 20:43:35 +01:00
Nicolas Patry
b2a41d2be4
Faster zero shot image ( #21897 )
...
* Make ZeroShotImageClassificationPipeline faster
The pipeline makes separate calls to model for each candidate label.
This commit combines all labels into one call.
Original code takes more that 60 seconds to process one image and 1000
candidate labels. Updated code takes less than 2 seconds.
* implement batching
* code formatting
* Creating an even faster zero-shot-image-classifiction.
Unfortunately super tailored towards CLIP.
Co-Authored-By: Yessen Kanapin <yessen@deepinfra.com>
* Quality.
* Cleanup.
* Order different on the CI it seems.
* Cleanup.
* Quality.
---------
Co-authored-by: Yessen Kanapin <yessen@deepinfra.com>
2023-03-02 19:46:22 +01:00