Steven Liu
0a75717602
Fix task guide formatting ( #21409 )
...
fix formatting
2023-02-02 10:06:26 -08:00
Yih-Dar
a6d8a149a8
Fix some pipeline tests ( #21401 )
...
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-02-02 19:03:31 +01:00
Yih-Dar
145bf41c13
Allow to add more information in is_flaky
( #21426 )
...
* Allow to add more information
* fix style
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-02-02 17:41:22 +01:00
Younes Belkada
8298e4ec02
[bnb
] Fine-tuning HF 8-bit models ( #21290 )
...
* force `memory_efficient_backward=True`
* enhancements
- trainer support
- add new flag
* some changes
- internal changes in `Trainer`
- small refactor
* make quality
* Fixes
- add new testing util
- add new test
- change test in Trainer
* fix CI test
* educate users on how to ft 8bit models
* more checks
* fix `logger` error
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* adapt from review
* fix
* add comment
* use return instead
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-02-02 16:39:23 +01:00
Clémentine Fourrier
67a3920d85
Fix Graphormer test suite ( #21419 )
...
* [FIX] path for Graphormer checkpoint
* [FIX] Test suite for graphormer
* [FIX] Update graphormer default num_classes
2023-02-02 16:29:13 +01:00
Joel Lamy-Poirier
e006ab51ac
Add the GeLU activation from pytorch with the tanh approximation ( #21345 )
...
* gelu_python_tanh
* rename
* Version check, add test
* Pr comment
2023-02-02 09:33:04 -05:00
Matt
53d374f1b9
Add distinct section names for PyTorch and TF ( #21422 )
...
* Add distinct section names for PyTorch and TF
* Remove extra space
2023-02-02 14:29:58 +00:00
Shikhar Tuli
0ae8dc0adf
Fix image_processor_class bug ( #21410 )
...
Co-authored-by: Shreshth Tuli <shreshthtuli@gmail.com>
2023-02-02 09:20:52 -05:00
Yih-Dar
db572b3854
Use torch 1.13.1
in push/schedule CI ( #21421 )
...
Use torch 1.13.1 in push/scheduled CI
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-02-02 14:58:52 +01:00
Joao Gante
92ce53aab8
Generate: decoder-only models can generate with inputs_embeds
( #21405 )
2023-02-01 21:50:38 +00:00
amyeroberts
e5db7051a8
Add TF image classification example script ( #19956 )
...
* TF image classification script
* Update requirements
* Fix up
* Add tests
* Update test fetcher
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Fix directory path
* Adding `zero-shot-object-detection` pipeline doctest. (#20274 )
* Adding `zero-shot-object-detection` pipeline doctest.
* Remove nested_simplify.
* Add generate kwargs to `AutomaticSpeechRecognitionPipeline` (#20952 )
* Add generate kwargs to AutomaticSpeechRecognitionPipeline
* Add test for generation kwargs
* Trigger CI
* Data collator returns np
* Update feature extractor -> image processor
* Bug fixes - updates to reflect changes in API
* Update flags to match PT & run faster
* Update instructions - Maria's comment
* Update examples/tensorflow/image-classification/README.md
* Remove slow decorator
---------
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
Co-authored-by: bofeng huang <bofenghuang7@gmail.com>
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>
2023-02-01 19:09:36 +00:00
Jinen Setpal
3fadb4b211
Added DagshubCallback ( #21404 )
...
* integrated logger
* bugifx
* added data
* bugfix
* model + state artifacts should log
* fixed paths
* i lied, trying again
* updated function call
* typo
this is painful :( what a stupid error
* typo
this is painful :( what a stupid error
* pivoted to adding a directory
* silly path bug
* multiple experiments
* migrated to getattr
* syntax fix
* syntax fix
* fixed repo pointer
* fixed path error
* added dataset if dataloader is present, uploaded artifacts
* variable in scope
* removed unnecessary line
* updated error type
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* trimmed unused variables, imports
* style formatting
* removed type conversion reliance
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* reverted accidental line deletion
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-02-01 13:51:46 -05:00
Sylvain Gugger
8d580779a3
Skip batches fast with accelerate ( #21390 )
...
* Skip batches fast with Accelerate
* remove debug statement
* Hack seed reload at the right time
* Reorganize RNG sync
* Fix accelerate version comp
2023-02-01 10:22:05 -05:00
raghavanone
77db257e2a
Fix the issue of using only inputs_embeds in convbert model ( #21398 )
...
* Fix the input embeds issue with tests
* Fix black and isort issue
* Clean up tests
* Add slow tag to the test introduced
* Incorporate PR feedbacks
2023-02-01 09:47:25 -05:00
Maria Khalusova
65b5035a1d
Moved LiLT under multimodal models in TOC ( #21393 )
...
moved LiLT under multimodal models
2023-02-01 08:03:00 -05:00
Patrick von Platen
90cddfa824
Add variant to transformers ( #21332 )
...
* Bump onnx in /examples/research_projects/decision_transformer
Bumps [onnx](https://github.com/onnx/onnx ) from 1.11.0 to 1.13.0.
- [Release notes](https://github.com/onnx/onnx/releases )
- [Changelog](https://github.com/onnx/onnx/blob/main/docs/Changelog.md )
- [Commits](https://github.com/onnx/onnx/compare/v1.11.0...v1.13.0 )
---
updated-dependencies:
- dependency-name: onnx
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
* adapt
* finish
* Update examples/research_projects/decision_transformer/requirements.txt
* up
* add tests
* Apply suggestions from code review
Co-authored-by: Lucain <lucainp@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* fix test
---------
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Lucain <lucainp@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
2023-02-01 09:21:52 +01:00
Yih-Dar
bc44e947f3
Update Graphormer
and fix its torchscript
test failures ( #21380 )
...
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-01-31 17:32:25 +01:00
Joao Gante
19d67bfecb
Generate: fix TF XLA tests on models with max_position_embeddings
or max_target_positions
( #21389 )
2023-01-31 15:49:34 +00:00
Yih-Dar
6342427353
Remove more unused attributes in config classes ( #21327 )
...
* remove unused classifier_dropout
* remove unused dropout
* remove unused pooler_fn
* remove unnecessary is_encoder_decoder
* remove unnecessary drop_rate
* remove unused classifier_dropout
* remove unused classifier_dropout
* remove unused dropout
* remove unused dropout
* remove unused summary_* attributes
* remove unused tie_word_embeddings
* remove unused summary_* attributes
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-01-31 16:35:38 +01:00
raghavanone
da2a4d95a2
Add support of backward_prefetch and forward_prefetch ( #21237 )
...
* Add support of backward_prefetch and forward_prefetch
* Fix format issue
* Fix isort issue
* Fix doc style issue
* Update src/transformers/trainer.py
Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com>
* Update src/transformers/training_args.py
Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com>
* Update src/transformers/training_args.py
Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com>
* Update src/transformers/training_args.py
Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com>
* Fix black issue
* Fix doc-style issue
* Make additional fsdp parameters into fsdp config
* Fix black issue
* Remove unused imports
* Fix doc style issues
* Incorporate PR feedbacks
* Remove unused imports
* Fix tests
* Fix tests
* Fix tests
* Fix tests
* Fix tests
* Update src/transformers/training_args.py
Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com>
* Fix tests
* Incorporate PR feedbacks
* Incorporate PR feedbacks
* Fix black issues
---------
Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com>
2023-01-31 09:51:35 -05:00
Quentin Lhoest
074d6b75fd
Simplify column_names in run_clm/mlm ( #21382 )
...
* simplify column_names in run_clm
* simplify column_names in run_mlm
* minor
2023-01-31 15:23:47 +01:00
NielsRogge
c21298a69b
[Docs] Minor fixes ( #21383 )
...
* Improve docs
* Add DETA resources
---------
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2023-01-31 15:13:12 +01:00
regisss
d31497b196
Do not log the generation config for each prediction step in TrainerSeq2Seq ( #21385 )
...
Do not log the generation config for each iteration
2023-01-31 09:05:22 -05:00
Yih-Dar
98d40fed3a
Cleanup the usage of layer_norm_eps
in some models ( #21336 )
...
* fix
* fix
* make style
* For CLIP
* For OwlViT
* For XCLIP
* For CLIPSeg
* For GroupViT
* fix docstrings
* fix docstrings
* For AltCLIP
* For ChineseCLIP
* For Blip
* For GiT
* make style
* update
* update
* update
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-01-31 13:54:16 +01:00
Joao Gante
623346ab18
Template for framework-agnostic tests ( #21348 )
2023-01-31 11:33:18 +00:00
NielsRogge
5451f8896c
Add DETA ( #20983 )
...
* First draft
* Add initial draft of conversion script
* Convert all weights
* Fix config
* Add image processor
* Fix DetaImageProcessor
* Run make fix copies
* Remove timm dependency
* Fix dummy objects
* Improve loss function
* Remove conv_encoder attribute
* Update conversion scripts
* Improve postprocessing + docs
* Fix copied from statements
* Add tests
* Improve postprocessing
* Improve postprocessing
* Update READMEs
* More improvements
* Fix rebase
* Add is_torchvision_available
* Add torchvision dependency
* Fix typo and README
* Fix bug
* Add copied from
* Fix style
* Apply suggestions
* Fix thanks to @ydshieh
* Fix another dependency check
* Simplify image processor
* Add scipy
* Improve code
* Add threshold argument
* Fix bug
* Set default threshold
* Improve integration test
* Add another integration test
* Update setup.py
* Address review
* Improve deformable attention function
* Improve copied from
* Use relative imports
* Address review
* Replace assertions
* Address review
* Update dummies
* Remove dummies
* Address comments, update READMEs
* Remove custom kernel code
* Add image processor tests
* Add requires_backends
* Add minor comment
* Update scripts
* Update organization name
* Fix defaults, add doc tests
* Add id2label for object 365
* Fix tests
* Update task guide
2023-01-31 10:43:10 +01:00
Stas Bekman
98d88b23f5
[run_(clm|mlm).py
examples] add streaming dataset support ( #21343 )
...
* [run_clm example] add streaming dataset support
* unrefactor kwargs
* fix
* fix
* require datasets>=2.0.0
* port to mlm
2023-01-30 14:01:35 -08:00
BFSS
95be242adc
translate index to zh( #20095 ) ( #21351 )
...
translate index to zh
Co-authored-by: bfss <bfss@bfss.com>
2023-01-30 16:50:57 -05:00
Adit Krishnan
914e5009fa
Adding resource section to GPT-J docs ( #21270 )
...
* Added resource section to GPT-J docs
* Added most of the links found
* Addressing review comments
* Fixing formatting
* Update docs/source/en/model_doc/gptj.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Fixing one of the labels
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2023-01-30 16:48:04 -05:00
Clémentine Fourrier
14d989a91d
Fixes path for Graphormer checkpoint ( #21367 )
...
[FIX] path for Graphormer checkpoint
2023-01-30 21:48:04 +01:00
Joao Gante
42b60f8b02
Generate: Relaxed max_length
and max_new_tokens
coexistence ( #21347 )
...
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2023-01-30 17:53:54 +00:00
Sylvain Gugger
6eb3c66a96
Add cPython files in build ( #21372 )
2023-01-30 11:19:30 -05:00
amyeroberts
59611a0f3a
Fix DETR tests after #21144 ( #21365 )
...
* Fix annotation check
* Fix annotation check
* Update type annotations
2023-01-30 15:55:00 +00:00
Yichao 'Peak' Ji
7a2e13204f
Remove duplicate declarations in dummy inputs for TFLongformer ( #21352 )
...
Remove duplicate declarations
2023-01-30 10:03:19 -05:00
简律纯
96addecff8
Corrected ( #21350 )
2023-01-30 09:38:15 -05:00
Wang, Yi
f3a7befffa
fix the issue that the output dict of jit model could not get [0] ( #21354 )
2023-01-30 09:23:55 -05:00
Yih-Dar
c749bd405e
Pipeline testing - using tiny models on Hub ( #20426 )
...
* rework pipeline tests
* run pipeline tests
* fix
* fix
* fix
* revert the changes in get_test_pipeline() parameter list
* fix expected error message
* skip a test
* clean up
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-01-30 10:39:43 +01:00
Yih-Dar
a582cfce3c
Fix GitModelIntegrationTest.test_batched_generation
device issue ( #21362 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-01-30 10:37:56 +01:00
Maria Khalusova
73a2ff6974
Automated compatible models list for task guides ( #21338 )
...
* initial commit. added tip placeholders and a script
* removed unused imports, fixed paths
* fixed generated links
* make style
* split language modeling doc into two: causal language modeling and masked language modeling
* added check_task_guides.py to make fix-copies
* review feedback addressed
2023-01-27 13:19:28 -05:00
Lucain
8f3b4a1d5b
Little cleanup: let huggingface_hub manage token retrieval ( #21333 )
...
* Let huggingface_hub manage token retrieval
* flake8
* code quality
* adapt in every PushToHubMixin children
* add explicit return type
2023-01-27 12:09:49 -05:00
Arthur
0dff407d71
[Whisper] another patch ( #21324 )
...
* another patch
* fix timestamp test modeling
* let it be negative when the token is None
2023-01-27 16:35:16 +01:00
Yih-Dar
e5eb3e22ea
Fix RobertaPreLayerNorm
doctest ( #21337 )
...
* add mask="<mask>"
* update
* update
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-01-27 16:20:25 +01:00
dependabot[bot]
36b668fa06
Bump onnx from 1.11.0 to 1.13.0 in /examples/research_projects/decision_transformer ( #21331 )
...
Bump onnx in /examples/research_projects/decision_transformer
Bumps [onnx](https://github.com/onnx/onnx ) from 1.11.0 to 1.13.0.
- [Release notes](https://github.com/onnx/onnx/releases )
- [Changelog](https://github.com/onnx/onnx/blob/main/docs/Changelog.md )
- [Commits](https://github.com/onnx/onnx/compare/v1.11.0...v1.13.0 )
---
updated-dependencies:
- dependency-name: onnx
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-01-27 10:13:13 -05:00
Michael Benayoun
938f437c53
Fix M2M100 positional embedding creation for ONNX ( #21328 )
...
* Fix M2M100 positional embedding creation for ONNX
* Restore READMEs
* Trigger CI
2023-01-27 10:43:19 +01:00
altryne
7d2a5fa749
Update Hebrew language code to he per IANA registry ( #21310 )
...
Here's my original PR into whisper that changes the same:
https://github.com/openai/whisper/pull/401
Per [IANA registry](https://www.iana.org/assignments/language-subtag-registry/language-subtag-registry ), `iw` was deprecated as the code for Hebrew in 1989 and the preferred code is `he`
The correct subtag:
```
%%
Type: language
Subtag: he
Description: Hebrew
Added: 2005-10-16
Suppress-Script: Hebr
%%
```
And the deprecation
```
%%
Type: language
Subtag: iw
Description: Hebrew
Added: 2005-10-16
Deprecated: 1989-01-01
Preferred-Value: he
Suppress-Script: Hebr
%%
```
2023-01-26 13:34:39 -05:00
Younes Belkada
b225ee6ea0
[Doctest] Fix Perceiver
doctest ( #21318 )
...
fix `Perceiver` doctest
2023-01-26 17:16:37 +01:00
Joao Gante
2b8feffad5
Generate: better compute_transition_scores
examples ( #21323 )
2023-01-26 16:06:05 +00:00
Yih-Dar
449df41f01
Fix TFEncoderDecoder
tests ( #21301 )
...
remove max_length=None
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-01-26 16:56:42 +01:00
Yih-Dar
857bad6e53
check paths in utils/documentation_tests.txt
( #21315 )
...
* check paths in utils/documentation_tests.txt
* check paths in utils/documentation_tests.txt
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-01-26 15:33:47 +01:00
Nicolas Patry
fd0ef8b66d
Small QoL for qa. ( #21316 )
2023-01-26 14:50:09 +01:00