Sylvain Gugger
74690b62a1
Pin ffspec ( #18837 )
...
* Pin ffspec
* Typo
2022-08-31 19:04:04 +02:00
NielsRogge
3b6943e7a3
[DETR] Add num_channels attribute ( #18714 )
...
* Add num_channels attribute
* Fix code quality
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2022-08-31 18:04:42 +02:00
Shu Takayama
811c4c9f79
fix bug: register_for_auto_class should be defined on TFPreTrainedModel instead of TFSequenceSummary ( #18607 )
2022-08-31 16:37:18 +02:00
Lysandre Debut
ee407024c4
Update location identification ( #18834 )
2022-08-31 15:10:25 +02:00
Zachary Mueller
e4910213be
Warn on TPUs when the custom optimizer and model device are not the same ( #18668 )
...
* Check optimizer for device on TPU
* Typo
2022-08-31 08:46:31 -04:00
Wang, Yi
cdde85a0a0
oob performance improvement for cpu DDP ( #18595 )
...
* oob performance improvement for cpu DDP
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
* add is_psutil_available check
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
2022-08-31 14:35:10 +02:00
Peter Jung
c3be98ebab
Fix cost condition in DetrHungarianMatcher and YolosHungarianMatcher to allow zero-cost ( #18647 )
...
* Fix loss condition in DetrHungarianMatcher
* Fix costs condition in YolosHungarianMatcher
2022-08-31 14:28:58 +02:00
Joao Gante
fea4636cfa
Pin max tf version ( #18818 )
2022-08-31 10:07:53 +02:00
Ankur Goyal
5c4c869014
Add LayoutLMForQuestionAnswering model ( #18407 )
...
* Add LayoutLMForQuestionAnswering model
* Fix output
* Remove TF TODOs
* Add test cases
* Add docs
* TF implementation
* Fix PT/TF equivalence
* Fix loss
* make fixup
* Fix up documentation code examples
* Fix up documentation examples + test them
* Remove LayoutLMForQuestionAnswering from the auto mapping
* Docstrings
* Add better docstrings
* Undo whitespace changes
* Update tokenizers in comments
* Fixup code and remove `from_pt=True`
* Fix tests
* Revert some unexpected docstring changes
* Fix tests by overriding _prepare_for_class
Co-authored-by: Ankur Goyal <ankur@impira.com>
2022-08-31 10:05:33 +02:00
Yih-Dar
e88e9ff045
Disable nightly CI temporarily ( #18820 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-08-30 18:33:09 +02:00
Nicolas Patry
73c6273d48
Improving the documentation for "word", within the pipeline. ( #18763 )
...
* Improving the documentation for "word", within the pipeline.
* Quality.
2022-08-30 15:29:48 +02:00
Dan Tegzes
5727dfcebe
Added Docstrings for Deberta and DebertaV2 [PyTorch] ( #18610 )
...
* Added Doctest for Deberta Pytorch
* Added path in documentation test file
* Added docstrings for DebertaV2
* Revert "Added docstrings for DebertaV2"
This reverts commit 307185e62a
.
* Added DebertaV2 Docstrings
2022-08-30 14:46:21 +02:00
anthony2261
a98f6a1da0
LayoutXLMProcessor: ensure 1-to-1 mapping between samples and images, and add test for it ( #18774 )
2022-08-30 14:43:14 +02:00
Dhruv Karan
220da3b8a1
Adds GroupViT to models exportable with ONNX ( #18628 )
...
* groupvit to onnx
* dynamic shape for pixel values dim
2022-08-30 14:31:35 +02:00
Dhruv Karan
46d0e26a27
Adds OWLViT to models exportable with ONNX ( #18588 )
...
* onnx conversion for owlvit
* .T to .t()
* dynamic shapes for pixel values
2022-08-30 14:30:59 +02:00
NielsRogge
b83796ded7
Remove ViltForQuestionAnswering from check_repo ( #18762 )
...
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2022-08-30 14:15:36 +02:00
amyeroberts
ef91a2d135
Run tests if skip condition not met ( #18764 )
...
* Run tests if skip condition not met
* Update comment - remove outdated ref to TF 2.8
2022-08-30 14:03:28 +02:00
Christoffer Koo Øhrstrøm
de8548ebf3
[LayoutLMv3] Add TensorFlow implementation ( #18678 )
...
Co-authored-by: Esben Toke Christensen <esben.christensen@visma.com>
Co-authored-by: Lasse Reedtz <lasse.reedtz@visma.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
2022-08-30 11:48:11 +01:00
NielsRogge
7320d95d98
[Swin, Swinv2] Fix attn_mask dtype ( #18803 )
...
* Add dtype
* Fix Swinv2 as well
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2022-08-30 12:31:34 +02:00
Li-Huai (Allan) Lin
5c702175eb
up ( #18805 )
2022-08-30 12:30:46 +02:00
Ekagra Ranjan
da02b4035c
Add docstring for BartForCausalLM ( #18795 )
...
* add docstring for BartForCausalLM
* doc-style fic
2022-08-30 12:19:03 +02:00
amyeroberts
8c4a11493f
Revert to and safely handle flag in owlvit config ( #18750 )
2022-08-29 18:48:24 +02:00
Yih-Dar
da5bb29219
send model to the correct device ( #18800 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-08-29 18:46:30 +02:00
NielsRogge
f1fd460694
Add SegFormer and ViLT links ( #18808 )
...
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2022-08-29 18:46:07 +02:00
Lucain
169b8cde47
Fix mock in test_cached_files_are_used_when_internet_is_down
( #18804 )
2022-08-29 15:56:08 +02:00
Yih-Dar
8b67f20935
Fix memory leak issue in torch_fx
tests ( #18547 )
...
Co-authored-by: Lysandre Debut <hi@lysand.re>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-08-29 11:43:20 +02:00
fatih
b10a3b3760
fix a possible typo in auto feature extraction ( #18779 )
2022-08-29 11:24:53 +02:00
Yih-Dar
5f06a09b9f
fix missing block when there is no failure ( #18775 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-08-29 09:10:13 +02:00
Philipp Schmid
f2fbe44753
Fix broken link DeepSpeed documentation link ( #18783 )
...
* Fix broken link
* Trigger CI
Co-authored-by: Stas Bekman <stas@stason.org>
2022-08-28 19:32:19 -07:00
Duong A. Nguyen
21f6f58721
Fix incomplete outputs of FlaxBert ( #18772 )
...
* Fix incomplete FlaxBert outputs
* fix big_bird electra roberta
2022-08-26 21:04:18 +02:00
Patrick von Platen
62ceb4d661
[Wav2vec2 + LM Test] Improve wav2vec2 with lm tests and make torch version dependent for now ( #18749 )
...
* add first generation tutorial
* remove generation
* make version dependent expected values
* Apply suggestions from code review
* Update tests/models/wav2vec2_with_lm/test_processor_wav2vec2_with_lm.py
* fix typo
2022-08-26 14:11:55 +02:00
Patrick von Platen
8869bf41fe
[VisionEncoderDecoder] Add gradient checkpointing ( #18697 )
...
* add first generation tutorial
* VisionEnocderDecoder gradient checkpointing
* remove generation
* add tests
2022-08-26 14:11:27 +02:00
Joao Gante
06a6a4bd51
CLI: Improved error control and updated hub requirement ( #18752 )
2022-08-25 17:08:05 +01:00
Rahul A R
e9442440fc
streamlining 'checkpointing_steps' parsing ( #18755 )
2022-08-25 11:00:38 -04:00
Craig Chan
fbf382c84d
Determine framework automatically before ONNX export ( #18615 )
...
* Automatic detection for framework to use when exporting to ONNX
* Log message change
* Incorporating PR comments, adding unit test
* Adding tf for pip install for run_tests_onnxruntime CI
* Restoring past changes to circleci yaml and test_onnx_v2.py, tests moved to tests/onnx/test_features.py
* Fixup
* Adding test to fetcher
* Updating circleci config to log more
* Changing test class name
* Comment typo fix in tests/onnx/test_features.py
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
* Moving torch_str/tf_str to self.framework_pt/tf
* Remove -rA flag in circleci config
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
2022-08-25 16:31:34 +02:00
Patrick Deutschmann
3223d49354
Add ONNX support for Longformer ( #17176 )
...
* Implement ONNX support for Longformer
Fix repo consistency check complaints
Fix value mismatches
Add pooler output for default model
Increase validation atol to accommodate multiple-choice error
Fix copies
Fix chunking for longer sequence lengths
Add future comment
* Fix issue in mask_invalid_locations
* Remove torch imports in configuration_longformer
* Change config access to fix LED
* Push opset version to support tril
* Work in review comments (mostly style)
* Add Longformer to ONNX tests
2022-08-25 08:34:42 +02:00
Rahul A R
c55d6e4e10
examples/run_summarization_no_trainer: fixed incorrect param to hasattr ( #18720 )
...
* fixed incorrect param to hasattr
* simplified condition checks
* code cleanup
2022-08-24 12:12:42 -04:00
SaulLu
6667b0d7bf
add warning to let the user know that the __call__
method is faster than encode
+ pad
for a fast tokenizer ( #18693 )
...
* add warning to let the user know that the method is slower that for a fast tokenizer
* user warnings
* fix layoutlmv2
* fix layout*
* change warnings into logger.warning
2022-08-24 06:27:56 -04:00
Juyoung Kim
dcff504e18
fixed docstring typos ( #18739 )
...
* fixed docstring typos
* Added missing colon
Co-authored-by: 김주영 <juyoung@zezedu.com>
2022-08-24 06:20:27 -04:00
dependabot[bot]
e49c71fc4c
Bump nbconvert from 6.3.0 to 6.5.1 in /examples/research_projects/lxmert ( #18742 )
...
Bumps [nbconvert](https://github.com/jupyter/nbconvert ) from 6.3.0 to 6.5.1.
- [Release notes](https://github.com/jupyter/nbconvert/releases )
- [Commits](https://github.com/jupyter/nbconvert/compare/6.3.0...6.5.1 )
---
updated-dependencies:
- dependency-name: nbconvert
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-08-24 06:12:56 -04:00
dependabot[bot]
5b24949669
Bump nbconvert in /examples/research_projects/visual_bert ( #18741 )
...
Bumps [nbconvert](https://github.com/jupyter/nbconvert ) from 6.3.0 to 6.5.1.
- [Release notes](https://github.com/jupyter/nbconvert/releases )
- [Commits](https://github.com/jupyter/nbconvert/compare/6.3.0...6.5.1 )
---
updated-dependencies:
- dependency-name: nbconvert
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-08-24 06:12:48 -04:00
Daniel Stancl
c72d7d91bf
Add TF implementation of XGLMModel
( #16543 )
...
* Add TFXGLM models
* Add todo: self.supports_xla_generation = False
Co-authored-by: Daniel Stancl <stancld@Daniels-MacBook-Pro.local>
Co-authored-by: Daniel Stancl <stancld@daniels-mbp.home>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Daniel <daniel.stancl@rossum.ai>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2022-08-24 10:51:05 +01:00
Yih-Dar
cecf9f9b27
fix pipeline_tutorial.mdx doctest ( #18717 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-08-24 05:38:03 -04:00
Constantin Hütterer
a442884b87
Add minor doc-string change to include hp_name param in hyperparameter_search ( #18700 )
...
* Add minor doc-string change to include hp_name
* fix: missing type-information for kwargs
* fix: missing white-space in hyperparameter_search doc-strings
2022-08-24 05:07:17 -04:00
Mishig Davaadorj
c12dbdc246
Update perf_infer_gpu_many.mdx ( #18744 )
2022-08-24 10:37:52 +02:00
Joao Gante
6faf283288
CLI: Don't check the model head when there is no model head ( #18733 )
2022-08-23 15:38:59 +01:00
SaulLu
438698085c
improve add_tokens
docstring ( #18687 )
...
* improve add_tokens documentation
* format
2022-08-23 07:23:51 -04:00
Nicolas Patry
891704b3c2
Removing warning of model type for microsoft/tapex-base-finetuned-wtq
( #18711 )
...
and friends.
2022-08-23 13:17:06 +02:00
Yih-Dar
84beb8a49b
Unpin detectron2 ( #18727 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-08-23 11:10:07 +02:00
Atharva Ingle
d90a36d192
remove check for main process for trackers initialization ( #18706 )
2022-08-22 11:16:27 -04:00