TilmannR
6e3b014471
Fix docstring of CLIPTokenizer(Fast) ( #20233 )
2022-11-15 10:00:16 -05:00
Yih-Dar
cf7b98b807
Fix run_clip.py
( #20234 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-11-15 15:45:21 +01:00
Kendall
683cbc4c34
fixed spelling error in testing.mdx ( #20220 )
2022-11-15 09:40:06 -05:00
Yih-Dar
6ed6ed29b1
fix device issue ( #20227 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-11-15 15:21:16 +01:00
Matt
d3d5fa3e85
Add missing ESM autoclass ( #20177 )
...
* Add missing ESM autoclass
* Correct ESMFold checkpoint
2022-11-15 14:20:22 +00:00
Arthur
92cfe8b074
Remove authorized_missing_keys
in favor of _keys_to_ignore_on_load_missing ( #20228 )
2022-11-15 15:12:41 +01:00
Yong woo Song
2d92001076
Typo on doctring in ElectraTokenizer ( #20192 )
...
* chore: typo on docstring in tokenization_electra
* chore: typo on docstring in tokenization_electra
* update for check copies
2022-11-15 09:10:20 -05:00
amyeroberts
4c7e8d0900
Add object detection + segmentation transforms ( #20003 )
...
* Add transforms for object detection
* Update src/transformers/image_transforms.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Better var names & docstring
* Remove unused var desc in docstring
* Update src/transformers/image_transforms.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-11-15 12:50:03 +00:00
Younes Belkada
163ac3d3ee
Add Switch transformers ( #19323 )
...
* first commit
* add more comments
* add router v1
* clean up
- remove `tf` modeling files
* clean up
- remove `tf` modeling files
* clean up
* v0 routers
* added more router
- Implemented `ExpertsChooseMaskedRouter`
- added tests
- 2 more routers to implement
* last router
* improved docstring
- completed the docstring in `router.py`
- added more args in the config
* v0 sparse mlp
* replace wrong naming
* forward pass run
* update MOE layer
* small router update
* fixup
* consistency
* remove scatter router
* remove abstract layer
* update test and model for integration testing
* v1 conversion
* update
* hardcode hack
* all keys match
* add gin conversion, without additional libraries
* update conversion sctipy
* delete router file
* update tests wrt router deletion
* fix router issues
* update expert code
* update, logits match, code needsREFACTORING
* Refactor code
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>
* add generate tests
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
* add support for router loss
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>
* fix forward error
* refactor a bit
* remove `FlaxSwitchTransformers` modules
* more tests pass
* Update code
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>
* fixup
* fix tests
* fix doc
* fix doc + tokenization
* fix tokenizer test
* fix test
* fix loss output
* update code for backward pass
* add loss support
* update documentation
* fix documentation, clean tokenizer
* more doc fix, cleanup example_switch
* fix failing test
* fix test
* fix test
* fix loss issue
* move layer
* update doc and fix router capacity usage
* fixup
* add sparse mlp index for documentation on hub
* fixup
* test sparse mix architecture
* Apply suggestions from code review
* Update docs/source/en/model_doc/switch_transformers.mdx
* fixup on update
* fix tests
* fix another test
* attempt fix
* Update src/transformers/models/switch_transformers/configuration_switch_transformers.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update src/transformers/models/switch_transformers/convert_switch_transformers_original_flax_checkpoint_to_pytorch.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* try
* all tests pass
* fix jitter noise
* Apply suggestions from code review
* doc tests pass
* Update src/transformers/models/switch_transformers/modeling_switch_transformers.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update src/transformers/models/switch_transformers/modeling_switch_transformers.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* remove assert
* change config order
* fix readme japanese
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* remove parallelizable tests + add one liners
* remove ONNX config
* fix nits
- add `T5Tokenizer` in auto mapping
- remove `Switch Transformers` from ONNX supported models
* remove `_get_router`
* remove asserts
* add check in test for `router_dtype`
* add `SwitchTransformersConfig` in `run_pipeline_test`
* Update tests/pipelines/test_pipelines_summarization.py
* add huge model conversion script
* fix slow tests
- add better casting for `Linear8bitLt`
- remove `torchscript` tests
* add make dir
* style on new script
* fix nits
- doctest
- remove `_keys_to_ignore_on_load_unexpected`
* Update src/transformers/models/switch_transformers/configuration_switch_transformers.py
* add google as authors
* fix year
* remove last `assert` statements
* standardize vertical spaces
* fix failing import
* fix another failing test
* Remove strange àuthorized_keys`
* removing todo and padding that is never used
Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
Co-authored-by: ybelkada <younes@huggingface.co>
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Arthur Zucker <arthur@huggingface.co>
2022-11-15 13:06:45 +01:00
amyeroberts
55ba31908a
Add param_name to size_dict logs & tidy ( #20205 )
2022-11-15 10:52:58 +00:00
Younes Belkada
f1e8c48c5e
Add accelerate
support for ViT
family ( #20174 )
...
* add `accelerate` support for `ViT` family
- add `_no_split_modules`
- manually cast to the right `dtype`: to change
* enable `float16` for `deit`
* fix `make fixup`
* add `slow` test for `fp16` inference
* another safety check
* Update src/transformers/models/deit/modeling_deit.py
2022-11-15 11:06:01 +01:00
Arthur
11b2e45ccc
[WHISPER] Update modeling tests ( #20162 )
...
* Update modeling tests
* update tokenization test
* typo
* nit
* fix expected attention outputs
* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
* Update tests from review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
* remove problematics kwargs passed to the padding function
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-11-15 11:04:58 +01:00
Arthur
f60eec4003
update relative positional embedding ( #20203 )
...
* update relative positional embedding
* make fix copies
* add `use_cache` to list of arguments
* fixup
* 1line fucntion
* add `test_decoder_model_past_with_large_inputs_relative_pos_emb`
* add relative pos embedding test for more models
* style
2022-11-15 10:46:34 +01:00
Yih-Dar
f9909fbf85
Make ImageSegmentationPipelineTests
less flaky ( #20147 )
...
* Fix ImageSegmentationPipelineTests
* Use 0.9
* no zip
* links to show images
* links to show images
* rebase
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-11-15 09:14:55 +01:00
bofeng huang
9625924c60
Update tokenizer_summary.mdx ( #20135 )
2022-11-15 01:18:13 +01:00
Wonhyeong Seo
8fadfd5035
[docs] set overflowing image width to auto-scale ( #20197 )
...
* docs: fix: set overflowing image width to auto-scale
* docs: fix: new language Korean is also affected
* docs: fix: unnecessary line break in index page
2022-11-15 01:13:40 +01:00
Nicolas Patry
25c451e5a0
Adding chunking for whisper (all seq2seq actually). Very crude matching algorithm. ( #20104 )
...
* Very crude matching algorithm.
* Fixing tests.
* Removing comments
* Adding warning + fix short matches.
* Cleanup tests.
* Quality.
* Less noisy.
* Fixup.
2022-11-14 22:32:50 +01:00
Joao Gante
938cb04789
Generate: add Bloom fixes for contrastive search ( #20213 )
2022-11-14 18:34:11 +00:00
amyeroberts
fda125638f
Downgrade log warning -> info ( #20202 )
2022-11-14 17:56:52 +00:00
Ming Liu
36b063ed4f
Update README.md ( #20188 )
...
There is typo in the original hyperlink.
Below is the original version:
Based on the script [`run_translation_no_trainer.py`](https://github.com/huggingface/transformers/blob/main/examples/pytorch/translation/**run_translationn_no_trainer.py **).
2022-11-14 12:53:02 -05:00
Yih-Dar
536e60d2c7
mark test_save_load_fast_init_from_base
as is_flaky
( #20200 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-11-14 18:51:33 +01:00
Sanchit Gandhi
af1a7c8ca3
[Examples] Generalise Seq2Seq ASR to handle Whisper ( #19519 )
...
* merge conflicts
* bos and eos in datacollator
* (temp) hardcode removal of attention mask
* freeze encoder
* actually freeze encoder
* set max length / num beams according to gen kwargs
* (temp) fix tests
* don't pop attn mask
* override return attention mask config from Hub
* Hub configs updated 🤗
* final fixes
* update type annotations
* backward comp
2022-11-14 17:45:46 +00:00
Wonhyeong Seo
7ecb039176
feat: add i18n issue template ( #20199 )
...
Part of #20183
docs: add relevant labels to i18n issue template
fix: typo on completion count
2022-11-14 12:36:58 -05:00
Wonhyeong Seo
07d8d6e2f7
docs: translated index page to korean ( #20180 )
...
docs: i18n: first draft of index page
docs: fix: first revision of index page
docs: i18n: missed section - supported frameworks
docs: fix: second revision of index page
review by @ArthurZucker
refactor: remove untranslated files from korean
docs: fix: remove untranslated references from toctree.yml
feat: enable korean docs in gh actions
docs: feat: add in_translation page as placeholder
docs: bug: testing if internal toc need alphabet chars
docs: fix: custom english anchor for non-alphanumeric headings
review by @sgugger
docs: i18n: translate comments on install methods in _config.py
docs: refactor: more concise wording for translations
2022-11-14 12:09:21 -05:00
Arthur
c149d366bb
add _keys_to_ignore_on_load_unexpected = [r"pooler"] ( #20210 )
2022-11-14 18:05:19 +01:00
Younes Belkada
8dcf494ef1
[ROC_BERT] Make CI happy ( #20175 )
...
* fix slow test
* Update tests/models/roc_bert/test_modeling_roc_bert.py
2022-11-14 18:04:25 +01:00
Joao Gante
7b55bb4540
Generate: TF sample doctest result update ( #20208 )
2022-11-14 15:42:48 +00:00
IMvision12
d24e84d9ed
Pytorch type hints ( #20112 )
...
* initial commit
* Update modeling_whisper.py
* Fixing Tests
* modeling_vision_text_dual_encoder
* modeling_vision_encoder_decoder
* Update modeling_vit.py
* Update modeling_vit_msn.py
* Update modeling_trajectory_transformer.py
* style
* Update modeling_time_series_transformer.py
* Update modeling_time_series_transformer.py
* Update modeling_segformer.py
* Update modeling_plbart.py
* Update modeling_dpt.py
* Update modeling_deit.py
* Update modeling_dpt.py
* Update modeling_esm.py
* Update modeling_fnet.py
* Update modeling_fnet.py
* Update modeling_fnet.py
* Update modeling_flava.py
* Update modeling_flava.py
* Update modeling_layoutlmv3.py
* Update modeling_levit.py
2022-11-14 12:39:18 +00:00
Nicolas Patry
03bc6ece1b
Proposal Remove the weird inspect
in ASR pipeline and make WhisperEncoder just nice to use. ( #19571 )
...
* Proposal Remove the weird `inspect` in ASR pipeline and make
WhisperEncoder just nice to use.
It seems that accepting `attention_mask` is kind of an invariant of our
models. For Seq2Seq ASR models, we had a special comment on how it
actually was important to send it.
`inspecting` seems pretty brittle way to handle this case.
My suggestion is to simply add it as an kwarg that and just ignoring
it with the docstring explaining why it's ignored.
* Fixup.
* Update src/transformers/models/whisper/modeling_whisper.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Doc fixing .
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2022-11-14 09:34:30 +01:00
code-with-rajeev
2308f3d42c
Update README.md ( #19530 )
...
Fixed a grammatical error.
2022-11-14 01:36:38 -05:00
Bartosz Szmelczynski
78a471ff71
Fix tapas scatter ( #20149 )
...
* First draft
* Remove scatter dependency
* Add require_torch
* update vectorized sum test, add clone call
* remove artifacts
* fix style
* fix style v2
* remove "scatter" mentions from the code base
* fix isort error
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-11-14 01:04:26 -05:00
Matthijs Hollemans
f711d683b5
add MobileNetV2 model ( #17845 )
...
* add model files etc for MobileNetV2
* rename files for MobileNetV1
* initial implementation of MobileNetV1
* fix conversion script
* cleanup
* write docs
* tweaks
* fix conversion script
* extract hidden states
* fix test cases
* make fixup
* fixup it all
* rename V1 to V2
* fix checkpoints
* fixup
* implement first block + weight conversion
* add remaining layers
* add output stride and dilation
* fixup
* add tests
* add deeplabv3+ head
* a bit of fixup
* finish deeplab conversion
* add link to doc
* fix issue with JIT trace
in_height and in_width would be Tensor objects during JIT trace, which caused Core ML conversion to fail on the remainder op. By making them ints, the result of the padding calculation becomes a constant value.
* cleanup
* fix order of models
* fix rebase error
* remove main from doc link
* add image processor
* remove old feature extractor
* fix converter + other issues
* fixup
* fix unit test
* add to onnx tests (but these appear broken now)
* add post_process_semantic_segmentation
* use google org
* remove unused imports
* move args
* replace weird assert
2022-11-14 01:00:10 -05:00
amyeroberts
6cc06d1739
Fix type - update any PIL.Image.Resampling ( #20172 )
2022-11-11 16:55:59 +00:00
NielsRogge
cbbeca3d17
[OWL-ViT] Make model consistent with CLIP ( #20144 )
...
* Apply fix
* Fix test
* Remove another argument which is not used
* Fix pipeline test
* Add argument back, add deprecation warning
* Add warning add other location
* Use warnings instead
* Add num_channels to config
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>
2022-11-11 11:36:17 +01:00
Nicolas Patry
d3c0566679
Fix object-detection bug (height, width inversion). ( #20167 )
2022-11-11 10:14:48 +01:00
Arthur
61a51f5f23
Add Jukebox model (replaces #16875 ) ( #17826 )
2022-11-10 21:05:27 +01:00
Sylvain Gugger
9740a03f61
Skip broken test
2022-11-10 14:59:32 -05:00
Sanchit Gandhi
905e5773a3
[processor] Add 'model input names' property ( #20117 )
...
* [processor] Add 'model input names' property
* add test
* no f string
* add generic property method to mixin
* copy to multimodal
* copy to vision
* tests for all audio
* remove ad-hoc tests
* style
* fix flava test
* fix test
* fix processor code
2022-11-10 19:29:20 +00:00
Matt
68187c4642
Fix arg names for our models ( #20166 )
...
* Fix arg names for our models
* Clean out the other uses of "residx" in infer()
* make fixup
2022-11-10 16:47:58 +00:00
Joao Gante
6dda14dc47
Generate: fix TF doctests ( #20159 )
2022-11-10 15:30:39 +00:00
Yih-Dar
e0d7c831c7
Update OnnxConfig.generate_dummy_inputs
to check ImageProcessingMixin
( #20157 )
...
* Check ImageProcessingMixin in OnnxConfig.generate_dummy_inputs
* Check ImageProcessingMixin in OnnxConfig.generate_dummy_inputs
* Add back
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-11-10 16:04:51 +01:00
Matthijs Hollemans
daf4436e07
doc comment fix: Args was in wrong place ( #20164 )
2022-11-10 10:02:24 -05:00
NielsRogge
9f0c72f93b
Add doc tests ( #20158 )
...
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>
2022-11-10 15:25:30 +01:00
Nicolas Patry
d066c3731b
Adding support for LayoutLMvX variants for object-detection
. ( #20143 )
...
* Adding support for LayoutLMvX variants for `object-detection`.
* Revert bogs `layoutlm` feature extractor which does not exist (it was a
V2 model) .
* Updated condition.
* Handling the comments.
2022-11-10 11:33:38 +01:00
Yih-Dar
7ec1dc8817
Add RoCBertTokenizer
to TOKENIZER_MAPPING_NAMES
( #20141 )
...
* Add RoCBertTokenizer to TOKENIZER_MAPPING_NAMES
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-11-09 20:58:56 +01:00
Mario Šaško
67b3789133
Make DummyObject more robust ( #20146 )
2022-11-09 12:57:27 -05:00
NielsRogge
93e14486d6
[CLIPSeg] Add resources ( #20118 )
...
* Add resource
* Add tag
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2022-11-09 18:31:22 +01:00
amyeroberts
f3d99e49d4
Update VisionEncoderDecoder to use an image processor ( #20137 )
...
* TrOCR processor uses an image processor
* Update VisionEncoderDecoder
* Add feature_extractor_class property
2022-11-09 16:31:05 +00:00
Steven Liu
a44985b41c
add cv + audio labels ( #20114 )
2022-11-09 07:40:15 -08:00
Joao Gante
f270b960d6
Generate: move generation_*.py src files into generation/*.py ( #20096 )
...
* move generation_*.py src files into generation/*.py
* populate generation.__init__ with lazy loading
* move imports and references from generation.xxx.object to generation.object
2022-11-09 15:34:08 +00:00