Sean Moriarity
279bc5849b
Allow saved_model export of TFCLIPModel in save_pretrained ( #16886 )
...
* CLIP Serving
* Add type hints per code review
* Use black, flake8, and isort
* Update src/transformers/models/clip/modeling_tf_clip.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
* Rollback serving_output and add TODO
* Remove irrelevant portions of failing tests
* Revert "Rollback serving_output and add TODO"
This reverts commit a4abfa6ba3b7875a13538dbc2ddc4eb17dfcca8d.
* Rollback to original test/serving_output
* Fix unused var
* Apply suggestions from code review
* Update formatting with black
* Fix style again from rebase
* Update tests/models/clip/test_modeling_tf_clip.py
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Sean Moriarity <sean.l.moriarity.mil@army.mil>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
2022-05-04 16:37:58 +02:00
Zachary Mueller
ef20390291
Update to build via git for accelerate ( #17084 )
2022-05-04 09:42:36 -04:00
Sylvain Gugger
bb8d40529e
Deprecate model templates ( #17062 )
...
* Deprecate model templates
* Address review comments
2022-05-04 09:36:38 -04:00
karthikrangasai
9c5ae87f13
Type hint complete Albert model file. ( #16682 )
...
* Type hint complete Albert model file.
* Update typing.
* Update src/transformers/models/albert/modeling_albert.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
2022-05-04 14:35:12 +01:00
dependabot[bot]
2bf95e2b09
Bump notebook from 6.4.1 to 6.4.10 in /examples/research_projects/lxmert ( #16634 )
...
Bumps [notebook](http://jupyter.org ) from 6.4.1 to 6.4.10.
---
updated-dependencies:
- dependency-name: notebook
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-05-04 08:27:40 -04:00
dependabot[bot]
7a229ef446
Bump notebook in /examples/research_projects/visual_bert ( #16635 )
...
Bumps [notebook](http://jupyter.org ) from 6.4.1 to 6.4.10.
---
updated-dependencies:
- dependency-name: notebook
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-05-04 08:27:27 -04:00
Sayak Paul
049e791758
Add Data2Vec for Vision in TF ( #17008 )
...
* add utilities till TFData2VecVisionLayer.
* chore: pass window_size to attention layer.
* feat: add TFData2VecVisionRelativePositionBias.
* feat: initial implementation ready for tf data2vec.
* fix: relative position bias index, table to be fixed.
* chore: implementation added, tests remaining.
* add: tests, other PR files.
* fix: code quality.
* fix: import structure in init.
* chore: run make fix-copies.
* chore: address PR feedback (round I).
* chore: styling nit.
* fix: tests due to removal of to_2tuple().
* chore: rebase with upstream main and move the test.
* Update src/transformers/models/auto/modeling_tf_auto.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/models/auto/modeling_tf_auto.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* fix: layer call.
* chore: remove from_pt=True and rerun test.
* chore: remove cast and tf.divide.
* chore: minor edits to the test script.
* Update src/transformers/models/data2vec/modeling_tf_data2vec_vision.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
* fix: expand() on TF tensors with broadcast_to().
* fix: test import.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
2022-05-04 08:08:25 -04:00
Sylvain Gugger
d76d2a2af7
Make sure telemetry arguments are not returned as unused kwargs ( #17063 )
...
* Make sure telemetry arguments are not returned as unused kwargs
* Fix test
2022-05-04 07:47:57 -04:00
lewtun
675e2d1663
Remove masked image modeling from BEIT ONNX export ( #16980 )
...
* Add masked image modelling to task mapping
* Refactor ONNX features to be listed alphabetically
* Add warning about BEiT masked image modeling
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-05-04 10:05:24 +02:00
lewtun
4bb1d0ec84
Skip RoFormer ONNX test if rjieba not installed ( #16981 )
...
* Skip RoFormer ONNX test if rjieba not installed
* Update deps table
* Skip RoFormer serialization test
* Fix RoFormer vocab
* Add rjieba to CircleCI
2022-05-04 10:04:10 +02:00
Thomas Wang
db034660fb
Fix hashing for deduplication ( #17048 )
2022-05-04 08:40:24 +02:00
Pavel Belevich
39f8eafc1b
Remove device parameter from create_extended_attention_mask_for_decoder ( #16894 )
2022-05-03 11:06:11 -04:00
Sylvain Gugger
dd739f7045
Remove fetch in model templates test
2022-05-03 10:49:12 -04:00
Sylvain Gugger
1c9fcd0e04
Fix RNG reload in resume training from epoch checkpoint ( #17055 )
...
* Fix RNG reload in resume training from epoch checkpoint
* Fix test
2022-05-03 10:31:24 -04:00
Sylvain Gugger
6e17ba6aa5
Remove Python and use v2 action ( #17059 )
2022-05-03 10:12:17 -04:00
Sylvain Gugger
a8fa2f91f4
Make Trainer compatible with sharded checkpoints ( #17053 )
...
* Make Trainer compatible with sharded checkpoints
* Add doc
2022-05-03 09:55:10 -04:00
Yih-Dar
19420fd99e
Move test model folders ( #17034 )
...
* move test model folders (TODO: fix imports and others)
* fix (potentially partially) imports (in model test modules)
* fix (potentially partially) imports (in tokenization test modules)
* fix (potentially partially) imports (in feature extraction test modules)
* fix import utils.test_modeling_tf_core
* fix path ../fixtures/
* fix imports about generation.test_generation_flax_utils
* fix more imports
* fix fixture path
* fix get_test_dir
* update module_to_test_file
* fix get_tests_dir from wrong transformers.utils
* update config.yml (CircleCI)
* fix style
* remove missing imports
* update new model script
* update check_repo
* update SPECIAL_MODULE_TO_TEST_MAP
* fix style
* add __init__
* update self-scheduled
* fix add_new_model scripts
* check one way to get location back
* python setup.py build install
* fix import in test auto
* update self-scheduled.yml
* update slack notification script
* Add comments about artifact names
* fix for yolos
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-05-03 14:42:02 +02:00
Sanchit Gandhi
cd9274d010
[FlaxBert] Add ForCausalLM ( #16995 )
...
* [FlaxBert] Add ForCausalLM
* make style
* fix output attentions
* Add RobertaForCausalLM
* remove comment
* fix fx-to-pt model loading
* remove comment
* add modeling tests
* add enc-dec model tests
* add big_bird
* add electra
* make style
* make repo-consitency
* add to docs
* remove roberta test
* quality
* amend cookiecutter
* fix attention_mask bug in flax bert model tester
* tighten pt-fx thresholds to 1e-5
* add 'copied from' statements
* amend 'copied from' statements
* amend 'copied from' statements
* quality
2022-05-03 11:26:19 +02:00
Patrick von Platen
31616b8d61
[T5 Tokenizer] Model has no fixed position ids - there is no hardcode… ( #16990 )
...
* [T5 Tokenizer] Model has no fixed position ids - there is no hardcoded max length
* [T5 Tokenizer] Model has no fixed position ids - there is no hardcoded max length
* correct t5 tokenizer
* correct t5 tokenizer
* fix test
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* finish
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-05-02 21:27:34 +02:00
Sylvain Gugger
1073f00d4e
Clean up setup.py ( #17045 )
...
* Clean up setup.py
* Trigger CI
* Upgrade Python used
2022-05-02 12:58:17 -04:00
Lysandre Debut
30ca529902
Make the sacremoses dependency optional ( #17049 )
...
* Make sacremoses optional
* Pickle
2022-05-02 12:47:47 -04:00
Lysandre Debut
bb2e088be7
Allow all imports from transformers ( #17050 )
2022-05-02 12:47:39 -04:00
NielsRogge
1ac698744c
Add YOLOS ( #16848 )
...
* First draft
* Add YolosForObjectDetection
* Make forward pass work
* Add mid position embeddings
* Add interpolation of position encodings
* Add expected values
* Add YOLOS to tests
* Add integration test
* Support tiny model as well
* Support all models in conversion script
* Remove mid_pe_size attribute
* Make more tests pass
* Add model to README and fix config
* Add copied from statements
* Rename base_model_prefix to vit
* Add missing YOLOS_PRETRAINED_CONFIG_ARCHIVE_MAP
* Apply suggestions from code review
* Apply more suggestions from code review
* Convert remaining checkpoints
* Improve docstrings
* Add YolosFeatureExtractor
* Add feature extractor to docs
* Add corresponding tests
* Fix style
* Fix docs
* Apply suggestion from code review
* Fix bad rebase
* Fix some more bad rebase
* Fix missing character
* Improve docs and variable names
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2022-05-02 18:30:55 +02:00
Zachary Mueller
f275e593bf
Fix no_trainer examples to properly calculate the number of samples ( #17046 )
...
* Update all examples to properly calculate progress bar
2022-05-02 11:56:25 -04:00
Zachary Mueller
35d48db881
Update no_trainer examples to use new logger ( #17044 )
...
* Propagate and fix imports
2022-05-02 11:56:15 -04:00
calpt
daecae1f1c
[Trainer] Move logic for checkpoint loading into separate methods for easy overriding ( #17043 )
2022-05-02 10:40:37 -04:00
NielsRogge
2de2c9ecca
Clean up vision tests ( #17024 )
...
* Clean up tests
* Make fixup
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2022-05-02 16:28:58 +02:00
Sylvain Gugger
4be8b95a9f
Disable Flax GPU tests on push ( #17042 )
2022-05-02 10:25:53 -04:00
yujun
bdd690a74d
add torch.no_grad when in eval mode ( #17020 )
...
* add torch.no_grad when in eval mode
* make style quality
2022-05-02 07:49:19 -04:00
Martin Pömsl
9586e222af
Fix typo in RetriBERT docstring ( #17018 )
2022-05-02 07:48:20 -04:00
Sanchit Gandhi
93b802c43e
[Flax(Speech)EncoderDecoder] Fix bug in decoder_module
( #17036 )
...
* [FlaxSpeechEncoderDecoder] Fix bug in `decoder_module`
* [FlaxEncoderDecoder] Fix bug in `decoder_module`
2022-05-02 13:06:45 +02:00
Sylvain Gugger
1ae182d9a6
Fix style
2022-05-02 06:19:31 -04:00
Michael Benayoun
2c2a2169b6
Fx with meta ( #16836 )
...
* Add meta proxy
* Uses meta data to trace data dependent control-flow
* Remove commented class
* Handles torch creating functions
* Added type annotation to fix tracing
* Tracing works for everything but T5 and GPT-J
* Almost all previously supported models pass
* All architectures can be traced except T5
* Intermediate commit to have a trace of the comparison operators for HFProxy
* Everything works, except loss computation
* Everything works
* Removed unused import
* Overriden methods do not use underlying ops (linear and torch.matmul), and model attributes are copied to the traced version
* Fix torch_matmul_override
* Change attributes reference to deepcopy
* Remove breakpoint and add torch_index_override
* Small fix
* Fix typo
* Replace asserts by explicit exceptions
2022-05-02 11:46:52 +02:00
Sanchit Gandhi
ff846e9b28
[FlaxGenerate] Fix bug in decoder_start_token_id ( #17035 )
2022-05-02 11:05:27 +02:00
Manan Dey
eb877f1fd0
update docs of length_penalty ( #17022 )
2022-05-02 11:01:18 +02:00
Omar U. Espejel
da47c264f9
Add translating guide ( #17004 )
...
* Add translating guide
2022-04-30 17:43:38 -05:00
Yih-Dar
ede5e04191
Add a check on config classes docstring checkpoints ( #17012 )
...
* Add the check
* add missing ckpts
* add a list to ignore
* call the added check script
* better regex pattern
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-04-30 10:40:46 +02:00
Sylvain Gugger
7152ed2bae
Result of new doc style with fixes ( #17015 )
...
* Result of new doc style with fixes
* Add last two files
* Bump hf-doc-builder
2022-04-29 17:42:15 -04:00
Sylvain Gugger
18df440709
Replace dict/BatchEncoding instance checks by Mapping ( #17014 )
...
* Replace dict/BatchEncoding instance checks by Mapping
* Typo
2022-04-29 17:20:52 -04:00
Nicolas Patry
b8dffd1f3e
Revert "Updating variable names. ( #16445 )" ( #17011 )
...
This reverts commit 4f3a14e3c2
.
2022-04-29 12:26:45 -04:00
Nicolas Patry
4f3a14e3c2
Updating variable names. ( #16445 )
2022-04-29 17:44:28 +02:00
tarzan
20fb5d51ea
Update README_zh-hans.md ( #16977 )
2022-04-29 11:05:03 -04:00
Pavel Belevich
63fbed5c59
Make create_extended_attention_mask_for_decoder static method ( #16893 )
2022-04-29 10:57:09 -04:00
Joao Gante
fb0ae12947
TF: XLA bad words logits processor and list of processors ( #16974 )
2022-04-29 15:54:58 +01:00
Zachary Mueller
57e6464ac9
Update all require decorators to use skipUnless when possible ( #16999 )
2022-04-29 08:55:38 -04:00
Yih-Dar
e952e049b4
use scale=1.0 in floats_tensor called in speech model testers ( #17007 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-04-29 14:41:33 +02:00
Sylvain Gugger
e6f00a11d7
Update README to latest release ( #16997 )
2022-04-28 14:17:44 -04:00
Zachary Mueller
3486a92a57
Fix savedir for by epoch ( #16996 )
2022-04-28 13:49:45 -04:00
Yih-Dar
5af5735f62
set eos_token_id to None to generate until max length ( #16989 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-04-28 19:47:38 +02:00
amyeroberts
01562dac7e
Rename a class to reflect framework pattern AutoModelXxx -> TFAutoModelXxx ( #16993 )
2022-04-28 18:11:54 +01:00