Matt
e3cb7a0b60
Tensorflow QA example ( #12252 )
...
* New Tensorflow QA example!
* Style pass
* Updating README.md for the new example
* flake8 fixes
* Update examples/tensorflow/question-answering/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2021-06-21 16:37:28 +01:00
Patrick von Platen
4e9a6796c7
[Flax] Fix flax test save pretrained ( #12256 )
...
* fix_torch_device_generate_test
* remove @
* fix flax save pretrained test
2021-06-21 16:37:13 +01:00
Stas Bekman
b75b5605c9
[DeepSpeed] don't ignore --adafactor ( #12257 )
2021-06-21 08:17:00 -07:00
Suraj Patil
eb881674f2
[Flax] [WIP] allow loading head model with base model weights ( #12255 )
...
* boom boom
* remove flax clip example
* allow loading head model with base model weights
* add test
* fix imports
* disable save, load test for clip
* add test_save_load_to_base
2021-06-21 15:56:42 +01:00
Suraj Patil
8d5b7f36e5
[FlaxClip] fix test from/save pretrained test ( #12284 )
...
* boom boom
* remove flax clip example
* fix from_save_pretrained
2021-06-21 15:54:34 +01:00
Vishal Burman
b53bc55ba9
Fix for making student ProphetNet for Seq2Seq Distillation ( #12130 )
...
* make_student.py: fix to make student ProphetNet
* reformat
2021-06-21 09:36:44 -04:00
Lysandre Debut
b76850a808
Better CI feedback ( #12279 )
...
* Better run ID
* Only part of CI
* Revert "Only part of CI"
This reverts commit 29f7f248d2
.
2021-06-21 02:52:12 -04:00
Lysandre
30a5521c0b
Fix the scheduled CI
2021-06-21 08:27:25 +02:00
Stas Bekman
2e5dbdf2db
[t5 doc] make the example work out of the box ( #12239 )
...
* [run_clm.py] restore caching
* style
* [t5 doc] make the example work out of the box
This PR expands the training example to include the correct model type for the example to work, e.g. with `T5Model` this example will break.
* Update docs/source/model_doc/t5.rst
Co-authored-by: Suraj Patil <surajp815@gmail.com>
* expand the other example
Co-authored-by: Suraj Patil <surajp815@gmail.com>
2021-06-18 10:00:19 -07:00
Xa9aX ツ
f3558bbcfd
Depreciate pythonic Mish and support PyTorch 1.9 version of Mish ( #12240 )
...
* Moved Mish to Torch 1.9 version
* Run black formatting
2021-06-18 09:13:45 -04:00
Suraj Patil
47a9768334
[FlaxBart] few small fixes ( #12247 )
...
* boom boom
* remove flax clip example
* few small fixes
2021-06-18 10:29:42 +01:00
Suraj Patil
f74655cd9b
[Flax] FlaxAutoModelForSeq2SeqLM ( #12228 )
...
* add FlaxAutoModelForSeq2SeqLM
2021-06-18 13:20:09 +05:30
Bhavitvya Malik
e43e11260f
update desc for map in all examples ( #12226 )
...
* update desc for map in all examples
* added plm
* suggestions
2021-06-17 15:37:31 -04:00
Sylvain Gugger
adb70eda4d
AutoTokenizer: infer the class from the tokenizer config if possible ( #12208 )
...
* AutoTokenizer: infer the class from the tokenizer config if possible
* Add tests
* Update src/transformers/models/auto/tokenization_auto.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2021-06-17 12:39:22 -04:00
Lysandre
0daadc1919
Docs for v4.8.0
2021-06-17 18:17:42 +02:00
Lysandre
7a6c9fab8e
Release: v4.7.0
2021-06-17 17:57:42 +02:00
Stas Bekman
d6ea91c96a
fix pt-1.9.0 add_
deprecation ( #12217 )
...
* fix pt-1.9.0 add_ deprecation
* add () for clarity
* Trigger CI
* require_version(torch
2021-06-17 08:53:59 -07:00
Lysandre Debut
3a960c4857
Support for torch 1.9.0 ( #12224 )
...
* Support for torch 1.9.0
* Torch scatter for 1.9.0
* Github Actions run on 1.9.0
2021-06-17 11:29:01 -04:00
Sylvain Gugger
afdd9e3663
Add link to the course ( #12229 )
2021-06-17 11:14:53 -04:00
NielsRogge
29b0aef871
Improve detr ( #12147 )
...
* Remove unused variables
* Improve docs
* Fix docs of segmentation masks
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2021-06-17 10:37:54 -04:00
Lysandre Debut
b56848c8c8
Pipeline update & tests ( #12207 )
2021-06-17 09:41:16 +02:00
Bhadresh Savani
700cee3446
[Docs] fixed broken link ( #12205 )
...
* fixed broken link
* Update docs/source/benchmarks.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update docs/source/benchmarks.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2021-06-16 15:14:53 -04:00
Sylvain Gugger
255a17a089
Use yaml to create metadata ( #12185 )
...
* Use yaml to create metadata
* Fix typo
* Remove pin
2021-06-16 13:17:45 -04:00
Nicolas Patry
15ef0dc5c6
Enabling AutoTokenizer for HubertConfig. ( #12198 )
2021-06-16 15:28:46 +01:00
Philipp Schmid
afa414d060
updated DLC images and sample notebooks ( #12191 )
2021-06-16 07:24:00 -04:00
Patrick von Platen
ccca510276
Hubert ( #11889 )
...
* fix_torch_device_generate_test
* remove @
* add hubert
* add first test file
* more docs
* fix bugs
* fix bug
* finish
* finish
* finish docstring
* fix
* fix
* finalize
* add to ignored
* finish
* Apply suggestions from code review
* correct naming
* finish
* fix auto config
* finish
* correct convert script
* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
* apply suggestions lysandre & suraj
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
2021-06-16 12:14:12 +01:00
Patrick von Platen
c3c39f7e84
[Flax] Add Beam Search ( #12131 )
...
* fix_torch_device_generate_test
* remove @
* push new logit processors
* add processors
* save first working version
* save intermediate
* finish
* make style
* make fix-copies
* finish
* Update tests/test_modeling_flax_bart.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
2021-06-16 09:43:54 +01:00
Sylvain Gugger
802ffaff0d
Temporarily deactivate torchhub test ( #12184 )
2021-06-15 16:16:51 -04:00
Lysandre Debut
52c7ca0488
Temporarily deactivate torch-scatter while we wait for new release ( #12181 )
...
* Temporarily deactivate torch-scatter while we wait for new release
* torch-1.8.1 binary for scatter
* Revert to 1.8.0
* Pin torch dependency
* torchaudio and torchvision
2021-06-15 16:03:58 -04:00
Sylvain Gugger
7d7ceca396
Model card defaults ( #12122 )
...
* [WIP] Model card defaults
* finetuned_from default value
* Add all mappings to the mapping file
* Be more defensive on finetuned_from arg
* Add default task tag
* Separate tags from tasks
* Edge case for dataset
* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2021-06-15 16:01:37 -04:00
Stas Bekman
6e7cc5cc51
[testing] ensure concurrent pytest workers use a unique port for torch.dist ( #12166 )
...
* ensure concurrent pytest workers use a unique port for torch.distributed.launch
* reword
2021-06-15 11:12:59 -07:00
Amog Kamsetty
b9d66f4c4b
Ray Tune Integration Updates ( #12134 )
...
* fix
* fixes
* add back to scheduled tests
* formatting
* Update integrations.py
2021-06-15 14:11:29 -04:00
Kilian Kluge
a79585bbf9
Update AutoModel classes in summarization example ( #12178 )
...
- Convert use of deprecated AutoModelWithLMHead to AutoModelForSeq2SeqLM
- Add newly required `truncation=True` to `tokenizer.encode` with `max_length`
This silences all warnings.
2021-06-15 10:36:10 -04:00
Sylvain Gugger
d6c929e200
Merge remote-tracking branch 'origin/master'
2021-06-15 09:37:46 -04:00
Sylvain Gugger
a8694b8850
Adjust banner width
2021-06-15 09:37:15 -04:00
kumapo
955b2b97a6
Enable add_prefix_space if model_type is roberta or gpt2 ( #12116 )
2021-06-15 09:33:21 -04:00
Sylvain Gugger
60b1d6b45b
Add course banner ( #12157 )
...
* Add course banner
* Update course banner
2021-06-15 09:25:49 -04:00
Lysandre Debut
d07b540a37
Have dummy processors have a from_pretrained
method ( #12145 )
2021-06-15 08:39:05 -04:00
Avital Oliver
9b393240a2
Use a released version of optax rather than installing from Git. ( #12173 )
...
Use a released version of optax rather than installing from Git
2021-06-15 16:42:51 +05:30
Patrick von Platen
9bc9e59869
[Flax generate] Add params to generate ( #12171 )
...
* fix_torch_device_generate_test
* remove @
* add params as input
* finish
2021-06-15 11:50:12 +01:00
Sylvain Gugger
a55dc157e3
Add video links to the documentation ( #12162 )
2021-06-15 06:37:37 -04:00
Stas Bekman
040283170c
consistent nn. and nn.functional: part 5 docs ( #12161 )
2021-06-14 13:34:32 -07:00
Stas Bekman
88e84186e5
[style] consistent nn. and nn.functional: part 4 examples
( #12156 )
...
* consistent nn. and nn.functional: p4 examples
* restore
2021-06-14 12:28:24 -07:00
Stas Bekman
372ab9cd6d
[style] consistent nn. and nn.functional: part 3 tests
( #12155 )
...
* consistent nn. and nn.functional: p3 templates
* restore
2021-06-14 12:18:22 -07:00
Vasudev Gupta
d9c0d08f9a
Flax Big Bird ( #11967 )
...
* add flax bert
* bert -> bigbird
* original_full ported
* add debugger
* init block sparse
* fix copies ; gelu_fast -> gelu_new
* block sparse port
* fix block sparse
* block sparse working
* all ckpts working
* fix-copies
* make quality
* init tests
* temporary fix for FlaxBigBirdForMultipleChoice
* skip test_attention_outputs
* fix
* gelu_fast -> gelu_new ; fix multiple choice model
* remove nsp
* fix sequence classifier
* fix
* make quality
* make fix-copies
* finish
* Delete debugger.ipynb
* Update src/transformers/models/big_bird/modeling_flax_big_bird.py
* make style
* finish
* bye bye jit flax tests
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2021-06-14 20:01:03 +01:00
Stas Bekman
a156da9a23
consistent nn. and nn.functional: p2 templates ( #12153 )
2021-06-14 11:41:24 -07:00
Patrick von Platen
007be9e402
[Flax] Fix flax pt equivalence tests ( #12154 )
...
* fix_torch_device_generate_test
* remove @
* upload
2021-06-14 19:19:10 +01:00
Will Rice
d438eee030
Adding TFWav2Vec2Model ( #11617 )
...
* [WIP] Add TFWav2Vec2Model
Work in progress for adding a tensorflow version of Wav2Vec2
* feedback changes
* small fix
* Test Feedback Round 1
* Add SpecAugment and CTC Loss
* correct spec augment mask creation
* docstring and correct copyright
* correct bugs
* remove bogus file
* finish tests correction
* del unnecessary layers
* Update src/transformers/models/wav2vec2/modeling_tf_wav2vec2.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* make style
* correct final bug
* Feedback Changes
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2021-06-14 18:58:54 +01:00
Stas Bekman
1ed2ebf60d
[style] consistent nn. and nn.functional ( #12124 )
...
* consistent nn. and nn.functional
* fix glitch
* fix glitch #2
2021-06-14 09:44:28 -07:00
Stas Bekman
ff7c81687a
[optim] implement AdafactorSchedule ( #12123 )
...
* implement AdafactorSchedule
* typo
* fix
* Update src/transformers/optimization.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2021-06-14 09:43:48 -07:00