Anton Lozhkov
3883e3a75e
Add SD and SV heads for WavLM ( #14847 )
...
* Add converted heads
* Add dummies
2021-12-20 16:40:56 +03:00
Patrick von Platen
cd583bdaa5
[WavLM] Fix slow tests ( #14845 )
2021-12-20 12:06:42 +01:00
Patrick von Platen
281e1fba75
up ( #14829 )
2021-12-20 11:47:32 +01:00
Patrick von Platen
091693b494
[Seq2SeqTrainer] Remove model input name hack ( #14802 )
...
* [Seq2SeqTrainer] Remove model input name hack
* Update src/transformers/trainer_seq2seq.py
* make style
* finish
2021-12-20 10:53:48 +01:00
Patrick von Platen
84ea427f46
[ImageGPT] Deprecate pixel_values input name to input_ids ( #14801 )
...
* [ImageGPT] Deprecate pixel_values input name to input_ids
* up
* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* correct
* finish
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
2021-12-17 20:05:22 +01:00
Patrick von Platen
c4a96cecbc
Wav2Vec2 meets phonemes ( #14353 )
...
* up
* add tokenizer
* improve more
* finish tokenizer
* finish
* adapt speech recognition script
* adapt convert
* more fixes
* more fixes
* update phonemizer wav2vec2
* better naming
* fix more tests
* more fixes swedish
* correct tests
* finish
* improve script
* remove file
* up
* lets get those 100 model architectures until the end of the month
* make fix-copies
* correct more
* correct script
* more fixes
* more fixes
* add to docs
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* replace assert
* fix copies
* fix docs
* new try docs
* boom boom
* update
* add phonemizer to audio tests
* make fix-copies
* up
* upload models
* some changes
* Update tests/test_tokenization_wav2vec2_phoneme.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>
* more fixes
* remove @
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>
2021-12-17 19:56:44 +01:00
Lysandre Debut
77d6c826d8
Convert rst to mdx bert ( #14806 )
...
* BERT to mdx
mdx :)
c
* Update docs/source/model_doc/bert.mdx
Co-authored-by: Julien Chaumond <julien@huggingface.co>
* Remove all
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Julien Chaumond <julien@huggingface.co>
2021-12-17 11:13:34 -05:00
Sylvain Gugger
0b4ea79a0c
Trigger doc building
2021-12-17 11:14:18 -05:00
Daniel Stancl
ff066119ca
Implement head_mask for Flax BERT and other models copied from BERT ( #14620 )
...
* Implement head_mask for Flax BERT and other models copied from BERT
* Remove `from jax._src.nn.functions import sigmoid`
Remove `from jax._src.nn.functions import sigmoid` unintentionally added by IDE
* Remove no more valid copy statement
* Apply patil-suraj's suggestions from code review
* Apply suggestions from the code review
* Update Flax template
* Fix a typo
* Also update template for CausalLM modules
2021-12-17 17:06:59 +01:00
Patrick von Platen
95119ad7b0
[Generate] Correct input_ids detection ( #14815 )
...
* [Generate] Correct input_ids detection
* correct
2021-12-17 16:08:54 +01:00
Patrick von Platen
bdbe3df869
[WavLM] Layerdrop is not allowed for first layer ( #14811 )
...
* [WavLM] Layerdrop is not allowed for first layer
* Apply suggestions from code review
2021-12-17 13:30:18 +01:00
NielsRogge
cbf036f7ae
Add test ( #14810 )
2021-12-17 04:33:27 -05:00
Patrick von Platen
c4a0fb5199
[WavLM] Correct position bias computation ( #14805 )
2021-12-16 22:42:57 +01:00
Lysandre Debut
d194d639ab
Remove datasets requirement ( #14795 )
2021-12-16 14:34:14 -05:00
Patrick von Platen
bef1e3e4a0
Add WavLM ( #14354 )
...
* first commit
* fix some stuff
* fix more readme
* Apply suggestions from code review
* update
* correct
* up
* attn layer works
* push code
* make modedls work
* Small change
* more refactor
* finish
* up
* fix convertsion
* fix position bias
* Fix style
* fix conversion
* make fix-copies
* add
* clean
* fix docs
* fix
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* apply final changes
* make fix-copies
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2021-12-16 18:57:05 +01:00
Patrick von Platen
b18d8534ea
[Generate] Make generate multi-modal ( #14784 )
...
* finish refactor
* refactor
* add tests
* add more tests
* up
* finish tests
* finish
* up
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* improve docstring
* fix docs
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2021-12-16 18:03:55 +01:00
Anton Lozhkov
48463ebb33
Add Speaker Diarization and Verification heads ( #14723 )
...
* Models
* Squashed commit of the following:
commit 72278e1e931a16d0879acc77f65762f3364833d0
Author: anton-l <aglozhkov@gmail.com>
Date: Fri Dec 10 21:45:08 2021 +0300
* Add unispeech heads
* Add sd/sv automodels
* Docs cleanup
* Fix docstrings
* rename xvector classes
* examples
* Tests cleanup
* Style
* Better checkpoints for tests
* leftover docs
* apply review suggestions
* Style + init tests
* Update unispeech-sat tdnn downsampling
2021-12-16 19:22:14 +03:00
Matt
2e07180cba
Train step fix ( #14796 )
...
* Fix for TF train step when no "labels" key in input
* make style
2021-12-16 16:08:13 +00:00
Kamal Raj
465a8b8d10
Update CONTRIBUTING.md ( #14800 )
...
fix pip installation cmd
2021-12-16 10:40:56 -05:00
Kamal Raj
8ae24e19b2
Update CONTRIBUTING.md ( #14799 )
...
typo
2021-12-16 10:24:26 -05:00
Sylvain Gugger
12e1b4c6df
Fix the build documentation job ( #14788 )
...
* Fix the build documentation job
* Fix install
* Address review comment
2021-12-16 09:35:20 -05:00
Sylvain Gugger
5061a9fd55
Post sphinx-clean up and contributing guide updates ( #14790 )
...
* Clean up sphinx
* Update contributing guide
* Update docs README
* No example title
* Fix copies
* Update CONTRIBUTING.md
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2021-12-16 09:29:26 -05:00
Lysandre Debut
8010fda9bf
Removes images to put them in a dataset ( #14781 )
...
* First try
* Update instructions
2021-12-16 04:42:02 -05:00
Sylvain Gugger
459677aebe
PoC for conserving old links ( #14754 )
...
* PoC for conserving old links
* Do the same for other links
* remap the redirects section
* add instructions on how to move sections
* improve
Co-authored-by: Stas Bekman <stas@stason.org>
2021-12-15 11:40:47 -08:00
Sylvain Gugger
c40ecfd740
Move import ( #14787 )
2021-12-15 13:34:42 -05:00
Lysandre
7c9c41f43c
Docs for v4.14.0
2021-12-15 18:29:53 +01:00
Lysandre
960d8cb41d
Release: v4.14.0
2021-12-15 18:20:35 +01:00
NielsRogge
aece7badc1
Improve Perceiver docs ( #14786 )
...
* Fix docs
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Code quality
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
2021-12-15 12:02:05 -05:00
NielsRogge
50bc57cef8
Update Perceiver code examples ( #14783 )
...
* Fix code examples
* Fix code example
2021-12-15 11:06:38 -05:00
Matt
48d4827697
TF model cards ( #14720 )
...
* Initial commit for Keras model cards
* Revert accidental change
* make style
* make style
* make style
* Fix PR comments
* Move repo creation to __init__
* Fixes to README.md creation
* Partial progress for proper card creation on `push_to_hub`
* Proper card creation from `push_to_hub` plus fixes for malformed model cards
* Fixes for model card creation outside the callback
* Adding a model card creation test
* Putting the model card creation test in the right file.
Good job, Matt.
* make style
* Fix model card test temp dir usage
* Fix model card creation when no optimizer present
* Fixes for when training history not present
* Fix accidental edit to test_modeling_common
2021-12-15 14:57:52 +00:00
Xing Han Lu
72c6e8b8bf
Update t5.rst ( #14776 )
2021-12-15 14:59:11 +01:00
Yih-Dar
a94105f95f
Fix preprocess_function in run_summarization_flax.py ( #14769 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2021-12-15 11:36:28 +01:00
Sylvain Gugger
7e61d56a45
Fix the doc_build_test job ( #14774 )
...
* Fake new model
* Fix doc-building test job
* Is this the problem?
* Another try
* Typo
* Clean up
* Can we do without -e ?
* Clean setup
2021-12-15 03:40:17 -05:00
Stas Bekman
fdf3ce2827
[doc] performance: groups of operations by compute-intensity ( #14757 )
2021-12-14 19:01:23 -08:00
Amit Chaudhary
851a78978a
Fix broken links to distillation on index page of documentation ( #14722 )
...
* Fix broken links to distillation on index page of documentation
* Fix broken link for distillation in main README
* Run make fixup
2021-12-14 21:55:33 -05:00
Nicolas Patry
e7ed7ffdcb
Adding support for multiple mask tokens. ( #14716 )
...
* Adding support for multiple mask tokens.
- Original implem: https://github.com/huggingface/transformers/pull/10222
Co-authored-by: njafer <naveen.jafer@oracle.com>
* In order to accomodate optionally multimodal models like Perceiver
we add information to the tasks to specify tasks where we know for sure
if we need the tokenizer/feature_extractor or not.
* Adding info in the documentation about multi masks.
+ marked as experimental.
* Add a copy() to prevent overriding the same tensor over and over.
* Fixup.
* Adding small test for multi mask with real values..
Co-authored-by: njafer <naveen.jafer@oracle.com>
2021-12-14 16:46:16 +01:00
Benjamin Minixhofer
2a606f9974
Make data shuffling in run_clm_flax.py
respect global seed ( #13410 )
...
* use jax and jnp instead of numpy in data_loader
* return batches as np.ndarray
2021-12-14 11:04:43 +01:00
Nicolas Patry
546a91abe9
Fixing tests for Perceiver ( #14739 )
...
* Adding some slow test to check for perceiver at least from a high level.
* Re-enabling fast tests for Perceiver ImageClassification.
* Perceiver might try to run without Tokenizer (Fast doesn't exist) and
with FeatureExtractor some text only pipelines.
* Oops.
* Adding a comment for `update_config_with_model_class`.
* Remove `model_architecture` to get `tiny_config`.
* Finalize rebase.
* Smarter way to handle undefined FastTokenizer.
* Remove old code.
* Addressing some nits.
* Don't instantiate `None`.
2021-12-14 09:43:07 +01:00
Sylvain Gugger
322d416916
Update Table of Contents ( #14755 )
2021-12-13 17:15:19 -05:00
Sylvain Gugger
7533d30acd
Convert Trainer doc page to MarkDown ( #14753 )
...
* Convert Trainer doc page to MarkDown
* Fix repo consistency
* Fix the doc build test job
2021-12-13 13:09:50 -05:00
NielsRogge
e926ea2bdd
Improve perceiver ( #14750 )
...
* First draft
* Improve docstring + clean up tests
* Remove unused code
* Add check in case one doesn't provide a preprocessor
2021-12-13 18:46:49 +01:00
Josué Nascimento
971e36667a
Change how to load config of XLNetLMHeadModel ( #14746 )
2021-12-13 12:34:26 -05:00
Yih-Dar
15a9d01519
Avoid using tf.tile in embeddings for TF models ( #14735 )
...
* avoid tf.tile in embeddings
* remove more tf.tile in embeddings
* clean
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2021-12-13 17:30:46 +00:00
Lysandre Debut
6ac0fac85a
Mention no images added to repository ( #14738 )
...
* Mention no images added to repository
* Update CONTRIBUTING.md
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
2021-12-13 12:21:26 -05:00
Sylvain Gugger
e4666bff06
Fix name
2021-12-13 12:01:37 -05:00
Sylvain Gugger
64e92ed224
Update transformers metadata ( #14724 )
...
* Wip on metadata update
* Most of the script
* Add a job to auto-update the transformers metadata
* Style
2021-12-13 11:46:03 -05:00
Sylvain Gugger
c3cd88a9ba
Small fixes for the doc ( #14751 )
2021-12-13 11:17:01 -05:00
Yih-Dar
12d9b95723
Fix: change tooslow to slow ( #14734 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2021-12-13 16:12:58 +00:00
Yih-Dar
ca0b82bbd7
Fix doc examples: cannot import name ( #14698 )
...
* Fix doc examples: cannot import name
* remove copy because of some necessary minor changes (maybe add copy to the individual methods instead)
* Keep copy with some modifications
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2021-12-13 10:36:50 -05:00
Lucien
fc74c84537
Swap TF and PT code inside two blocks ( #14742 )
2021-12-13 10:31:11 -05:00