Spacefish
2af36f957f
Add pillow to layoutlmv3 example requirements.txt ( #19663 )
2022-10-17 08:41:57 -04:00
Duong A. Nguyen
4212bb0d60
[Re-submit] Compute true loss Flax examples ( #19504 )
...
* Compute true loss
* fixup
* final
* final
* final
* Update examples/flax/language-modeling/run_bart_dlm_flax.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
* jax.tree_map => jax.tree_util.tree_map
* Compute true loss
* final
* fixup
* final
* final
* Update examples/flax/language-modeling/run_bart_dlm_flax.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
* jax.tree_map => jax.tree_util.tree_map
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
2022-10-13 11:33:36 +01:00
Sanchit Gandhi
eefcecaa35
[Examples] Fix typos in run speech recognition seq2seq ( #19514 )
2022-10-12 15:33:22 +01:00
FilipposVentirozos
4ed0fa3676
Fix pytorch seq2seq qa ( #19258 )
...
* fixed typo for SQuAD
* Fixed the preprocess_validation_function function for the labels to reflect the remaining truncated instances
* Rolled back the trainer_seq2seq_qa.py for UnboundLocalError: local variable 'metrics' referenced before assignment
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-10-12 08:33:44 -04:00
regisss
bb2cfd1824
Add multi-node conditions in trainer_qa.py and trainer_seq2seq.py ( #19502 )
...
* Add multi-node conditions in trainer_qa.py and trainer_seq2seq.py
* Code improvement
2022-10-11 22:48:56 -04:00
Lysandre
10100979ed
Dev version
2022-10-10 17:25:40 -04:00
Kaiyu Yang
e150c4e2fe
Fix the error message in run_t5_mlm_flax.py ( #19282 )
2022-10-10 14:51:11 +01:00
wei zhao
7d5ce6802e
Fix typo in image-classification/README.md ( #19424 )
...
Fix link typo of the following content.
PyTorch version, Trainer
PyTorch version, no Trainer
2022-10-10 09:16:58 -04:00
ddobokki
fa4bcd5274
edit: cast attention_mask to long in DataCollatorCTCWithPadding ( #19369 )
...
* edit: casting attention_mask to long in DataCollatorCTCWithPadding
* edit: casting attention_mask to long in DataCollatorCTCWithPadding
2022-10-07 10:05:48 -04:00
Ilaygoldman
ce2620194b
Change link of repojacking vulnerable link ( #19393 )
...
The link to https://github.com/vasudevgupta7/bigbird is vulnerable to repojacking (it redirects to the orignial project that changed name), you should change the link to the current name of the project. if you won't change the link, an attacker can open the linked repository and attacks users that trust your links
2022-10-06 23:06:39 +02:00
Zachary Mueller
ad98642a82
Fix gather for metrics ( #19360 )
2022-10-05 14:52:01 -04:00
dependabot[bot]
4c962d5e79
Bump joblib in /examples/research_projects/visual_bert ( #19269 )
...
Bumps [joblib](https://github.com/joblib/joblib ) from 0.16.0 to 1.2.0.
- [Release notes](https://github.com/joblib/joblib/releases )
- [Changelog](https://github.com/joblib/joblib/blob/master/CHANGES.rst )
- [Commits](https://github.com/joblib/joblib/compare/0.16.0...1.2.0 )
---
updated-dependencies:
- dependency-name: joblib
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-10-03 23:57:50 +02:00
dependabot[bot]
c7ec0afce0
Bump joblib in /examples/research_projects/decision_transformer ( #19270 )
...
Bumps [joblib](https://github.com/joblib/joblib ) from 1.1.0 to 1.2.0.
- [Release notes](https://github.com/joblib/joblib/releases )
- [Changelog](https://github.com/joblib/joblib/blob/master/CHANGES.rst )
- [Commits](https://github.com/joblib/joblib/compare/1.1.0...1.2.0 )
---
updated-dependencies:
- dependency-name: joblib
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-10-03 23:57:37 +02:00
dependabot[bot]
ca26277e33
Bump joblib from 0.16.0 to 1.2.0 in /examples/research_projects/lxmert ( #19268 )
...
Bumps [joblib](https://github.com/joblib/joblib ) from 0.16.0 to 1.2.0.
- [Release notes](https://github.com/joblib/joblib/releases )
- [Changelog](https://github.com/joblib/joblib/blob/master/CHANGES.rst )
- [Commits](https://github.com/joblib/joblib/compare/0.16.0...1.2.0 )
---
updated-dependencies:
- dependency-name: joblib
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-10-03 23:49:35 +02:00
Divyanshu Kumar
c28d04e9e2
Update no_trainer script for summarization ( #19277 )
...
* Update no_trainer script for summarization
* removed unnecessary import
* fixes notation mistake
* removed: unused variable
2022-10-03 09:21:51 -04:00
Sylvain Gugger
0fc68a7e14
Fix seq2seq QA example
2022-09-28 15:45:49 -04:00
Tatsuki Okada
4a0b958d61
Fix trainer seq2seq qa.py evaluate log and ft script ( #19208 )
...
* fix args option
* fix trainer eval log
* fix out of memory qa script
* do isort, black, flake
* fix tokenize target
* take it back.
* fix: comment
2022-09-28 10:55:46 -04:00
Arijit Mukherjee
ea540a5977
add wav2vec2_alignment ( #16782 )
...
* add wav2vec2_alignment
* Update alignment.py
* Update examples/research_projects/wav2vec2/alignment.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Update examples/research_projects/wav2vec2/alignment.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Update examples/research_projects/wav2vec2/alignment.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Update examples/research_projects/wav2vec2/alignment.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Update README.md
* fix style
* fix imports
* fix multithread
* fix bash script
* [@anton-l] Style fixes and docstrings
* [@anton-l] Style fixes and docstrings
* Update alignment.py
fix blank id in backtrack
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: anton-l <aglozhkov@gmail.com>
2022-09-27 13:12:56 +02:00
Sylvain Gugger
c20b2c7e18
Use repo_type instead of deprecated datasets repo IDs ( #19202 )
...
* Use repo_type instead of deprecated datasets repo IDs
* Add missing one in doc
2022-09-26 09:50:48 -04:00
dependabot[bot]
98af4f9b54
Bump protobuf in /examples/research_projects/decision_transformer ( #19176 )
...
Bumps [protobuf](https://github.com/protocolbuffers/protobuf ) from 3.19.4 to 3.19.5.
- [Release notes](https://github.com/protocolbuffers/protobuf/releases )
- [Changelog](https://github.com/protocolbuffers/protobuf/blob/main/generate_changelog.py )
- [Commits](https://github.com/protocolbuffers/protobuf/compare/v3.19.4...v3.19.5 )
---
updated-dependencies:
- dependency-name: protobuf
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-09-26 14:55:16 +02:00
Enze
5da6afdd8d
Update run_clip.py ( #19130 )
...
The overwrite_cache parameter is declared twice.
2022-09-23 20:48:41 +02:00
dependabot[bot]
74a3ea4737
Bump oauthlib in /examples/research_projects/decision_transformer ( #19080 )
...
Bumps [oauthlib](https://github.com/oauthlib/oauthlib ) from 3.2.0 to 3.2.1.
- [Release notes](https://github.com/oauthlib/oauthlib/releases )
- [Changelog](https://github.com/oauthlib/oauthlib/blob/master/CHANGELOG.rst )
- [Commits](https://github.com/oauthlib/oauthlib/compare/v3.2.0...v3.2.1 )
---
updated-dependencies:
- dependency-name: oauthlib
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-09-22 17:01:40 +02:00
Matt
83dc6377d0
Reduce LR for TF MLM example test ( #19156 )
2022-09-22 08:51:27 -04:00
Leandro von Werra
ef6741fe65
Fix GLUE MNLI when using max_eval_samples
( #18722 )
2022-09-21 09:33:22 +02:00
Santiago Castro
06f341de4f
Add a missing space in a script arg documentation ( #19113 )
2022-09-20 21:43:32 +02:00
Ryan X
36e356caa4
Fix: update ltp word segmentation call in mlm_wwm ( #19047 )
...
* Fix: update ltp word segmentation call in mlm_wwm
* Fix: update ltp word segmentation call in mlm_wwm
* Fix: update ltp word segmentation call in mlm_wwm
2022-09-20 09:20:38 -04:00
dependabot[bot]
7d0486c106
Bump mako in /examples/research_projects/decision_transformer ( #19077 )
...
Bumps [mako](https://github.com/sqlalchemy/mako ) from 1.2.0 to 1.2.2.
- [Release notes](https://github.com/sqlalchemy/mako/releases )
- [Changelog](https://github.com/sqlalchemy/mako/blob/main/CHANGES )
- [Commits](https://github.com/sqlalchemy/mako/commits )
---
updated-dependencies:
- dependency-name: mako
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-09-16 22:15:02 +02:00
Lysandre
16913b3c92
Dev version
2022-09-14 14:58:20 -04:00
Rahul A R
00fc9217d1
Fixed bug which caused overwrite_cache to always be True ( #19000 )
...
* fixed bug which caused overwrite_cache to always be True (#18967 ).
* reformatting changes
2022-09-13 11:29:48 -04:00
Rafał Jankowski
85125fcffd
Neptune.ai integration improvements ( #18934 )
...
* NeptuneCallback improvements
* After review suggestions and deduplication of initial run
* Added volatile checkpoints support due to missing post-rebase commit
* Update README per review comments
- Remove list formatting
- Correct Neptune docs link
Co-authored-by: Sabine <sabine.nyholm@neptune.ai>
2022-09-09 11:37:34 -04:00
Sanchit Gandhi
e6f221c8d4
[JAX] Replace all jax.tree_* calls with jax.tree_util.tree_* ( #18361 )
...
* [JAX] Replace all jax.tree_* calls with jax.tree_util.tree_*
* fix double tree_util
2022-09-09 15:18:56 +02:00
Nicholas Broad
4f299b2446
Accelerator end training ( #18910 )
...
* add accelerator.end_training()
Some trackers need this to end their runs.
* fixup and quality
* add space
* add space again ?!?
2022-09-07 07:46:26 -04:00
arun99481
3b19c0317b
updating gather function with gather_for_metrics in run_wav2vec2_pretraining ( #18877 )
...
Co-authored-by: Arun Rajaram <arunrajaram@Aruns-MacBook-Pro.local>
2022-09-06 07:36:37 -04:00
Sylvain Gugger
c61f116b63
Tie weights after preparing the model in run_clm ( #18855 )
2022-09-01 12:06:56 -04:00
Rahul A R
e9442440fc
streamlining 'checkpointing_steps' parsing ( #18755 )
2022-08-25 11:00:38 -04:00
Rahul A R
c55d6e4e10
examples/run_summarization_no_trainer: fixed incorrect param to hasattr ( #18720 )
...
* fixed incorrect param to hasattr
* simplified condition checks
* code cleanup
2022-08-24 12:12:42 -04:00
dependabot[bot]
e49c71fc4c
Bump nbconvert from 6.3.0 to 6.5.1 in /examples/research_projects/lxmert ( #18742 )
...
Bumps [nbconvert](https://github.com/jupyter/nbconvert ) from 6.3.0 to 6.5.1.
- [Release notes](https://github.com/jupyter/nbconvert/releases )
- [Commits](https://github.com/jupyter/nbconvert/compare/6.3.0...6.5.1 )
---
updated-dependencies:
- dependency-name: nbconvert
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-08-24 06:12:56 -04:00
dependabot[bot]
5b24949669
Bump nbconvert in /examples/research_projects/visual_bert ( #18741 )
...
Bumps [nbconvert](https://github.com/jupyter/nbconvert ) from 6.3.0 to 6.5.1.
- [Release notes](https://github.com/jupyter/nbconvert/releases )
- [Commits](https://github.com/jupyter/nbconvert/compare/6.3.0...6.5.1 )
---
updated-dependencies:
- dependency-name: nbconvert
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-08-24 06:12:48 -04:00
Atharva Ingle
d90a36d192
remove check for main process for trackers initialization ( #18706 )
2022-08-22 11:16:27 -04:00
Atharva Ingle
e54a1b49aa
model.tie_weights()
should be applied after accelerator.prepare()
(#18676 )
...
* `model.tie_weights()` should be applied after `accelerator.prepare`
Weight tying should be done after the model has been moved to XLA device as mentioned on PyTorch/XLA Troubleshooting guide [here](https://github.com/pytorch/xla/blob/master/TROUBLESHOOTING.md#xla-tensor-quirks )
* format code
2022-08-18 13:46:57 -04:00
Loubna Ben Allal
bbbb453e58
Add an examples folder for code downstream tasks ( #18679 )
...
* add examples subfolder
* mention examples in codeparrot readme
* use Trainer optimizer and scheduler type and add output_dir as argument
* add example of text-to-python and python-to-text models
* mention the downstream examples in the readme
* fix typo
2022-08-18 18:24:24 +02:00
Zachary Mueller
358fc18613
Add evaluate to examples requirements ( #18666 )
2022-08-18 10:57:39 -04:00
Stefan Schweter
358478e729
Examples: add Bloom support for token classification ( #18632 )
...
* examples: add Bloom support for token classification (FLAX, PyTorch and TensorFlow)
* examples: remove support for Bloom in token classication (FLAX and TensorFlow currently have no support for it)
2022-08-17 09:50:57 +02:00
zhoutang776
25e651a2de
Update run_translation_no_trainer.py ( #18637 )
...
* Update run_translation_no_trainer.py
found an error in selecting `no_decay` parameters and some small modifications when the user continues to train from a checkpoint
* fixs `no_decay` and `resume_step` issue
1. change `no_decay` list
2. if use continue to train their model from provided checkpoint, the `resume_step` will not be initialized properly if `args.gradient_accumulation_steps != 1`
2022-08-16 13:25:57 -04:00
Karim Foda
d6eeb87170
Flax Remat for LongT5 ( #17994 )
...
* [Flax] Add remat (gradient checkpointing)
* fix variable naming in test
* flip: checkpoint using a method
* fix naming
* fix class naming
* apply PVP's suggestions from code review
* add gradient_checkpointing to examples
* Add gradient_checkpointing to run_mlm_flax
* Add remat to longt5
* Add gradient checkpointing test longt5
* Fix args errors
* Fix remaining tests
* Make fixup & quality fixes
* replace kwargs
* remove unecessary kwargs
* Make fixup changes
* revert long_t5_flax changes
* Remove return_dict and copy to LongT5
* Remove test_gradient_checkpointing
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>
2022-08-14 16:27:13 +01:00
dependabot[bot]
05d3a43c59
Bump nbconvert in /examples/research_projects/visual_bert ( #18566 )
...
Bumps [nbconvert](https://github.com/jupyter/nbconvert ) from 6.0.1 to 6.3.0.
- [Release notes](https://github.com/jupyter/nbconvert/releases )
- [Commits](https://github.com/jupyter/nbconvert/compare/6.0.1...6.3.0 )
---
updated-dependencies:
- dependency-name: nbconvert
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-08-11 10:47:31 -04:00
dependabot[bot]
713ab6fde5
Bump nbconvert from 6.0.1 to 6.3.0 in /examples/research_projects/lxmert ( #18565 )
...
Bumps [nbconvert](https://github.com/jupyter/nbconvert ) from 6.0.1 to 6.3.0.
- [Release notes](https://github.com/jupyter/nbconvert/releases )
- [Commits](https://github.com/jupyter/nbconvert/compare/6.0.1...6.3.0 )
---
updated-dependencies:
- dependency-name: nbconvert
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-08-11 10:47:19 -04:00
Matt
6eb51450fa
TF Examples Rewrite ( #18451 )
...
* Finished QA example
* Dodge a merge conflict
* Update text classification and LM examples
* Update NER example
* New Keras metrics WIP, fix NER example
* Update NER example
* Update MC, summarization and translation examples
* Add XLA warnings when shapes are variable
* Make sure batch_size is consistently scaled by num_replicas
* Add PushToHubCallback to all models
* Add docs links for KerasMetricCallback
* Add docs links for prepare_tf_dataset and jit_compile
* Correct inferred model names
* Don't assume the dataset has 'lang'
* Don't assume the dataset has 'lang'
* Write metrics in text classification
* Add 'framework' to TrainingArguments and TFTrainingArguments
* Export metrics in all examples and add tests
* Fix training args for Flax
* Update command line args for translation test
* make fixup
* Fix accidentally running other tests in fp16
* Remove do_train/do_eval from run_clm.py
* Remove do_train/do_eval from run_mlm.py
* Add tensorflow tests to circleci
* Fix circleci
* Update examples/tensorflow/language-modeling/run_mlm.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
* Update examples/tensorflow/test_tensorflow_examples.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
* Update examples/tensorflow/translation/run_translation.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
* Update examples/tensorflow/token-classification/run_ner.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
* Fix save path for tests
* Fix some model card kwargs
* Explain the magical -1000
* Actually enable tests this time
* Skip text classification PR until we fix shape inference
* make fixup
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
2022-08-10 16:49:51 +01:00
Rasmus Arpe Fogh Jensen
a765b68aa6
Update no_trainer.py scripts to include accelerate gradient accumulation wrapper ( #18473 )
...
* Added accelerate gradient accumulation wrapper to run_image_classification_no_trainer.py example script
* make fixup changes
* PR comments
* changed input to Acceletor based on PR comment, ran make fixup
* Added comment explaining the sync_gradients statement
* Fixed lr scheduler max steps
* Changed run_clm_no_trainer.py script to use accelerate gradient accum wrapper
* Fixed all scripts except wav2vec2 pretraining to use accelerate gradient accum wrapper
* Added accelerate gradient accum wrapper for wav2vec2_pretraining_no_trainer.py script
* make fixup and lr_scheduler step inserted back into run_qa_beam_search_no_trainer.py
* removed changes to run_wav2vec2_pretraining_no_trainer.py script and fixed using wrong constant in qa_beam_search_no_trainer.py script
2022-08-08 15:52:47 -04:00
Sylvain Gugger
70b0d4e193
Fix compatibility with 1.12 ( #17925 )
...
* Fix compatibility with 1.12
* Remove pin from examples requirements
* Update torch scatter version
* Fix compatibility with 1.12
* Remove pin from examples requirements
* Update torch scatter version
* fix torch.onnx.symbolic_opset12 import
* Reject bad version
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-08-08 09:53:08 -04:00