NielsRogge
78ba9f4617
[Docs] Add video section ( #28958 )
...
Add video section
2024-02-12 19:50:31 +01:00
Klaus Hipp
fe3df9d5b3
[Docs] Add language identifiers to fenced code blocks ( #28955 )
...
Add language identifiers to code blocks
2024-02-12 10:48:31 -08:00
Yunxuan Xiao
c617f988f8
Clean up staging tmp checkpoint directory ( #28848 )
...
clean up remaining tmp checkpoint dir
Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
2024-02-12 15:47:21 +00:00
JB (Don)
136cd893dc
Always initialize tied output_embeddings if it has a bias term ( #28947 )
...
Continue to initialize tied output_embeddings if it has a bias term
The bias term is not tied, and so will need to be initialized accordingly.
2024-02-12 15:47:08 +00:00
Alexey Fadeev
792819f6cf
Updated requirements for image-classification samples: datasets>=2.14.0 ( #28974 )
...
Updated datasets requirements. Need a package version >= 2.14.0
2024-02-12 14:57:25 +00:00
Joao Gante
e30bbb2685
Tests: tag test_save_load_fast_init_from_base
as flaky ( #28930 )
2024-02-12 14:43:34 +00:00
cmahmut
1709886eba
[pipelines
] updated docstring with vqa alias ( #28951 )
...
updated docstring with vqa alias
2024-02-12 14:34:08 +00:00
Kossai Sbai
cf4c20b9fb
Convert torch_dtype
as str
to actual torch data type (i.e. "float16" …to torch.float16
) ( #28208 )
...
* Convert torch_dtype as str to actual torch data type (i.e. "float16" to torch.float16)
* Check if passed torch_dtype is an attribute in torch
* Update src/transformers/pipelines/__init__.py
Check type via isinstance
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2024-02-12 14:04:53 +00:00
NielsRogge
ef5ab72f4b
[Docs] Update README and default pipelines ( #28864 )
...
* Update README and docs
* Update README
* Update README
2024-02-12 10:21:36 +01:00
NielsRogge
f278ef20ed
[Nougat] Fix pipeline ( #28242 )
...
* Fix pipeline
* Remove print statements
* Address comments
* Address issue
* Remove unused imports
2024-02-12 10:21:15 +01:00
Klaus Hipp
58e3d23e97
[i18n-de] Translate README.md to German ( #28933 )
...
* Translate README.md to German
* Add links to README_de.md
* Remove invisible characters in README
* Change to a formal tone and fix punctuation marks
2024-02-09 12:56:22 -08:00
Philip Blair
d123e661e4
Fix type annotations on neftune_noise_alpha and fsdp_config TrainingArguments parameters ( #28942 )
2024-02-09 15:42:01 +00:00
Yuki Watanabe
ebf3ea2788
Fix a wrong link to CONTRIBUTING.md section in PR template ( #28941 )
2024-02-09 15:10:47 +00:00
Karl Hajjar
de11e654c9
Fix max_position_embeddings default value for llama2 to 4096 #28241 ( #28754 )
...
* Changed max_position_embeddings default value from 2048 to 4096
* force push
* Fixed formatting issues. Fixed missing argument in write_model.
* Reverted to the default value 2048 in the Llama config. Added comments for the llama_version argument.
* Fixed issue with default value value of max_position_embeddings in docstring
* Updated help message for llama versions
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2024-02-09 10:24:01 +00:00
Klaus Hipp
2749e479f3
[Docs] Fix broken links and syntax issues ( #28918 )
...
* Fix model documentation links in attention.md
* Fix external link syntax
* Fix target anchor names of section links
* Fix copyright statement comments
* Fix documentation headings
2024-02-08 14:13:35 -08:00
Raushan Turganbay
d628664688
Support batched input for decoder start ids ( #28887 )
...
* support batched input for decoder start ids
* Fix typos
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
* minor changes
* fix: decoder_start_id as list
* empty commit
* empty commit
* empty commit
* empty commit
* empty commit
* empty commit
* empty commit
* empty commit
* empty commit
---------
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
2024-02-08 16:00:53 +00:00
Raushan Turganbay
cc309fd406
pass kwargs in stopping criteria list ( #28927 )
2024-02-08 15:38:29 +00:00
vodkaslime
0b693e90e0
fix: torch.int32 instead of torch.torch.int32 ( #28883 )
2024-02-08 16:28:17 +01:00
Matt
693667b8ac
Remove dead TF loading code ( #28926 )
...
Remove dead code
2024-02-08 14:17:33 +00:00
Arthur
115ac94d06
[Core generation
] Adds support for static KV cache ( #27931 )
...
Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
2024-02-08 11:50:34 +01:00
Javier
4b236aed76
Fix utf-8 yaml load for marian conversion to pytorch in Windows ( #28618 )
...
Fix utf-8 yaml in marian conversion
2024-02-08 08:23:15 +01:00
Klaus Hipp
33df036917
[Docs] Revert translation of '@slow' decorator ( #28912 )
2024-02-08 03:31:47 +01:00
Klaus Hipp
328ade855b
[Docs] Fix placement of tilde character ( #28913 )
...
Fix placement of tilde character
2024-02-07 17:19:39 -08:00
Huazhong Ji
5f96855761
Add npu device for pipeline ( #28885 )
...
add npu device for pipeline
Co-authored-by: unit_test <test@unit.com>
2024-02-07 17:27:01 +00:00
Yih-Dar
308d2b9004
Update the cache number ( #28905 )
...
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2024-02-07 16:37:09 +01:00
Daniel Korat
abf8f54a01
⚠️ Raise Exception
when trying to generate 0 tokens ⚠️ ( #28621 )
...
* change warning to exception
* Update src/transformers/generation/utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
* validate `max_new_tokens` > 0 in `GenerationConfig`
* fix truncation test parameterization in `TextGenerationPipelineTests`
---------
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
2024-02-07 13:42:01 +01:00
Matt
349a6e8542
Fix Keras scheduler import so it works for older versions of Keras ( #28895 )
...
Fix our schedule import so it works for older versions of Keras
2024-02-07 12:28:24 +00:00
Sourab Mangrulkar
d9deddb4c1
fix Starcoder FA2 implementation ( #28891 )
2024-02-07 14:10:10 +05:30
Sai-Suraj-27
64d1518cbf
fix: Fixed the documentation for logging_first_step
by removing "evaluate" ( #28884 )
...
Fixed the documentation for logging_first_step by removing evaluate.
2024-02-07 08:46:36 +01:00
Klaus Hipp
1c31b7aa3b
[Docs] Add missing language options and fix broken links ( #28852 )
...
* Add missing entries to the language selector
* Add links to the Colab and AWS Studio notebooks for ONNX
* Use anchor links in CONTRIBUTING.md
* Fix broken hyperlinks due to spaces
* Fix links to OpenAI research articles
* Remove confusing footnote symbols from author names, as they are also considered invalid markup
2024-02-06 12:01:01 -08:00
Yih-Dar
40658be461
Hotfix - make torchaudio
get the correct version in torch_and_flax_job
( #28899 )
...
* check
* check
* check
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2024-02-06 21:00:42 +01:00
Klaus Hipp
4830f26965
[Docs] Fix backticks in inline code and documentation links ( #28875 )
...
Fix backticks in code blocks and documentation links
2024-02-06 11:15:44 -08:00
Lucain
a1afec9e17
Explicit server error on gated model ( #28894 )
2024-02-06 17:45:20 +00:00
Yih-Dar
89439fea64
unpin torch ( #28892 )
...
* unpin torch
* check
* check
* check
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2024-02-06 17:21:05 +01:00
Yih-Dar
76b4f666f5
Revert "[WIP] Hard error when ignoring tensors." ( #28898 )
...
Revert "[WIP] Hard error when ignoring tensors. (#27484 )"
This reverts commit 2da28c4b41
.
2024-02-06 17:18:30 +01:00
Yih-Dar
6529a5b5c1
Fix FastSpeech2ConformerModelTest
and skip it on CPU ( #28888 )
...
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2024-02-06 11:05:23 +01:00
Sourab Mangrulkar
5346db1684
Raise error when using save_only_model
with load_best_model_at_end
for DeepSpeed/FSDP ( #28866 )
...
* Raise error when using `save_only_model` with `load_best_model_at_end` for DeepSpeed/FSDP
* Update trainer.py
2024-02-06 11:25:44 +05:30
Eran Hirsch
ee2a3400f2
Fix LongT5ForConditionalGeneration initialization of lm_head ( #28873 )
2024-02-06 04:24:20 +01:00
Klaus Hipp
1ea0bbd73c
[Docs] Update project names and links in awesome-transformers ( #28878 )
...
Update project names and repository links in awesome-transformers
2024-02-06 04:06:29 +01:00
dependabot[bot]
e83227d76e
Bump cryptography from 41.0.2 to 42.0.0 in /examples/research_projects/decision_transformer ( #28879 )
...
Bump cryptography in /examples/research_projects/decision_transformer
Bumps [cryptography](https://github.com/pyca/cryptography ) from 41.0.2 to 42.0.0.
- [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst )
- [Commits](https://github.com/pyca/cryptography/compare/41.0.2...42.0.0 )
---
updated-dependencies:
- dependency-name: cryptography
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-02-06 03:53:08 +01:00
nakranivaibhav
2e7c942c81
Adds LlamaForQuestionAnswering class in modeling_llama.py along with AutoModel Support ( #28777 )
...
* This is a test commit
* testing commit
* final commit with some changes
* Removed copy statement
* Fixed formatting issues
* Fixed error added past_key_values in the forward method
* Fixed a trailing whitespace. Damn the formatting rules are strict
* Added the copy statement
2024-02-06 03:41:42 +01:00
xkszltl
ac51e59e47
Do not use mtime for checkpoint rotation. ( #28862 )
...
Resolve https://github.com/huggingface/transformers/issues/26961
2024-02-06 03:21:50 +01:00
eajechiloae
06901162b5
ClearMLCallback enhancements: support multiple runs and handle logging better ( #28559 )
...
* add clearml tracker
* support multiple train runs
* remove bad code
* add UI entries for config/hparams overrides
* handle models in different tasks
* run ruff format
* tidy code based on code review
---------
Co-authored-by: Eugen Ajechiloae <eugenajechiloae@gmail.com>
2024-02-05 20:04:17 +00:00
amyeroberts
ba3264b4e8
Image Feature Extraction pipeline ( #28216 )
...
* Draft pipeline
* Fixup
* Fix docstrings
* Update doctest
* Update pipeline_model_mapping
* Update docstring
* Update tests
* Update src/transformers/pipelines/image_feature_extraction.py
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
* Fix docstrings - review comments
* Remove pipeline mapping for composite vision models
* Add to pipeline tests
* Remove for flava (multimodal)
* safe pil import
* Add requirements for pipeline run
* Account for super slow efficientnet
* Review comments
* Fix tests
* Swap order of kwargs
* Use build_pipeline_init_args
* Add back FE pipeline for Vilt
* Include image_processor_kwargs in docstring
* Mark test as flaky
* Update TODO
* Update tests/pipelines/test_pipelines_image_feature_extraction.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Add license header
---------
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2024-02-05 14:50:07 +00:00
Yoach Lacombe
7addc9346c
Correct wav2vec2-bert inputs_to_logits_ratio ( #28821 )
...
* Correct wav2vec2-bert inputs_to_logits_ratio
* correct ratio
* correct ratio, clean asr pipeline
* refactor on one line
2024-02-05 13:14:47 +00:00
Arthur
3f9f749325
[Doc
] update contribution guidelines ( #28858 )
...
update guidelines
2024-02-05 21:19:21 +09:00
Nicolas Patry
2da28c4b41
[WIP] Hard error when ignoring tensors. ( #27484 )
...
* [WIP] Hard error when ignoring tensors.
* Better selection/error when saving a checkpoint.
- Find all names we should normally drop (those are in the transformers
config)
- Find all disjoint tensors (for those we can safely trigger a copy to
get rid of the sharing before saving)
- Clone those disjoint tensors getting rid of the issue
- Find all identical names (those should be declared in the config
but we try to find them all anyway.)
- For all identical names:
- If they are in the config, just ignore them everything is fine
- If they are not, warn about them.
- For all remainder tensors which are shared yet neither identical NOR
disjoint. raise a hard error.
* Adding a failing test on `main` that passes here.
* We don't need to keep the subfolder logic in this test.
* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2024-02-05 09:17:24 +01:00
w4ffl35
0466fd5ca2
Ability to override clean_code_for_run ( #28783 )
...
* Add clean_code_for_run function
* Call clean_code_for_run from agent method
2024-02-05 03:48:41 +01:00
Zizhao Chen
c430d6eaee
[Docs] Fix bad doc: replace save with logging ( #28855 )
...
Fix bad doc: replace save with logging
2024-02-05 03:38:08 +01:00
Ziyang
7b702836af
Support custom scheduler in deepspeed training ( #26831 )
...
Reuse trainer.create_scheduler to create scheduler for deepspeed
2024-02-05 03:33:55 +01:00