transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
NielsRogge	78ba9f4617	[Docs] Add video section (#28958 ) Add video section	2024-02-12 19:50:31 +01:00
Klaus Hipp	fe3df9d5b3	[Docs] Add language identifiers to fenced code blocks (#28955 ) Add language identifiers to code blocks	2024-02-12 10:48:31 -08:00
Yunxuan Xiao	c617f988f8	Clean up staging tmp checkpoint directory (#28848 ) clean up remaining tmp checkpoint dir Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>	2024-02-12 15:47:21 +00:00
JB (Don)	136cd893dc	Always initialize tied output_embeddings if it has a bias term (#28947 ) Continue to initialize tied output_embeddings if it has a bias term The bias term is not tied, and so will need to be initialized accordingly.	2024-02-12 15:47:08 +00:00
Alexey Fadeev	792819f6cf	Updated requirements for image-classification samples: datasets>=2.14.0 (#28974 ) Updated datasets requirements. Need a package version >= 2.14.0	2024-02-12 14:57:25 +00:00
Joao Gante	e30bbb2685	Tests: tag `test_save_load_fast_init_from_base` as flaky (#28930 )	2024-02-12 14:43:34 +00:00
cmahmut	1709886eba	[`pipelines`] updated docstring with vqa alias (#28951 ) updated docstring with vqa alias	2024-02-12 14:34:08 +00:00
Kossai Sbai	cf4c20b9fb	Convert `torch_dtype` as `str` to actual torch data type (i.e. "float16" …to `torch.float16`) (#28208 ) * Convert torch_dtype as str to actual torch data type (i.e. "float16" to torch.float16) * Check if passed torch_dtype is an attribute in torch * Update src/transformers/pipelines/__init__.py Check type via isinstance Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-02-12 14:04:53 +00:00
NielsRogge	ef5ab72f4b	[Docs] Update README and default pipelines (#28864 ) * Update README and docs * Update README * Update README	2024-02-12 10:21:36 +01:00
NielsRogge	f278ef20ed	[Nougat] Fix pipeline (#28242 ) * Fix pipeline * Remove print statements * Address comments * Address issue * Remove unused imports	2024-02-12 10:21:15 +01:00
Klaus Hipp	58e3d23e97	[i18n-de] Translate README.md to German (#28933 ) * Translate README.md to German * Add links to README_de.md * Remove invisible characters in README * Change to a formal tone and fix punctuation marks	2024-02-09 12:56:22 -08:00
Philip Blair	d123e661e4	Fix type annotations on neftune_noise_alpha and fsdp_config TrainingArguments parameters (#28942 )	2024-02-09 15:42:01 +00:00
Yuki Watanabe	ebf3ea2788	Fix a wrong link to CONTRIBUTING.md section in PR template (#28941 )	2024-02-09 15:10:47 +00:00
Karl Hajjar	de11e654c9	Fix max_position_embeddings default value for llama2 to 4096 #28241 (#28754 ) * Changed max_position_embeddings default value from 2048 to 4096 * force push * Fixed formatting issues. Fixed missing argument in write_model. * Reverted to the default value 2048 in the Llama config. Added comments for the llama_version argument. * Fixed issue with default value value of max_position_embeddings in docstring * Updated help message for llama versions Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-02-09 10:24:01 +00:00
Klaus Hipp	2749e479f3	[Docs] Fix broken links and syntax issues (#28918 ) * Fix model documentation links in attention.md * Fix external link syntax * Fix target anchor names of section links * Fix copyright statement comments * Fix documentation headings	2024-02-08 14:13:35 -08:00
Raushan Turganbay	d628664688	Support batched input for decoder start ids (#28887 ) * support batched input for decoder start ids * Fix typos Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * minor changes * fix: decoder_start_id as list * empty commit * empty commit * empty commit * empty commit * empty commit * empty commit * empty commit * empty commit * empty commit --------- Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>	2024-02-08 16:00:53 +00:00
Raushan Turganbay	cc309fd406	pass kwargs in stopping criteria list (#28927 )	2024-02-08 15:38:29 +00:00
vodkaslime	0b693e90e0	fix: torch.int32 instead of torch.torch.int32 (#28883 )	2024-02-08 16:28:17 +01:00
Matt	693667b8ac	Remove dead TF loading code (#28926 ) Remove dead code	2024-02-08 14:17:33 +00:00
Arthur	115ac94d06	[`Core generation`] Adds support for static KV cache (#27931 ) Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>	2024-02-08 11:50:34 +01:00
Javier	4b236aed76	Fix utf-8 yaml load for marian conversion to pytorch in Windows (#28618 ) Fix utf-8 yaml in marian conversion	2024-02-08 08:23:15 +01:00
Klaus Hipp	33df036917	[Docs] Revert translation of '@slow' decorator (#28912 )	2024-02-08 03:31:47 +01:00
Klaus Hipp	328ade855b	[Docs] Fix placement of tilde character (#28913 ) Fix placement of tilde character	2024-02-07 17:19:39 -08:00
Huazhong Ji	5f96855761	Add npu device for pipeline (#28885 ) add npu device for pipeline Co-authored-by: unit_test <test@unit.com>	2024-02-07 17:27:01 +00:00
Yih-Dar	308d2b9004	Update the cache number (#28905 ) * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-02-07 16:37:09 +01:00
Daniel Korat	abf8f54a01	⚠️ Raise `Exception` when trying to generate 0 tokens ⚠️ (#28621 ) * change warning to exception * Update src/transformers/generation/utils.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * validate `max_new_tokens` > 0 in `GenerationConfig` * fix truncation test parameterization in `TextGenerationPipelineTests` --------- Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>	2024-02-07 13:42:01 +01:00
Matt	349a6e8542	Fix Keras scheduler import so it works for older versions of Keras (#28895 ) Fix our schedule import so it works for older versions of Keras	2024-02-07 12:28:24 +00:00
Sourab Mangrulkar	d9deddb4c1	fix Starcoder FA2 implementation (#28891 )	2024-02-07 14:10:10 +05:30
Sai-Suraj-27	64d1518cbf	fix: Fixed the documentation for `logging_first_step` by removing "evaluate" (#28884 ) Fixed the documentation for logging_first_step by removing evaluate.	2024-02-07 08:46:36 +01:00
Klaus Hipp	1c31b7aa3b	[Docs] Add missing language options and fix broken links (#28852 ) * Add missing entries to the language selector * Add links to the Colab and AWS Studio notebooks for ONNX * Use anchor links in CONTRIBUTING.md * Fix broken hyperlinks due to spaces * Fix links to OpenAI research articles * Remove confusing footnote symbols from author names, as they are also considered invalid markup	2024-02-06 12:01:01 -08:00
Yih-Dar	40658be461	Hotfix - make `torchaudio` get the correct version in `torch_and_flax_job` (#28899 ) * check * check * check --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-02-06 21:00:42 +01:00
Klaus Hipp	4830f26965	[Docs] Fix backticks in inline code and documentation links (#28875 ) Fix backticks in code blocks and documentation links	2024-02-06 11:15:44 -08:00
Lucain	a1afec9e17	Explicit server error on gated model (#28894 )	2024-02-06 17:45:20 +00:00
Yih-Dar	89439fea64	unpin torch (#28892 ) * unpin torch * check * check * check --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-02-06 17:21:05 +01:00
Yih-Dar	76b4f666f5	Revert "[WIP] Hard error when ignoring tensors." (#28898 ) Revert "[WIP] Hard error when ignoring tensors. (#27484)" This reverts commit `2da28c4b41`.	2024-02-06 17:18:30 +01:00
Yih-Dar	6529a5b5c1	Fix `FastSpeech2ConformerModelTest` and skip it on CPU (#28888 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-02-06 11:05:23 +01:00
Sourab Mangrulkar	5346db1684	Raise error when using `save_only_model` with `load_best_model_at_end` for DeepSpeed/FSDP (#28866 ) * Raise error when using `save_only_model` with `load_best_model_at_end` for DeepSpeed/FSDP * Update trainer.py	2024-02-06 11:25:44 +05:30
Eran Hirsch	ee2a3400f2	Fix LongT5ForConditionalGeneration initialization of lm_head (#28873 )	2024-02-06 04:24:20 +01:00
Klaus Hipp	1ea0bbd73c	[Docs] Update project names and links in awesome-transformers (#28878 ) Update project names and repository links in awesome-transformers	2024-02-06 04:06:29 +01:00
dependabot[bot]	e83227d76e	Bump cryptography from 41.0.2 to 42.0.0 in /examples/research_projects/decision_transformer (#28879 ) Bump cryptography in /examples/research_projects/decision_transformer Bumps [cryptography](https://github.com/pyca/cryptography) from 41.0.2 to 42.0.0. - [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst) - [Commits](https://github.com/pyca/cryptography/compare/41.0.2...42.0.0) --- updated-dependencies: - dependency-name: cryptography dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-02-06 03:53:08 +01:00
nakranivaibhav	2e7c942c81	Adds LlamaForQuestionAnswering class in modeling_llama.py along with AutoModel Support (#28777 ) * This is a test commit * testing commit * final commit with some changes * Removed copy statement * Fixed formatting issues * Fixed error added past_key_values in the forward method * Fixed a trailing whitespace. Damn the formatting rules are strict * Added the copy statement	2024-02-06 03:41:42 +01:00
xkszltl	ac51e59e47	Do not use mtime for checkpoint rotation. (#28862 ) Resolve https://github.com/huggingface/transformers/issues/26961	2024-02-06 03:21:50 +01:00
eajechiloae	06901162b5	ClearMLCallback enhancements: support multiple runs and handle logging better (#28559 ) * add clearml tracker * support multiple train runs * remove bad code * add UI entries for config/hparams overrides * handle models in different tasks * run ruff format * tidy code based on code review --------- Co-authored-by: Eugen Ajechiloae <eugenajechiloae@gmail.com>	2024-02-05 20:04:17 +00:00
amyeroberts	ba3264b4e8	Image Feature Extraction pipeline (#28216 ) * Draft pipeline * Fixup * Fix docstrings * Update doctest * Update pipeline_model_mapping * Update docstring * Update tests * Update src/transformers/pipelines/image_feature_extraction.py Co-authored-by: Omar Sanseviero <osanseviero@gmail.com> * Fix docstrings - review comments * Remove pipeline mapping for composite vision models * Add to pipeline tests * Remove for flava (multimodal) * safe pil import * Add requirements for pipeline run * Account for super slow efficientnet * Review comments * Fix tests * Swap order of kwargs * Use build_pipeline_init_args * Add back FE pipeline for Vilt * Include image_processor_kwargs in docstring * Mark test as flaky * Update TODO * Update tests/pipelines/test_pipelines_image_feature_extraction.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Add license header --------- Co-authored-by: Omar Sanseviero <osanseviero@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2024-02-05 14:50:07 +00:00
Yoach Lacombe	7addc9346c	Correct wav2vec2-bert inputs_to_logits_ratio (#28821 ) * Correct wav2vec2-bert inputs_to_logits_ratio * correct ratio * correct ratio, clean asr pipeline * refactor on one line	2024-02-05 13:14:47 +00:00
Arthur	3f9f749325	[`Doc`] update contribution guidelines (#28858 ) update guidelines	2024-02-05 21:19:21 +09:00
Nicolas Patry	2da28c4b41	[WIP] Hard error when ignoring tensors. (#27484 ) * [WIP] Hard error when ignoring tensors. * Better selection/error when saving a checkpoint. - Find all names we should normally drop (those are in the transformers config) - Find all disjoint tensors (for those we can safely trigger a copy to get rid of the sharing before saving) - Clone those disjoint tensors getting rid of the issue - Find all identical names (those should be declared in the config but we try to find them all anyway.) - For all identical names: - If they are in the config, just ignore them everything is fine - If they are not, warn about them. - For all remainder tensors which are shared yet neither identical NOR disjoint. raise a hard error. * Adding a failing test on `main` that passes here. * We don't need to keep the subfolder logic in this test. * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2024-02-05 09:17:24 +01:00
w4ffl35	0466fd5ca2	Ability to override clean_code_for_run (#28783 ) * Add clean_code_for_run function * Call clean_code_for_run from agent method	2024-02-05 03:48:41 +01:00
Zizhao Chen	c430d6eaee	[Docs] Fix bad doc: replace save with logging (#28855 ) Fix bad doc: replace save with logging	2024-02-05 03:38:08 +01:00
Ziyang	7b702836af	Support custom scheduler in deepspeed training (#26831 ) Reuse trainer.create_scheduler to create scheduler for deepspeed	2024-02-05 03:33:55 +01:00

1 2 3 4 5 ...

15098 Commits