transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-28 16:52:24 +06:00

Author	SHA1	Message	Date
dependabot[bot]	6e59b30841	Bump zipp from 3.7.0 to 3.19.1 in /examples/research_projects/decision_transformer (#31871 ) Bump zipp in /examples/research_projects/decision_transformer Bumps [zipp](https://github.com/jaraco/zipp) from 3.7.0 to 3.19.1. - [Release notes](https://github.com/jaraco/zipp/releases) - [Changelog](https://github.com/jaraco/zipp/blob/main/NEWS.rst) - [Commits](https://github.com/jaraco/zipp/compare/v3.7.0...v3.19.1) --- updated-dependencies: - dependency-name: zipp dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-07-09 21:44:48 +01:00
Merve Noyan	e3a7d9bd47	Update depth estimation task guide (#31860 ) --------- Co-authored-by: Merve Noyan <mervenoyan@Merve-MacBook-Pro.local> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-07-09 22:13:30 +03:00
Yih-Dar	4c8149d643	Fix `_init_weights` for `ResNetPreTrainedModel` (#31851 ) * init * test --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-07-09 20:09:08 +02:00
Yung-Sung Chuang	d094d8d9ec	Generate: Add new decoding strategy "DoLa" in `.generate()` (#29619 ) Co-authored-by: Joao Gante <joao@huggingface.co>	2024-07-09 17:37:38 +01:00
chenk	99c0e55335	docs: typo in tf qa example (#31864 ) Signed-off-by: chenk <hen.keinan@gmail.com>	2024-07-09 16:30:06 +01:00
Joao Gante	4c2538b863	Test loading generation config with safetensor weights (#31550 ) fix test	2024-07-09 16:22:43 +02:00
kallewoof	cffa2b9c1d	save_pretrained: use tqdm when saving checkpoint shards from offloaded params (#31856 )	2024-07-09 12:55:57 +01:00
hatti	350aed7076	chore: remove duplicate words (#31853 ) remove duplicate words	2024-07-09 10:38:29 +01:00
NielsRogge	bd760cd13d	[Grounding DINO] Add processor to auto mapping (#31845 ) Add model	2024-07-09 11:28:53 +02:00
fxmarty	0abf5e8eae	FX symbolic_trace: do not test decoder_inputs_embeds (#31840 ) only test input_embeds, not decoder_input_embeds	2024-07-09 08:07:46 +02:00
Raushan Turganbay	952dfd4867	Deprecate `vocab_size` in other two VLMs (#31681 ) * deprrecate `vocab_size` in other two VLMs * Update src/transformers/models/fuyu/configuration_fuyu.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * depracate until 4.44 --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-07-09 10:40:06 +05:00
Joao Gante	594c1610fa	Mamba & RecurrentGemma: enable strict signature (#31549 ) * enable strict signature * this should not have been deleted * recurrent_gemma too	2024-07-08 15:48:32 +01:00
André Storhaug	ae9dd02ee1	Fix incorrect accelerator device handling for MPS in `TrainingArguments` (#31812 ) * Fix wrong acclerator device setup when using MPS * More robust TrainingArguments MPS handling * Update training_args.py * Cleanup	2024-07-08 12:49:30 +01:00
Yih-Dar	4879ac2b33	Avoid failure `TFBlipModelTest::test_pipeline_image_to_text` (#31827 ) * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-07-08 13:49:21 +02:00
fxmarty	ba743700f4	transformers.fx.symbolic_trace supports inputs_embeds (#31574 ) * symbolic trace supports inputs_embeds * fix test? * Update tests/test_modeling_common.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-07-08 19:17:28 +08:00
omahs	e5ca9b057c	Fix typos (#31819 ) * fix typo * fix typo * fix typos * fix typo * fix typos	2024-07-08 11:52:47 +01:00
dependabot[bot]	f4711844a3	Bump certifi from 2023.7.22 to 2024.7.4 in /examples/research_projects/lxmert (#31838 ) Bump certifi in /examples/research_projects/lxmert Bumps [certifi](https://github.com/certifi/python-certifi) from 2023.7.22 to 2024.7.4. - [Commits](https://github.com/certifi/python-certifi/compare/2023.07.22...2024.07.04) --- updated-dependencies: - dependency-name: certifi dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-07-08 11:17:49 +01:00
dependabot[bot]	9f3f58c905	Bump transformers from 4.26.1 to 4.38.0 in /examples/tensorflow/language-modeling-tpu (#31837 ) Bump transformers in /examples/tensorflow/language-modeling-tpu Bumps [transformers](https://github.com/huggingface/transformers) from 4.26.1 to 4.38.0. - [Release notes](https://github.com/huggingface/transformers/releases) - [Commits](https://github.com/huggingface/transformers/compare/v4.26.1...v4.38.0) --- updated-dependencies: - dependency-name: transformers dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-07-08 11:12:33 +01:00
Pavel Iakubovskii	a177821b24	Add FA2 and `sdpa` support for SigLIP (#31499 ) * Rebase to main * Fix attention implementation autoset for tex and vision configs * Fixup * Minor fixes * Fix copies * Fix attention_mask for FA2 * Add eqvivalence tests for siglip * Remove right padding test * Uncomment flaky * Fix import * Add to docs * Fix test message * Add sdpa * Add sdpa equivalence test * Add siglip sdpa to docs * Fix typing for attention output * Add sdpa tests * Fix signature of FA2 * Autoset attn_implementation in config * Rename bsz -> batch_size * Move back autoset attn method * Mark as flaky * Correct attention mask padding * [run-slow] siglip * Add FA2 and sdpa docs * Style fix * Remove flaky for FA2 test * Change attention implementation set * Change attn_implementaiton propogation * Fix typos * Add modality to assert message * Add more sdpa backends in test * [run slow] siglip * Add math sdpa backend for all options * [run slow] siglip	2024-07-08 11:10:02 +01:00
dependabot[bot]	076e66e479	Bump certifi from 2023.7.22 to 2024.7.4 in /examples/research_projects/decision_transformer (#31813 ) Bump certifi in /examples/research_projects/decision_transformer Bumps [certifi](https://github.com/certifi/python-certifi) from 2023.7.22 to 2024.7.4. - [Commits](https://github.com/certifi/python-certifi/compare/2023.07.22...2024.07.04) --- updated-dependencies: - dependency-name: certifi dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-07-08 10:52:10 +01:00
Dingli Yang	c1cda0ee2c	Fix Seq2SeqTrainer crash when BatchEncoding data is None (#31418 ) avoiding crash when BatchEncoding data is None	2024-07-08 10:51:23 +01:00
NielsRogge	06fd7972ac	Add ZoeDepth (#30136 ) * First draft * Add docs * Clean up code * Convert model * Add image processor * Convert Zoe_K * More improvements * Improve variable names and docstrings * Improve variable names * Improve variable names * Replace nn.sequential * More improvements * Convert ZoeD_NK * Fix most tests * Verify pixel values * Verify pixel values * Add squeeze * Update beit to support arbitrary window sizes * Improve image processor * Improve docstring * Improve beit * Improve model outputs * Add figure * Fix beit * Update checkpoint * Fix repo id * Add _keys_to_ignore_on_load_unexpected * More improvements * Address comments * Address comments * Address comments * Address comments * Rename variable name * Add backbone_hidden_size * Vectorize * Vectorize more * Address comments * Clarify docstring * Remove backbone_hidden_size * Fix image processor * Remove print statements * Remove print statement * Add integration test * Address comments * Address comments * Address comments * Address comments * Add requires_backends * Clean up * Simplify conversion script * Simplify more * Simplify more * Simplify more * Clean up * Make sure beit is loaded correctly * Address comment * Address bin_configurations * Use bin_configurations * Convert models, add integration tests * Fix doc test * Address comments * Unify regressor classes * Clarify arguments * Improve resize_image * Add num_relative_features * Address comment * [run-slow]beit,data2vec,zoedepth * [run-slow]beit,data2vec,zoedepth * Address comments * Address comment * Address comment * Replace nn.TransformerEncoderLayer and nn.TransformerEncoder * Replace nn.MultiheadAttention * Add attributes for patch transformer to config * Add tests for ensure_multiple_of * Update organization * Add tests * [run-slow] beit data2vec * Update ruff * [run-slow] beit data2vec * Add comment * Improve docstrings, add test * Fix interpolate_pos_encoding * Fix slow tests * Add docstring * Update src/transformers/models/zoedepth/image_processing_zoedepth.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/zoedepth/image_processing_zoedepth.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Improve tests and docstrings * Use run_common_tests * Improve docstrings * Improve docstrings * Improve tests * Improve tests * Remove print statements --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-07-08 11:43:33 +02:00
Pedro Cuenca	1082361a19	Depth Anything: update conversion script for V2 (#31522 ) * Depth Anything: update conversion script for V2 * Update docs * Style * Revert "Update docs" This reverts commit `be0ca47ea1`. * Add docs for depth anything v2 * Add depth_anything_v2 to MODEL_NAMES_MAPPING Done similarly to Flan-T5: https://github.com/huggingface/transformers/pull/19892/files * Add tip in original docs	2024-07-05 19:28:41 +01:00
Thien Tran	a8fa6fbbec	Fix Wav2Vec2 Fairseq conversion (weight norm state dict keys) (#31714 ) * handle new weight norm * fix * fix trailing space	2024-07-05 19:26:21 +01:00
Anton Vlasjuk	a01b033cb4	Fix galore lr display with schedulers (#31710 ) * fix galore lr display with lr schedulers * style * add some tests to check for displayed lrs * copy-paste err for warmup steps * standardize the default lr to be only in the optimizer * trying out my luck with the reads	2024-07-05 18:59:09 +01:00
Billy Cao	ac26260436	Allow FP16 or other precision inference for Pipelines (#31342 ) * cast image features to model.dtype where needed to support FP16 or other precision in pipelines * Update src/transformers/pipelines/image_feature_extraction.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Use .to instead * Add FP16 pipeline support for zeroshot audio classification * Remove unused torch imports * Add docs on FP16 pipeline * Remove unused import * Add FP16 tests to pipeline mixin * Add fp16 placeholder for mask_generation pipeline test * Add FP16 tests for all pipelines * Fix formatting * Remove torch_dtype arg from is_pipeline_test_to_skip* * Fix format * trigger ci --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-07-05 17:21:50 +01:00
Matt	e786844425	Repeating an important warning in the chat template docs (#31796 ) * Repeating an important warning in the chat template docs * Update docs/source/en/chat_templating.md Co-authored-by: Lysandre Debut <hi@lysand.re> * Reword for clarity * Reword for clarity --------- Co-authored-by: Lysandre Debut <hi@lysand.re>	2024-07-05 15:30:24 +01:00
Billy Cao	1d3eaa6f7e	Add training support for SigLIP (#31495 ) * Add siglip loss function * Update docs * Enable training tests [experimental] enable GC training tests as it has worked for my own data * Remove test_training* overrides to enable training tests [run_slow] siglip * Skip training tests for Siglip text model and ImageClassificationModel [run_slow] siglip * Skip GC training tests for SiglipForImageClassification * Explicitly skip training tests for SiglipVisionModel Add skip reason for training tests for SiglipTextModel * Remove copied from to fix CI	2024-07-05 14:50:39 +01:00
Aymeric Roucher	1556025271	Code agent: allow function persistence between steps (#31769 ) * Code agent: allow function persistence between steps	2024-07-05 11:09:11 +02:00
Yih-Dar	eef0507f3d	Fix gemma tests (#31794 ) * skip 3 7b tests * fix * fix * fix * [run-slow] gemma --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-07-05 10:17:59 +02:00
Boris Feld	9e599d1d94	Update CometCallback to allow reusing of the running experiment (#31366 ) * Update CometCallback to allow reusing of the running experiment * Fixups * Remove useless TODO * Add checks for minimum version of the Comet SDK * Fix documentation and links. Also simplify how the Comet Experiment name is passed	2024-07-05 08:13:46 +02:00
xiangdong	d19b5a90c2	Exclude torch.compile time from metrics computation (#31443 ) * exclude compile time from metrics computation * fix the quality issue	2024-07-05 08:11:55 +02:00
Kazuaki Ishizaki	2aa2a14481	Make tensor device correct when ACCELERATE_TORCH_DEVICE is defined (#31751 ) return correct device when ACCELERATE_TORCH_DEVICE is defined	2024-07-05 08:09:04 +02:00
Marc Sun	8c5c180de0	Fix serialization for offloaded model (#31727 ) * Fix serialization * style * add test	2024-07-05 08:07:07 +02:00
mxkopy	eaa5f41439	Fix ClapProcessor to merge feature_extractor output into the returned BatchEncoding (#31767 ) * fixed ClapProcessor to merge all values output from the feature extractor into the returned BatchEncoding. * fixed trailing whitespace	2024-07-05 07:55:47 +02:00
Billy Cao	43ffb785c0	Add torch_empty_cache_steps to TrainingArguments (#31546 ) * Add torch_empty_cache_steps to TrainingArguments * Fix formatting * Add torch_empty_cache_steps to docs on single gpu training * Remove check for torch_empty_cache_steps <= max_steps * Captalize Tip * Be device agnostic * Fix linting	2024-07-04 13:20:49 -04:00
hoshi-hiyouga	cee768d97e	Fix Gemma2 types (#31779 ) Update __init__.py	2024-07-04 15:37:32 +02:00
Yih-Dar	87726a08ed	`pytest_num_workers=4` for some CircleCI jobs (#31764 ) pytest_num_workers=4 Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-07-04 14:44:58 +02:00
Pavel Iakubovskii	048f599f35	Fix RT-DETR weights initialization (#31724 ) * Fix init for rt-detr heads * Fixup * Add separate prior_prob value to config for initialization * Add bbox init * Change to 1 / num_labels init * Adjust weights init test * Fix style for test	2024-07-03 14:29:02 +01:00
Pavel Iakubovskii	b97521614a	Fix RT-DETR cache for generate_anchors (#31671 ) * Fix cache and type conversion * Add test * Fixup * nit * [run slow] rt_detr * Fix test * Fixup * [run slow] rt_detr * Update src/transformers/models/rt_detr/modeling_rt_detr.py	2024-07-03 14:19:57 +01:00
Willard Sheen	534cbf8a5d	[fix bug] logits's shape different from label's shape in preprocess_logits_for_metrics (#31447 ) * [fix BUG] pad labels before use it in preprocess_logits_for_metrics * a more readable fix labels can't use `gather` before pass to `preprocess_logits_for_metrics`, so must split into 2 if-block * add a comment * oh code quality check	2024-07-03 06:58:27 -04:00
Nate Brake	65a02cd27d	Add ignore_errors=True to trainer.py rmtree in _inner_training_loop (#31668 ) Update trainer.py	2024-07-03 06:54:49 -04:00
Joao Gante	ddfaf11926	Gemma 2: Update slow tests (#31759 ) gemma 2 slow tests	2024-07-03 11:43:44 +02:00
Pablo Montalvo	c1fe12595e	handle (processor_class, None) returned by ModelPatterns (#31753 )	2024-07-03 11:42:30 +02:00
Aymeric Roucher	0fd885b91c	Adds final answer tool for all agents (#31703 ) * Adds final answer tool for all agents * Typo * Add clarification in doc * Put final_answer tool adition in agent for clarity	2024-07-03 11:36:09 +02:00
Ella Charlaix	dc72fd7edd	Requires for torch.tensor before casting (#31755 )	2024-07-03 11:12:51 +02:00
jiqing-feng	7f91f168a1	fix assisted decoding (#31401 ) * fix assisted decoding * check None * fix typo * fix _prepare_special_tokens * fix style * fix lint * add tests for assisted decoding * fix style * fix tests check	2024-07-03 09:22:56 +01:00
Jörg Bornschein	f91c16d270	Fix documentation for Gemma2. (#31682 ) * Fix documentation for Gemma2. Model sizes and Blog post URL are wrong in the documentation. * Update docs/source/en/model_doc/gemma2.md Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-07-02 23:04:53 +01:00
Matt	cd0935dd55	Make tool JSON schemas consistent (#31756 ) Make the order of array items consistent using sorted()	2024-07-02 20:00:42 +01:00
Joao Gante	82486e5995	🚨🚨 TextGenerationPipeline: rely on the tokenizer default kwargs (#31747 ) * rely on the tokenizer default kwargs * fix a few tests	2024-07-02 16:17:42 +02:00

... 3 4 5 6 7 ...

16502 Commits