transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-23 14:29:01 +06:00

Author	SHA1	Message	Date
Yoach Lacombe	979f4774f6	Fix Bark saving (#33266 )	2024-09-03 10:57:59 +02:00
Raushan Turganbay	7ed9789e21	Fix: `num_logits_to_keep` in composite models (#33168 ) * fix * paligemma	2024-09-03 13:48:45 +05:00
Arthur	566302686a	remove torch input dependant control flow (#33245 )	2024-09-03 07:41:14 +02:00
ZM	cff06aac6f	Fix: use `torch.from_numpy()` to create tensors for np.ndarrays (#33201 ) use torch.from_numpy for np.ndarrays	2024-09-02 17:45:55 +01:00
Sergio Paniego Blanco	28952248b1	Fixed typo repeated word in DETR docs (#33250 )	2024-09-02 17:19:18 +02:00
Marc Sun	9ea1eacd11	remove to restriction for 4-bit model (#33122 ) * remove to restiction for 4-bit model * Update src/transformers/modeling_utils.py Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com> * bitsandbytes: prevent dtype casting while allowing device movement with .to or .cuda * quality fix * Improve warning message for .to() and .cuda() on bnb quantized models --------- Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>	2024-09-02 16:28:50 +02:00
Joao Gante	97c0f45b9c	Generate: fix assistant in different device (#33257 )	2024-09-02 14:37:49 +01:00
Matt	52a0213755	Add assistant prefill for chat templates and TextGenerationPipeline (#33198 ) * Add assistant prefill to chat templates * Add assistant prefill to pipeline * Add assistant prefill to pipeline * Tweak another test that ended in assistant message * Update tests that ended in assistant messages * Update tests that ended in assistant messages * Replace assistant_prefill with continue_final_message * Allow passing continue_final_message to pipeline * Small fixup * Add continue_final_message as a pipeline kwarg * Update docstrings * Move repos to hf-internal-testing! * Update src/transformers/tokenization_utils_base.py Co-authored-by: Lysandre Debut <hi@lysand.re> * Add explanatory comment * make fixup * Update chat templating docs to explain continue_last_message --------- Co-authored-by: Lysandre Debut <hi@lysand.re>	2024-09-02 13:23:47 +01:00
dependabot[bot]	2d37085817	Bump opencv-python from 4.4.0.42 to 4.8.1.78 in /examples/research_projects/lxmert (#33227 ) Bump opencv-python in /examples/research_projects/lxmert Bumps [opencv-python](https://github.com/opencv/opencv-python) from 4.4.0.42 to 4.8.1.78. - [Release notes](https://github.com/opencv/opencv-python/releases) - [Commits](https://github.com/opencv/opencv-python/commits) --- updated-dependencies: - dependency-name: opencv-python dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-09-02 13:40:49 +02:00
Jeongseok Kang	963ed98bed	docs: Replace package abbreviations with full name(`bitsandbytes`) in docstrings (#33230 ) * docs: Provide fullname for `bitsandbytes` package * docs: Provide fullname for `bitsandbytes` package (2)	2024-09-02 13:40:34 +02:00
Ankush	409fcfdfcc	Fix: Suppressed 'use_reentrant=False' warning (#33208 ) Co-authored-by: Ankush <ankush13r>	2024-09-02 10:16:07 +02:00
Aymeric Roucher	1ca9ff5c91	Add duckduckgo search tool (#32882 ) * Add duckduckgo search tool	2024-09-02 09:56:20 +02:00
NielsRogge	b9bc691e8d	Add GraniteRMSNorm (#33177 ) * Add GraniteRMSNorm * [run_slow] granite	2024-09-02 09:39:39 +02:00
Merve Noyan	2e3f8f7474	Add video text to text docs (#33164 ) --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-09-01 12:06:31 +03:00
Joao Gante	eb5b968c5d	Generate: throw warning when `return_dict_in_generate` is False but should be True (#33146 )	2024-08-31 10:47:08 +01:00
Joao Gante	746104ba6f	Test fetcher: missing return on filtered tests; don't write empty files (#33224 ) * missing return * skip files without contents * test 2 * dbg * dbg * how about this?	2024-08-31 00:41:52 +02:00
Arthur	51e6526b38	Fix red amin (#33220 ) * fix * oups * oups * proper fix * forget about that * arf * ish	2024-08-30 18:49:23 +01:00
Yijun Lee	db70426854	🌐 [i18n-KO] Translated `llm_optims.md` to Korean (#32325 ) * docs: ko: llm_optims.md * feat: nmt draft * fix toc title * fix: manual edits * Update docs/source/ko/llm_optims.md Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com> * Update docs/source/ko/llm_optims.md Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com> * Update docs/source/ko/llm_optims.md Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com> * Update docs/source/ko/llm_optims.md Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com> * Update docs/source/ko/llm_optims.md Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com> * Update docs/source/ko/llm_optims.md Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com> * Update docs/source/ko/llm_optims.md Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com> * Update docs/source/ko/llm_optims.md Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com> * Update docs/source/ko/llm_optims.md Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com> * Update docs/source/ko/llm_optims.md Co-authored-by: HyunJi Shin <74661937+shinhyunji36@users.noreply.github.com> * Update docs/source/ko/llm_optims.md Co-authored-by: HyunJi Shin <74661937+shinhyunji36@users.noreply.github.com> * Update llm_optims.md * fix: resolve suggestions * fix: resolve suggestions * Apply suggestions from code review fix: resolve suggestions Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com> --------- Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com> Co-authored-by: HyunJi Shin <74661937+shinhyunji36@users.noreply.github.com>	2024-08-30 09:52:41 -07:00
Aymeric Roucher	c79bfc71b8	Create local Transformers Engine (#33218 ) * Create local Transformers Engine	2024-08-30 18:22:27 +02:00
Arthur	b017a9eb11	Refactor CI: more explicit (#30674 ) * don't run custom when not needed? * update test fetcher filtering * fixup and updates * update * update * reduce burden * nit * nit * mising comma * this? * this? * more parallelism * more * nit for real parallelism on tf and torch examples * update * update * update * update * update * update * update * update * update * update * update * update * update to make it more custom * update to make it more custom * update to make it more custom * update to make it more custom * update * update * update * update * update * update * use correct path * fix path to test files and examples * filter-tests * filter? * filter? * filter? * nits * fix naming of the artifacts to be pushed * list vs files * list vs files * fixup * fix list of all tests * fix the install steps * fix the install steps * fix the config * fix the config * only split if needed * only split if needed * extend should fix it * extend should fix it * arg * arg * update * update * run tests * run tests * run tests * more nits * update * update * update * update * update * update * update * simpler way to show the test, reduces the complexity of the generated config * simpler way to show the test, reduces the complexity of the generated config * style * oups * oups * fix import errors * skip some tests for now * update doctestjob * more parallelism * fixup * test only the test in examples * test only the test in examples * nits * from Arthur * fix generated congi * update * update * show tests * oups * oups * fix torch job for now * use single upload setp * oups * fu*k fix * nit * update * nit * fix * fixes * [test-all] * add generate marker and generate job * oups * torch job runs not generate tests * let repo utils test all utils * UPdate * styling * fix repo utils test * more parallel please * don't test * update * bit more verbose sir * more * hub were skipped * split by classname * revert * maybe? * Amazing catch Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * fix * update * update * maybe non capturing * manual convert? * pass artifacts as parameters as otherwise the config is too long * artifact.json * store output * might not be safe? * my token * mmm? * use CI job IS * can't get a proper id? * ups * build num * update * echo url * this? * this! * fix * wget * ish * dang * udpdate * there we go * update * update * pass all * not .txt * update * fetcg * fix naming * fix * up * update * update * ?? * update * more updates * update * more * skip * oups * pr documentation tests are currently created differently * update * hmmmm * oups * curl -L * update * ???? * nit * mmmm * ish * ouf * update * ish * update * update * updatea * nit * nit * up * oups * documentation_test fix * test hub tests everything, just marker * update * fix * test_hub is the only annoying one now * tf threads? * oups * not sure what is happening? * fix? * just use folder for stating hub * I am getting fucking annoyed * fix the test? * update * uupdate * ? * fixes * add comment! * nit --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2024-08-30 18:17:25 +02:00
Matt	38d58a4427	Fix local repos with remote code not registering for pipelines (#33100 ) * Extremely experimental fix! * Try removing the clause entirely * Add test * make fixup * stash commit * Remove breakpoint * Add anti-regression test * make fixup * Move repos to hf-internal-testing!	2024-08-30 16:56:22 +01:00
Matt	fbff27623a	Add warning for stop string edge case (#33169 ) * Add warning for edge case * make fixup	2024-08-30 16:26:26 +01:00
Julien Denize	e259d6d1e0	Add missing quotes in modeling_llava_next_video.py (#33214 )	2024-08-30 15:39:23 +02:00
dependabot[bot]	9a6956baab	Bump torch from 1.13.1 to 2.2.0 in /examples/research_projects/decision_transformer (#33215 ) Bump torch in /examples/research_projects/decision_transformer Bumps [torch](https://github.com/pytorch/pytorch) from 1.13.1 to 2.2.0. - [Release notes](https://github.com/pytorch/pytorch/releases) - [Changelog](https://github.com/pytorch/pytorch/blob/main/RELEASE.md) - [Commits](https://github.com/pytorch/pytorch/compare/v1.13.1...v2.2.0) --- updated-dependencies: - dependency-name: torch dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-08-30 15:38:53 +02:00
dependabot[bot]	4987463de7	Bump torch from 1.13.1 to 2.2.0 in /examples/research_projects/codeparrot (#33173 ) Bump torch in /examples/research_projects/codeparrot Bumps [torch](https://github.com/pytorch/pytorch) from 1.13.1 to 2.2.0. - [Release notes](https://github.com/pytorch/pytorch/releases) - [Changelog](https://github.com/pytorch/pytorch/blob/main/RELEASE.md) - [Commits](https://github.com/pytorch/pytorch/compare/v1.13.1...v2.2.0) --- updated-dependencies: - dependency-name: torch dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-08-30 15:23:35 +02:00
Joao Gante	b127fb8fdc	Pipeline: fix bad generation kwargs docs (#33205 ) fix link	2024-08-30 14:14:42 +02:00
Arthur	c409cd8177	use a single for loop (#33148 ) * use a single for loop * oups * fixup * fix typo	2024-08-29 15:55:02 +02:00
Gerben van V	5129671290	Add a static cache that offloads to the CPU or other device (#32161 ) * Add a static cache that offloads to the CPU or other device * Fix PR comments, add unit-tests	2024-08-29 11:51:09 +02:00
Anton Vlasjuk	92a75ff6b1	Mamba2 conversion script for original models (#32580 ) * first attempt at allowing both conversions from codestral and from the original mamba ssm * allow fp16, seems default for mamba2 * dtype fix * simplify codestral check, dont overwrite pad/eos/bos when codestral * change file -> directory * use path join to be safe * style * apply code review - add util mamba2 tokenizer (gptneox with left padding) - add models dict * fix copies * add tokenizer to docs * empty commit to check for weird err * make conversion user dependent on model type, defaults for original paper models * small comment nit * remove norm_before_gate in conversion * simplify model dict by using shared keys directly + remove unnecessary attributes * fix tokenization: remove separate mamba2 tokenizer, add padding option as kwarg to gptneox one and reuse it for the conversion script * simplify even further as we pass padding side via **kwargs already	2024-08-29 11:27:45 +02:00
Wing Lian	39bfb2f514	pass module to Params4bit.from_prequantized to ensure quant_state (#32524 ) * pass module to Params4bit.from_prequantized to ensure quant_state * make sure to check bnb version * revert min bnb version and use inspect on method instead * use version instead of inspect to prevent performance hit * make the property name readable	2024-08-29 11:09:56 +02:00
Duygu Altinok	5c1027bf09	added quick clarification (#33166 ) * added quick clarification * cosmetics	2024-08-28 18:52:17 +02:00
Yih-Dar	3d79dcbda0	update push CI workflow files for security (#33142 ) * update for security 1 * update for security 2 * update for security 3 * update for security 4 * update for security 5 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-08-28 18:15:58 +02:00
Nanami	74e19e81e2	Fix spell mistakes (#33149 )	2024-08-28 15:27:16 +02:00
beep-bebop	5c84682f16	Customise the separator used for splicing in DataCollatorWithFlattening (#33114 ) * Customising the separator used for splicing in DataCollatorWithFlattening * update DataCollatorWithFlattening docs --------- Co-authored-by: weifangyuan <i.weifangyuan@yuewen.com>	2024-08-28 15:22:07 +02:00
Pedro Cuenca	f4c86d0416	Zero-shot pipelines: minor doc changes (#33127 ) Minor zero-shot doc changes for pipelines.	2024-08-28 13:59:16 +02:00
rasmi	f9ed05dd03	Fix import paths for test_module (#32888 ) * Fix import path for test_feature_extraction_utils.py See https://github.com/huggingface/transformers/pull/32601 * Fix import path for test_image_processing_utils.py	2024-08-28 12:08:29 +01:00
JB (Don)	f1a385b1de	[RoBERTa-based] Add support for sdpa (#30510 ) * Adding SDPA support for RoBERTa-based models * add not is_cross_attention * fix copies * fix test * add minimal test for camembert and xlm_roberta as their test class does not inherit from ModelTesterMixin * address some review comments * use copied from * style * consistency * fix lists --------- Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2024-08-28 10:26:00 +02:00
benniekiss	e0b87b0f40	[whisper] pass attention_mask to generate_with_fallback() (#33145 ) pass attention_mask to generate_with_fallback	2024-08-28 09:53:58 +02:00
Anton Vlasjuk	3bfd3e4803	Fix: Jamba batched generation (#32914 ) * init fix * fix mask during cached forward, move mask related stuff to own function * adjust tests as left padding does not change logits as much anymore + batch gen (with todo on logits comp) * revert overwriting new integration tests * move some comments to docstring	2024-08-28 09:24:06 +02:00
Mayank Mishra	386931d950	fix model name and copyright (#33152 )	2024-08-28 08:38:57 +02:00
Mayank Mishra	c35d2ccf5a	Granite language models (#31502 ) * first commit * drop tokenizer * drop tokenizer * drop tokenizer * drop convert * granite * drop tokenization test * mup * fix * reformat * reformat * reformat * fix docs * stop checking for checkpoint * update support * attention multiplier * update model * tiny drop * saibo drop * skip test * fix test * fix test * drop * drop useless imports * update docs * drop flash function * copied from * drop pretraining tp * drop pretraining tp * drop pretraining tp * drop unused import * drop code path * change name * softmax scale * head dim * drop legacy cache * rename params * cleanup * fix copies * comments * add back legacy cache * multipliers * multipliers * multipliers * text fix * fix copies * merge * multipliers * attention multiplier * drop unused imports * fix * fix * fix * move rope? * Update src/transformers/models/granite/configuration_granite.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fix * Update src/transformers/models/granite/modeling_granite.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fix * fix * fix * fix * fix-copies * torch rmsnorm * add authors * change model path * fix * test * drop static cache test * uupdate readme * drop non-causal * readme * drop useless imports * Update docs/source/en/model_doc/granite.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update docs/source/en/model_doc/granite.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update docs/source/en/model_doc/granite.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2024-08-27 21:27:21 +02:00
Juan Pizarro	7591ca5bc5	🚨 Add Blip2ForImageTextRetrieval (#29261 ) * add Blip2ForImageTextRetrieval * use one line and remove unnecessary space in tests Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * use value from the config, rather than hardcoded * change order of params in Blip2QFormerModel.forward * update docstring * fix style * update test_inference_opt * move embeddings out of Blip2QFormerModel * remove from_vision_qformer_configs * remove autocast float16 in Blip2QFormerModel * rename fiels into vision_projection,text_projection,use_image_text_matching_head * use CLIPOutput for Blip2ImageTextMatchingModelOutput * remove past_key_values_length from Blip2TextEmbeddings * fix small typo in the CLIPOutput docstring * add Blip2ForImageTextRetrieval to Zero Shot Image Classification mapping * update docstring and add require_torch_fp16 * rollback test_inference_opt * use use_image_text_matching_head=True in convert * skip test_model_get_set_embeddings * fix create_rename_keys error on new itm fields * revert to do scale after dot product between "query" and "key" * fix ValueError on convert script for blip2-opt-2.7b * update org of paths to Salesforce * add is_pipeline_test_to_skip for VisualQuestionAnsweringPipelineTests * [run_slow] blip_2 * removed Blip2ForImageTextRetrieval from IGNORE_NON_AUTO_CONFIGURED * fix docstring of Blip2ImageTextMatchingModelOutput * [run_slow] blip_2 * fix multi-gpu tests * [run_slow] blip_2 * [run_slow] blip_2 --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-08-27 18:50:27 +01:00
Ali Salamatian	27903de7ec	Very small change to one of the function parameters (#32548 ) Very small change to one of the parameters np.random.randint second parameter is not included in the possible options. Therefore, we want the upper range to be 2, so that we have some 1 labels in our classification as well.	2024-08-27 09:29:05 -07:00
Sae_Chan_Oh	6101d934a1	🌐 [i18n-KO] Translated `conversations.md` to Korean (#32468 ) * docs: ko: conversations.md * feat: hand-crafted translate docs * fix: modify typo after Grammar Check * Update docs/source/ko/conversations.md 감사합니다 Co-authored-by: SeungAhSon <gongsoonyee@gmail.com> * Update docs/source/ko/conversations.md Co-authored-by: SeungAhSon <gongsoonyee@gmail.com> * Update docs/source/ko/conversations.md Co-authored-by: SeungAhSon <gongsoonyee@gmail.com> * Update docs/source/ko/conversations.md Co-authored-by: SeungAhSon <gongsoonyee@gmail.com> * Update docs/source/ko/conversations.md Co-authored-by: SeungAhSon <gongsoonyee@gmail.com> * Update docs/source/ko/conversations.md Co-authored-by: SeungAhSon <gongsoonyee@gmail.com> * Update docs/source/ko/conversations.md Co-authored-by: SeungAhSon <gongsoonyee@gmail.com> * Update docs/source/ko/conversations.md Co-authored-by: SeungAhSon <gongsoonyee@gmail.com> * Update docs/source/ko/conversations.md Co-authored-by: SeungAhSon <gongsoonyee@gmail.com> * Update docs/source/ko/conversations.md Co-authored-by: SeungAhSon <gongsoonyee@gmail.com> * Update docs/source/ko/conversations.md Co-authored-by: SeungAhSon <gongsoonyee@gmail.com> * fix: accept suggestions about anchor and spacing * Update docs/source/ko/conversations.md Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com> * Update docs/source/ko/conversations.md Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com> * Update docs/source/ko/conversations.md Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com> * Update docs/source/ko/conversations.md Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com> * Update docs/source/ko/conversations.md Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com> * Update docs/source/ko/conversations.md Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com> * Update docs/source/ko/conversations.md Co-authored-by: Sungmin Oh <fabxoe.kor@gmail.com> * Update docs/source/ko/conversations.md Co-authored-by: Sungmin Oh <fabxoe.kor@gmail.com> * Update docs/source/ko/conversations.md Co-authored-by: Sungmin Oh <fabxoe.kor@gmail.com> * fix: anchor 'what happened inside piepeline?' be removed question mark * fix: translate the comments in the code block --------- Co-authored-by: SeungAhSon <gongsoonyee@gmail.com> Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com> Co-authored-by: Sungmin Oh <fabxoe.kor@gmail.com>	2024-08-27 09:25:41 -07:00
Marc Sun	7ee4363d19	update torch req for 4-bit optimizer (#33144 ) update req	2024-08-27 17:07:10 +02:00
Emin Orhan	d47a9e8ce5	fix redundant checkpointing in example training scripts (#33131 ) * fix redundant checkpointing in example scripts * Update examples/pytorch/image-classification/run_image_classification_no_trainer.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update examples/pytorch/translation/run_translation_no_trainer.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update examples/pytorch/token-classification/run_ner_no_trainer.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update examples/pytorch/text-classification/run_glue_no_trainer.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update examples/pytorch/summarization/run_summarization_no_trainer.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update examples/pytorch/semantic-segmentation/run_semantic_segmentation_no_trainer.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update examples/pytorch/language-modeling/run_mlm_no_trainer.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update examples/pytorch/language-modeling/run_fim_no_trainer.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update examples/pytorch/language-modeling/run_clm_no_trainer.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update examples/pytorch/image-pretraining/run_mim_no_trainer.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update examples/pytorch/instance-segmentation/run_instance_segmentation_no_trainer.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update examples/pytorch/multiple-choice/run_swag_no_trainer.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update examples/pytorch/question-answering/run_qa_no_trainer.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update examples/pytorch/object-detection/run_object_detection_no_trainer.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update examples/pytorch/question-answering/run_qa_beam_search_no_trainer.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> --------- Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>	2024-08-27 15:50:00 +02:00
Joao Gante	c6b23fda65	Llama: make slow tests green 🟢 (#33138 )	2024-08-27 14:44:42 +01:00
Matt	9956c2bc98	Add a fix for custom code tokenizers in pipelines (#32300 ) * Add a fix for the case when tokenizers are passed as a string * Support image processors and feature extractors as well * Reverting load_feature_extractor and load_image_processor * Add test * Test is torch-only * Add tests for preprocessors and feature extractors and move test * Extremely experimental fix * Revert that change, wrong branch! * Typo! * Split tests	2024-08-27 14:39:57 +01:00
Zizhao Chen	834ec7b1cc	fix Idefics2VisionConfig type annotation (#33103 ) * fix Idefics2VisionConfig type annotation * Update modeling_idefics2.py * Update modeling_idefics2.py add ignore copy * Update modeling_idefics2.py * Update modeling_idefics2.py	2024-08-27 14:43:28 +02:00
pedrobrs	d1f39c484d	Update stateful_callbacks state before saving checkpoint (#32115 ) * update ExportableState callbacks state before saving trainer_state on save_checkpoint * run make fixup and fix format * manage multiple stateful callbacks of same class	2024-08-27 14:33:35 +02:00

... 52 53 54 55 56 ...

19383 Commits