transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-20 04:58:22 +06:00

Author	SHA1	Message	Date
Sangbum Daniel Choi	874ac129bb	fix the get_size_with_aspect_ratio in max_size situation (#30902 ) * fix the get_size_with_aspect_ratio in max_size situation * make fix-up * add more general solution * consider when max_size is not defined * fix typo * fix typo * simple fix * fix error * fix if else error * fix error of size overwrite * fix yolos image processing * fix detr image processing * make * add longest related test script * Update src/transformers/models/yolos/image_processing_yolos.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * add more test * add test script about longest size * remove deprecated --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-06-03 16:12:08 +01:00
Isotr0py	e4628434d8	Add Qwen2 GGUF loading support (#31175 ) * add qwen2 gguf support * Update docs * fix qwen2 tokenizer * add qwen2 gguf test * fix typo in qwen2 gguf test * format code * Remove mistral, clarify the error message * format code * add typing and update docstring	2024-06-03 14:55:10 +01:00
Yih-Dar	df848acc5d	Fix `test_compile_static_cache` (#30991 ) * fix * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-06-03 15:16:28 +02:00
fxmarty	221aaec6ec	Ignore non-causal mask in more cases with SDPA (#30138 ) * update non-causal mask for sdpa * add test * update docstrings * add one more test * fix cross attention bug * gentler atol/rtol	2024-06-03 19:08:41 +08:00
Ahmed Moubtahij	39b2ff69d6	Token healing (#30081 ) * token healing impl + trie with extensions * make fixup * prefix-robust space tokenization * examples readme and requirements * make fixup * allow input prompt and model * redundant defaults * Specialized Trie * make fixup * updated tests with new inherited Tree * input ids to auto device_map * rm unused import * Update src/transformers/generation/utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * naming convention * Revert "naming convention" This reverts commit dd39d9c5b7a969e2d8a8d2a8e54f121b82dc44f0. * naming convention * last -hopefully- changes --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2024-06-03 10:53:15 +02:00
Aymeric Roucher	9837a25481	Add streaming, various fixes (#30838 ) * Implement streaming run in ReAct agents * Allow additional imports in code agents * Python interpreter: support classes and exceptions, fixes	2024-05-31 14:16:23 +02:00
Marc Sun	48cada87c3	Fix quantized cache output (#31143 )	2024-05-31 12:08:55 +02:00
zspo	cda9c82a63	fix get_scheduler when name is warmup_stable_decay (#31128 ) fix get_scheduler args	2024-05-30 15:25:43 +01:00
Younes Belkada	5e5c4d629d	FIX / Quantization: Add extra validation for bnb config (#31135 ) add validation for bnb config	2024-05-30 11:45:03 +02:00
Dhruv Pai	5c88253556	Add on_optimizer_step to callback options (#31095 ) * Modified test * Added on_optimizer_step to callbacks * Move callback after step is called * Added on optimizer step callback	2024-05-29 16:20:59 +02:00
Lucain	c3044ec2f3	Use `HF_HUB_OFFLINE` + fix has_file in offline mode (#31016 ) * Fix has_file in offline mode * harmonize env variable for offline mode * Switch to HF_HUB_OFFLINE * fix test * revert test_offline to test TRANSFORMERS_OFFLINE * Add new offline test * merge conflicts * docs	2024-05-29 11:55:43 +01:00
amyeroberts	a564d10afe	Deprecate low use models (#30781 ) * Deprecate models - graphormer - time_series_transformer - xlm_prophetnet - qdqbert - nat - ernie_m - tvlt - nezha - mega - jukebox - vit_hybrid - x_clip - deta - speech_to_text_2 - efficientformer - realm - gptsan_japanese * Fix up * Fix speech2text2 imports * Make sure message isn't indented * Fix docstrings * Correctly map for deprecated models from model_type * Uncomment out * Add back time series transformer and x-clip * Import fix and fix-up * Fix up with updated ruff	2024-05-28 18:07:07 +01:00
Younes Belkada	3264be4114	TST: Fix instruct-blip tests (#31088 ) * fix flan t5 tests * better format	2024-05-28 18:29:11 +02:00
Yih-Dar	3af7bf30ad	skip `test_multi_gpu_data_parallel_forward` for `vit` and `deit` (#31086 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-05-28 17:44:52 +02:00
Raushan Turganbay	779bc360ff	Watermark: fix tests (#30961 ) * fix tests * style * Update tests/generation/test_utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-05-28 17:07:42 +05:00
Lysandre Debut	a3c7b59e31	Fix failing tokenizer tests (#31083 ) * Fix failing tokenizer tests * Use small tokenizer * Fix remaining reference	2024-05-28 13:34:23 +02:00
Pavel Iakubovskii	98e2d48e9a	Fix OWLv2 post_process_object_detection for multiple images (#31082 ) * Add test for multiple images * [run slow] owlv2 * Fix box rescaling * [run slow] owlv2	2024-05-28 12:06:06 +01:00
oOraph	936ab7bae5	fix from_pretrained in offline mode when model is preloaded in cache (#31010 ) * Unit test to verify fix Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com> * fix from_pretrained in offline mode when model is preloaded in cache Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com> * minor: fmt Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com> --------- Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com> Co-authored-by: Raphael Glon <oOraph@users.noreply.github.com>	2024-05-28 11:56:05 +02:00
Yih-Dar	8e3b1fef97	Remove `ninja` from docker image build (#31080 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-05-28 11:36:26 +02:00
Yih-Dar	9d35edbb30	skip `test_model_parallelism` for 2 model test classes (#31067 ) skip Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-05-27 18:36:39 +02:00
Yoach Lacombe	d355741eca	Fix pad_to_max_length Whisper (#30787 ) * fix pad_to_max_length Whisper * add tests * make style	2024-05-27 16:09:05 +02:00
Marc Sun	b84cd67526	Fix quanto tests (#31062 ) fix quanto tests	2024-05-27 15:53:45 +02:00
Ita Zaporozhets	deba7655e6	Add split special tokens (#30772 ) * seems like `split_special_tokens` is used here * split special token * add new line at end of file * moving split special token test to common tests * added assertions * test * fixup * add co-author * passing rest of args to gptsan_japanese, fixing tests * removing direct comparison of fast and slow models * adding test support for UDOP and LayoutXLM * ruff fix * readd check if slow tokenizer * modify test to handle bos tokens * removing commented function * trigger build * applying review feedback - updated docstrings, var names, and simplified tests * ruff fixes * Update tests/test_tokenization_common.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * applying feedback, comments * shutil temp directory fix --------- Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com> Co-authored-by: Ita Zaporozhets <itazaporozhets@Itas-MBP.localdomain> Co-authored-by: itazap <itazap@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Ita Zaporozhets <itazaporozhets@Itas-MacBook-Pro.local>	2024-05-24 08:38:58 -07:00
BHUVAN M	e5103a76cc	added interpolation for vitmae model in pytorch as well as tf. (#30732 ) * added interpolation for vitmae model in pytorch as well as tf. * Update modeling_vit_mae.py irreugalr import fixed * small changes and proper formatting * changes suggested in review. * modified decoder interpolate_func * arguments and docstring fix * Apply suggestions from code review doc fixes Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-05-24 16:20:09 +01:00
Younes Belkada	658b849aeb	Quantization / TST: Fix remaining quantization tests (#31000 ) * Fix remaining quant tests * Update test_quanto.py	2024-05-24 14:35:59 +02:00
Lucain	fd3c128040	Fix resume_download future warning (#31007 ) * Fix resume_download future warning * better like this * Add regression test	2024-05-24 14:35:40 +02:00
Marc Sun	ae87f9797b	FIX / TST: Fix expected results on Mistral AWQ test (#30971 ) fix awq mistral test	2024-05-24 14:06:31 +02:00
Fanli Lin	04c7c176d7	[tests] make `test_model_parallelism` device-agnostic (#30844 ) * enable on xpu * fix style * add comment and mps	2024-05-24 11:51:51 +01:00
Yixiang Gao	42d8dd8716	Perceiver interpolate position embedding (#30979 ) * add test that currently fails * test passed * all perceiver passed * fixup, style, quality, repo-consistency, all passed * Apply suggestions from code review: default to False + compute sqrt once only Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * fix a minor bracket * replace dim with self._num_channels * add arguments to the rest preprocessors --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-05-24 11:13:58 +01:00
Ita Zaporozhets	7f6e87413f	add prefix space ignored in llama #29625 (#30964 ) * add prefix space ignored in llama #29625 * adding test with add_prefix_space=False * ruff --------- Co-authored-by: Ita Zaporozhets <itazaporozhets@Itas-MBP.localdomain>	2024-05-24 01:03:00 -07:00
Yasmin Moslem	6d3d5b1039	Remove deprecated properties in tokenization_nllb.py and tokenization_nllb_fast.py (#29834 ) * Fix typo in tokenization_nllb.py Change `adder_tokens_decoder` into `added_tokens_decoder` and improve the warning's readability. * Fix typo in tokenization_nllb_fast.py Change `adder_tokens_decoder` into `added_tokens_decoder` and improve the warning's readability. * Remove deprecated attributes in tokenization_nllb.py Remove deprecated attributes: `lang_code_to_id`, `fairseq_tokens_to_ids`, `id_to_lang_code`, and `fairseq_ids_to_tokens` * Remove deprecated attribute in tokenization_nllb_fast.py Remove deprecated attribute `lang_code_to_id` * Remove deprecated properties in tokenization_nllb.py Remove deprecated properties - fix format * Remove deprecated properties in tokenization_nllb_fast.py Remove deprecated properties - fix format * Update test_tokenization_nllb.py * update test_tokenization_nllb.py * Update tokenization_nllb.py * Update test_tokenization_seamless_m4t.py * Update test_tokenization_seamless_m4t.py	2024-05-23 18:53:26 +02:00
Aritra Roy Gosthipaty	965e98dc54	[Port] TensorFlow implementation of Mistral (#29708 ) * chore: initial commit * chore: adding imports and inits * chore: adding the causal and classification code * chore: adding names to the layers * chore: using single self attn layer * chore: built the model and layers * chore: start with testing * chore: docstring change, transpose fix * fix: rotary embedding * chore: adding cache implementation * remove unused torch * chore: fixing the indexing issue * make fix-copies * Use modeling_tf_utils.keras * make fixup * chore: fixing tests * chore: adding past key value logic * chore: adding multi label classfication test * fix: switching on the built parameters in the layers * fixing repo consistency * ruff formats * style changes * fix: tf and pt equivalence * removing returns from docstrings * fix docstrings * fix docstrings * removing todos * fix copies * fix docstring * fix docstring * chore: using easier rotate_half * adding integration tests * chore: addressing review related to rotary embedding layer * review changes * [run-slow] mistral * skip: test save load after resize token embedding * style --------- Co-authored-by: Matt <rocketknight1@gmail.com>	2024-05-23 17:48:49 +01:00
Yih-Dar	2a89673fe5	Update 4 `MptIntegrationTests` expected outputs (#30989 ) * fix * fix * fix * fix * fix * [run-slow] mpt --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-05-23 18:27:54 +02:00
Fanli Lin	21339a5213	[tests] add `torch.use_deterministic_algorithms` for XPU (#30774 ) * add xpu check * add marker * add documentation * update doc * fix ci * remove from global init * fix	2024-05-23 16:53:07 +01:00
Marc Sun	8366b57241	Fix accelerate failing tests (#30836 ) * Fix accelerate tests * fix clip * skip dbrx tests * fix GPTSan * fix M2M100Model * same fix as jamba * fix mt5 * Fix T5Model * Fix umt5 model * fix switch_transformers * fix whisper * fix gptsan again * fix siglip recent test * skip siglip tests * wrong place fixed	2024-05-23 17:18:58 +02:00
Poedator	6739e1d261	test_custom_4d_attention_mask skip with sliding window attn (#30833 )	2024-05-23 15:22:10 +02:00
Raushan Turganbay	d583f1317b	Quantized KV Cache (#30483 ) * clean-up * Update src/transformers/cache_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/cache_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/cache_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fixup * Update tests/quantization/quanto_integration/test_quanto.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update src/transformers/generation/configuration_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * more suggestions * mapping if torch available * run tests & add 'support_quantized' flag * fix jamba test * revert, will be fixed by another PR * codestyle * HQQ and versatile cache classes * final update * typo * make tests happy --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>	2024-05-23 17:25:20 +05:00
Kamil Akesbi	eb1a77bbb0	Using assistant in AutomaticSpeechRecognitionPipeline with different encoder size (#30637 ) * fiw input to generate in pipeline * fixup * pass input_features to generate with assistant * error if model and assistant with different enc size * fix * apply review suggestions * use self.config.is_encoder_decoder * pass inputs to generate directly * add slow tests * Update src/transformers/generation/utils.py Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Update tests/pipelines/test_pipelines_automatic_speech_recognition.py Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Update tests/pipelines/test_pipelines_automatic_speech_recognition.py Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Update tests/pipelines/test_pipelines_automatic_speech_recognition.py Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Update tests/pipelines/test_pipelines_automatic_speech_recognition.py Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Update tests/pipelines/test_pipelines_automatic_speech_recognition.py Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * apply review * Update src/transformers/generation/utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update tests/pipelines/test_pipelines_automatic_speech_recognition.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * apply code review * update attributes encoder_xyz to check * Update src/transformers/generation/utils.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update src/transformers/generation/utils.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update src/transformers/generation/utils.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * add slow test * solve conflicts --------- Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>	2024-05-23 09:59:38 +01:00
Pablo Montalvo	a25f7d3c12	Paligemma causal attention mask (#30967 ) * PaliGemma working causal attention * Formatting * Style * Docstrings + remove commented code * Update docstring for PaliGemma Config * PaliGemma - add separator ind to model/labels * Refactor + docstring paligemma processor method * Style * return token type ids when tokenizing labels * use token type ids when building causal mask * add token type ids to tester * remove separator from config * fix style * don't ignore separator * add processor documentation * simplify tokenization * fix causal mask * style * fix label propagation, revert suffix naming * fix style * fix labels tokenization * [run-slow]paligemma * add eos if suffixes are present * [run-slow]paligemma * [run-slow]paligemma * add misssing tokens to fast version * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fix style * [run-slow]paligemma --------- Co-authored-by: Peter Robicheaux <peter@roboflow.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2024-05-22 19:37:15 +02:00
Sanchit Gandhi	0948c827de	[Whisper] Strip prompt before finding common subsequence (#27836 )	2024-05-22 17:25:47 +01:00
Raushan Turganbay	b1065aa08a	Generation: get special tokens from model config (#30899 ) * fix * let's do this way? * codestyle * update * add tests	2024-05-22 18:15:41 +02:00
amyeroberts	dff54ad2d9	🚨 out_indices always a list (#30941 ) * out_indices always a list * Update src/transformers/utils/backbone_utils.py * Update src/transformers/utils/backbone_utils.py * Move type casting * nit	2024-05-22 15:23:04 +01:00
Pablo Montalvo	250ae9f746	Paligemma - fix slow tests, add bf16 and f16 slow tests (#30851 ) * fix slow tests, add bf16 and f16 slow tests * few fixes * [run-slow]paligemma * add gate decorator * [run-slow]paligemma * add missing gating * [run-slow]paligemma * [run-slow]paligemma	2024-05-22 16:20:07 +02:00
Jonatan Kłosko	1518508467	Avoid extra chunk in speech recognition (#29539 )	2024-05-22 14:07:51 +01:00
Marc Sun	5c186003b8	Fix low cpu mem usage tests (#30808 ) * Fix tests * fix udop failing test * remove skip * style	2024-05-22 14:09:01 +02:00
Arthur	673440d073	update ruff version (#30932 ) * update ruff version * fix research projects * Empty * Fix errors --------- Co-authored-by: Lysandre <lysandre@huggingface.co>	2024-05-22 06:40:15 +02:00
Matthew Beckers	3b09d3f05f	fix: center_crop occasionally outputs off-by-one dimension matrix (#30934 ) If required padding for a crop larger than input image is odd-numbered, the padding would be rounded down instead of rounded up, causing the output dimension to be one smaller than it should be.	2024-05-21 13:56:52 +01:00
Zach Mueller	daf281f44f	Enforce saving at end of training if saving option chosen (#30160 ) * Enforce saving at end of training * Fix test * Rework test * Fixup tests' * Update comment based on sourab feedback * Clean	2024-05-21 07:50:11 -04:00
Mohit Sharma	7a4792e6b3	CI: AMD MI300 tests fix (#30797 ) * add fix * update import * updated dicts and comments * remove prints * Update testing_utils.py	2024-05-21 12:46:07 +01:00
Younes Belkada	8871b26150	FEAT / Trainer: LOMO optimizer support (#30178 ) * add V1 - adalomo not working yet * add todo docs + refactor from comments * adjust LR * add docs * add more elaborated test * Apply suggestions from code review Co-authored-by: Zach Mueller <muellerzr@gmail.com> * fix * push * add accelerate check * fix DDP case * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * fix * init kwargs * safely add attribute * revert to enum logic * Update src/transformers/trainer.py --------- Co-authored-by: Zach Mueller <muellerzr@gmail.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-05-21 10:16:37 +02:00

1 2 3 4 5 ...

3756 Commits