transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Cyril Vallez	e2ac16b28a	Large modular logic refactoring (#34487 ) * rework converter * Update modular_model_converter.py * Update modular_model_converter.py * Update modular_model_converter.py * Update modular_model_converter.py * cleaning * cleaning * finalize imports * imports * Update modular_model_converter.py * Better renaming to avoid visiting same file multiple times * start converting files * style * address most comments * style * remove unused stuff in get_needed_imports * style * move class dependency functions outside class * Move main functions outside class * style * Update modular_model_converter.py * rename func * add augmented dependencies * Update modular_model_converter.py * Add types_to_file_type + tweak annotation handling * Allow assignment dependency mapping + fix regex * style + update modular examples * fix modular_roberta example (wrong redefinition of __init__) * slightly correct order in which dependencies will appear * style * review comments * Performance + better handling of dependencies when they are imported * style * Add advanced new classes capabilities * style * add forgotten check * Update modeling_llava_next_video.py * Add prority list ordering in check_conversion as well * Update check_modular_conversion.py * Update configuration_gemma.py	2024-11-01 10:13:51 +01:00
Pablo Montalvo	86701f2b6f	🔴 🔴 fix `query_pre_attn_scalar` different of `num_heads` in default gemma2 config (#34540 ) * fix query_pre_attn_scalar different of num_heads in default config * propagate modular changes * fix copies * fix modular copies * fix copies? * correct copies fix	2024-11-01 09:06:17 +01:00
Raushan Turganbay	4cc0813e28	BLIP: enable generation tests (#34174 ) * blip2 tests * instructblips * copies * fix slow tests * fix * uncomment this * clean up after rebase * should be model main input * fix overwritten tests * oops len should be multiple of frame number * style * fix some tests	2024-11-01 08:54:48 +01:00
Raushan Turganbay	6beb3f1691	Blip: get/set input embeddings correctly (#34152 ) * set-get embeds * add tests * fix tests * remove * return dict True * fix tests * why did i remove this * enabel torchscript tests	2024-11-01 08:39:39 +01:00
Ahmed Almaghz	b53e44e847	[i18n-ar] Translated file : `docs/source/ar/multilingual.md` into Arabic (#33048 ) * Add docs/source/ar/multilingual.md to Add_docs_source_ar_multilingual.md * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update _toctree.yml * Update _toctree.yml * Add Translated files to branch for merg * Update _toctree.yml * Update _toctree.yml * Update custom_models.md * Update chat_templating.md * Update docs/source/ar/create_a_model.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update create_a_model.md * Update gguf.md * Update gguf.md * Update gguf.md * Update gguf.md --------- Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-10-31 16:10:09 -07:00
jiqing-feng	2801d7bcf6	update doc (#34478 ) * update doc * Update docs/source/en/perf_train_cpu.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * delete closing tip --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-10-31 15:59:23 -07:00
NielsRogge	df8640cedb	[CLIPSeg] Make interpolate_pos_encoding default to True (#34419 ) * Remove interpolate_pos_encoding * Make fixup * Make interpolate_pos_encoding default to True * Reuse existing interpolation * Add integration test	2024-10-31 22:15:04 +01:00
Yoni Gozlan	203e27059b	Add image text to text pipeline (#34170 ) * Standardize image-text-to-text-models-output add post_process_image_text_to_text to chameleon and cleanup Fix legacy kwarg behavior and deprecation warning add post_process_image_text_to_text to qwen2_vl and llava_onevision Add post_process_image_text_to_text to idefics3, mllama, pixtral processor * nit var name post_process_image_text_to_text udop * nit fix deprecation warnings * Add image-text-to-text pipeline * add support for image url in chat template for pipeline * Reformat to be fully compatible with chat templates * Add tests chat template * Fix imports and tests * Add pipeline tag * change logic handling of single prompt ans multiple images * add pipeline mapping to models * fix batched inference * fix tests * Add manual batching for preprocessing * Fix outputs with nested images * Add support for all common processing kwargs * Add default padding when multiple text inputs (batch size>1) * nit change version deprecation warning * Add support for text only inference * add chat_template warnings * Add pipeline tests and add copied from post process function * Fix batched pipeline tests * nit * Fix pipeline tests blip2 * remove unnecessary max_new_tokens * revert processing kosmos2 and remove unnecessary max_new_tokens * fix pipeline tests idefics * Force try loading processor if pipeline supports it * revert load_processor change * hardcode loading only processor * remove unnecessary try except * skip imagetexttotext tests for kosmos2 as tiny model causes problems * Make code clearer * Address review comments * remove preprocessing logic from pipeline * fix fuyu * add BC resize fuyu * Move post_process_image_text_to_text to ProcessorMixin * add guard in post_process * fix zero shot object detection pipeline * add support for generator input in pipeline * nit * change default image-text-to-text model to llava onevision * fix owlv2 size dict * Change legacy deprecation warning to only show when True	2024-10-31 15:48:11 -04:00
fpgaminer	c443d8d536	Bug Fix for issue #34294 (#34295 ) Update SiglipVisionEmbeddings.forward to cast input to correct dtype before embedding it.	2024-10-31 18:51:15 +01:00
Yih-Dar	114dd812dd	make `test_eager_matches_sdpa_inference` less flaky (#34512 ) * try * try * try * try * try * try * update * update * update * update * update * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-10-31 18:34:00 +01:00
Luc Georges	294c170ff9	feat: add benchmarks pg indexes (#34536 ) * feat: add benchmarks pg indexes * refactor: remove debug `df -h`	2024-10-31 17:41:06 +01:00
Phillip Kuznetsov	b5919e12f7	fix(DPT,Depth-Anything) Address expected_slice errors inside inference tests (#34518 ) * fix(DPT,Depth-Anything) Address expected_slice errors inside inference tests Signed-off-by: Phillip Kuznetsov <philkuz@gimletlabs.ai> * [run_slow] dpt, depth_anything --------- Signed-off-by: Phillip Kuznetsov <philkuz@gimletlabs.ai>	2024-10-31 16:47:58 +01:00
Joao Gante	4ca004eac6	Qwen2VL: skip base `input_ids`-`inputs_embeds` equivalence check (#34535 ) it has complex inputs_embeds computation	2024-10-31 15:42:13 +00:00
Yih-Dar	ab98f0b0a1	avoid calling `gc.collect` and `cuda.empty_cache` (#34514 ) * update * update * update * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-10-31 16:36:13 +01:00
kibitzing	dca93ca076	Fix step shifting when accumulate gradient (#33673 ) * replace total_batched_samples with step while counting grad accum step * remove unused variable * simplify condition for update step * fix format by ruff * simplify update step condition using accelerator.sync_gradients * simplify update condition using do_sync_step * remove print for test --------- Co-authored-by: Zach Mueller <muellerzr@gmail.com>	2024-10-31 09:53:23 -04:00
jp	1b86772de5	Fix: img size mismatch caused by incorrect unpadding in LLaVA-Next (#34522 ) Fix: unpadding img mismatch	2024-10-31 14:32:45 +01:00
jiqing-feng	f38531619d	enable QA bf16 pipeline (#34483 ) * enable QA bf16 pipeline * add tests	2024-10-31 12:55:53 +00:00
anshumangahlot	405b562698	UPDATE Documentation for #TRANSLATING.md Documentation into Multiple Languages.(Changes made) (#34226 ) * Update TRANSLATING.md * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update TRANSLATING.md --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-10-30 12:37:39 -07:00
Yoni Gozlan	48872fd6ae	Add Image Processor Fast RT-DETR (#34354 ) * add fast image processor rtdetr * add gpu/cpu test and fix docstring * remove prints * add to doc * nit docstring * avoid iterating over images/annotations several times * change torch typing * Add image processor fast documentation	2024-10-30 13:49:47 -04:00
fzyzcjy	9f06fb0505	Fix super tiny extra space typo (#34440 ) Update training_args.py	2024-10-30 16:55:16 +01:00
Vladislav Bronzov	5251fe6271	Add GGUF for Mamba (#34200 ) * add mamba architecture for gguf * add logic for weights conversion, some fixes and refactoring * add lm_head layers, unit test refactoring * more fixes for tests * remove lm_head creation * remove unused comments	2024-10-30 16:52:17 +01:00
Yih-Dar	eab6c491d4	Use torch 2.5 in scheduled CI (#34465 ) * torch 2.5 * try --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-10-30 14:54:10 +01:00
Pablo Montalvo	241d79026f	fix pixtral processor (#34486 ) * fix pixtral processor * test out full length batches + remove undue ValueError * fix up processing * fix tests * fix * last fixup * style * [run-slow] pixtral * [run-slow] pixtral * fix config key * skip torchscript tests * [run-slow] pixtral * add missing key * [run-slow] pixtral * fix docs * [run-slow] pixtral * fix wrong url for integration test * [run-slow] pixtral * pixtralVisionModel does not have a lm head * [run-slow] pixtral	2024-10-30 14:17:20 +01:00
Joao Gante	8a734ea2c3	Tests: move `generate` tests to the right mixin and delete redundant tests (#34464 ) * tmp commit * tmp commit * cull overwrites of deleted tests * typo * more specific docstring * make fixup * parameterize at the top? * correction * more deletions :D * tmp commit * for VLMs too * fix _check_outputs * test nit * make fixup * fix another flaky * test_generate_from_inputs_embeds -- handle missing attention mask	2024-10-30 10:59:08 +00:00
Raushan Turganbay	913330ca9f	VLMs: fix number of image tokens (#34332 ) * fix * fix tests * add tests * style * style * fix qwen after rebase * fix video llava	2024-10-30 10:21:37 +01:00
Raushan Turganbay	0f764a5af7	Mllama: update docs (#34334 ) * update docs * be more explicit * use avaialble methods	2024-10-30 10:11:50 +01:00
Pethő Gergely	25a9fc584a	Fix format mistake in string repr of tokenizer objects (#34493 ) * fix repr string format for tokenizer objects The repr of tokenizer tokens looks confusing and just stupid, like this: `Tokenizer(...), added_tokens_decoder={1: ..., 2: ...}`. The dict that is the value of the added_tokens_decoder attribute is outside of the parentheses of the tokenizer object, whereas all other attributes are inside the parentheses like they should be. This commit fixes this bug. * cos: add newline before closing parenthesis of repr string	2024-10-30 10:03:41 +01:00
Guang Yang	cd277618d4	Roberta is ExecuTorch compatible (#34425 ) * Roberta is ExecuTorch compatible * [run_slow] roberta --------- Co-authored-by: Guang Yang <guangyang@fb.com>	2024-10-30 08:36:45 +00:00
Matt	9bee9ff5db	Un-deprecate timeout arg in pipelines (#34382 ) * Un-deprecate timeout * Put "timeout" on the allowed list * make fixup	2024-10-29 18:45:14 +00:00
Yoni Gozlan	e4449bb790	fix incorrect warning (#34416 )	2024-10-29 14:08:42 -04:00
Aleksey Lobanov	f55595b177	Fix performance in get_imports regexp (#34298 ) * fix: Fix performance in get_imports regexp * Minimize get_imports content regexp	2024-10-29 17:29:24 +00:00
dependabot[bot]	4e2e8809ff	Bump werkzeug from 3.0.3 to 3.0.6 in /examples/research_projects/decision_transformer (#34420 ) Bump werkzeug in /examples/research_projects/decision_transformer Bumps [werkzeug](https://github.com/pallets/werkzeug) from 3.0.3 to 3.0.6. - [Release notes](https://github.com/pallets/werkzeug/releases) - [Changelog](https://github.com/pallets/werkzeug/blob/main/CHANGES.rst) - [Commits](https://github.com/pallets/werkzeug/compare/3.0.3...3.0.6) --- updated-dependencies: - dependency-name: werkzeug dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-10-29 16:42:40 +00:00
Apoorv Khandelwal	e9ad460494	Adding `optimizer_cls_and_kwargs` to `Trainer.__init__` (#34358 ) * Adding `optimizer_cls_and_kwargs` to `Trainer.__init__` * formatting * make fix-copies docstring * added more docs for optimizer_cls_and_kwargs * add docs for Trainer(optimizer_cls_and_kwargs) * reverting anchor names	2024-10-29 16:23:16 +01:00
Guang Yang	f339042b0b	Albert is ExecuTorch compatible (#34476 ) Co-authored-by: Guang Yang <guangyang@fb.com>	2024-10-29 16:22:13 +01:00
Guang Yang	34620e8f0a	MobileBERT is ExecuTorch compatible (#34473 ) Co-authored-by: Guang Yang <guangyang@fb.com>	2024-10-29 16:14:31 +01:00
Abhijit Deo	56c45d5757	Bug fix for drop path decay rate in swin transformer (#34291 ) * potential bug fix for drop path * variable name change * forgot to rename the variables * back to original * modify dpr properly * check_copies auto fix * corresponsing swin2 changes * auto fix * linting * default value for drop_path_rate as 0.0 * Update src/transformers/models/glm/modeling_glm.py * maskformer fix * ruff format * changes made to tf code as well * lint --------- Co-authored-by: abhijit deo <167164474+deo-abhijit@users.noreply.github.com>	2024-10-29 16:09:18 +01:00
Shijie	0ab0a42651	fix-qwen2vl-no-position_ids (#33487 )	2024-10-29 15:27:34 +01:00
Doohae Jung	8755dd26b7	manual `head_dim` for `mixtral` model (#34281 )	2024-10-29 14:31:36 +01:00
Guang Yang	5392f12e16	Bert is ExecuTorch compatible (#34424 ) Co-authored-by: Guang Yang <guangyang@fb.com>	2024-10-29 14:30:02 +01:00
Marc Sun	004530aa05	Fix regression loading dtype (#34409 ) * fix regression * add test for torchao * expected output * better fix	2024-10-29 11:41:04 +01:00
hlky	9e3d704e23	Fixes for Modular Converter on Windows (#34266 ) * Separator in regex * Standardize separator for relative path in auto generated message * open() encoding * Replace `\` on `os.path.abspath` --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2024-10-29 11:40:41 +01:00
Martin Gubri	626c610a4d	Fix perplexity computation in perplexity.md (#34387 ) fix average NLL in perplexity.md	2024-10-29 11:10:10 +01:00
Yih-Dar	439334c8fb	Simplify running tests in a subprocess (#34213 ) * check * check * check * check * add docstring --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-10-29 10:48:57 +01:00
StevenBucaille	a1835195d1	🚨🚨🚨 [SuperPoint] Fix keypoint coordinate output and add post processing (#33200 ) * feat: Added int conversion and unwrapping * test: added tests for post_process_keypoint_detection of SuperPointImageProcessor * docs: changed docs to include post_process_keypoint_detection method and switched from opencv to matplotlib * test: changed test to not depend on SuperPointModel forward * test: added missing require_torch decorator * docs: changed pyplot parameters for the keypoints to be more visible in the example * tests: changed import torch location to make test_flax and test_tf * Revert "tests: changed import torch location to make test_flax and test_tf" This reverts commit `39b32a2f69`. * tests: fixed import * chore: applied suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * tests: fixed import * tests: fixed import (bis) * tests: fixed import (ter) * feat: added choice of type for target_size and changed tests accordingly * docs: updated code snippet to reflect the addition of target size type choice in post process method * tests: fixed imports (...) * tests: fixed imports (...) * style: formatting file * docs: fixed typo from image[0] to image.size[0] * docs: added output image and fixed some tests * Update docs/source/en/model_doc/superpoint.md Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com> * fix: included SuperPointKeypointDescriptionOutput in TYPE_CHECKING if statement and changed tests results to reflect changes to SuperPoint from absolute keypoints coordinates to relative * docs: changed SuperPoint's docs to print output instead of just accessing * style: applied make style * docs: added missing output type and precision in docstring of post_process_keypoint_detection * perf: deleted loop to perform keypoint conversion in one statement * fix: moved keypoint conversion at the end of model forward * docs: changed SuperPointInterestPointDecoder to SuperPointKeypointDecoder class name and added relative (x, y) coordinates information to its method * fix: changed type hint * refactor: removed unnecessary brackets * revert: SuperPointKeypointDecoder to SuperPointInterestPointDecoder * Update docs/source/en/model_doc/superpoint.md Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com> --------- Co-authored-by: Steven Bucaille <steven.bucaille@buawei.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com>	2024-10-29 09:36:03 +00:00
kang sheng	655bec2da7	use a tinymodel to test generation config which aviod timeout (#34482 ) * use a tinymodel to test generation config which aviod timeout * remove tailing whitespace	2024-10-29 09:39:06 +01:00
Raushan Turganbay	63ca6d9771	Fix CI (#34458 ) * fix * fix mistral	2024-10-29 08:26:04 +01:00
Raushan Turganbay	808d6c50f8	Generation: fix test (#34369 ) * fix test * fix copies	2024-10-29 07:57:10 +01:00
Raushan Turganbay	fe76b60370	LLaVA: latency issues (#34460 ) * fix llavas * code style * green ci	2024-10-29 07:54:51 +01:00
Alexandros Benetatos	a769ed45e1	Add `post_process_depth_estimation` for GLPN (#34413 ) * add depth postprocessing for GLPN * remove previous temp fix for glpn tests * Style changes for GLPN's `post_process_depth_estimation` Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * additional style fix --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2024-10-28 19:44:20 +01:00
Luc Georges	6cc4a67b3d	feat: run benchmarks on A100 (#34287 )	2024-10-28 19:33:17 +01:00

1 2 3 4 5 ...

17318 Commits