transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-02 19:21:31 +06:00

Author	SHA1	Message	Date
Marc Sun	ef10dbce5c	remove torch_dtype override (#25894 ) * remove torch_dtype override * style * Update src/transformers/modeling_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-08-31 17:38:14 -04:00
Sylvain Gugger	0f08cd205a	Smarter check for `is_tensor` (#25871 ) * Smarter check for * Use protected functions * Do others too * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Address review comments --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-08-31 13:14:18 -04:00
Yih-Dar	3fb1535b09	Update `setup.py` (#25893 ) update Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-08-31 18:54:01 +02:00
David Reguera	eaf5e98ec0	Add type hints for tf models batch 1 (#25853 ) * Add type hints to `TFBlipTextModel` * Add missing type hints to DPR family models * Add type hints to `TFLEDModel` * Add type hints to `TFLxmertForPreTraining` * Add missing type hints to `TFMarianMTModel` and `TFMarianModel` * Add missing type hints to `TFRagModel` & `TFRagTokenForGeneration` * Make type hints annotations consistent	2023-08-31 17:00:03 +01:00
Younes Belkada	9c5acca002	[`InstructBlip`] FINAL Fix instructblip test (#25887 ) fix instructblip test	2023-08-31 17:01:27 +02:00
raghavanone	2be8a9098e	Save image_processor while saving pipeline (ImageSegmentationPipeline) (#25884 ) * Save image_processor while saving pipeline (ImageSegmentationPipeline) * Fix black issues	2023-08-31 16:08:20 +02:00
Arthur	a39ebbf879	[`CodeLlama`] Fix CI (#25890 ) * Fix coellama * style	2023-08-31 16:06:56 +02:00
Arthur	3b39b90618	[`TokenizerFast`] `can_save_slow_tokenizer` as a property for when `vocab_file`'s folder was removed (#25626 ) * pad token should be None by default * fix tests * nits * check if isfile vocabfile * add warning if sp model folder was deleted * save SPM when missing folder for sloz * update the ` can_save_slow_tokenizer` to be a property * first batch * second batch * missing one	2023-08-31 14:17:26 +02:00
Vibhor Kumar	99fc3ac8ac	Modify efficient GPU training doc with now-available adamw_bnb_8bit optimizer (#25807 ) * Modify single-GPU efficient training doc with now-available adamw_bnb_8bit optimizer * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-08-31 10:55:10 +01:00
Sourab Mangrulkar	e95bcaeef0	fix ds z3 checkpointing when `stage3_gather_16bit_weights_on_model_save=False` (#25817 ) * fix ds z3 checkpointing when `stage3_gather_16bit_weights_on_model_save=False` * refactoring	2023-08-31 15:17:53 +05:30
qihqi	f8468b4fac	For xla tensors, use an alternative way to get a unique id (#25802 ) * For xla tensors, use an alternative way to get a unique id Because xla tensors don't have storage. * add is_torch_tpu_available check	2023-08-31 10:31:16 +01:00
NielsRogge	716bb2e391	[ViTDet] Fix doc tests (#25880 ) Fix docstrings	2023-08-30 22:49:03 +02:00
Yih-Dar	1c6f072db0	Reduce CI output (#25876 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-08-30 18:15:07 +02:00
Yih-Dar	9219d1427b	pin pandas==2.0.3 (#25875 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-08-30 18:10:01 +02:00
Joao Gante	459bc6738c	Docs: fix example failing doctest in `generation_strategies.md` (#25874 )	2023-08-30 16:23:44 +01:00
Marc Sun	72298178bc	fix max_memory for bnb (#25842 )	2023-08-30 11:00:36 -04:00
Yih-Dar	f73c20970c	Fix imports (#25869 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-08-30 16:11:54 +02:00
Lysandre Debut	ed290b0837	Remote tools are turned off (#25867 )	2023-08-30 09:40:39 -04:00
Juan Pizarro	09dc99517f	Add Blip2 model in VQA pipeline (#25532 ) * Add Blip2 model in VQA pipeline * use require_torch_gpu for test_large_model_pt_blip2 * use can_generate in vqa pipeline * test Blip2ForConditionalGeneration using float16 * remove custom can_generate from Blip2ForConditionalGeneration	2023-08-30 14:16:16 +01:00
Yih-Dar	62399d6f35	Add flax installation in daily doctest workflow (#25860 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-08-30 15:13:50 +02:00
Aman Gupta Karmani	52574026b6	minor typo fix in PeftAdapterMixin docs (#25829 ) fix minor documentation typo	2023-08-30 11:56:05 +01:00
Nino Risteski	1bf2f36daf	Update README.md (#25832 ) deleted unnecessary comma in the Adding a new model section.	2023-08-30 10:52:41 +01:00
Joao Gante	07998ef399	Generate: models with custom `generate()` return `True` in `can_generate()` (#25838 )	2023-08-29 20:10:46 +01:00
Nino Risteski	8c75cfdaee	Update README.md (#25834 ) _toctree.yml file. broken link, now fixed.	2023-08-29 20:02:57 +01:00
Haylee Schäfer	dbc16f4404	Support loading base64 images in pipelines (#25633 ) * support loading base64 images * add test * mention in docs * remove the logging * sort imports * update error message * Update tests/utils/test_image_utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * restructure to catch base64 exception * doesn't like the newline * download files * format * optimize imports * guess it needs a space? * support loading base64 images * add test * remove the logging * sort imports * restructure to catch base64 exception * doesn't like the newline * download files * optimize imports * guess it needs a space? --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-08-29 19:24:24 +01:00
amyeroberts	ce2d4bc6a1	MaskFormer,Mask2former - reduce memory load (#25741 ) Allocate result array ahead of time	2023-08-29 18:49:15 +01:00
Sanchit Gandhi	0daeeb40a1	[AutoTokenizer] Add data2vec to mapping (#25835 )	2023-08-29 18:26:41 +01:00
Susnato Dhar	0e59c93983	update remaining `Pop2Piano` checkpoints (#25827 ) update checkpoints	2023-08-29 18:00:40 +01:00
Arthur	245dcc49ef	🤦update warning to If you want to use the new behaviour, set `legacy=… (#25833 ) 🤦update warning to If you want to use the new behaviour, set `legacy=False`. instead of True	2023-08-29 18:01:43 +02:00
Sohyun Sim	aade754b27	🌐 [i18n-KO] Translated `community.md` to Korean (#25674 ) * docs: ko: community.md * feat: deepl draft * fix: manual edits * fix: resolve suggestions Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com> Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> --------- Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com> Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>	2023-08-29 11:47:24 -04:00
heuristicwave	d97fd871e5	🌐 [i18n-KO] Translated `add_new_pipeline.md` to Korean (#25498 ) * dos: ko: add_new_pipeline.mdx * feat: chatgpt draft * fix: manual edits * docs: ko: add_new_pipeline Update _toctree * Update docs/source/ko/add_new_pipeline.md Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/add_new_pipeline.md Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/add_new_pipeline.md Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/add_new_pipeline.md Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> * Update docs/source/ko/add_new_pipeline.md Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> * Update docs/source/ko/add_new_pipeline.md Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> * Update docs/source/ko/add_new_pipeline.md Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/add_new_pipeline.md Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/add_new_pipeline.md Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> * Update docs/source/ko/add_new_pipeline.md Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> * Update docs/source/ko/add_new_pipeline.md Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> --------- Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>	2023-08-29 11:38:44 -04:00
Joao Gante	a35f889acc	Tests: detect lines removed from "utils/not_doctested.txt" and doctest ALL generation files (#25763 )	2023-08-29 16:15:05 +01:00
Chau Nguyen	483861d52d	Error with checking args.eval_accumulation_steps to gather tensors (#25819 ) * Update trainer.py (error with checking steps in args.eval_accumulation_steps to gather tensors) While the deprecated code has the correct check (line 3772): "if args.eval_accumulation_steps is not None and (step + 1) % args.eval_accumulation_steps == 0:" The current code does not (line 3196): "if args.eval_accumulation_steps is not None and self.accelerator.sync_gradients:" We need to check "(step + 1) % args.eval_accumulation_steps == 0". Hence, the line 3196 should be modified to: "if args.eval_accumulation_steps is not None and (step + 1) % args.eval_accumulation_steps == 0 and self.accelerator.sync_gradients:" * Fix error with checking args.eval_accumulation_steps to gather tensors	2023-08-29 15:06:41 +01:00
MinJae Kang	33aa0af70c	🌐 [i18n-KO] `model_memory_anatomy.md` to Korean (#25755 ) * docs: ko-model_memory_anatomy.md * feat: chatgpt draft * feat: manual edits * feat: change document title * feat: manual edits * fix: resolve suggestion Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: heuristicwave <31366038+heuristicwave@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: heuristicwave <31366038+heuristicwave@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> * fix: resolve suggestion --------- Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> Co-authored-by: heuristicwave <31366038+heuristicwave@users.noreply.github.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>	2023-08-29 09:48:51 -04:00
SeongWooChoi	173fa7da9c	🌐 [i18n-KO] Translated peft.md to Korean (#25706 ) * docs: ko: peft.mdx * feat: chatgpt draft * fix: manual edits * fix: resolve suggestions Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: heuristicwave <31366038+heuristicwave@users.noreply.github.com> * fix: resolve suggestions Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> --------- Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: heuristicwave <31366038+heuristicwave@users.noreply.github.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>	2023-08-29 09:10:00 -04:00
Dongkeun Yoon	2ee60b757e	fix warning trigger for embed_positions when loading xglm (#25798 ) * fix warning triggering for xglm.embed_positions * Make TF variable a tf.constant to match (and fix some spelling) --------- Co-authored-by: Matt <rocketknight1@gmail.com>	2023-08-29 14:09:07 +01:00
Arthur	5b5ee235f3	[`LlamaTokenizer`] `tokenize` nits. (#25793 ) * return when length is zero * Add tests Co-authored-by: Avnish Narayan <38871737avnishn@users.noreply.github.com> * Co-authored-by: avnishn <38871737+avnishn@users.noreply.github.com> * codeLlama doc should not be on Main * update test --------- Co-authored-by: Avnish Narayan <38871737avnishn@users.noreply.github.com>	2023-08-29 15:08:14 +02:00
Omar Sanseviero	9525515cd4	Minor wording changes for Code Llama (#25815 ) * Update code_llama.md * Update code_llama.md	2023-08-29 15:02:57 +02:00
zspo	3dd030d264	fix register (#25779 )	2023-08-29 14:11:48 +02:00
Younes Belkada	dc0c102954	[`Docs`] More clarifications on BT + FA (#25823 )	2023-08-29 13:52:25 +02:00
Sourab Mangrulkar	c9bae84eb5	Resolving Attribute error when using the FSDP ram efficient feature (#25820 ) fix bug	2023-08-29 17:02:19 +05:30
NielsRogge	77713d11f6	[DINOv2] Add backbone class (#25520 ) * First draft * More improvements * Fix all tests * More improvements * Add backbone test * Improve docstring * Address comments * Rename attribute * Remove expected output * Update src/transformers/models/dinov2/modeling_dinov2.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Fix style --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-08-29 11:05:27 +01:00
NielsRogge	4c21da5e34	Add ViTDet (#25524 ) * First draft * Fix READMEs * Update return_dict * Add more tests * Fix docstrings * Address comments * Address more comments * Address more comments * Address more comments, fix test * Fix test	2023-08-29 10:03:52 +01:00
Lorenzo Battistela	99c3d44906	fixing name position_embeddings to object_queries (#24652 ) * fixing name position_embeddings to object_queries * [fix] renaming variable and docstring do object queries * [fix] comment position_embedding to object queries * [feat] changes from make-fix-copies to keep consistency * Revert "[feat] changes from make-fix-copies to keep consistency" This reverts commit `56e3e9ede1`. * [tests] fix wrong expected score * [fix] wrong assignment causing wrong tensor shapes * [fix] fixing position_embeddings to object queries to keep consistency (make fix copies) * [fix] make fix copies, renaming position_embeddings to object_queries * [fix] positional_embeddingss to object queries, fixes from make fix copies * [fix] comments frmo make fix copies * [fix] adding args validation to keep version support * [fix] adding args validation to keep version support -conditional detr * [fix] adding args validation to keep version support - maskformer * [style] make fixup style fixes * [feat] adding args checking * [feat] fixcopies and args checking * make fixup * make fixup --------- Co-authored-by: Lorenzobattistela <lorenzobattistela@gmail.com>	2023-08-29 09:09:45 +01:00
Aman Gupta Karmani	39c37fe45c	Fix incorrect Boolean value in deepspeed example (#25788 )	2023-08-29 09:22:37 +02:00
Arup De	738ecd17d8	Arde/fsdp activation checkpointing (#25771 ) * add FSDP config option to enable activation-checkpointing * update docs * add checks and remove redundant code * fix formatting error	2023-08-29 12:52:14 +05:30
Stas Bekman	50573c648a	[idefics] fix vision's `hidden_act` (#25787 ) [idefics] fix vision's hidden_act	2023-08-28 07:37:37 -07:00
David Reguera	886b6be081	Add type hints for several pytorch models (batch-4) (#25749 ) * Add type hints for MGP STR model * Add missing type hints for plbart model * Add type hints for Pix2struct model * Add missing type hints to Rag model and tweak the docstring * Add missing type hints to Sam model * Add missing type hints to Swin2sr model * Fix a type hint for Pix2StructTextModel Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Fix typo on Rag model docstring Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Fix linter --------- Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>	2023-08-28 14:31:33 +01:00
David Reguera	ed915cff97	Add type hints for pytorch models (final batch) (#25750 ) * Add type hints for table_transformer * Add type hints to Timesformer model * Add type hints to Timm Backbone model * Add type hints to TVLT family models * Add type hints to Vivit family models * Use the typing instance instead of the python builtin. * Fix the `replace_return_docstrings` decorator for Vivit model Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> --------- Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>	2023-08-28 14:31:22 +01:00
David Reguera	cb91ec67b5	Add type hints for several pytorch models (batch-2) (#25557 ) * Add missing type hint to cpmant * Add type hints to decision_transformer model * Add type hints to deformable_detr models * Add type hints to detr models * Add type hints to deta models * Add type hints to dpr models * Update attention mask type hint Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Update remaining attention masks type hints * Update docstrings' type hints related to attention masks --------- Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>	2023-08-28 13:58:23 +01:00

1 2 3 4 5 ...

13916 Commits