transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-02 19:21:31 +06:00

Author	SHA1	Message	Date
Gema Parreño	52c4276e44	Fix link to documentation in Install from Source (#24336 ) Update __init__.py Fix link to documentation to install Transformers from source Probably the title changed at some point from 'Installing' to 'Install'	2023-06-19 17:12:55 +01:00
amyeroberts	7e71eb2ef7	Fix ImageGPT doctest (#24353 ) Fix doctest	2023-06-19 15:23:29 +01:00
Yih-Dar	a4de24f691	Make `AutoFormer` work with previous torch version (#24357 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-19 16:02:06 +02:00
Vineel Pratap	7761b1893a	Update MMS integration docs (#24311 ) * Update mms.mdx * Update mms.mdx * Update docs/source/en/model_doc/mms.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update mms.mdx * Update docs/source/en/model_doc/mms.mdx Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>	2023-06-19 14:49:01 +01:00
Yih-Dar	5fca839fef	Fix device issue in `SwitchTransformers` (#24352 ) * fix * Update src/transformers/models/switch_transformers/modeling_switch_transformers.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-06-19 15:06:05 +02:00
Matěj Kripner	3b5a56e595	Fix `KerasMetricCallback`: pass `generate_kwargs` even if `use_xla_generation` is False (#24333 ) * Fix `KerasMetricCallback`: always pass `generate_kwargs`. * Reformat code using Black.	2023-06-19 12:51:25 +01:00
Yih-Dar	0b259a3b7e	Clean up disk sapce during docker image build for `transformers-pytorch-gpu` (#24346 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-19 12:54:02 +02:00
Yih-Dar	691b60db90	byebye Hub connection timeout (#24350 ) byebye timeout Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-19 12:50:20 +02:00
Yih-Dar	17e3e7d686	pin `apex` to a speicifc commit (for DeepSpeed CI docker image) (#24351 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-19 12:48:53 +02:00
Sohyun Sim	3c124df579	🌐 [i18n-KO] Fixed `tutorial/preprocessing.mdx` (#24156 ) * fix: revise translations * fix: resolve suggestions Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com> --------- Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>	2023-06-19 11:43:57 +01:00
Xiaoyang Sun	881c0df952	error bug on saving distributed optim state when using data parallel (#24108 ) Update checkpoint_reshaping_and_interoperability.py	2023-06-19 16:04:21 +05:30
Teven	ee88ae5994	Adding ddp_broadcast_buffers argument to Trainer (#24326 ) adding ddp_broadcast_buffers argument	2023-06-16 15:14:03 -04:00
Matt	9138995025	Add test for proper TF input signatures (#24320 ) * Add test for proper input signatures * No more signature pruning * Test the dummy inputs are valid too * fine-tine -> fine-tune * Fix indent in test_dataset_conversion	2023-06-16 17:03:13 +01:00
amyeroberts	bdfd57d1d1	Fix ImageGPT doc example (#24317 ) * Fix ImageGPT doc example * Update src/transformers/models/imagegpt/image_processing_imagegpt.py * Fix types	2023-06-16 17:01:22 +01:00
Sylvain Gugger	096f2cf126	Tied weights load (#24310 ) * Use tied weight keys * More * Fix tied weight missing warning * Only give info on unexpected keys with different classes * Deal with empty archs * Fix tests * Refine test	2023-06-16 10:55:42 -04:00
Nicolas Patry	61ffdeba38	Fix ner average grouping with no groups (#24319 ) Fixes #https://github.com/huggingface/transformers/issues/24314	2023-06-16 16:43:19 +02:00
Matt	3403712958	Big TF test cleanup (#24282 ) * Fix one BLIP arg not being optional, remove misspelled arg * Remove the lxmert test overrides and just use the base test_saved_model_creation * saved_model_creation fixes and re-enabling tests across the board * Remove unnecessary skip * Stop caching sinusoidal embeddings in speech_to_text * Fix transfo_xl compilation * Fix transfo_xl compilation * Fix the conditionals in xglm * Set the save spec only when building * Clarify comment * Move comment correctly * Correct embeddings generation for speech2text * Mark RAG generation tests as @slow * Remove redundant else: * Add comment to clarify the save_spec line in build() * Fix size tests for XGLM at last! * make fixup * Remove one band_part operation * Mark test_keras_fit as @slow	2023-06-16 15:40:49 +01:00
Yih-Dar	896a58de15	Byebye pytorch 1.9 (#24080 ) byebye --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-16 16:38:23 +02:00
Matt	62d71f4083	Fix functional TF Whisper and modernize tests (#24301 ) * Revert whisper change and modify the test_compile_tf_model test * make fixup * Tweak test slightly * Add functional model saving to test * Ensure TF can infer shapes for data2vec * Add override for efficientformer * Mark test as slow	2023-06-16 14:43:43 +01:00
Arthur	ba3fb4b8d7	[`SwitchTransformers`] Fix return values (#24300 ) * clean history * remove other changes * fix * fix coipes	2023-06-16 15:40:33 +02:00
Sayed Qaiser Ali	0b7b4429c7	Update test versions on README.md (#24307 ) Update README.md Updated the tested versions	2023-06-15 18:01:11 +01:00
Yih-Dar	6134b9b4c7	Make `can_generate` as class method (#24299 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-15 18:31:38 +02:00
jprivera44	e45bc14350	Beam search type (#24288 ) * test check in * adding in type hint fix on beam search * fixed code quality issue	2023-06-15 16:48:02 +01:00
Belladore	1a113fcf65	Update tokenizer_summary.mdx (grammar) (#24286 )	2023-06-15 16:31:47 +01:00
hitchhicker	c3ca346b49	[Docs] Fix the paper URL for MMS model (#24302 ) Fix the paper URL for MMS model	2023-06-15 15:45:49 +01:00
Sanchit Gandhi	4124a09f8b	[EnCodec] Changes for 32kHz ckpt (#24296 ) * [EnCodec] Changes for 32kHz ckpt * Update src/transformers/models/encodec/convert_encodec_checkpoint_to_pytorch.py * Update src/transformers/models/encodec/convert_encodec_checkpoint_to_pytorch.py	2023-06-15 14:36:19 +01:00
Sourab Mangrulkar	01b55779d3	deepspeed init during eval fix (#24298 ) * deepspeed init during eval fix * commit suggestions Co-Authored-By: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-06-15 18:47:09 +05:30
Cooper	6a081c512a	Update README_zh-hans.md (#24181 ) * Update README_zh-hans.md update document link * Update README_zh-hans.md Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-06-15 13:50:40 +01:00
Patrick von Platen	604a21b1e6	[Docs] Improve docs for MMS loading of other languages (#24292 ) * Improve docs * Apply suggestions from code review * upload readme * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-06-15 14:29:32 +02:00
amyeroberts	e6122c3f40	Fix image segmentation tool bug (#23897 ) * Image segmentation tool bug * Remove resizing in the tests	2023-06-15 08:09:31 -04:00
jiangmingyan	6cd34d451c	[fix] bug in BatchEncoding.__getitem__ (#24293 ) Co-authored-by: luchen <luchen@luchendeMBP.lan>	2023-06-15 12:33:37 +01:00
Sylvain Gugger	372f50030b	Split common test from core tests (#24284 )	2023-06-15 07:30:24 -04:00
JayL0321	a611ac9b3f	remove unused is_decoder parameter in DetrAttention (#24226 ) * issue#24161 remove unused is_decoder parameter in DetrAttention * #24161 fix check_repository_consistency fail	2023-06-15 11:39:32 +01:00
Fei Wang	33196b459c	Fix LLaMa beam search when using parallelize (#24224 ) * Fix LLaMa beam search when using parallelize same issue as T5 #11717 * fix code format in modeling_llama.py * fix format of _reorder_cache in modeling_llama.py	2023-06-15 11:28:48 +01:00
Yih-Dar	7504be35ab	Fix `check_config_attributes`: check all configuration classes (#24231 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-15 11:39:20 +02:00
Stephan Tulkens	6793f0cfe0	Fix bug in slow tokenizer conversion, make it a lot faster (#24266 ) * Make conversion faster, fix None vs 0 bug * Add second sort for consistency * Update src/transformers/convert_slow_tokenizer.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-06-15 09:41:57 +01:00
Patrick von Platen	1609a436ec	Add MMS CTC Fine-Tuning (#24281 ) * Add mms ctc fine tuning * make style * More fixes that are needed * make fix-copies * make draft for README * add new file * move to new file * make style * make style * add quick test * make style * make style	2023-06-15 01:10:27 +02:00
Matthijs Hollemans	0c3fdccf2f	[WIP] add EnCodec model (#23655 ) * boilerplate stuff * messing around with the feature extractor * fix feature extractor * unit tests for feature extractor * rename speech to audio * quick-and-dirty import of Meta's code * import weights (sort of) * cleaning up * more cleaning up * move encoder/decoder args into config * cleanup model * rename EnCodec -> Encodec * RVQ parameters in config * add slow test * add lstm init and test_init * Add save & load * finish EncodecModel * remove decoder_input_values as they are ont used anywhere (not removed from doc yet) * fix test feature extraction model name * Add better slow test * Fix tests * some fixup and cleaning * Improve further * cleaning up quantizer * fix up conversion script * test don't pass, _encode_fram does not work * update tests with output per encode and decode * more cleanup * rename _codebook * remove old config cruft * ratios & hop_length * use ModuleList instead of Sequential * clean up resnet block * update types * update tests * fixup * quick cleanup * fix padding * more styl,ing * add patrick feedback * fix copies * fixup * fix lstm * fix shape issues * fixup * rename conv layers * fixup * fix decoding * small conv refactoring * remove norm_params * simplify conv layers * rename conv layers * stuff * Clean up * Add padding logic use padding mask small conv refactoring remove norm_params simplify conv layers rename conv layers stuff add batched test update Clean up merge and update for padding fix padding fixup * clean up more * clean up more * More clean ups * cleanup convolutions * typo * fix typos * fixup * build PR doc? * start refactoring docstring * fix don't pad when no strid and chunk * update docstring * update docstring * nits * update going to lunch * update config and model * fix broken testse (becaue of the config changes) * fix scale computation * fixu[ * only return dict if speciefied or if config returns it * remove todos * update defaults in config * update conversion script * fix doctest * more docstring + fixup * nits on batched_tests * more nits * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * update basxed on review * fix update * updaet tests * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fixup * add overlap and chunl_length_s * cleanup feature extraction * teste edge cases truncation and padding * correct processor values * update config encodec, nits * fix tests * fixup * fix 24Hz test * elle tests are green * fix fixup * Apply suggestions from code review * revert readme changes * fixup * add example * use facebook checkpoints * fix typo * no pipeline tests * use slef.pad everywhere we can * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * update based on review * update * update mdx * fix bug and tests * fixup * fix doctest * remove comment * more nits * add more coverage for `test_truncation_and_padding` * fixup * add last test * fix text * nits * Update tests/models/encodec/test_modeling_encodec.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * take care of the last comments * typo * fix test * nits * fixup * Update src/transformers/models/encodec/feature_extraction_encodec.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: arthur.zucker@gmail.com <arthur.zucker@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-06-14 18:57:23 +02:00
Sylvain Gugger	26a2ec56d7	Clean up old Accelerate checks (#24279 ) * Clean up old Accelerate checks * Put back imports	2023-06-14 12:44:09 -04:00
Wissam Antoun	860d11ff7c	Fix Debertav2 embed_proj (#24205 ) * MLM prediction head output size from embed_size Take the output size of the dense projection layer from embedding_size instead of hidden_size since there could be a projection of the input embedding into hidden_size if they are different * project TFDebertaV2 mlm output to embedding size embedding size can be different that hidden_size, so the final layer needs to project back to embedding size. like in ELECTRA or DeBERTaV3 style pertaining. This should solve an error that occurs when loading models like "almanach/camemberta-base-generator". * fix the same issue for reshaping after projection * fix layernorm size * add self.embedding_size to scope * fix embed_proj scope name * apply the same changes to TF Deberta * add the changes to deberta * added self.embedding_size instead of config.embedding_size * added the same change to debertav2 * added coppied from deberta to deberta2 model * config.embedding_size fix * black * fix deberta config name	2023-06-14 17:24:53 +01:00
Yih-Dar	a04ebc8b33	`Pix2StructImageProcessor` requires `torch>=1.11.0` (#24270 ) * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-14 17:05:40 +02:00
Sylvain Gugger	8978b696d7	Update check of core deps (#24277 )	2023-06-14 10:06:31 -04:00
Patrick von Platen	c4fec38bc7	Adapt Wav2Vec2 conversion for MMS lang identification (#24234 ) * Add conversion for mms lid * make style	2023-06-14 16:02:36 +02:00
Joao Gante	4626df5077	TF: CTRL with native embedding layers (#23456 )	2023-06-14 14:39:02 +01:00
Yih-Dar	eac8dede83	Skip some `TQAPipelineTests` tests in past CI (#24267 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-14 14:25:24 +02:00
ByronHsu	91b62f5a78	QA doc: import torch before it is used (#24228 ) * import torch before it is used * style Signed-off-by: byhsu <byhsu@linkedin.com> --------- Signed-off-by: byhsu <byhsu@linkedin.com> Co-authored-by: byhsu <byhsu@linkedin.com>	2023-06-14 11:23:55 +01:00
TAE YOUNGDON	6ab045d6fe	Fix URL in comment for contrastive loss function (#24271 ) * Update language_modeling.py in "class TextDatasetForNextSentencePrediction(Dataset)", double considering "self.tokenizer.num_special_tokens_to_add(pair=True)" so, i remove self.block_size, and add parameter for "def create_examples_from_document". like "class LineByLineWithSOPTextDataset" do * Update language_modeling.py * Fix URL in comment for contrastive loss function	2023-06-14 11:08:31 +01:00
Sourab Mangrulkar	b89fcccd44	update FSDP save and load logic (#24249 ) * update fsdp save and load logic * fix * see if this resolves the failing tests	2023-06-14 00:49:15 +05:30
Sourab Mangrulkar	e0603d894d	docs wrt using accelerate launcher with trainer (#24250 ) * update docs * missing part * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * address comments * address Zach's comment --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-06-14 00:31:06 +05:30
Yih-Dar	233113149b	Skip `GPT-J` fx tests for torch < 1.12 (#24256 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-13 20:33:26 +02:00

... 2 3 4 5 6 ...

13362 Commits