transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-14 01:58:22 +06:00

Author	SHA1	Message	Date
Julien Chaumond	6316ce8d27	[doc] Always call it Agents for consistency (#25958 )	2023-09-05 12:27:20 +01:00
Yih-Dar	391f26459a	Use main in conversion script (#25973 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-09-05 13:04:49 +02:00
Kai	6f125aaa48	fix typo (#25981 ) rename doanloading to downloading	2023-09-05 11:13:06 +01:00
Susnato Dhar	52a46dc57b	Add `Pop2Piano` space demo. (#25975 ) Update pop2piano.md	2023-09-05 11:07:02 +01:00
Huazhong Ji	1cc3bc22fe	nn.Identity is not required to be compatible with PyTorch < 1.1.0 as the minimum PyTorch version we currently support is 1.10.0 (#25974 ) nn.Identity is not required to be compatible with PyTorch < 1.1.0 as the minimum PyTorch version we currently support is 1.10.0	2023-09-05 11:37:54 +02:00
Yih-Dar	fbbe1b8a40	Fix `test_load_img_url_timeout` (#25976 ) * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-09-05 11:34:28 +02:00
Yih-Dar	feec56959a	Fix Detr CI (#25972 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-09-05 11:19:56 +02:00
Susnato Dhar	404ff8fc17	Fix typo (#25966 ) * Update feature_extraction_clap.py * changed all lenght to length	2023-09-05 10:12:25 +02:00
Lysandre	d8e13b3e04	v4.34.dev.0	2023-09-04 15:12:11 -04:00
Younes Belkada	49b69fe0d4	[`Falcon`] Remove SDPA for falcon to support earlier versions of PyTorch (< 2.0) (#25947 ) * remove SDPA for falcon * revert previous behaviour and add warning * nit * Update src/transformers/models/falcon/modeling_falcon.py Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Update src/transformers/models/falcon/modeling_falcon.py --------- Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> Co-authored-by: Lysandre Debut <hi@lysand.re>	2023-09-04 14:34:04 -04:00
Lysandre Debut	22a69f1d7d	Put Falcon back (#25960 ) * Put Falcon back * Update src/transformers/models/auto/configuration_auto.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update test --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-09-04 14:17:09 -04:00
David Reguera	040c4613c2	Add type hints for tf models final batch (#25883 ) * Add missing type hints and consistency to `RegNet` models * Add missing type hints and consistency to `TFSamModel` * Add missing type hints to `TFSegformerDecodeHead` * Add missing type hints and consistency to `TransfoXL` family models * Add missing type hints and consistency to `TFWav2Vec2ForSequenceClassification` * Add type hints to `TFXLMModel` * Fix linter * Revert the type hints for `RegNet` to python 3.8 compliant * Remove the redundant np.ndarray type hint.	2023-09-04 18:16:10 +01:00
Yih-Dar	44d2c199f6	Fix smart check (#25955 ) * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-09-04 18:54:34 +02:00
Lysandre Debut	3a479672ea	Fix failing test (#25963 )	2023-09-04 12:53:50 -04:00
Matt	034bc5d26a	Add proper Falcon docs and conversion script (#25954 ) * Add proper Falcon docs and conversion script * Autodetect the decoder architecture instead of using an arg * Update docs now that we can autodetect * Fix doc error * Add doc to toctree * Quick doc update	2023-09-04 17:18:34 +01:00
Sanchit Gandhi	d750eff627	[VITS] Fix init test (#25945 ) * [VITS] Fix init test * add flaky decorator * style * max attempts Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * style --------- Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2023-09-04 17:09:26 +01:00
Nino Risteski	7cd01d4e38	Update README.md (#25922 ) fixed a typo	2023-09-04 16:11:00 +02:00
Omar Sanseviero	bfb1895e33	Import deepspeed utilities from integrations (#25919 ) Follow up from #25599	2023-09-04 14:03:48 +01:00
Sanchit Gandhi	eb984418e2	[VITS] Handle deprecated weight norm (#25946 )	2023-09-04 11:54:03 +01:00
Sanchit Gandhi	f435003e0c	[MMS] Fix pip install in docs (#25949 )	2023-09-04 11:53:41 +01:00
Nino Risteski	604a6c51ae	Update README.md (#25941 ) fixed a typo	2023-09-04 11:28:21 +01:00
Nino Risteski	d4407a3bd1	Update autoclass_tutorial.md (#25929 ) fixed typos	2023-09-04 11:16:49 +01:00
Nino Risteski	51e1e8120b	Update community.md (#25928 ) fixed a few typos	2023-09-04 11:16:34 +01:00
omahs	0f0e1a2c2b	Fix typos (#25936 ) * fix typo * fix typo * fix typo * fix typos * fix typos * fix typo * fix typo * fix typo * fix typos * fix typo * fix typo * fix typo * fix typos * fix typos	2023-09-04 11:15:12 +01:00
Yih-Dar	b1d475f6d2	Skip offload tests for `ViTDet` (#25913 ) * update * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-09-04 11:35:39 +02:00
ydshieh	ab8cba824e	CI: hotfix (skip VitsModelTest::test_initialization)	2023-09-04 09:06:11 +02:00
Nino Risteski	0afa5071bd	Update model_memory_anatomy.md (#25896 ) typo fixes	2023-09-01 12:27:01 -07:00
Arthur	a4dd53d88e	Update-llama-code (#25826 ) * some bug fixes * updates * Update code_llama.md Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com> * Add co author Co-authored-by: pcuenca <pedro@latenitesoft.com> * add a test * fixup * nits * some updates * fix-coies * adress comments * nits * nits * fix docsting * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * update * add int for https://huggingface.co/spaces/hf-accelerate/model-memory-usage --------- Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com> Co-authored-by: pcuenca <pedro@latenitesoft.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-09-01 20:40:40 +02:00
Sanchit Gandhi	3587769c08	[VITS] Only trigger tokenizer warning for uroman (#25915 )	2023-09-01 19:27:01 +01:00
Sanchit Gandhi	1fa2d89a9b	[MMS] Update docs with HF TTS implementation (#25907 ) * [MMS] Update docs with HF TTS implementation * Update docs/source/en/model_doc/mms.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * add uromanise to docs --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-09-01 16:50:59 +01:00
Sanchit Gandhi	b439129e74	[VITS] Add to TTA pipeline (#25906 ) * [VITS] Add to TTA pipeline * Update tests/pipelines/test_pipelines_text_to_audio.py Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com> * remove extra spaces --------- Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>	2023-09-01 16:39:00 +01:00
Zach Mueller	be0e189bd3	Revert frozen training arguments (#25903 ) * Revert frozen training arguments * TODO	2023-09-01 11:24:12 -04:00
Omar Sanseviero	69c5b8f186	Remove broken docs for MusicGen (#25905 ) Update musicgen.md	2023-09-01 15:26:42 +01:00
Yih-Dar	16d6e3087c	Better error message for pipeline loading (#25912 ) * update * update * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-09-01 16:09:12 +02:00
Joao Gante	53e2fd785b	Falcon: Add RoPE scaling (#25878 )	2023-09-01 12:05:53 +01:00
pkumc	024acd271b	fix FSDP model resume optimizer & scheduler (#25852 ) * fix FSDP resume optimizer & scheduler * improve trainer code quality --------- Co-authored-by: machi04 <machi04@meituan.com>	2023-09-01 15:20:42 +05:30
Matthijs Hollemans	4ece3b9433	add VITS model (#24085 ) * add VITS model * let's vits * finish TextEncoder (mostly) * rename VITS to Vits * add StochasticDurationPredictor * ads flow model * add generator * correctly set vocab size * add tokenizer * remove processor & feature extractor * add PosteriorEncoder * add missing weights to SDP * also convert LJSpeech and VCTK checkpoints * add training stuff in forward * add placeholder tests for tokenizer * add placeholder tests for model * starting cleanup * let the great renaming begin! * use config * global_conditioning * more cleaning * renaming variables * more renaming * more renaming * it never ends * reticulating the splines * more renaming * HiFi-GAN * doc strings for main model * fixup * fix-copies * don't make it a PreTrainedModel * fixup * rename config options * remove training logic from forward pass * simplify relative position * use actual checkpoint * style * PR review fixes * more review changes * fixup * more unit tests * fixup * fix doc test * add integration test * improve tokenizer tests * add tokenizer integration test * fix tests on GPU (gave OOM) * conversion script can handle repos from hub * add conversion script for all MMS-TTS checkpoints * automatically create a README for the converted checkpoint * small changes to config * push README to hub * only show uroman note for checkpoints that need it * remove conversion script because code formatting breaks the readme * make WaveNet layers configurable * rename variables * simplifying the math * output attentions and hidden states * remove VitsFlip in flow model * also got rid of the other flip * fix tests * rename more variables * rename tokenizer, add phonemization * raise error when phonemizer missing * re-order config docstrings to match method * change config naming * remove redundant str -> list * fix copyright: vits authors -> kakao enterprise * (mean, log_variances) -> (prior_mean, prior_log_variances) * if return dict -> if not return dict * speed -> speaking rate * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * update fused tanh sigmoid * reduce dims in tester * audio -> output_values * audio -> output_values in tuple out * fix return type * fix return type * make _unconstrained_rational_quadratic_spline a function * all nn's to accept a config * add spectro to output * move {speaking rate, noise scale, noise scale duration} to config * path -> attn_path * idxs -> valid idxs -> padded idxs * output values -> waveform * use config for attention * make generation work * harden integration test * add spectrogram to dict output * tokenizer refactor * make style * remove 'fake' padding token * harden tokenizer tests * ron norm test * fprop / save tests deterministic * move uroman to tokenizer as much as possible * better logger message * fix vivit imports * add uroman integration test * make style * up * matthijs -> sanchit-gandhi * fix tokenizer test * make fix-copies * fix dict comprehension * fix config tests * fix model tests * make outputs consistent with reverse/not reverse * fix key concat * more model details * add author * return dict * speaker error * labels error * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/vits/convert_original_checkpoint.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * remove uromanize * add docstrings * add docstrings for tokenizer * upper-case skip messages * fix return dict * style * finish tests * update checkpoints * make style * remove doctest file * revert * fix docstring * fix tokenizer * remove uroman integration test * add sampling rate * fix docs / docstrings * style * add sr to model output * fix outputs * style / copies * fix docstring * fix copies * remove sr from model outputs * Update utils/documentation_tests.txt Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * add sr as allowed attr --------- Co-authored-by: sanchit-gandhi <sanchit@huggingface.co> Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-09-01 10:50:06 +01:00
Marc Sun	ef10dbce5c	remove torch_dtype override (#25894 ) * remove torch_dtype override * style * Update src/transformers/modeling_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-08-31 17:38:14 -04:00
Sylvain Gugger	0f08cd205a	Smarter check for `is_tensor` (#25871 ) * Smarter check for * Use protected functions * Do others too * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Address review comments --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-08-31 13:14:18 -04:00
Yih-Dar	3fb1535b09	Update `setup.py` (#25893 ) update Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-08-31 18:54:01 +02:00
David Reguera	eaf5e98ec0	Add type hints for tf models batch 1 (#25853 ) * Add type hints to `TFBlipTextModel` * Add missing type hints to DPR family models * Add type hints to `TFLEDModel` * Add type hints to `TFLxmertForPreTraining` * Add missing type hints to `TFMarianMTModel` and `TFMarianModel` * Add missing type hints to `TFRagModel` & `TFRagTokenForGeneration` * Make type hints annotations consistent	2023-08-31 17:00:03 +01:00
Younes Belkada	9c5acca002	[`InstructBlip`] FINAL Fix instructblip test (#25887 ) fix instructblip test	2023-08-31 17:01:27 +02:00
raghavanone	2be8a9098e	Save image_processor while saving pipeline (ImageSegmentationPipeline) (#25884 ) * Save image_processor while saving pipeline (ImageSegmentationPipeline) * Fix black issues	2023-08-31 16:08:20 +02:00
Arthur	a39ebbf879	[`CodeLlama`] Fix CI (#25890 ) * Fix coellama * style	2023-08-31 16:06:56 +02:00
Arthur	3b39b90618	[`TokenizerFast`] `can_save_slow_tokenizer` as a property for when `vocab_file`'s folder was removed (#25626 ) * pad token should be None by default * fix tests * nits * check if isfile vocabfile * add warning if sp model folder was deleted * save SPM when missing folder for sloz * update the ` can_save_slow_tokenizer` to be a property * first batch * second batch * missing one	2023-08-31 14:17:26 +02:00
Vibhor Kumar	99fc3ac8ac	Modify efficient GPU training doc with now-available adamw_bnb_8bit optimizer (#25807 ) * Modify single-GPU efficient training doc with now-available adamw_bnb_8bit optimizer * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-08-31 10:55:10 +01:00
Sourab Mangrulkar	e95bcaeef0	fix ds z3 checkpointing when `stage3_gather_16bit_weights_on_model_save=False` (#25817 ) * fix ds z3 checkpointing when `stage3_gather_16bit_weights_on_model_save=False` * refactoring	2023-08-31 15:17:53 +05:30
qihqi	f8468b4fac	For xla tensors, use an alternative way to get a unique id (#25802 ) * For xla tensors, use an alternative way to get a unique id Because xla tensors don't have storage. * add is_torch_tpu_available check	2023-08-31 10:31:16 +01:00
NielsRogge	716bb2e391	[ViTDet] Fix doc tests (#25880 ) Fix docstrings	2023-08-30 22:49:03 +02:00
Yih-Dar	1c6f072db0	Reduce CI output (#25876 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-08-30 18:15:07 +02:00

... 22 23 24 25 26 ...

15053 Commits