ydshieh
ab8cba824e
CI: hotfix (skip VitsModelTest::test_initialization)
2023-09-04 09:06:11 +02:00
Nino Risteski
0afa5071bd
Update model_memory_anatomy.md ( #25896 )
...
typo fixes
2023-09-01 12:27:01 -07:00
Arthur
a4dd53d88e
Update-llama-code ( #25826 )
...
* some bug fixes
* updates
* Update code_llama.md
Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com>
* Add co author
Co-authored-by: pcuenca <pedro@latenitesoft.com>
* add a test
* fixup
* nits
* some updates
* fix-coies
* adress comments
* nits
* nits
* fix docsting
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* update
* add int for https://huggingface.co/spaces/hf-accelerate/model-memory-usage
---------
Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com>
Co-authored-by: pcuenca <pedro@latenitesoft.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-09-01 20:40:40 +02:00
Sanchit Gandhi
3587769c08
[VITS] Only trigger tokenizer warning for uroman ( #25915 )
2023-09-01 19:27:01 +01:00
Sanchit Gandhi
1fa2d89a9b
[MMS] Update docs with HF TTS implementation ( #25907 )
...
* [MMS] Update docs with HF TTS implementation
* Update docs/source/en/model_doc/mms.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* add uromanise to docs
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2023-09-01 16:50:59 +01:00
Sanchit Gandhi
b439129e74
[VITS] Add to TTA pipeline ( #25906 )
...
* [VITS] Add to TTA pipeline
* Update tests/pipelines/test_pipelines_text_to_audio.py
Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>
* remove extra spaces
---------
Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>
2023-09-01 16:39:00 +01:00
Zach Mueller
be0e189bd3
Revert frozen training arguments ( #25903 )
...
* Revert frozen training arguments
* TODO
2023-09-01 11:24:12 -04:00
Omar Sanseviero
69c5b8f186
Remove broken docs for MusicGen ( #25905 )
...
Update musicgen.md
2023-09-01 15:26:42 +01:00
Yih-Dar
16d6e3087c
Better error message for pipeline loading ( #25912 )
...
* update
* update
* update
* update
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-09-01 16:09:12 +02:00
Joao Gante
53e2fd785b
Falcon: Add RoPE scaling ( #25878 )
2023-09-01 12:05:53 +01:00
pkumc
024acd271b
fix FSDP model resume optimizer & scheduler ( #25852 )
...
* fix FSDP resume optimizer & scheduler
* improve trainer code quality
---------
Co-authored-by: machi04 <machi04@meituan.com>
2023-09-01 15:20:42 +05:30
Matthijs Hollemans
4ece3b9433
add VITS model ( #24085 )
...
* add VITS model
* let's vits
* finish TextEncoder (mostly)
* rename VITS to Vits
* add StochasticDurationPredictor
* ads flow model
* add generator
* correctly set vocab size
* add tokenizer
* remove processor & feature extractor
* add PosteriorEncoder
* add missing weights to SDP
* also convert LJSpeech and VCTK checkpoints
* add training stuff in forward
* add placeholder tests for tokenizer
* add placeholder tests for model
* starting cleanup
* let the great renaming begin!
* use config
* global_conditioning
* more cleaning
* renaming variables
* more renaming
* more renaming
* it never ends
* reticulating the splines
* more renaming
* HiFi-GAN
* doc strings for main model
* fixup
* fix-copies
* don't make it a PreTrainedModel
* fixup
* rename config options
* remove training logic from forward pass
* simplify relative position
* use actual checkpoint
* style
* PR review fixes
* more review changes
* fixup
* more unit tests
* fixup
* fix doc test
* add integration test
* improve tokenizer tests
* add tokenizer integration test
* fix tests on GPU (gave OOM)
* conversion script can handle repos from hub
* add conversion script for all MMS-TTS checkpoints
* automatically create a README for the converted checkpoint
* small changes to config
* push README to hub
* only show uroman note for checkpoints that need it
* remove conversion script because code formatting breaks the readme
* make WaveNet layers configurable
* rename variables
* simplifying the math
* output attentions and hidden states
* remove VitsFlip in flow model
* also got rid of the other flip
* fix tests
* rename more variables
* rename tokenizer, add phonemization
* raise error when phonemizer missing
* re-order config docstrings to match method
* change config naming
* remove redundant str -> list
* fix copyright: vits authors -> kakao enterprise
* (mean, log_variances) -> (prior_mean, prior_log_variances)
* if return dict -> if not return dict
* speed -> speaking rate
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* update fused tanh sigmoid
* reduce dims in tester
* audio -> output_values
* audio -> output_values in tuple out
* fix return type
* fix return type
* make _unconstrained_rational_quadratic_spline a function
* all nn's to accept a config
* add spectro to output
* move {speaking rate, noise scale, noise scale duration} to config
* path -> attn_path
* idxs -> valid idxs -> padded idxs
* output values -> waveform
* use config for attention
* make generation work
* harden integration test
* add spectrogram to dict output
* tokenizer refactor
* make style
* remove 'fake' padding token
* harden tokenizer tests
* ron norm test
* fprop / save tests deterministic
* move uroman to tokenizer as much as possible
* better logger message
* fix vivit imports
* add uroman integration test
* make style
* up
* matthijs -> sanchit-gandhi
* fix tokenizer test
* make fix-copies
* fix dict comprehension
* fix config tests
* fix model tests
* make outputs consistent with reverse/not reverse
* fix key concat
* more model details
* add author
* return dict
* speaker error
* labels error
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Update src/transformers/models/vits/convert_original_checkpoint.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* remove uromanize
* add docstrings
* add docstrings for tokenizer
* upper-case skip messages
* fix return dict
* style
* finish tests
* update checkpoints
* make style
* remove doctest file
* revert
* fix docstring
* fix tokenizer
* remove uroman integration test
* add sampling rate
* fix docs / docstrings
* style
* add sr to model output
* fix outputs
* style / copies
* fix docstring
* fix copies
* remove sr from model outputs
* Update utils/documentation_tests.txt
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* add sr as allowed attr
---------
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-09-01 10:50:06 +01:00
Marc Sun
ef10dbce5c
remove torch_dtype override ( #25894 )
...
* remove torch_dtype override
* style
* Update src/transformers/modeling_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2023-08-31 17:38:14 -04:00
Sylvain Gugger
0f08cd205a
Smarter check for is_tensor
( #25871 )
...
* Smarter check for
* Use protected functions
* Do others too
* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Address review comments
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2023-08-31 13:14:18 -04:00
Yih-Dar
3fb1535b09
Update setup.py
( #25893 )
...
update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-08-31 18:54:01 +02:00
David Reguera
eaf5e98ec0
Add type hints for tf models batch 1 ( #25853 )
...
* Add type hints to `TFBlipTextModel`
* Add missing type hints to DPR family models
* Add type hints to `TFLEDModel`
* Add type hints to `TFLxmertForPreTraining`
* Add missing type hints to `TFMarianMTModel` and `TFMarianModel`
* Add missing type hints to `TFRagModel` & `TFRagTokenForGeneration`
* Make type hints annotations consistent
2023-08-31 17:00:03 +01:00
Younes Belkada
9c5acca002
[InstructBlip
] FINAL Fix instructblip test ( #25887 )
...
fix instructblip test
2023-08-31 17:01:27 +02:00
raghavanone
2be8a9098e
Save image_processor while saving pipeline (ImageSegmentationPipeline) ( #25884 )
...
* Save image_processor while saving pipeline (ImageSegmentationPipeline)
* Fix black issues
2023-08-31 16:08:20 +02:00
Arthur
a39ebbf879
[CodeLlama
] Fix CI ( #25890 )
...
* Fix coellama
* style
2023-08-31 16:06:56 +02:00
Arthur
3b39b90618
[TokenizerFast
] can_save_slow_tokenizer
as a property for when vocab_file
's folder was removed ( #25626 )
...
* pad token should be None by default
* fix tests
* nits
* check if isfile vocabfile
* add warning if sp model folder was deleted
* save SPM when missing folder for sloz
* update the ` can_save_slow_tokenizer` to be a property
* first batch
* second batch
* missing one
2023-08-31 14:17:26 +02:00
Vibhor Kumar
99fc3ac8ac
Modify efficient GPU training doc with now-available adamw_bnb_8bit optimizer ( #25807 )
...
* Modify single-GPU efficient training doc with now-available adamw_bnb_8bit optimizer
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2023-08-31 10:55:10 +01:00
Sourab Mangrulkar
e95bcaeef0
fix ds z3 checkpointing when stage3_gather_16bit_weights_on_model_save=False
( #25817 )
...
* fix ds z3 checkpointing when `stage3_gather_16bit_weights_on_model_save=False`
* refactoring
2023-08-31 15:17:53 +05:30
qihqi
f8468b4fac
For xla tensors, use an alternative way to get a unique id ( #25802 )
...
* For xla tensors, use an alternative way to get a unique id
Because xla tensors don't have storage.
* add is_torch_tpu_available check
2023-08-31 10:31:16 +01:00
NielsRogge
716bb2e391
[ViTDet] Fix doc tests ( #25880 )
...
Fix docstrings
2023-08-30 22:49:03 +02:00
Yih-Dar
1c6f072db0
Reduce CI output ( #25876 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-08-30 18:15:07 +02:00
Yih-Dar
9219d1427b
pin pandas==2.0.3 ( #25875 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-08-30 18:10:01 +02:00
Joao Gante
459bc6738c
Docs: fix example failing doctest in generation_strategies.md
( #25874 )
2023-08-30 16:23:44 +01:00
Marc Sun
72298178bc
fix max_memory for bnb ( #25842 )
2023-08-30 11:00:36 -04:00
Yih-Dar
f73c20970c
Fix imports ( #25869 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-08-30 16:11:54 +02:00
Lysandre Debut
ed290b0837
Remote tools are turned off ( #25867 )
2023-08-30 09:40:39 -04:00
Juan Pizarro
09dc99517f
Add Blip2 model in VQA pipeline ( #25532 )
...
* Add Blip2 model in VQA pipeline
* use require_torch_gpu for test_large_model_pt_blip2
* use can_generate in vqa pipeline
* test Blip2ForConditionalGeneration using float16
* remove custom can_generate from Blip2ForConditionalGeneration
2023-08-30 14:16:16 +01:00
Yih-Dar
62399d6f35
Add flax installation in daily doctest workflow ( #25860 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-08-30 15:13:50 +02:00
Aman Gupta Karmani
52574026b6
minor typo fix in PeftAdapterMixin docs ( #25829 )
...
fix minor documentation typo
2023-08-30 11:56:05 +01:00
Nino Risteski
1bf2f36daf
Update README.md ( #25832 )
...
deleted unnecessary comma in the Adding a new model section.
2023-08-30 10:52:41 +01:00
Joao Gante
07998ef399
Generate: models with custom generate()
return True
in can_generate()
( #25838 )
2023-08-29 20:10:46 +01:00
Nino Risteski
8c75cfdaee
Update README.md ( #25834 )
...
_toctree.yml file. broken link, now fixed.
2023-08-29 20:02:57 +01:00
Haylee Schäfer
dbc16f4404
Support loading base64 images in pipelines ( #25633 )
...
* support loading base64 images
* add test
* mention in docs
* remove the logging
* sort imports
* update error message
* Update tests/utils/test_image_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* restructure to catch base64 exception
* doesn't like the newline
* download files
* format
* optimize imports
* guess it needs a space?
* support loading base64 images
* add test
* remove the logging
* sort imports
* restructure to catch base64 exception
* doesn't like the newline
* download files
* optimize imports
* guess it needs a space?
---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-08-29 19:24:24 +01:00
amyeroberts
ce2d4bc6a1
MaskFormer,Mask2former - reduce memory load ( #25741 )
...
Allocate result array ahead of time
2023-08-29 18:49:15 +01:00
Sanchit Gandhi
0daeeb40a1
[AutoTokenizer] Add data2vec to mapping ( #25835 )
2023-08-29 18:26:41 +01:00
Susnato Dhar
0e59c93983
update remaining Pop2Piano
checkpoints ( #25827 )
...
update checkpoints
2023-08-29 18:00:40 +01:00
Arthur
245dcc49ef
🤦 update warning to If you want to use the new behaviour, set `legacy=… ( #25833 )
...
🤦 update warning to If you want to use the new behaviour, set `legacy=False`. instead of True
2023-08-29 18:01:43 +02:00
Sohyun Sim
aade754b27
🌐 [i18n-KO] Translated community.md
to Korean ( #25674 )
...
* docs: ko: community.md
* feat: deepl draft
* fix: manual edits
* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
---------
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
2023-08-29 11:47:24 -04:00
heuristicwave
d97fd871e5
🌐 [i18n-KO] Translated add_new_pipeline.md
to Korean ( #25498 )
...
* dos: ko: add_new_pipeline.mdx
* feat: chatgpt draft
* fix: manual edits
* docs: ko: add_new_pipeline
Update _toctree
* Update docs/source/ko/add_new_pipeline.md
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
* Update docs/source/ko/add_new_pipeline.md
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
* Update docs/source/ko/add_new_pipeline.md
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
* Update docs/source/ko/add_new_pipeline.md
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* Update docs/source/ko/add_new_pipeline.md
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* Update docs/source/ko/add_new_pipeline.md
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* Update docs/source/ko/add_new_pipeline.md
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
* Update docs/source/ko/add_new_pipeline.md
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
* Update docs/source/ko/add_new_pipeline.md
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* Update docs/source/ko/add_new_pipeline.md
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* Update docs/source/ko/add_new_pipeline.md
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
2023-08-29 11:38:44 -04:00
Joao Gante
a35f889acc
Tests: detect lines removed from "utils/not_doctested.txt" and doctest ALL generation files ( #25763 )
2023-08-29 16:15:05 +01:00
Chau Nguyen
483861d52d
Error with checking args.eval_accumulation_steps to gather tensors ( #25819 )
...
* Update trainer.py (error with checking steps in args.eval_accumulation_steps to gather tensors)
While the deprecated code has the correct check (line 3772):
"if args.eval_accumulation_steps is not None and (step + 1) % args.eval_accumulation_steps == 0:"
The current code does not (line 3196):
"if args.eval_accumulation_steps is not None and self.accelerator.sync_gradients:"
We need to check "(step + 1) % args.eval_accumulation_steps == 0". Hence, the line 3196 should be modified to:
"if args.eval_accumulation_steps is not None and (step + 1) % args.eval_accumulation_steps == 0 and self.accelerator.sync_gradients:"
* Fix error with checking args.eval_accumulation_steps to gather tensors
2023-08-29 15:06:41 +01:00
MinJae Kang
33aa0af70c
🌐 [i18n-KO] model_memory_anatomy.md
to Korean ( #25755 )
...
* docs: ko-model_memory_anatomy.md
* feat: chatgpt draft
* feat: manual edits
* feat: change document title
* feat: manual edits
* fix: resolve suggestion
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* fix: resolve suggestion
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* fix: resolve suggestion
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* fix: resolve suggestion
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* fix: resolve suggestion
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* fix: resolve suggestion
Co-authored-by: heuristicwave <31366038+heuristicwave@users.noreply.github.com>
* fix: resolve suggestion
Co-authored-by: heuristicwave <31366038+heuristicwave@users.noreply.github.com>
* fix: resolve suggestion
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
* fix: resolve suggestion
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
* fix: resolve suggestion
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
* fix: resolve suggestion
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
* fix: resolve suggestion
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
* fix: resolve suggestion
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
* fix: resolve suggestion
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
* fix: resolve suggestion
---------
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
Co-authored-by: heuristicwave <31366038+heuristicwave@users.noreply.github.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
2023-08-29 09:48:51 -04:00
SeongWooChoi
173fa7da9c
🌐 [i18n-KO] Translated peft.md to Korean ( #25706 )
...
* docs: ko: peft.mdx
* feat: chatgpt draft
* fix: manual edits
* fix: resolve suggestions
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: heuristicwave <31366038+heuristicwave@users.noreply.github.com>
* fix: resolve suggestions
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: heuristicwave <31366038+heuristicwave@users.noreply.github.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
2023-08-29 09:10:00 -04:00
Dongkeun Yoon
2ee60b757e
fix warning trigger for embed_positions when loading xglm ( #25798 )
...
* fix warning triggering for xglm.embed_positions
* Make TF variable a tf.constant to match (and fix some spelling)
---------
Co-authored-by: Matt <rocketknight1@gmail.com>
2023-08-29 14:09:07 +01:00
Arthur
5b5ee235f3
[LlamaTokenizer
] tokenize
nits. ( #25793 )
...
* return when length is zero
* Add tests
Co-authored-by: Avnish Narayan <38871737avnishn@users.noreply.github.com>
* Co-authored-by: avnishn
<38871737+avnishn@users.noreply.github.com>
* codeLlama doc should not be on Main
* update test
---------
Co-authored-by: Avnish Narayan <38871737avnishn@users.noreply.github.com>
2023-08-29 15:08:14 +02:00
Omar Sanseviero
9525515cd4
Minor wording changes for Code Llama ( #25815 )
...
* Update code_llama.md
* Update code_llama.md
2023-08-29 15:02:57 +02:00