transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 10:12:23 +06:00

Author	SHA1	Message	Date
Yih-Dar	4603fe9b1f	use `accelerate@main` in CI (#22859 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-19 14:58:53 +02:00
Elabonga Atuo	337225ec1c	feat(model parallelism): move labels to the same device as logits for M2M100 (#22850 ) moved logits for m2m_100	2023-04-19 08:54:27 -04:00
Liu Chenyang	6bd8ae2640	move preprocess_logits_for_metrics before _nested_gather in trainer.e… (#22603 ) * move preprocess_logits_for_metrics before _nested_gather in trainer.evaluation_loop * fix * Update src/transformers/trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fix * fix --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-04-19 08:53:47 -04:00
Matthijs Hollemans	c582e8aad0	fix SpeechT5 doc comments (#22854 ) fix doc comments	2023-04-19 14:10:40 +02:00
Youssef Adarrab	84a6570e7b	Make ClipSeg compatible with model parallelism (#22844 )	2023-04-18 19:31:59 -04:00
Zachary Mueller	5bb4ec6233	Raise err if minimum Accelerate version isn't available (#22841 ) * Add warning about accelerate * Version block Accelerate * Include parse * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Check partial state * Update param --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-04-18 14:25:02 -04:00
Sylvain Gugger	5f09219400	Fix from_pretrained when model is instantiated on the meta device (#22837 )	2023-04-18 13:54:18 -04:00
Sylvain Gugger	5f9b825c89	Use code on the Hub from another repo (#22814 ) * initial work * Add other classes * Refactor code * Move warning and fix dynamic pipeline * Issue warning when necessary * Add test * Do not skip auto tests * Fix failing tests * Refactor and address review comments * Address review comments	2023-04-18 13:46:11 -04:00
Zachary Mueller	aec10d162f	Update accelerate version + warning check fix (#22833 )	2023-04-18 12:51:32 -04:00
Joao Gante	78cda46f17	Generate: Add assisted generation (#22211 ) * working mvp * remove breakpoint * fix commit * standardize outputs * tmp commit * tests almost ready * tmp commit * skip a few models * Add streaming; Docs and examples * document limitations * PR commits * Amy PR comments	2023-04-18 17:36:56 +01:00
Yih-Dar	90247d3e01	Fix `test_eos_token_id_int_and_list_top_k_top_sampling` (#22826 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-18 16:04:51 +02:00
Yih-Dar	1ebc1dee92	Fix Past CI not running against the latest `main` (#22823 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-18 15:41:41 +02:00
Gabriel Yang	42288269c3	🌐 [i18n-KO] Fix anchor links for docs `auto_tutorial`, `training` (#22796 ) docs: ko: fix anchor links for docs (auto_tutorial, training) Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by: Na Yeon Han <nayeon2.han@gmail.com> Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>	2023-04-18 09:11:30 -04:00
Matthijs Hollemans	ac2bc50a10	TTS fine-tuning for SpeechT5 (#21824 ) * wrong argument name * append eos_token_id * all tokenizers need mask and ctc_blank tokens * remove reduction factor from feature extractor * add proper TTS loss * did shifting the wrong way around * mask out padded portions * remove logits again (don't really need it) * fix unit tests * fixup * pad also returns the decoder attention mask, since that's useful to have * clean up feature extractor logic * pad can handle TTS task too * remove stop_labels from loss calculation * simplify logic * fixup * do -100 masking properly * small STFT optimization (calculate mel filterbanks only once) * replace torchaudio fbanks with audio_utils * remove torchaudio dependency * simplify & speed up the STFT * don't serialize window and mel filters * output cross attentions when generating speech * add guided attention loss * fix failing test * Update src/transformers/models/speecht5/feature_extraction_speecht5.py Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Update src/transformers/models/speecht5/modeling_speecht5.py Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * change type annotation of attention_mask to LongTensor * extract loss into class * remove unused frame_signal_scale argument * use config object in loss class * fix type annotations in doc comments * change optional to just bool * implement missing tokenizer method * add deprecation warning * Update src/transformers/models/speecht5/feature_extraction_speecht5.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/speecht5/feature_extraction_speecht5.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * add deprecation warning for stop_labels --------- Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-04-18 10:12:30 +01:00
Sylvain Gugger	dacd34568d	Mark auto models as important (#22815 ) * Mark auto models as important * Annoying file with bad line endings	2023-04-17 15:33:01 -04:00
Zachary Mueller	03462875cc	Introduce `PartialState` as the device handler in the `Trainer` (#22752 ) * Use accelerate for device management * Add accelerate to setup Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-04-17 15:09:45 -04:00
Sylvain Gugger	50caa20628	Revert "Use code on the Hub from another repo" (#22813 ) Revert "Use code on the Hub from another repo (#22698)" This reverts commit `ea7b0a539a`.	2023-04-17 14:22:13 -04:00
Sylvain Gugger	e13d6ef7dc	Simplify update metadata job (#22811 ) * Simplify update metadata job * Match more branch names * Install all what is necessary * Install all what is necessary * Forgot the dev * Install less stuff * This syntax?	2023-04-17 13:54:20 -04:00
Zachary Mueller	cd3e0211a6	Remove accelerate from tf test reqs (#22777 ) Remove accelerate from tf	2023-04-17 12:31:21 -04:00
Kunhao ZHENG	f8c43c9425	Fix squeeze into torch 1.x compatible form in llama model (#22808 ) fix-squeeze-tuple	2023-04-17 17:28:48 +01:00
Yih-Dar	5269718cb7	Don't use `LayoutLMv2` and `LayoutLMv3` in some pipeline tests (#22774 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-17 17:45:20 +02:00
Sylvain Gugger	ea7b0a539a	Use code on the Hub from another repo (#22698 ) * initial work * Add other classes * Refactor code * Move warning and fix dynamic pipeline * Issue warning when necessary * Add test	2023-04-17 11:36:29 -04:00
Wonhyeong Seo	4d2c52e830	🌐 [i18n-KO] Translated `tasks/translation.mdx` to Korean (#22805 ) docs: ko: tasks/translation.mdx	2023-04-17 11:30:17 -04:00
Matt	2237127a6c	Fix sneaky torch dependency in TF example (#22804 )	2023-04-17 16:11:52 +01:00
fpgaminer	626c1b8af1	improve(llama): Faster apply_rotary_pos_emb (#22785 )	2023-04-17 15:18:38 +01:00
Jungnerd	abbc96a214	[i18n-KO] fix: docs: ko: sagemaker anchors and `_toctree.yml` (#22549 ) fix: docs: ko: sagemaker anchors and `_toctree.yml` Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com> Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by: Na Yeon Han <nayeon2.han@gmail.com> Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>	2023-04-17 07:41:52 -04:00
Na Yeon Han	18c894814e	🌐 [i18n-KO] Translated `custom_models.mdx` to Korean (#22534 ) docs: ko: translated `custom_models.mdx` Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com> Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>	2023-04-17 07:39:53 -04:00
Yih-Dar	76d24f1a83	Fix `test_word_time_stamp_integration` for `Wav2Vec2ProcessorWithLMTest` (#22800 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-17 12:41:55 +02:00
bcol	28f26c107b	Generate: add CJK support to TextStreamer (#22664 )	2023-04-15 10:35:08 +01:00
oscar-garzon	fb3aa06cb6	Move labels to the same device as logits for Whisper (#22779 )	2023-04-14 19:08:41 -04:00
amyeroberts	20e54e49fa	Indexing fix - CLIP checkpoint conversion (#22776 ) * Indexing fix - CLIP checkpoint conversion * Fix up	2023-04-14 19:12:47 +01:00
Joao Gante	895ae3b5c4	Seq2SeqTrainer: Evict decoder_input_ids only when it is created from labels (#22772 )	2023-04-14 17:45:14 +01:00
Mayank Agarwal	daf53241d6	Fix word_ids hyperlink (#22765 ) * Fix word_ids hyperlink * Add suggested fix	2023-04-14 16:18:15 +01:00
Matt	06e737fbaf	Tweak ESM tokenizer for Nucleotide Transformer (#22770 ) * If EOS is None, don't add it to sequences * If EOS is None, don't add it to sequences	2023-04-14 15:18:43 +01:00
Sohyun Sim	c8df3900c8	[WIP]🌐 [i18n-KO] Translated `tutorial/proprecssing.mdx` to Korean (#22578 ) * add ko preprocessing * translate preprocessing.mdx to korean * translate preprocessing.mdx * Update preprocessing.mdx Fixed the line 273 as below: 또한, 특징 추출기에 `sampling_rate` 인자를 추가하여 발생할 수 있는 조용한 오류(silent errors)를 더 잘 디버깅하는 것을 권장합니다. * translate Image part * translated preprocess.mdx * Update docs/source/ko/preprocessing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/preprocessing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/preprocessing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/preprocessing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/preprocessing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/preprocessing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/preprocessing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/preprocessing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/preprocessing.mdx * Update docs/source/ko/preprocessing.mdx * Update docs/source/ko/preprocessing.mdx * Update docs/source/ko/preprocessing.mdx * Update docs/source/ko/preprocessing.mdx * Update docs/source/ko/preprocessing.mdx * fixed translation --------- Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>	2023-04-14 07:26:44 -04:00
Yih-Dar	53c710d17b	Fix failing torchscript tests for `CpmAnt` model (#22766 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-14 12:53:45 +02:00
Alexander Ljungberg	d2ffc3fc48	Fix a mistake in Llama weight converter log output. (#22764 ) Fixed string format; better tokenizer message. Before: `Saving a {tokenizer_class} to {tokenizer_path}` After: `Saving a LlamaTokenizerFast to outdir.`	2023-04-14 10:26:45 +01:00
Joao Gante	9af845afc2	Generate: pin number of beams in BART test (#22763 )	2023-04-14 09:57:25 +01:00
Joao Gante	66b15efb20	Pix2struct: doctest fix (#22761 )	2023-04-14 09:40:39 +01:00
Sayak Paul	390e121fb5	[Examples] TPU-based training of a language model using TensorFlow (#21657 ) * add: tokenizer training script for TF TPU LM training. * add: script for preparing the TFRecord shards. * add: sequence of execution to readme. * remove limit from the tfrecord shard name. * Add initial train_model.py * Add basic training arguments and model init * Get up to the point of writing the data collator * Pushing progress so far! * Complete first draft of model training code * feat: grouping of texts efficiently. Co-authored-by: Matt <rocketknight1@gmail.com> * Add proper masking collator and get training loop working * fix: things. * Read sample counts from filenames * Read sample counts from filenames * Draft README * Improve TPU warning * Use distribute instead of distribute.experimental * Apply suggestions from code review Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Modularize loading and add MLM probability as arg * minor refactoring to better use the cli args. * readme fillup. * include tpu and inference sections in the readme. * table of contents. * parallelize maps. * polish readme. * change script name to run_mlm.py * address PR feedback (round I). --------- Co-authored-by: Matt <rocketknight1@gmail.com> Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>	2023-04-14 10:41:01 +05:30
Hyeonseo Yun	bfb3925fcb	🌐 [i18n-KO] Translated `sequence_classification.mdx` to Korean (#22655 ) * docs: ko: init: tasks/sequence_classification.mdx * docs: ko: revised: change voca in tasks/sequence_classification.mdx * docs: ko: revised: [RE] change voca in tasks/sequence_classification.mdx * docs: ko: revised: spell check and sentence naturally in tasks/sequence_classification.mdx * docs: ko: revised: spell check and consistent vocabulary in tasks/sequence_classification.mdx * docs: ko: revised: Add full stop and change voca in tasks/sequence_classification.mdx * docs: ko: revised: sync first section templates in tasks/sequence_classification.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * fix: revert use of full-stops to colons * colons are used to emphasize the code block that follows * @0525hhgus @wonhyeongseo docs: ko: revised: sync second section templates in tasks/sequence_classification.mdx Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com> * docs: ko: revised: change 'train', 'finetuning' in tasks/sequence_classification.mdx --------- Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>	2023-04-13 21:40:36 -04:00
Yih-Dar	a6752a7d3c	Fix `serving_output` for TF composite models (encoder-decoder like models) (#22743 ) * fix * style * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-13 23:45:22 +02:00
Yih-Dar	410b61ad7e	Revert (for now) the change on `Deta` in #22437 (#22750 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-13 21:32:29 +02:00
Joao Gante	9dfd6a4baa	Generate: handle text conditioning with multimodal encoder-decoder models (#22748 )	2023-04-13 19:51:13 +01:00
Ruiyang Sun	90ce374d14	fix(llama): fix LlamaTokenzier (#22746 ) Bug in LlamaTokenizer when #22742	2023-04-13 18:19:38 +01:00
Stas Bekman	d85bf95436	[trainer] update url (#22747 ) * [trainer] update url * style	2023-04-13 09:23:55 -07:00
Yih-Dar	656d41ab4c	Remove `DS_BUILD_AIO=1` (#22741 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-13 18:08:22 +02:00
Yih-Dar	32b08742a5	`DocumentQuestionAnsweringPipeline` only for fast ⚡ tokenizers (#22745 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-13 17:22:59 +02:00
Gabriel Yang	4def2fe969	🌐 [i18n-KO] Translated `training.mdx` to Korean (#22670 ) translate training doc to Korean	2023-04-13 11:04:47 -04:00
Yih-Dar	7df1343292	Change `torch_dtype` to `str` when `saved_model=True` in `save_pretrained` for TF models (#22740 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-13 15:52:16 +02:00

1 2 3 4 5 ...

12628 Commits