transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Matthijs Hollemans	ac2bc50a10	TTS fine-tuning for SpeechT5 (#21824 ) * wrong argument name * append eos_token_id * all tokenizers need mask and ctc_blank tokens * remove reduction factor from feature extractor * add proper TTS loss * did shifting the wrong way around * mask out padded portions * remove logits again (don't really need it) * fix unit tests * fixup * pad also returns the decoder attention mask, since that's useful to have * clean up feature extractor logic * pad can handle TTS task too * remove stop_labels from loss calculation * simplify logic * fixup * do -100 masking properly * small STFT optimization (calculate mel filterbanks only once) * replace torchaudio fbanks with audio_utils * remove torchaudio dependency * simplify & speed up the STFT * don't serialize window and mel filters * output cross attentions when generating speech * add guided attention loss * fix failing test * Update src/transformers/models/speecht5/feature_extraction_speecht5.py Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Update src/transformers/models/speecht5/modeling_speecht5.py Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * change type annotation of attention_mask to LongTensor * extract loss into class * remove unused frame_signal_scale argument * use config object in loss class * fix type annotations in doc comments * change optional to just bool * implement missing tokenizer method * add deprecation warning * Update src/transformers/models/speecht5/feature_extraction_speecht5.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/speecht5/feature_extraction_speecht5.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * add deprecation warning for stop_labels --------- Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-04-18 10:12:30 +01:00
Sylvain Gugger	dacd34568d	Mark auto models as important (#22815 ) * Mark auto models as important * Annoying file with bad line endings	2023-04-17 15:33:01 -04:00
Zachary Mueller	03462875cc	Introduce `PartialState` as the device handler in the `Trainer` (#22752 ) * Use accelerate for device management * Add accelerate to setup Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-04-17 15:09:45 -04:00
Sylvain Gugger	50caa20628	Revert "Use code on the Hub from another repo" (#22813 ) Revert "Use code on the Hub from another repo (#22698)" This reverts commit `ea7b0a539a`.	2023-04-17 14:22:13 -04:00
Sylvain Gugger	e13d6ef7dc	Simplify update metadata job (#22811 ) * Simplify update metadata job * Match more branch names * Install all what is necessary * Install all what is necessary * Forgot the dev * Install less stuff * This syntax?	2023-04-17 13:54:20 -04:00
Zachary Mueller	cd3e0211a6	Remove accelerate from tf test reqs (#22777 ) Remove accelerate from tf	2023-04-17 12:31:21 -04:00
Kunhao ZHENG	f8c43c9425	Fix squeeze into torch 1.x compatible form in llama model (#22808 ) fix-squeeze-tuple	2023-04-17 17:28:48 +01:00
Yih-Dar	5269718cb7	Don't use `LayoutLMv2` and `LayoutLMv3` in some pipeline tests (#22774 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-17 17:45:20 +02:00
Sylvain Gugger	ea7b0a539a	Use code on the Hub from another repo (#22698 ) * initial work * Add other classes * Refactor code * Move warning and fix dynamic pipeline * Issue warning when necessary * Add test	2023-04-17 11:36:29 -04:00
Wonhyeong Seo	4d2c52e830	🌐 [i18n-KO] Translated `tasks/translation.mdx` to Korean (#22805 ) docs: ko: tasks/translation.mdx	2023-04-17 11:30:17 -04:00
Matt	2237127a6c	Fix sneaky torch dependency in TF example (#22804 )	2023-04-17 16:11:52 +01:00
fpgaminer	626c1b8af1	improve(llama): Faster apply_rotary_pos_emb (#22785 )	2023-04-17 15:18:38 +01:00
Jungnerd	abbc96a214	[i18n-KO] fix: docs: ko: sagemaker anchors and `_toctree.yml` (#22549 ) fix: docs: ko: sagemaker anchors and `_toctree.yml` Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com> Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by: Na Yeon Han <nayeon2.han@gmail.com> Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>	2023-04-17 07:41:52 -04:00
Na Yeon Han	18c894814e	🌐 [i18n-KO] Translated `custom_models.mdx` to Korean (#22534 ) docs: ko: translated `custom_models.mdx` Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com> Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>	2023-04-17 07:39:53 -04:00
Yih-Dar	76d24f1a83	Fix `test_word_time_stamp_integration` for `Wav2Vec2ProcessorWithLMTest` (#22800 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-17 12:41:55 +02:00
bcol	28f26c107b	Generate: add CJK support to TextStreamer (#22664 )	2023-04-15 10:35:08 +01:00
oscar-garzon	fb3aa06cb6	Move labels to the same device as logits for Whisper (#22779 )	2023-04-14 19:08:41 -04:00
amyeroberts	20e54e49fa	Indexing fix - CLIP checkpoint conversion (#22776 ) * Indexing fix - CLIP checkpoint conversion * Fix up	2023-04-14 19:12:47 +01:00
Joao Gante	895ae3b5c4	Seq2SeqTrainer: Evict decoder_input_ids only when it is created from labels (#22772 )	2023-04-14 17:45:14 +01:00
Mayank Agarwal	daf53241d6	Fix word_ids hyperlink (#22765 ) * Fix word_ids hyperlink * Add suggested fix	2023-04-14 16:18:15 +01:00
Matt	06e737fbaf	Tweak ESM tokenizer for Nucleotide Transformer (#22770 ) * If EOS is None, don't add it to sequences * If EOS is None, don't add it to sequences	2023-04-14 15:18:43 +01:00
Sohyun Sim	c8df3900c8	[WIP]🌐 [i18n-KO] Translated `tutorial/proprecssing.mdx` to Korean (#22578 ) * add ko preprocessing * translate preprocessing.mdx to korean * translate preprocessing.mdx * Update preprocessing.mdx Fixed the line 273 as below: 또한, 특징 추출기에 `sampling_rate` 인자를 추가하여 발생할 수 있는 조용한 오류(silent errors)를 더 잘 디버깅하는 것을 권장합니다. * translate Image part * translated preprocess.mdx * Update docs/source/ko/preprocessing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/preprocessing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/preprocessing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/preprocessing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/preprocessing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/preprocessing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/preprocessing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/preprocessing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Update docs/source/ko/preprocessing.mdx * Update docs/source/ko/preprocessing.mdx * Update docs/source/ko/preprocessing.mdx * Update docs/source/ko/preprocessing.mdx * Update docs/source/ko/preprocessing.mdx * Update docs/source/ko/preprocessing.mdx * fixed translation --------- Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>	2023-04-14 07:26:44 -04:00
Yih-Dar	53c710d17b	Fix failing torchscript tests for `CpmAnt` model (#22766 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-14 12:53:45 +02:00
Alexander Ljungberg	d2ffc3fc48	Fix a mistake in Llama weight converter log output. (#22764 ) Fixed string format; better tokenizer message. Before: `Saving a {tokenizer_class} to {tokenizer_path}` After: `Saving a LlamaTokenizerFast to outdir.`	2023-04-14 10:26:45 +01:00
Joao Gante	9af845afc2	Generate: pin number of beams in BART test (#22763 )	2023-04-14 09:57:25 +01:00
Joao Gante	66b15efb20	Pix2struct: doctest fix (#22761 )	2023-04-14 09:40:39 +01:00
Sayak Paul	390e121fb5	[Examples] TPU-based training of a language model using TensorFlow (#21657 ) * add: tokenizer training script for TF TPU LM training. * add: script for preparing the TFRecord shards. * add: sequence of execution to readme. * remove limit from the tfrecord shard name. * Add initial train_model.py * Add basic training arguments and model init * Get up to the point of writing the data collator * Pushing progress so far! * Complete first draft of model training code * feat: grouping of texts efficiently. Co-authored-by: Matt <rocketknight1@gmail.com> * Add proper masking collator and get training loop working * fix: things. * Read sample counts from filenames * Read sample counts from filenames * Draft README * Improve TPU warning * Use distribute instead of distribute.experimental * Apply suggestions from code review Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Modularize loading and add MLM probability as arg * minor refactoring to better use the cli args. * readme fillup. * include tpu and inference sections in the readme. * table of contents. * parallelize maps. * polish readme. * change script name to run_mlm.py * address PR feedback (round I). --------- Co-authored-by: Matt <rocketknight1@gmail.com> Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>	2023-04-14 10:41:01 +05:30
Hyeonseo Yun	bfb3925fcb	🌐 [i18n-KO] Translated `sequence_classification.mdx` to Korean (#22655 ) * docs: ko: init: tasks/sequence_classification.mdx * docs: ko: revised: change voca in tasks/sequence_classification.mdx * docs: ko: revised: [RE] change voca in tasks/sequence_classification.mdx * docs: ko: revised: spell check and sentence naturally in tasks/sequence_classification.mdx * docs: ko: revised: spell check and consistent vocabulary in tasks/sequence_classification.mdx * docs: ko: revised: Add full stop and change voca in tasks/sequence_classification.mdx * docs: ko: revised: sync first section templates in tasks/sequence_classification.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * fix: revert use of full-stops to colons * colons are used to emphasize the code block that follows * @0525hhgus @wonhyeongseo docs: ko: revised: sync second section templates in tasks/sequence_classification.mdx Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com> * docs: ko: revised: change 'train', 'finetuning' in tasks/sequence_classification.mdx --------- Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>	2023-04-13 21:40:36 -04:00
Yih-Dar	a6752a7d3c	Fix `serving_output` for TF composite models (encoder-decoder like models) (#22743 ) * fix * style * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-13 23:45:22 +02:00
Yih-Dar	410b61ad7e	Revert (for now) the change on `Deta` in #22437 (#22750 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-13 21:32:29 +02:00
Joao Gante	9dfd6a4baa	Generate: handle text conditioning with multimodal encoder-decoder models (#22748 )	2023-04-13 19:51:13 +01:00
Ruiyang Sun	90ce374d14	fix(llama): fix LlamaTokenzier (#22746 ) Bug in LlamaTokenizer when #22742	2023-04-13 18:19:38 +01:00
Stas Bekman	d85bf95436	[trainer] update url (#22747 ) * [trainer] update url * style	2023-04-13 09:23:55 -07:00
Yih-Dar	656d41ab4c	Remove `DS_BUILD_AIO=1` (#22741 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-13 18:08:22 +02:00
Yih-Dar	32b08742a5	`DocumentQuestionAnsweringPipeline` only for fast ⚡ tokenizers (#22745 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-13 17:22:59 +02:00
Gabriel Yang	4def2fe969	🌐 [i18n-KO] Translated `training.mdx` to Korean (#22670 ) translate training doc to Korean	2023-04-13 11:04:47 -04:00
Yih-Dar	7df1343292	Change `torch_dtype` to `str` when `saved_model=True` in `save_pretrained` for TF models (#22740 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-13 15:52:16 +02:00
NielsRogge	8eb38f638d	[Pix2struct] Simplify generation (#22527 ) * Add model to doc tests * Remove generate and replace by prepare_inputs_for_generation * More fixes * Remove print statements * Update integration tests * Fix generate * Remove model from auto mapping * Use auto processor * Fix integration tests * Fix test * Add inference code snippet * Remove is_encoder_decoder * Update docs * Remove notebook link	2023-04-13 09:01:14 -04:00
Rinat	95e7057507	Make vilt, switch_transformers compatible with model parallelism (#22703 ) * Update modeling_vilt.py Vilt compatible with model parallelism * Update modeling_switch_transformers.py switch_transformers compatible with model parallelism	2023-04-13 06:50:30 -04:00
Joel Lamy-Poirier	89087597ba	Indexing fix for gpt_bigcode (#22737 ) Fix indexing	2023-04-13 11:00:37 +01:00
Elabonga Atuo	7ade6ef7d4	[Doctest] Add configuration_mvp.py (#22735 ) * added configuration file for mvp model * added configuration_mvp.py line to file	2023-04-13 08:19:18 +02:00
Elabonga Atuo	51007976ec	[Doctest] Add configuration_m2m_100.py (#22733 ) m2m-100-config for doctest	2023-04-13 08:17:07 +02:00
Sylvain Gugger	888c4a2ae0	v4.29.0.dev0	2023-04-12 20:04:29 -04:00
Matt	50f82e1282	Fix docstrings for TF BLIP (#22618 ) * Fix docstrings for TFBLIP * Fix missing line in TF port! * Use values from torch tests now other bugs fixed * Use values from torch tests now other bugs fixed * Fix doctest string	2023-04-12 17:46:41 +01:00
NielsRogge	ce06e4780e	Update warning levels (#22727 ) * Use different level * Remove futurewarning * Use warning_once * Update copies	2023-04-12 17:25:24 +01:00
Arthur	9858195481	add fast support and option (#22724 ) * add fast support and option * update based on review * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/llama/convert_llama_weights_to_hf.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * nit * add print * fixup --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-04-12 18:10:04 +02:00
Michael Benayoun	10fab90fe2	`torch.distributed` group initialization for `torch_neuron` disabled when `optimum-neuron` is installed (#22728 ) * Make the process group initialization not happen if optimum_neuron is installed * Add warning * Remove list and added warning	2023-04-12 17:42:50 +02:00
Stas Bekman	1306b7d3ae	[tests] switch to torchrun (#22712 )	2023-04-12 08:25:45 -07:00
ARKA1112	d87ef00c31	Modify pipeline_tutorial.mdx (#22726 ) generator(model="openai/whisper-large") always returns error. As the error says the generator expects an input, just like the .flac file above. Even the generator object has no parameters called model. While there are parameters which can be passed to generator like 'batch_size' but to pass a model i believe the the parameter has to be passed while instantiating the pipeline and not as a parameter to the instance. I believe the correct term should be: generator = pipeline(model="openai/whisper-large", device=0)	2023-04-12 15:20:25 +01:00
Younes Belkada	370f0ca18c	[`bnb`] Let's make serialization of int8 models possible (#22177 ) * make serialization of int8 models possible * make fixup * add docs * add ability to push to hub and save pretrained * fixes * more addition * more tests * fix issues * change variable * clearer message * adapt from suggestions * few fixes * remove unused function * Update src/transformers/utils/quantization_config.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * address last comments * last warning * clarify doc * protect import * Update src/transformers/modeling_utils.py * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-04-12 08:01:18 -04:00

1 2 3 4 5 ...

12615 Commits