transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-03 03:31:05 +06:00

Author	SHA1	Message	Date
Arthur	bb300ac686	Whisper Timestamp processor and prediction (#20620 ) * add draft logit processor * add template functions * update timesapmt processor parameters * draft script * simplify code * cleanup * fixup and clean * update pipeline * style * clean up previous idea * add tokenization utils * update tokenizer and asr output * fit whisper type * style and update test * clean test * style test * update tests * update error test * udpate code (not based on review yet) * update tokenization * update asr pipeline * update code * cleanup and update test * fmt * remove text verificatino * cleanup * cleanup * add model test * update tests * update code add docstring * update code and add docstring * fix pipeline tests * add draft logit processor add template functions update timesapmt processor parameters draft script simplify code cleanup fixup and clean update pipeline style clean up previous idea add tokenization utils update tokenizer and asr output fit whisper type style and update test clean test style test update tests update error test udpate code (not based on review yet) update tokenization update asr pipeline update code cleanup and update test fmt remove text verificatino cleanup cleanup add model test update tests update code add docstring update code and add docstring fix pipeline tests * Small update. * Fixup. * Tmp. * More support. * Making `forced_decoder_ids` non mandatory for users to set. * update and fix first bug * properly process sequence right after merge if last * tofo * allow list inputs + compute begin index better * start adding tests * add the 3 edge cases * style * format sequences * fixup * update * update * style * test passes, edge cases should be good * update last value * remove Trie * update tests and expec ted values * handle bigger chunk_length * clean tests a bit * refactor chunk iter and clean pipeline * update tests * style * refactor chunk iter and clean pipeline * upade * resolve comments * Apply suggestions from code review Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> * take stride right into account * update test expected values * Update code based on review Co-authored-by: sgugger <sylvain.gugger@gmail.com> Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by: sgugger <sylvain.gugger@gmail.com>	2023-01-17 15:50:09 +01:00
Nicolas Patry	25ddd91b24	Fixing offline mode for pipeline (when inferring task). (#21113 ) * Fixing offline mode for pipeline (when inferring task). * Update src/transformers/pipelines/__init__.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Updating test to reflect change in exception. * Fixing offline mode. * Clean. Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-01-17 15:24:40 +01:00
Sherman Siu	8896ebb9a9	Clarify and add missing typical_p argument docstring. (#21095 ) * Clarify and add missing typical_p docstring. * Make the docstring easier to understand. * Clarify typical_p docstring Accept the suggestion by @stevhliu for paraphrasing the docstring. Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Use the same docstring as in GenerationConfig Follow the suggestion suggested by @stevhliu in the pull request conversation. * Fix docstring spacing. Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-01-17 09:23:47 -05:00
Sayak Paul	f30bcd5357	feat: add standalone guide on XLA support. (#21141 ) * feat: add standalone guide on XLA support. Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Empty commit to trigger CI * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * address PR comments. Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-01-17 15:07:59 +01:00
Nick Hill	3bbc2451b1	Small simplification to TopKLogitsWarper (#21130 ) The max of top_k and min_tokens_to_keep performed on every call can just be done once up-front.	2023-01-17 09:06:03 -05:00
amyeroberts	0dde58978a	Rename test_feature_extraction files (#21140 ) * Rename files * Update file names in tests	2023-01-17 14:04:07 +00:00
Joao Gante	7b5e943cb6	Generate: TF contrastive search must pop `use_cache` from `model_kwargs` (#21149 )	2023-01-17 13:42:52 +00:00
Joao Gante	7f3dab39b5	TF: serializable hubert (#20966 ) * serializable hubert	2023-01-17 13:07:37 +00:00
Matt	e5dcceb82c	Fixes to TF collators (#21143 ) * Add num_workers for prepare_tf_dataset * Bugfix in the default collator and change default tensor type * Remove the "num_workers" arg and move it to a new PR	2023-01-17 12:18:56 +00:00
Alara Dirik	2411f0e465	Add Mask2Former (#20792 ) * Adds Mask2Former to transformers Co-authored-by: Shivalika Singh <shivalikasingh95@gmail.com> Co-authored-by: Shivalika Singh <73357305+shivalikasingh95@users.noreply.github.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-01-16 20:37:07 +03:00
NielsRogge	9edf375834	[GIT] Fix training (#21133 ) * Fix training * Add test * Fix failing tests Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2023-01-16 15:37:38 +01:00
Yih-Dar	0fb27dc988	Update `TFTapasEmbeddings` (#21107 ) Update TFTapasEmbeddings Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-16 15:29:50 +01:00
Clémentine Fourrier	4bbbabcb2c	Added clefourrier as ref point for graph models in bug reports (#21139 ) * Added clefourrier as ref point for graph models in bug reports * Update PULL_REQUEST_TEMPLATE.md	2023-01-16 15:12:42 +01:00
Yih-Dar	a45914193a	Fix `RealmModelIntegrationTest.test_inference_open_qa` (#21136 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-16 15:09:52 +01:00
Susnato Dhar	a5327c6a9a	Fixed issue #21053 (#21065 ) Co-authored-by: susnato <susnato@tensorflow123456@gmail.com>	2023-01-16 15:06:35 +01:00
Nicolas Patry	488a179ce1	Fixing batching pipelines on single items for ChunkPipeline (#21132 ) * Fixing #20783 * Update src/transformers/pipelines/base.py * Fixing some tests. * Fixup. * Remove ffmpeg dep + a bit more relaxed for bigbird QA precision. * Better dataset. * Prevent failing on TF. * Better condition. We can't use `can_use_iterator` since we cannot use it directly.	2023-01-16 15:04:27 +01:00
Silver	fa906a264b	Add `min_new_tokens` argument in generate() (implementation based on `MinNewTokensLengthLogitsProcessor`) (#21044 ) add a new parameter min_new_tokens for generate()	2023-01-16 15:02:08 +01:00
guillaume-be	125f137562	[LongT5] Remove duplicate encoder_attention_mask default value check (#21124 ) - Remove duplicate encoder_attention_mask default value assignment	2023-01-16 14:26:56 +01:00
NielsRogge	05b8e25fff	[VideoMAE] Fix docstring (#21111 ) Fix docstring Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2023-01-16 09:39:35 +01:00
NielsRogge	4ed89d48ab	Add UperNet (#20648 ) * First draft * More improvements * Add convnext backbone * Add conversion script * Add more improvements * Comment out to_dict * Add to_dict method * Add default config * Fix config * Fix backbone * Fix backbone some more * Add docs, auto mapping, tests * Fix some tests * Fix more tests * Fix more tests * Add conversion script * Improve conversion script * Add support for getting reshaped undownsampled hidden states * Fix forward pass * Add print statements * Comment out set_shift_and_window_size * More improvements * Correct downsampling layers conversion * Fix style * First draft * Fix conversion script * Remove config attribute * Fix more tests * Update READMEs * Update ConvNextBackbone * Fix ConvNext tests * Align ConvNext with Swin * Remove files * Fix index * Improve docs * Add output_attentions to model forward * Add backbone mixin, improve tests * More improvements * Update init_weights * Fix interpolation of logits * Add UperNetImageProcessor * Improve image processor * Fix image processor * Remove print statements * Remove script * Update import * Add image processor tests * Remove print statements * Fix test * Add integration test * Add convnext integration test * Update docstring * Fix README * Simplify config * Apply suggestions * Improve docs * Rename class * Fix test_initialization * Fix import * Address review * Fix confg * Convert all checkpoints * Fix default backbone * Usage same processor as segformer * Apply suggestions * Fix init_weights, update conversion scripts * Improve config * Use Auto API instead of creating a new image processor * Fix docs * Add doctests * Remove ResNetConfig dependency * Add always_partition argument * Fix rebaseé * Improve docs * Convert checkpoints Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>	2023-01-16 09:39:13 +01:00
TK Buristrakul	5db9abde43	Fixed typo in docstring (#21115 ) Fixed typo	2023-01-15 11:03:30 +01:00
Yusuke Oda	15adc24208	Use raw string for regex in tokenization_t5_fast.py (#21125 ) Suppress deprecation warning	2023-01-15 10:56:31 +01:00
Arthur	056218dab1	[CI-doc-daily] Remove RobertaPreLayernorm random tests (#20992 ) * Remove random output * remove values * fix copy statements	2023-01-14 19:47:32 +01:00
Sylvain Gugger	c8f35a9ce3	Rework automatic code samples in docstrings (#20757 ) * Rework automatic code samples in docstrings * ImageProcessor->AutoImageProcessor * Add models to fix copies * Last typos * A couple more models * Fix copies	2023-01-14 09:49:36 +01:00
Shogo Hida	7f65d2366a	Add Spanish translation to community.mdx (#21055 ) * Add community to toctree Signed-off-by: Shogo Hida <shogo.hida@gmail.com> * Copy English content Signed-off-by: Shogo Hida <shogo.hida@gmail.com> * Add some translations Signed-off-by: Shogo Hida <shogo.hida@gmail.com> * Add some translations Signed-off-by: Shogo Hida <shogo.hida@gmail.com> * Add some translations Signed-off-by: Shogo Hida <shogo.hida@gmail.com> * Fix position of community Signed-off-by: Shogo Hida <shogo.hida@gmail.com> * Fix translation Signed-off-by: Shogo Hida <shogo.hida@gmail.com> * Add translation Signed-off-by: Shogo Hida <shogo.hida@gmail.com> * Add translation Signed-off-by: Shogo Hida <shogo.hida@gmail.com> * Add translation Signed-off-by: Shogo Hida <shogo.hida@gmail.com> * Add translation Signed-off-by: Shogo Hida <shogo.hida@gmail.com> Signed-off-by: Shogo Hida <shogo.hida@gmail.com>	2023-01-14 09:25:05 +01:00
Steven Liu	f58248b824	Update task summary part 1 (#21014 ) * first draft of new task summary * make style * review * apply feedback * apply feedbacks * final touches	2023-01-13 11:01:53 -08:00
Arthur	95f0dd2123	[Tokenizers] Fix a small typo (#21104 ) * typo * change name in `__repr__` * fix my mistake	2023-01-13 16:21:34 +01:00
Yih-Dar	b210c83a78	Fix `torchscript` tests for `AltCLIP` (#21102 ) fix torchscript tests for AltCLIP Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-13 10:03:19 +01:00
Yih-Dar	b3a0aad37d	Fix past CI (#20967 ) * Fix for Past CI * make style * clean up * unindent 2 blocks Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-12 18:04:21 +01:00
Stas Bekman	41b0564b35	[bnb optim] fixing test (#21030 ) * [bnb optim] fixing test * force 1 gpu * fix * fix * fix * finalize * improve commentary * fix * cleanup * more fixes	2023-01-12 08:52:54 -08:00
Yih-Dar	212829ade6	Remove more unused attributes in config classes (#21000 ) * Remove gradient_checkpointing from MarkupLMConfig * Remove predict_special_tokens from OpenAIGPTConfig * Remove enable_cls from RoCBertConfig * Remove batch_size from TrajectoryTransformerConfig * Remove searcher_seq_len from RealmConfig * Remove feat_quantizer_dropout from WavLMConfig * Remove position_biased_input from SEWDConfig * Remove max_source_positions from Speech2Text2Config Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-12 13:32:04 +01:00
Susnato Dhar	b5be744d3c	Fixed issue #21039 (#21062 ) Fixed issue #21039 and added test for low_cpu_mem_usage	2023-01-12 10:03:13 +01:00
Wang, Yi	e849e5bb4a	Optimize inference only mode memory if ipex is used (#21083 ) * Optimize inference only mode memory if ipex is used Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> * fix code style Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>	2023-01-12 10:01:17 +01:00
zzz	6767ce71d6	fix typo in comment (#21088 ) fix typo Signed-off-by: xiaoyang zhu <zhuxiaoyang1996@gmail.com> Signed-off-by: xiaoyang zhu <zhuxiaoyang1996@gmail.com>	2023-01-11 17:51:41 +01:00
Ying Zhang	64b6b2b273	Update docstring for CLIPConfig (#21066 ) Update doc for CLIPConfig	2023-01-11 14:22:26 +01:00
Steven Liu	8f796960f6	Fix header level (#21072 ) fix header level	2023-01-10 10:24:10 -08:00
Bharat Ramanathan	07cde58bdb	feature: update wandb callback to upload checkpoints (#21035 ) * docs: add wandb metrics and model checkpointing to callback docstrings * docs: update reference to wandb documentation * fix: change default of `"WANDB_WATCH"` from ``"gradients"` to ``"false"` * feature: add `on_save` method and update `"WANDB_LOG_MODEL` behaviour * fix: use default wandb run names instead of `output_dir` - removes duplicated run names from wandb workspace - models can be logged with corresponding run names * fix: edit deprecation warning based on review suggestions Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fix: change indentation of docstrings * fix: change indentation of docstrings and run fixup * fix: empty commit for circleci permissions issue * fix: format deprecation doc strings review suggestion Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * docs: Highlight WANDB_DISABLED arg in documentaion Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * fix: run fixup after updating docstrings Co-authored-by: Bharat Ramanathan <ramanathan.parameshwaran@gohuddl.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-01-10 18:43:22 +01:00
KarlFelixJoehnk	a3c37825cc	Make the attention_head_size in distilbert an object attribute (#20970 ) * [Fix] Make the attention head size in distilbert an object attribute * Fix code style Co-authored-by: Felix Joehnk <fjoehnk@N73GCH2NDH.corp.proofpoint.com>	2023-01-09 18:17:16 +01:00
Arthur	e3ecbaa4ab	Patch-past-refactor (#21050 ) * small patches, forgot a line * refactor PT * the actual fix	2023-01-09 18:12:13 +01:00
Yih-Dar	48d4e147d8	remove flax file from `documentation_tests.txt` (#21036 ) remove flax file from `documentation_tests.txt` Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-08 12:33:25 +01:00
Sylvain Gugger	d0f324f1e1	Fix warning for MCTC model (#21049 )	2023-01-08 10:55:23 +01:00
Sylvain Gugger	9a046cc14e	Skip failing test until Athur looks at it.	2023-01-08 04:53:20 -05:00
Arthur	f0577df6de	Replace `past` with `past_key_values` (#20944 ) * start cleanup * more updates * more models are affected * more updates * update generation utils * style * revert change that removed reorder cachce * update generation utils * style * style * remove reorder cache	2023-01-08 10:21:40 +01:00
SABA UL HAQUE	7cb596fa22	fix typo (#21048 ) Typo fix: Corrected the word metada --> metadata	2023-01-08 10:03:01 +01:00
Kaito Sugimoto	bd9d51263a	fix typo (#21042 )	2023-01-07 10:13:26 +01:00
Bartosz Szmelczynski	f93c90d217	fix levit timm conversion file (#20938 ) * fix levit timm conversion file * remove set_defaults	2023-01-06 13:27:30 +01:00
Ceyda Cinarel	c29bec485e	fix parameter name in docstring (#21032 )	2023-01-06 07:23:16 -05:00
Dudu Lasry	61e068e5a2	Support turning off the model uploading in ClearML (#20969 ) * Add support for turning off the model uploading in ClearML * Add documentation for the CLEARML_LOG_MODEL environment variable * Adjust new doc addition to the new style Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Dudu Lasry <dudu.lasry@viz.ai> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-01-06 07:22:19 -05:00
Observer46	ff8dcb5efa	Fix arguments passed to predict function in QA Seq2seq training script (#21026 ) fix args passed to predict function	2023-01-06 07:19:42 -05:00
Roy Hvaara	35a7052b61	[NumPy] Remove references to deprecated NumPy type aliases (#21022 ) [NumPy] Remove references to deprecated NumPy type aliases. This change replaces references to a number of deprecated NumPy type aliases (np.bool, np.int, np.float, np.complex, np.object, np.str) with their recommended replacement (bool, int, float, complex, object, str). NumPy 1.24 drops the deprecated aliases, so we must remove uses before updating NumPy. Co-authored-by: Peter Hawkins <phawkins@google.com> Co-authored-by: Peter Hawkins <phawkins@google.com>	2023-01-05 13:02:10 -05:00

1 2 3 4 5 ...

11765 Commits