transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-21 13:38:31 +06:00

Author	SHA1	Message	Date
Sylvain Gugger	35920c9715	Trigger CI	2023-01-19 07:52:32 -05:00
Matthijs Hollemans	9b468a7cd7	workaround documentation rendering bug (#21189 )	2023-01-19 07:50:59 -05:00
Yih-Dar	464c86ac93	Update year 2020 to 2023 in one file (#21190 ) * update year Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-19 13:16:28 +01:00
Yih-Dar	1d33f55cb8	Fix `Mask2FormerForUniversalSegmentation` (#21175 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-19 10:15:08 +01:00
Jitesh Jain	5b949623c7	Add OneFormer Model (#20577 ) * Add Oneformer Model * Add OneFormer Tests * Add UNIVERSAL_SEGMENTATION_MAPPING * Fix config * 🐛 Fix error encountered while writing tests * 🔨 Fix instance segmentation post processing * Format Files and Add Documentation * Add Documentation mdx file * Run make fixup * Run make fix-copies * Remove unnecessary code * Format modeling_oneformer.py * Add OneFormer to ImageSegmentationPipeline * Format files * Add Demo link to Readme * Fix fomatting errors * Fix test failures * Update Table in index.mdx * Fix version * Fix style * Remove OneFormer from TF * Fix Imports * Fix dummy objects * Fix tests * Add newline * Remove OneFormerFeatureExtractor * Remove CUDA Kernels * Use AutoBackbone for Swin * Fix description * Use Image Processor * Fix copies * Fix formatting * Fix import order * Fix flake8 errors * Fix doc errors * Add Hindi Readme entry * Update supported backbones * Update supported backbones * Undo Changes * Fix type of config * Fix isort * Fix auto.mdx * Fix swin config * Replace DinatBackbone with AutoBackbone * Use SwinBackbone * Use SwinBackbone * Fix conversion script * Fix arguments * Add argument description * Fix style * Add OneFormerProcessor * Fix OneFormerProcessor Tests * Fix mapping * Fix imports * Fix inits * Fix style * Fix comment * Fix docstring * Move OneFormer to MultiModal * Fix Copies * Remove size divisor * Fix check_repo.py * Fix copies * Add Processor for Testing Pipeline * Fix padding for tokens * Fix variables * Fix formatting with correct black version * Add Image Processor Test * Apply suggestions * Revert common modeling * Add check for task * Fix conversion script * Fix initialization order * Fix tests * Undo Pipeline Changes * Fix layers in MLP * Fix copies * Update image paths * Fix copies * Apply suggestions	2023-01-19 09:31:07 +01:00
Stas Bekman	6d67664380	[issues template] update deepspeed owners (#21027 ) * [issues template] update deepspeed owners add the right contact for deepspeed@accelerate * pr-template	2023-01-18 17:23:36 -08:00
Matt	00ba7cadd8	Rewrite a couple of lines in the TF XLA doc (#21177 ) * Rewrite a couple of lines in the TF XLA doc to explain that jit_compile can be used in model.compile() too * Remove extra )	2023-01-18 17:53:05 +00:00
jeffhataws	c59d71b282	Add AWS Neuron torchrun support (#20806 ) * Add XLA torchrun support * Clarify that currently DDP doesn't work with torch.distributed XLA backend yet * Enable DDP with torchrun and XLA (now available in PT-XLA 1.13) * Add check for AWS Neuron availability and AWS Neuron specific compiler flag * Change the new test's name to TestTrainerDistributedNeuronCore * Remove "assert" and replace raised exception * Remove compiler flag as it is optional. If needed, will be another PR. * Use TORCHELASTIC_RUN_ID to determine whether torchrun is used	2023-01-18 11:21:19 -05:00
dependabot[bot]	f70ee51029	Bump future from 0.18.2 to 0.18.3 in /examples/research_projects/visual_bert (#21173 ) Bump future in /examples/research_projects/visual_bert Bumps [future](https://github.com/PythonCharmers/python-future) from 0.18.2 to 0.18.3. - [Release notes](https://github.com/PythonCharmers/python-future/releases) - [Changelog](https://github.com/PythonCharmers/python-future/blob/master/docs/changelog.rst) - [Commits](https://github.com/PythonCharmers/python-future/compare/v0.18.2...v0.18.3) --- updated-dependencies: - dependency-name: future dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-01-18 11:17:35 -05:00
dependabot[bot]	0194665c33	Bump future from 0.18.2 to 0.18.3 in /examples/research_projects/lxmert (#21169 ) Bumps [future](https://github.com/PythonCharmers/python-future) from 0.18.2 to 0.18.3. - [Release notes](https://github.com/PythonCharmers/python-future/releases) - [Changelog](https://github.com/PythonCharmers/python-future/blob/master/docs/changelog.rst) - [Commits](https://github.com/PythonCharmers/python-future/compare/v0.18.2...v0.18.3) --- updated-dependencies: - dependency-name: future dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-01-18 11:16:43 -05:00
Sylvain Gugger	05e72aa0c4	Adapt repository creation to latest hf_hub (#21158 ) * Adapt repository creation to latest hf_hub * Update all examples * Fix other tests, add Flax examples * Address review comments	2023-01-18 11:14:00 -05:00
Yih-Dar	32525428e1	Fix doctest CI (#21166 ) * fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-18 16:54:24 +01:00
Pengfei Liu	8ad06b7c13	using raw string for regex to search <extra_id> (#21162 ) * using raw string for regex to search <extra_id> * fix the same issue in test file:`tokenization_t5.py`	2023-01-18 09:43:54 -05:00
Wang, Yi	8a17da2f7f	fix the issue that the output dict of jit model could not get [:2] (#21146 ) "TypeError: unhashable type: 'slice'" Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>	2023-01-18 09:41:28 -05:00
Peter Lin	e1ad188641	Fix git model for generate with beam search. (#21071 ) * Fix git model for generate with beam search. * Update comment * Fix bug on multi batch * Add generate tests * Clean up tests * Fix style Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2023-01-18 09:40:24 -05:00
Joao Gante	e15f0d73db	OPT: Fix batched generation with FLAX (#21150 ) * Fix Flax OPT numerical masking * re-enable test * add fix to bart and reintroduce copied from in opt	2023-01-18 14:24:53 +00:00
Jordi Mas	f4786d7f39	Fix typos in documentation (#21160 ) * Fix typos in documentation * Small fix * Fix formatting	2023-01-18 09:05:25 -05:00
Samuel Xu	defdcd2862	Remove Roberta Dependencies from XLM Roberta Flax and Tensorflow models (#21047 ) * Added flax model code * Added tf changes * missed some * Added copy comments * Added style hints * Fixed copy statements * Added suggested fixes * Made some fixes * Style fixup * Added necessary copy statements * Fixing copy statements * Added more copies * Final copy fix * Some bugfixes * Adding imports to init * Fixed up all make fixup errors * Fixed doc errors * Auto model changes	2023-01-18 07:49:39 -05:00
Younes Belkada	023f51fe16	`blip` support for training (#21021 ) * `blip` support for training * remove labels creation * remove unneeded `decoder_input_ids` creation * final changes - add colab link to documentation - reduction = mean for loss * fix nits * update link * clearer error message	2023-01-18 11:24:37 +01:00
Yih-Dar	c8849583ad	Make `test_save_pretrained_signatures` slow test (#21105 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-18 10:43:05 +01:00
Shogo Hida	14154f7238	Add Japanese translation to multilingual.mdx (#21084 ) * Create toctree for Japanese translations Signed-off-by: Shogo Hida <shogo.hida@gmail.com> * Copy English version Signed-off-by: Shogo Hida <shogo.hida@gmail.com> * Add Japanese translations Signed-off-by: Shogo Hida <shogo.hida@gmail.com> * Add Japanese translations Signed-off-by: Shogo Hida <shogo.hida@gmail.com> Signed-off-by: Shogo Hida <shogo.hida@gmail.com>	2023-01-18 10:08:18 +01:00
Wonhyeong Seo	30c12301f8	🌐 [i18n-KO] Translated `installation.mdx` to Korean (#20948 ) docs: ko: installation.mdx	2023-01-18 10:05:23 +01:00
layjain	44caf4f6f4	Fixed num_channels!=3 normalization training (#20630 ) * Fixed num_channels!=3 normalization training * empty commit to trigger CI * Empty-Commit for CircleCI * Empty-Commit * Empty Commit try-3: https://discuss.circleci.com/t/github-code-checkout-suddenly-failing/31558 * Empty commit to trigger CI Co-authored-by: Lay Jain <layjain@basil.csail.mit.edu> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-17 13:06:20 -05:00
Sherman Siu	865da84abb	Add Epsilon- and Eta-Sampling (#21121 ) * Add epsilon- and eta-sampling. Add epsilon- and eta-sampling, following the official code from https://github.com/john-hewitt/truncation-sampling and adapting to be more configurable, as required by Huggingface transformers. * Add unit tests for epsilon- and eta-sampling. * Black: fix code formatting. * Fix docstring spacing. * Clean up newlines. * Fix implementation bugs and their associated tests. * Remove epsilon- and eta-sampling parameters from PretrainedConfig. * Clarify and clean up the documentation. * Remove parameters for PretrainedConfig test.	2023-01-17 13:04:32 -05:00
Maria Khalusova	0248810300	Refactoring of the text generate API docs (#21112 ) * initial commit, refactoring the text generation api reference * removed repetitive code examples * Refactoring the text generation docs to reduce repetition * make style	2023-01-17 12:23:48 -05:00
Maria Khalusova	d386fd646a	Add: An introductory guide for text generation (#21090 ) * Part of the "text generation" rework: adding a high-level overview of the text generation strategies * code samples update via make style * fixed a few formatting issues * Apply suggestions from review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fixed spaces, and switched two links to markdown * Apply Steven's suggestions from review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * new lines after headers to fix link rendering * review feedback addressed. added links to image captioning and audio transcription examples * minor capitalization fix * addressed the review feedback * Apply suggestions from review Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Applied review suggestions Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>	2023-01-17 12:23:22 -05:00
Maria Khalusova	868d37165f	Add: tensorflow example for image classification task guide (#21038 ) * Added TF example for image classification * Code style polishing * code style polishing * minor polishing * fixed a link in a tip, and a typo in the inference TF content * Apply Amy's suggestions from review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update docs/source/en/tasks/image_classification.mdx Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * review feedback addressed * make style * added PushToHubCallback with save_strategy="no" * minor polishing * added PushToHubCallback with save_strategy=no * minor polishing * Update docs/source/en/tasks/image_classification.mdx * added data augmentation Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * make style Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-01-17 12:20:08 -05:00
NielsRogge	3a9bd972e2	Add resources (#20872 ) * Add resources * Add more resources * Remove pipeline tag * Add more resources * Add more resources Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2023-01-17 17:42:33 +01:00
Joao Gante	d96098c641	CLI: update hub PR URL (#21154 )	2023-01-17 16:36:47 +00:00
Sayak Paul	f3feaf7f22	Change variable name to prevent shadowing (#21153 ) fix: input -> input_string.	2023-01-17 11:29:23 -05:00
NielsRogge	cf028d0c3d	Add batch of resources (#20647 ) * Add resources * Add more resources * Add more resources * Add TAPAS * Fix pipeline tag * Fix pipeline tags * Remove pipeline tag * Remove depth-estimation tag * Update docs/source/en/model_doc/segformer.mdx Co-authored-by: Maria Khalusova <kafooster@gmail.com> * Apply suggestion * Fix segformer Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> Co-authored-by: Maria Khalusova <kafooster@gmail.com>	2023-01-17 17:18:56 +01:00
Arthur	bb300ac686	Whisper Timestamp processor and prediction (#20620 ) * add draft logit processor * add template functions * update timesapmt processor parameters * draft script * simplify code * cleanup * fixup and clean * update pipeline * style * clean up previous idea * add tokenization utils * update tokenizer and asr output * fit whisper type * style and update test * clean test * style test * update tests * update error test * udpate code (not based on review yet) * update tokenization * update asr pipeline * update code * cleanup and update test * fmt * remove text verificatino * cleanup * cleanup * add model test * update tests * update code add docstring * update code and add docstring * fix pipeline tests * add draft logit processor add template functions update timesapmt processor parameters draft script simplify code cleanup fixup and clean update pipeline style clean up previous idea add tokenization utils update tokenizer and asr output fit whisper type style and update test clean test style test update tests update error test udpate code (not based on review yet) update tokenization update asr pipeline update code cleanup and update test fmt remove text verificatino cleanup cleanup add model test update tests update code add docstring update code and add docstring fix pipeline tests * Small update. * Fixup. * Tmp. * More support. * Making `forced_decoder_ids` non mandatory for users to set. * update and fix first bug * properly process sequence right after merge if last * tofo * allow list inputs + compute begin index better * start adding tests * add the 3 edge cases * style * format sequences * fixup * update * update * style * test passes, edge cases should be good * update last value * remove Trie * update tests and expec ted values * handle bigger chunk_length * clean tests a bit * refactor chunk iter and clean pipeline * update tests * style * refactor chunk iter and clean pipeline * upade * resolve comments * Apply suggestions from code review Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> * take stride right into account * update test expected values * Update code based on review Co-authored-by: sgugger <sylvain.gugger@gmail.com> Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by: sgugger <sylvain.gugger@gmail.com>	2023-01-17 15:50:09 +01:00
Nicolas Patry	25ddd91b24	Fixing offline mode for pipeline (when inferring task). (#21113 ) * Fixing offline mode for pipeline (when inferring task). * Update src/transformers/pipelines/__init__.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Updating test to reflect change in exception. * Fixing offline mode. * Clean. Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-01-17 15:24:40 +01:00
Sherman Siu	8896ebb9a9	Clarify and add missing typical_p argument docstring. (#21095 ) * Clarify and add missing typical_p docstring. * Make the docstring easier to understand. * Clarify typical_p docstring Accept the suggestion by @stevhliu for paraphrasing the docstring. Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Use the same docstring as in GenerationConfig Follow the suggestion suggested by @stevhliu in the pull request conversation. * Fix docstring spacing. Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-01-17 09:23:47 -05:00
Sayak Paul	f30bcd5357	feat: add standalone guide on XLA support. (#21141 ) * feat: add standalone guide on XLA support. Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Empty commit to trigger CI * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * address PR comments. Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-01-17 15:07:59 +01:00
Nick Hill	3bbc2451b1	Small simplification to TopKLogitsWarper (#21130 ) The max of top_k and min_tokens_to_keep performed on every call can just be done once up-front.	2023-01-17 09:06:03 -05:00
amyeroberts	0dde58978a	Rename test_feature_extraction files (#21140 ) * Rename files * Update file names in tests	2023-01-17 14:04:07 +00:00
Joao Gante	7b5e943cb6	Generate: TF contrastive search must pop `use_cache` from `model_kwargs` (#21149 )	2023-01-17 13:42:52 +00:00
Joao Gante	7f3dab39b5	TF: serializable hubert (#20966 ) * serializable hubert	2023-01-17 13:07:37 +00:00
Matt	e5dcceb82c	Fixes to TF collators (#21143 ) * Add num_workers for prepare_tf_dataset * Bugfix in the default collator and change default tensor type * Remove the "num_workers" arg and move it to a new PR	2023-01-17 12:18:56 +00:00
Alara Dirik	2411f0e465	Add Mask2Former (#20792 ) * Adds Mask2Former to transformers Co-authored-by: Shivalika Singh <shivalikasingh95@gmail.com> Co-authored-by: Shivalika Singh <73357305+shivalikasingh95@users.noreply.github.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-01-16 20:37:07 +03:00
NielsRogge	9edf375834	[GIT] Fix training (#21133 ) * Fix training * Add test * Fix failing tests Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2023-01-16 15:37:38 +01:00
Yih-Dar	0fb27dc988	Update `TFTapasEmbeddings` (#21107 ) Update TFTapasEmbeddings Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-16 15:29:50 +01:00
Clémentine Fourrier	4bbbabcb2c	Added clefourrier as ref point for graph models in bug reports (#21139 ) * Added clefourrier as ref point for graph models in bug reports * Update PULL_REQUEST_TEMPLATE.md	2023-01-16 15:12:42 +01:00
Yih-Dar	a45914193a	Fix `RealmModelIntegrationTest.test_inference_open_qa` (#21136 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-16 15:09:52 +01:00
Susnato Dhar	a5327c6a9a	Fixed issue #21053 (#21065 ) Co-authored-by: susnato <susnato@tensorflow123456@gmail.com>	2023-01-16 15:06:35 +01:00
Nicolas Patry	488a179ce1	Fixing batching pipelines on single items for ChunkPipeline (#21132 ) * Fixing #20783 * Update src/transformers/pipelines/base.py * Fixing some tests. * Fixup. * Remove ffmpeg dep + a bit more relaxed for bigbird QA precision. * Better dataset. * Prevent failing on TF. * Better condition. We can't use `can_use_iterator` since we cannot use it directly.	2023-01-16 15:04:27 +01:00
Silver	fa906a264b	Add `min_new_tokens` argument in generate() (implementation based on `MinNewTokensLengthLogitsProcessor`) (#21044 ) add a new parameter min_new_tokens for generate()	2023-01-16 15:02:08 +01:00
guillaume-be	125f137562	[LongT5] Remove duplicate encoder_attention_mask default value check (#21124 ) - Remove duplicate encoder_attention_mask default value assignment	2023-01-16 14:26:56 +01:00
NielsRogge	05b8e25fff	[VideoMAE] Fix docstring (#21111 ) Fix docstring Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2023-01-16 09:39:35 +01:00

... 7 8 9 10 11 ...

12196 Commits