transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-28 00:32:25 +06:00

Author	SHA1	Message	Date
amyeroberts	8e4ee28e34	Update TF whisper doc tests (#19484 )	2022-10-11 16:05:31 +01:00
Younes Belkada	6c66c6c860	Add warning in `generate` & `device_map=auto` & half precision models (#19468 ) * fix device mismatch * make fixup * added slow tests - added slow tests on `bnb` models to make sure generate works correctly * replace with `self.device` * revert force device assign * Update src/transformers/generation_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * set the warning in `generate` instead of `sample` Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-10-11 16:58:49 +02:00
Ankur Goyal	a3008c5a6d	Implement multiple span support for DocumentQuestionAnswering (#19204 ) * Implement multiple span support * Address comments * Add tests + fix bugs	2022-10-11 10:47:55 -04:00
h	ab856f68df	Decouples `XLMProphet` model from `Prophet` (#19406 ) * decouples xlm_prophet from prophet and adds copy patterns that pass the copy check * adds copy patterns to copied docstrings too * restores autodoc for XLMProphetNetModel * removes all-casing in a bunch of places to ensure that the model is compatible with all checkpoints on the hub * adds missing model to main init * adds autodocs to make document checker happy * adds missing pretrained model import * adds missing pretrained model import to main init * adds XLMProphetNetPreTrainedModel to the dummy pt objects * removes examples from the source-doc file since docstrings contain them already * adds a missing new line to make check_repo happy	2022-10-11 10:45:23 -04:00
Yih-Dar	c66466133a	Fix `get_embedding` dtype at init. time (#19473 ) * cast positions dtype in XGLMModel * Get the correct dtype at init time * Get the correct dtype at init time Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-11 16:05:39 +02:00
Sofia Oliveira	e38cf93e7c	Make `XLMRoberta` model and config independent from `Roberta` (#19359 ) * remove config dependence * remove dependencies from xlm_roberta * Fix style * Fix comments * various fixes * Fix pre-trained model name	2022-10-11 09:56:42 -04:00
Arnaud Stiegler	8cb44aaf17	Make LayoutLM tokenizers independent from BertTokenizer (#19351 ) * fixing tokenizer * adding all missing classes * fast tokenizer \| fixing format * revert to full class copy flag * fixing different casing	2022-10-11 09:49:23 -04:00
Joao Gante	9ed80b0000	TF: TFBart embedding initialization (#19460 ) * correct embedding init	2022-10-11 14:44:46 +01:00
lewtun	b651efe59e	[Swin] Replace hard-coded batch size to enable dynamic ONNX export (#19475 ) * [Swin] Replace hard-coded batch size to enable dynamic ONNX export	2022-10-11 15:21:29 +02:00
Yih-Dar	440bbd44aa	Update `WhisperModelIntegrationTests.test_large_batched_generation` (#19472 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-11 14:39:24 +02:00
Yih-Dar	e1a5cc338b	Fix doctests for `DeiT` and `TFGroupViT` (#19466 ) * Fix some doctests Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-11 14:30:42 +02:00
Yih-Dar	d7dc774a79	Fix `TFGroupViT` CI (#19461 ) * Fix TFGroupViT CI Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-11 14:29:15 +02:00
Joao Gante	a293a0e8a3	CLI: add import protection to datasets (#19470 )	2022-10-11 13:19:32 +01:00
Darío Hereñú	ae710425d2	Syntax issues (lines 126, 203) (#19444 )	2022-10-11 08:14:21 -04:00
Guillem Orellana Trullols	335f9bcd34	Extend `nested_XXX` functions to mappings/dicts. (#19455 ) * Extend `nested_XXX` functions to mappings/dicts. * Update src/transformers/trainer_pt_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/trainer_pt_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/trainer_pt_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Style updated file Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-10-11 08:13:21 -04:00
Arthur	b722a6be72	Fix whisper for `pipeline` (#19482 ) * update feature extractor params * update attention mask handling * fix doc and pipeline test * add warning when skipping test * add whisper translation and transcription test * fix build doc test	2022-10-11 07:17:53 -04:00
Dimitre Oliveira	df8faba4db	Enabling custom TF signature draft (#19249 ) * Custom TF signature draft * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Adding tf signature tests * Fixing signature check and adding asserts * fixing model load path * Adjusting signature tests * Formatting file Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by: Dimitre Oliveira <dimitreoliveira@Dimitres-MacBook-Air.local>	2022-10-11 10:56:08 +01:00
Lysandre	10100979ed	Dev version	2022-10-10 17:25:40 -04:00
Partho	df2f28120d	wrap forward passes with torch.no_grad() (#19412 )	2022-10-10 15:04:10 -04:00
Partho	5f5e264a12	wrap forward passes with torch.no_grad() (#19413 )	2022-10-10 15:03:46 -04:00
Partho	c6a928cadb	wrap forward passes with torch.no_grad() (#19414 )	2022-10-10 15:03:24 -04:00
Partho	d739a707d9	wrap forward passes with torch.no_grad() (#19416 )	2022-10-10 15:03:09 -04:00
Partho	870a9542be	wrap forward passes with torch.no_grad() (#19438 )	2022-10-10 14:54:54 -04:00
Partho	692c5be74e	wrap forward passes with torch.no_grad() (#19439 )	2022-10-10 14:54:36 -04:00
Yih-Dar	a7bc4221c0	fix (#19469 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-10 14:35:23 -04:00
Mikail Duzenli	25cfd911d0	Fixed a non-working hyperlink in the README.md file (#19434 ) * Fixed a non-working hyperlink in the README.md file The hyperlink to the community notebooks was outdated. * Fixing missing double slash in hyperlink	2022-10-10 12:57:28 -04:00
Bartosz Szmelczynski	9df953a855	Fix misspelled word in docstring (#19415 )	2022-10-10 17:33:57 +01:00
Shivang Mishra	d866b4858a	Generate: corrected exponential_decay_length_penalty type hint (#19376 )	2022-10-10 17:32:03 +01:00
amyeroberts	4dd784c32f	Fix momentum and epsilon values (#19454 ) The momentum value for PyTorch and TensorFlow batch normalization layers is not equivalent. The TensorFlow value should be (1 - pytorch_momentum) in order to ensure the correct updates are applied to the running mean and running variance calculations. We wouldn't observe a difference loading a pretrained model and performing inference, but evaluation outputs would change after some training steps.	2022-10-10 15:17:41 +01:00
Stefano Bosisio	b0b962ccca	Add Italian translation for `add_new_model.mdx` (#18713 ) * fix conflicts * start translating * proof check * add toc * fix errors and typos	2022-10-10 10:12:40 -04:00
Kaiyu Yang	e150c4e2fe	Fix the error message in run_t5_mlm_flax.py (#19282 )	2022-10-10 14:51:11 +01:00
amyeroberts	e3f028f3af	Add TF whisper (#19378 ) * simplify loop * add featur extractor * add model * start conversion * add dropout * initial commit of test files * copnversion for all models * update processor for correct padding * update feature extraction * update integration test logits match * fmnt: off for the logits * on the fly mel bank * small nit * update test * update tokenizer * nit feature extraction * update * update tokenizer test * adds logit processor and update tokenizer to get supress tokens * style * clean convert * revert to original modeling tf utils * Update * update * nit * clean convert file * update tests and nits * quality * slow generation test * ffn_dim to allow customization * update readme * add to toctreee * start fixing integration tests * update tests and code * fix feature extractor * fix config tests common * update code to fix tests * fix feature exctractor * nit feature extraction * update test for new feature extractor * style * add absrtact * large logits wioth custom decoder input ids * wraap around is otrch available * fix feature extractor * correct logits for whisper small.en * nit * fix encoder_attentino_mask * some fixes * remove unnecessary inputs * nits * add normalizer file * update etst tokenization * fix attention mask not defined * fix generate * remove uncoder attention mask useless * update test modeling whisper * update condfig to add second non supress tokens * nits on feature exrtactor * nit for test tokenizers * update etsts * update tests * update tokenization test * fixup * invalidated hf token. Clean convert openai to whisper * fix logit tests * fixup * Add model to README * Fix doc tests * clean merge * revert toc_tree changes * remove useless LogitProcessor * Update whisper .mdx * update config file doc * update configuration docstring * update test tokenization * update test tokenization * update tokenization whisper Added copied from where needed * update feature extraction * nit test name * style * quality * remove get suppress tokens and update non_speech tokens global variables * Update src/transformers/models/whisper/feature_extraction_whisper.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * clean modeling whisper and test Removed the attention mask arguments that are deprecated * fix large test * Add multilingual audio test, and translate test * style * fix larg multilingual test * nits * add copied from for attention layer * remove attention masks in doc * add english normalizer * Update docs/source/en/model_doc/whisper.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * update tokenization test * remove copied from in whisper attention : no bias in k_proj only * wrap around dependencies in english normalizer * style * correct import generation logits * for now, wrap feature extractor with torch * remove torch depencies for feature extraction and style * Update src/transformers/models/whisper/convert_openai_whisper_to_tfms.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/whisper/configuration_whisper.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/model_doc/whisper.mdx Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * fixup * nit * update logitds * style * nit * nits and fix final tests * add `is_more_itertools_available` to utils * quality * add begin supress tokens, supress tokens to generate args and config * clean supressTokensLogitProcessor in generation logits * Nit naming * add supressTokensAtBegin * udpate tests, supress tokens to None or correct values * nit and style * update RAG to fit test and generate_logit * add copy pasted statment on english normalizer * add arguments to config_common_kwargs * Update src/transformers/generation_utils.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/generation_logits_process.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * revert changes based on reviews * update doc and nits * Update src/transformers/models/whisper/configuration_whisper.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * more nits * last nits * update test configuration common * add BART name in decoder attention mask documentation * Update src/transformers/models/whisper/modeling_whisper.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * style * nit * nit * add english.json file to git * nits on documentation * nit * nits * last styling * add main toctree file * remove sentence piece dependency * clean init file * fix tokenizer that has no dependencies on sentencepiece * update whisper init file, nit * remove english.json file * add get decoder prompt id * All weights loading * Remove hanging pdb * Fixup and tidy up * Use same copied from as PT model * Remove whitespace changes * Remove torch references * Tie embeddings * Remove logits processor input to generate * Update logit values * revert changes and add forced logit processor * nit * clean normalizer * remove protected * Add logit processors and update generation code & tests * Some tidy up * Update docstring * update * update based on review * Update src/transformers/models/whisper/configuration_whisper.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/whisper/configuration_whisper.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update to reflect changes on the PT model branch * Tidy up * Remove extra whitespace * Fix test - make input ids small enough we can append * Include upstream changes on main * PR comments - add batch tests, remove comments & defaults * Fix model output imports * Update src/transformers/models/whisper/modeling_tf_whisper.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update src/transformers/generation_tf_logits_process.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update src/transformers/models/whisper/modeling_tf_whisper.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update src/transformers/models/whisper/modeling_tf_whisper.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update tests/models/whisper/test_modeling_tf_whisper.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update src/transformers/models/whisper/modeling_tf_whisper.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update src/transformers/models/whisper/modeling_tf_whisper.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update docstring example * Update src/transformers/models/whisper/modeling_tf_whisper.py Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Remove changes to adjust_logits_during_generation function * Update src/transformers/models/whisper/modeling_tf_whisper.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Tidy up imports that don't require TF * Update tests - skip and no more skip * Update tests/generation/test_generation_tf_logits_process.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update src/transformers/models/whisper/modeling_tf_whisper.py * Update src/transformers/models/whisper/modeling_tf_whisper.py Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Add training flags * Add (skipped) XLA generation tests * Add embedding correctness test * Add constant ids for generation tests * Make logits finding a bit tidier * Remove unused args * xla generation enabled * Don't skip XLA tests anymore * Fix tests - add position ids to expected signature and update rag generation * Undo method reorder * Remove added whitespace * Remove copy-paste gradient checkopint ref * Remove * Trigger CI - (issue with refs when pulling) Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: NielsRogge <niels.rogge1@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> Co-authored-by: Joao Gante <joao@huggingface.co>	2022-10-10 14:48:17 +01:00
APAVOU Clément	af69360bf9	Add `OPTForQuestionAnswering` (#19402 ) * Add `OPTForQuestionAnswering` - added `OPTForQuestionAnswering` class based on `BloomForQuestionAnswering` - added `OPTForQuestionAnswering` in common tests - all common tests pass - make fixup done * added docstrings for OPTForQuestionAnswering * Fix docstrings for OPTForQuestionAnswering	2022-10-10 09:30:59 -04:00
Aritra Roy Gosthipaty	ba71bf4cae	fix: renamed variable name (#18850 ) The sequence_masked variable is actually the part of the sequence that is kept unmasked for the encoder. This commit renames the variable.	2022-10-10 09:26:36 -04:00
Ryan Chan	4824741c4c	Remove dependency of Roberta in Blenderbot (#19411 ) * Remove dependency of Roberta in Blenderbot * Move Copied from statements to each method of the Roberta classes * Remove copied from line for mask_token.setter * update output from example in docs	2022-10-10 09:25:22 -04:00
Mohit Sharma	3080bb4754	Add onnx support for VisionEncoderDecoder (#19254 ) * Add onnx support for VisionEncoderDecoder * Add onnx support for VisionEncoderDecoder * Removed unused import * Rename encoder hidden state Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> * Update docstrings and removed redundant code * Added test function for enc-dec models * Update doc string text Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> * fixed code style Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>	2022-10-10 09:20:19 -04:00
Lysandre Debut	298f6a98c2	Stop relying on huggingface_hub's private methods (#19392 ) * Leverage hfh for move cache * Style	2022-10-10 15:19:33 +02:00
wei zhao	7d5ce6802e	Fix typo in image-classification/README.md (#19424 ) Fix link typo of the following content. PyTorch version, Trainer PyTorch version, no Trainer	2022-10-10 09:16:58 -04:00
Rak Alexey	c523a86929	fix marianMT convertion to onnx (#19287 ) * fix marianMT convertion to onnx * Update src/transformers/onnx/convert.py Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> * Update src/transformers/onnx/convert.py Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>	2022-10-10 09:11:29 -04:00
Darío Hereñú	3410705730	Fixed duplicated line (paragraph #83 ) Documentation: @sgugger (#19436 ) * Fixed duplicated line (paragraph #83) @omarespejel @sgugger * Datasets map denomination fixed (paragraph 42)	2022-10-10 09:08:34 -04:00
Darío Hereñú	83dc49b69b	Backtick fixed (paragraph 68) (#19440 )	2022-10-10 08:47:14 -04:00
Druhin Abrol	1241a4993b	remove RobertaConfig inheritance from MarkupLMConfig (#19404 ) * remove RobertaConfig inheritance from MarkupLMConfig * Update src/transformers/models/markuplm/configuration_markuplm.py fixed typo in docstring Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-10-10 08:44:59 -04:00
Matt	4107445a0f	Fix repo names for ESM tests (#19451 )	2022-10-10 13:20:00 +01:00
Yih-Dar	cbb8a37929	Skip `BloomEmbeddingTest.test_embeddings` for PyTorch < 1.10 (#19261 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-10 10:05:30 +02:00
Yih-Dar	8b6bba54a7	Fix `ViTMSNForImageClassification` doctest (#19275 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-10 09:51:30 +02:00
Sylvain Gugger	d92e22d1f2	Remove ref to is_pipeline_test	2022-10-07 21:38:07 -04:00
Sylvain Gugger	9ac586b3c8	Rework pipeline tests (#19366 ) * Rework pipeline tests * Try to fix Flax tests * Try to put it before * Use a new decorator instead * Remove ignore marker since it doesn't work * Filter pipeline tests * Woopsie * Use the fitlered list * Clean up and fake modif * Remove init * Revert fake modif	2022-10-07 18:01:58 -04:00
Alara Dirik	983451a13e	Improve and fix ImageSegmentationPipeline (#19367 ) - Fixes the image segmentation pipeline test failures caused by changes to the postprocessing methods of supported models - Updates the ImageSegmentationPipeline tests - Improves docs, adds 'task' argument to optionally perform semantic, instance or panoptic segmentation	2022-10-07 23:34:41 +03:00
Vishwas	de4d71ea07	Removed Bert dependency from BertGeneration code base. (#19370 ) * Copied all the code required from transformers.models.bert.modeling_bert to here * Fixed styling issues * Reformatted copied names with Model specific name. * Reverted BertEncoder part as there is already a class called BertGenerationEncoder * Added prefixes in missing places. Co-authored-by: vishwaspai <vishwas.pai@emplay.net>	2022-10-07 13:45:24 -04:00
mustapha ajeghrir	34e0cc6d86	Make `Camembert` TF version independent from `Roberta` (#19364 ) * camembert tf version independent * fixup * fixup, all working * remove comments * Adding copied from roberta Co-authored-by: Mustapha AJEGHRIR <mustapha.ajeghrir@kleegroup.com>	2022-10-07 13:42:24 -04:00

... 82 83 84 85 86 ...

15053 Commits