transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-03 21:00:08 +06:00

Author	SHA1	Message	Date
Yih-Dar	857bad6e53	check paths in `utils/documentation_tests.txt` (#21315 ) * check paths in utils/documentation_tests.txt * check paths in utils/documentation_tests.txt Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-26 15:33:47 +01:00
Nicolas Patry	fd0ef8b66d	Small QoL for qa. (#21316 )	2023-01-26 14:50:09 +01:00
Wonhyeong Seo	a01dd3818f	[i18n-KO] Translated quicktour page to Korean (#20946 ) docs: ko: quicktour page review by @ArthurZucker docs: fix: remove duplicate Co-Authored-By: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-01-26 14:10:02 +01:00
Yih-Dar	31336dcf3f	Fix 2 paths in the doctest list (#21314 ) fix the list Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-26 12:07:08 +01:00
Yih-Dar	4e41b87e3d	Use `model_class.__name__` and compare against `XXX_MAPPING_NAMES` (#21304 ) * update * update all * clean up * make quality * clean up Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-26 11:31:31 +01:00
amyeroberts	d18a1cba24	Accept batched tensor of images as input to image processor (#21144 ) * Accept a batched tensor of images as input * Add to all image processors * Update oneformer	2023-01-26 10:15:26 +00:00
Arthur	6f3faf3863	[WHISPER] Small patch (#21307 ) * add small patch * update tests, forced decoder ids is not prioritary against generation config * fix two new tests	2023-01-25 22:49:23 +01:00
Nick Hill	140c6edeb9	Small fix to ExponentialDecayLengthPenalty docstring (#21308 ) Currently, it incorrectly states that the exponential_decay_length_penalty tuple parameter is optional. Also changed the corresponding type hint to be more specific.	2023-01-25 14:46:08 -05:00
Anahita Bhiwandiwalla	3a6e4a221c	Add BridgeTower model (#20775 ) * Commit with BTModel and latest HF code * Placeholder classes for BTForMLM and BTForITR * Importing Bert classes from transformers * Removed objectives.py and dist_utils.py * Removed swin_transformer.py * Add image normalization, BridgeTowerForImageAndTextRetrieval * Add center_crop * Removing bert tokenizer and LCI references * Tested config loading from HF transformers hub * Removed state_dict updates and added path to hub * Enable center crop * Getting image_size from config, renaming num_heads and num_layers * Handling max_length in BridgeTowerProcessor * Add BridgeTowerForMaskedLM * Add doc string for BridgeTowerConfig * Add doc strings for BT config, processor, image processor * Adding docs, removed swin * Removed convert_bridgetower_original_to_pytorch.py * Added doc files for bridgetower, removed is_vision * Add support attention_mask=None and BridgeTowerModelOutput * Fix formatting * Fixes with 'make style', 'make quality', 'make fixup' * Remove downstream tasks from BridgeTowerModel * Formatting fixes, add return_dict to BT models * Clean up after doc_test * Update BTModelOutput return type, fix todo in doc * Remove loss_names from init * implement tests and update tuples returned by models * Add image reference to bridgetower.mdx * after make fix-copies, make fixup, make style, make quality, make repo-consistency * Rename class names with BridgeTower prefix * Fix for image_size in BTImageProcessor * implement feature extraction bridgetower tests * Update image_mean and image_std to be list * remove unused import * Removed old comments * Rework CLIP * update config in tests followed config update * Formatting fixes * Add copied from for BridgeTowerPredictionHeadTransform * Update bridgetower.mdx * Update test_feature_extraction_bridgetower.py * Update bridgetower.mdx * BridgeTowerForMaskedLM is conditioned on image too * Add BridgeTowerForMaskedLM * Fixes * Call post_init to init weights * Move freeze layers into method * Remove BTFeatureExtractor, add BT under multimodal models * Remove BTFeatureExtractor, add BT under multimodal models * Code review feedback - cleanup * Rename variables * Formatting and style to PR review feedback * Move center crop after resize * Use named parameters * Style fix for modeling_bridgetower.py * Update docs/source/en/model_doc/bridgetower.mdx Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update docs/source/en/model_doc/bridgetower.mdx Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update docs/source/en/model_doc/bridgetower.mdx Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/bridgetower/modeling_bridgetower.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/bridgetower/modeling_bridgetower.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update docs/source/en/model_doc/bridgetower.mdx Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update src/transformers/models/bridgetower/modeling_bridgetower.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Rename config params, copy BERT classes, clean comments * Cleanup irtr * Replace Roberta imports, add BTTextConfig and Model * Update docs, add visionconfig, consistent arg names * make fixup * Comments for forward in BTModel and make fixup * correct tests * Remove inconsistent roberta copied from * Add BridgeTowerTextModel to dummy_pt_objects.py * Add BridgeTowerTextModel to IGNORE_NON_TESTED * Update docs for BT Text and Vision Configs * Treat BridgeTowerTextModel as a private model * BridgeTowerTextModel as private * Run make fix-copies * Adding BTTextModel to PRIVATE_MODELS * Fix for issue with BT Text and Image configs * make style changes * Update README_ja.md Add から to BridgeTower's description * Clean up config, .mdx and arg names * Fix init_weights. Remove nn.Sequential * Formatting and style fixes * Re-add tie_word_embeddings in config * update test implementation * update style * remove commented out * fix style * Update README with abs for BridgeTower * fix style * fix mdx file * Update bridgetower.mdx * Update img src in bridgetower.mdx * Update README.md * Update README.md * resolve style failed * Update _toctree.yml * Update README_ja.md * Removed mlp_ratio, rename feats, rename BTCLIPModel * Replace BTCLIP with BTVisionModel,pass in vision_config to BTVisionModel * Add test_initialization support * Add support for output_hidden_states * Update support for output_hidden_states * Add support for output_attentions * Add docstring for output_hidden_states * update tests * add bridgetowervisionmodel as private model * rerun the PR test * Remove model_type, pass configs to classes, renames * Change self.device to use weight device * Remove image_size * Style check fixes * Add hidden_size and num_hidden_layers to BridgeTowerTransformer * Update device setting * cosmetic update * trigger test again * trigger tests again * Update test_modeling_bridgetower.py trigger tests again * Update test_modeling_bridgetower.py * minor update * re-trigger tests * Update docs/source/en/model_doc/bridgetower.mdx Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Remove pad, update max_text_len, doc cleanup, pass eps to LayerNorm * Added copied to, some more review feedback * make fixup * Use BridgeTowerVisionEmbeddings * Code cleanup * Fixes for BridgeTowerVisionEmbeddings * style checks * re-tests * fix embedding * address comment on init file * retrigger tests * update import prepare_image_inputs * update test_image_processing_bridgetower.py to reflect test_image_processing_common.py * retrigger tests Co-authored-by: Shaoyen Tseng <shao-yen.tseng@intel.com> Co-authored-by: Tiep Le <tiep.le@intel.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Tiep Le <97980157+tileintel@users.noreply.github.com>	2023-01-25 14:04:32 -05:00
Arthur	39799fbf85	[CI-Daily] replace `past` in prepare inputs for generation (#21296 ) replace `past` in prepare inputs for generation	2023-01-25 18:25:59 +01:00
Maria Khalusova	238449414f	Documentation code sample fixes (#21302 ) * Fixed the following: pipe -> pipeline out in pipe(data()) is a list of dict, not a dict * Fixed the TypeError: __init__() missing 1 required positional argument: 'key' * Added a tip: code sample requires additional libraries to run * Fixed custom config's name * added seqeval to the required libraries * fixed a missing dependency, fixed metric naming, added checkpoint to fix the datacollator * added checkpoint to fix the datacollator, added missing dependency	2023-01-25 11:33:39 -05:00
Younes Belkada	015443f42b	[Doctest] Fix `Blenderbot` doctest (#21297 ) fix blenderbot doctest - add correct expected value	2023-01-25 17:28:29 +01:00
Yih-Dar	cc714d74c4	Update `OneFormerModelIntegrationTest` expected values (#21295 ) * update values * update values * update values * Update tests/models/oneformer/test_modeling_oneformer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-01-25 17:27:02 +01:00
Younes Belkada	63b204eadd	[Hubert] Fix Hubert processing auto (#21299 ) * fix Hubert processing auto * remove unneeded space	2023-01-25 16:36:31 +01:00
Yih-Dar	de2d793e83	Fix `EfficientFormer` (#21294 ) * fix * fix checkpoint * fix style * tiny update Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-25 16:09:15 +01:00
Nicolas Patry	8788fd0ceb	Moving to cleaner tokenizer version or `oneformer`. (#21292 ) Moving to cleaner tokenizer version.	2023-01-25 15:46:10 +01:00
Arthur	255257f3ea	[Whisper] Refactor whisper (#21252 ) * update whisper logit processor * add generate for whisper * remove part of the whisper specific code from pipeline * update logit processes * major update * enforce first timestamp * update generate * add more tests * update new decoding strategy * Apply suggestions from code review * update docstring * fixup * default config will not have multilingual ar * update expected tokenizer size, see pull on the hub for whisper-tiny	2023-01-25 13:09:43 +01:00
NielsRogge	f83135eb76	[Mask2Former] Add doc tests (#21232 ) * Add doc tests * Add OneFormer resourcesé * Fix merge * Fix style Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2023-01-25 12:34:43 +01:00
Nicolas Patry	99e7905422	Supporting `ImageProcessor` in place of `FeatureExtractor` for pipelines (#20851 ) * Fixing the pipeline with image processor. * Update the slow test. * Using only the first image processor. * Include exclusion mecanism for Image processor. * Do not handle Gitconfig, deemed as a bug. * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Remove `conversational` changes. They are not supposed to be here. * Address first row of comments. * Remove OneFormer modifications. Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-01-25 10:16:31 +01:00
NielsRogge	efdbad56ab	[GIT] Add test for batched generation (#21282 ) * Add test * Apply suggestions Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2023-01-25 10:14:18 +01:00
Steven Liu	de1ca3a0c5	Update expected values for doctest (#21284 ) update expected values	2023-01-24 13:32:31 -08:00
Frederico Tommasi Caroli	1f981215dd	Fix `TrainingArguments.label_names` docs to reflect the correct default value behaviour (#21288 ) * Update TrainingArguments.label_names docs * Change wording * Change wording	2023-01-24 14:48:24 -05:00
Sanchit Gandhi	14d058b940	[W2V2 with LM] Fix decoder test with params (#21277 )	2023-01-24 19:27:56 +01:00
Arthur	94a7edd938	[GenerationConfig] add additional kwargs handling (#21269 ) * add additional kwargs handling * fix issue when serializing * correct order of kwargs removal for serialization in from dict * add `dict_torch_dtype_to_str` in case a dtype is needed for generation * add condition when adding the kwargs : not from config * Add comment based on review Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * add test function * default None when poping arg Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>	2023-01-24 19:04:42 +01:00
Stas Bekman	9286039c2a	[examples/deepspeed] fix renamed api (#21283 )	2023-01-24 09:54:33 -08:00
Younes Belkada	e2e393c6f2	[`t5`] Fix T5 inference in `float16` + `bnb` error (#21281 ) * attempts to fix: - upcast input for `T5DenseActDense` - add the condition `self.wo.weight.dtype != torch.int8` - added tests on `test/mixed_int8` - `make fixup` * fix ci test	2023-01-24 18:14:38 +01:00
Alara Dirik	f424b09410	Fix MaskFormerImageProcessor.post_process_instance_segmentation (#21256 ) * fix instance segmentation post processing * add Mask2FormerImageProcessor	2023-01-24 18:49:29 +03:00
Hirokazu Kiyomaru	767939af52	Use `logger.info` instead of `print` to emit a logging message in `hub.py` (#21273 ) use logger.info() instead of print() to emit a debug message	2023-01-24 10:37:10 -05:00
Nicolas Patry	67316444b0	Hotifx remove tuple for git config image processor. (#21278 )	2023-01-24 16:07:50 +01:00
Matt	071529bd54	Use return_tensors="np" instead of "tf" (#21266 ) Return NP instead of TF tensors for our data loading pipeline	2023-01-24 13:37:49 +00:00
Younes Belkada	f0fc791298	[Doc] fix broken link (#21276 ) fix broken link	2023-01-24 11:18:48 +01:00
Yih-Dar	bde7378bf0	Skip `test_multi_gpu_data_parallel_forward` for `UperNetModelTest` (#21216 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-24 10:41:16 +01:00
Sylvain Gugger	7119bb052a	v4.27.0.dev0	2023-01-23 16:52:35 -05:00
Sylvain Gugger	fd5cdaeea6	Models docstring (#21225 ) * Clean all models * Style * Last to remove * address review comments * Address review comments	2023-01-23 14:33:18 -05:00
Maria Khalusova	9e86c4e193	Supported pipeline tasks update (#21268 ) * added tasks from SUPPORTED_TASKS to docstrings * make style * sorted the tasks in the docstrtings in alphabetical order	2023-01-23 14:23:20 -05:00
Arthur	d8415ba42e	[Whisper] fix all issues with unk token (#21250 ) * fix all issues with unk token * fixup	2023-01-23 20:19:57 +01:00
amyeroberts	c18b4fbe9f	Add class properties with warnings (#21195 ) * Replace reduce_labels with do_reduce_labels * Replace only for __init__ and preprocess * Add class properties with warnings * Update tests	2023-01-23 18:45:27 +00:00
Arthur	b80b2218b5	[ci-daily] Fix pipeline tests (#21257 ) * use streaming dataset * fix whisper's test * add rescale argument to chunk_iter	2023-01-23 19:32:49 +01:00
Maria Khalusova	275ad9d80a	Add: TensorFlow example for semantic segmentation task guide (#21223 ) * wip: adding tf example for semantic segmentation guide * completed the working example in tf * make style * Update docs/source/en/tasks/semantic_segmentation.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/tasks/semantic_segmentation.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * fixed a callback doc links Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-01-23 13:32:15 -05:00
Maria Khalusova	2218dac5d2	Notebook examples grouping and update (#21265 ) * Split the examples by modality, added missing examples * fixed a link	2023-01-23 12:51:24 -05:00
amyeroberts	e2bd7f80d0	Update tests: replace feature extractor tests with image processor (#20768 ) * Update imports and test fetcher * Revert but keep test fetcher update * Fix imports * Fix all imports * Replace fe with ip names * Add generate kwargs to `AutomaticSpeechRecognitionPipeline` (#20952) * Add generate kwargs to AutomaticSpeechRecognitionPipeline * Add test for generation kwargs * Update image processor parameters if creating with kwargs (#20866) * Update parameters if creating with kwargs * Shallow copy to prevent mutating input * Pass all args in constructor dict - warnings in init * Fix typo * Rename tester class * Rebase and tidy up * Fixup * Use ImageProcessingSavingTestMixin * Update property ref in tests * Update property ref in tests * Update recently merged in models * Small fix Co-authored-by: bofeng huang <bofenghuang7@gmail.com>	2023-01-23 17:25:41 +00:00
amyeroberts	354ea44340	Replace reduce_labels with do_reduce_labels (#21218 ) * Replace reduce_labels with do_reduce_labels * Replace only for __init__ and preprocess * Update tests	2023-01-23 17:21:33 +00:00
Joao Gante	1eda4a4102	Generate: save generation config with the models' `.save_pretrained()` (#21264 )	2023-01-23 16:21:44 +00:00
amyeroberts	cf1a1eed70	Add missing checkpoint for doctest (#21258 )	2023-01-23 15:27:25 +00:00
Mostafa Elhoushi	5603f78fc4	Add scikit-learn dependency to train langage-modeling (#21229 )	2023-01-23 09:54:45 -05:00
Kambe Hiroyuki	929111698c	Add Japanese translation installation.mdx (#21241 ) * Add Japanese translation installation.mdx * Fixed for consistency with english version	2023-01-23 15:38:30 +01:00
Yih-Dar	cb6b56859a	Fix reformer CI (#21254 ) * fix ReformerForSequenceClassification doc example * fix ReformerForMaskedLM doc example Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-23 15:34:14 +01:00
raghavanone	eaace0c668	Optimize by not computing gradients for parameters set to requires_grad=False (#21236 ) * Optimize by not computing gradients for parameters set to requires_grad=False * Make change to retrigger the build * Fix isort issue * Fix issue	2023-01-23 09:27:59 -05:00
NielsRogge	6e4d3f0859	[GIT] Convert more checkpoints (#21245 ) * Extend conversion script * Remove print statement Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2023-01-23 15:19:27 +01:00
amyeroberts	66459ce319	Add test_image_processing_common.py (#20785 ) * Add test_image_processing_common.py * Fix typo * Update imports and test fetcher * Revert but keep test fetcher update * Fix imports * Fix all imports * Formatting fix * Update tests/test_image_processing_common.py	2023-01-23 13:48:30 +00:00

1 2 3 4 5 ...

11881 Commits