transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-25 07:18:58 +06:00

Author	SHA1	Message	Date
Arthur	92cfe8b074	Remove `authorized_missing_keys`in favor of _keys_to_ignore_on_load_missing (#20228 )	2022-11-15 15:12:41 +01:00
Yong woo Song	2d92001076	Typo on doctring in ElectraTokenizer (#20192 ) * chore: typo on docstring in tokenization_electra * chore: typo on docstring in tokenization_electra * update for check copies	2022-11-15 09:10:20 -05:00
amyeroberts	4c7e8d0900	Add object detection + segmentation transforms (#20003 ) * Add transforms for object detection * Update src/transformers/image_transforms.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Better var names & docstring * Remove unused var desc in docstring * Update src/transformers/image_transforms.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-11-15 12:50:03 +00:00
Younes Belkada	163ac3d3ee	Add Switch transformers (#19323 ) * first commit * add more comments * add router v1 * clean up - remove `tf` modeling files * clean up - remove `tf` modeling files * clean up * v0 routers * added more router - Implemented `ExpertsChooseMaskedRouter` - added tests - 2 more routers to implement * last router * improved docstring - completed the docstring in `router.py` - added more args in the config * v0 sparse mlp * replace wrong naming * forward pass run * update MOE layer * small router update * fixup * consistency * remove scatter router * remove abstract layer * update test and model for integration testing * v1 conversion * update * hardcode hack * all keys match * add gin conversion, without additional libraries * update conversion sctipy * delete router file * update tests wrt router deletion * fix router issues * update expert code * update, logits match, code needsREFACTORING * Refactor code Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com> * add generate tests Co-authored-by: younesbelkada <younesbelkada@gmail.com> * add support for router loss Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com> * fix forward error * refactor a bit * remove `FlaxSwitchTransformers` modules * more tests pass * Update code Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com> * fixup * fix tests * fix doc * fix doc + tokenization * fix tokenizer test * fix test * fix loss output * update code for backward pass * add loss support * update documentation * fix documentation, clean tokenizer * more doc fix, cleanup example_switch * fix failing test * fix test * fix test * fix loss issue * move layer * update doc and fix router capacity usage * fixup * add sparse mlp index for documentation on hub * fixup * test sparse mix architecture * Apply suggestions from code review * Update docs/source/en/model_doc/switch_transformers.mdx * fixup on update * fix tests * fix another test * attempt fix * Update src/transformers/models/switch_transformers/configuration_switch_transformers.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/models/switch_transformers/convert_switch_transformers_original_flax_checkpoint_to_pytorch.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * try * all tests pass * fix jitter noise * Apply suggestions from code review * doc tests pass * Update src/transformers/models/switch_transformers/modeling_switch_transformers.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/models/switch_transformers/modeling_switch_transformers.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * remove assert * change config order * fix readme japanese * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * remove parallelizable tests + add one liners * remove ONNX config * fix nits - add `T5Tokenizer` in auto mapping - remove `Switch Transformers` from ONNX supported models * remove `_get_router` * remove asserts * add check in test for `router_dtype` * add `SwitchTransformersConfig` in `run_pipeline_test` * Update tests/pipelines/test_pipelines_summarization.py * add huge model conversion script * fix slow tests - add better casting for `Linear8bitLt` - remove `torchscript` tests * add make dir * style on new script * fix nits - doctest - remove `_keys_to_ignore_on_load_unexpected` * Update src/transformers/models/switch_transformers/configuration_switch_transformers.py * add google as authors * fix year * remove last `assert` statements * standardize vertical spaces * fix failing import * fix another failing test * Remove strange àuthorized_keys` * removing todo and padding that is never used Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com> Co-authored-by: ybelkada <younes@huggingface.co> Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Arthur Zucker <arthur@huggingface.co>	2022-11-15 13:06:45 +01:00
amyeroberts	55ba31908a	Add param_name to size_dict logs & tidy (#20205 )	2022-11-15 10:52:58 +00:00
Younes Belkada	f1e8c48c5e	Add `accelerate` support for `ViT` family (#20174 ) * add `accelerate` support for `ViT` family - add `_no_split_modules` - manually cast to the right `dtype`: to change * enable `float16` for `deit` * fix `make fixup` * add `slow` test for `fp16` inference * another safety check * Update src/transformers/models/deit/modeling_deit.py	2022-11-15 11:06:01 +01:00
Arthur	11b2e45ccc	[WHISPER] Update modeling tests (#20162 ) * Update modeling tests * update tokenization test * typo * nit * fix expected attention outputs * Apply suggestions from code review Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Update tests from review Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> * remove problematics kwargs passed to the padding function Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-15 11:04:58 +01:00
Arthur	f60eec4003	update relative positional embedding (#20203 ) * update relative positional embedding * make fix copies * add `use_cache` to list of arguments * fixup * 1line fucntion * add `test_decoder_model_past_with_large_inputs_relative_pos_emb` * add relative pos embedding test for more models * style	2022-11-15 10:46:34 +01:00
Yih-Dar	f9909fbf85	Make `ImageSegmentationPipelineTests` less flaky (#20147 ) * Fix ImageSegmentationPipelineTests * Use 0.9 * no zip * links to show images * links to show images * rebase Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-15 09:14:55 +01:00
bofeng huang	9625924c60	Update tokenizer_summary.mdx (#20135 )	2022-11-15 01:18:13 +01:00
Wonhyeong Seo	8fadfd5035	[docs] set overflowing image width to auto-scale (#20197 ) * docs: fix: set overflowing image width to auto-scale * docs: fix: new language Korean is also affected * docs: fix: unnecessary line break in index page	2022-11-15 01:13:40 +01:00
Nicolas Patry	25c451e5a0	Adding chunking for whisper (all seq2seq actually). Very crude matching algorithm. (#20104 ) * Very crude matching algorithm. * Fixing tests. * Removing comments * Adding warning + fix short matches. * Cleanup tests. * Quality. * Less noisy. * Fixup.	2022-11-14 22:32:50 +01:00
Joao Gante	938cb04789	Generate: add Bloom fixes for contrastive search (#20213 )	2022-11-14 18:34:11 +00:00
amyeroberts	fda125638f	Downgrade log warning -> info (#20202 )	2022-11-14 17:56:52 +00:00
Ming Liu	36b063ed4f	Update README.md (#20188 ) There is typo in the original hyperlink. Below is the original version: Based on the script [`run_translation_no_trainer.py`](https://github.com/huggingface/transformers/blob/main/examples/pytorch/translation/run_translationn_no_trainer.py).	2022-11-14 12:53:02 -05:00
Yih-Dar	536e60d2c7	mark `test_save_load_fast_init_from_base` as `is_flaky` (#20200 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-14 18:51:33 +01:00
Sanchit Gandhi	af1a7c8ca3	[Examples] Generalise Seq2Seq ASR to handle Whisper (#19519 ) * merge conflicts * bos and eos in datacollator * (temp) hardcode removal of attention mask * freeze encoder * actually freeze encoder * set max length / num beams according to gen kwargs * (temp) fix tests * don't pop attn mask * override return attention mask config from Hub * Hub configs updated 🤗 * final fixes * update type annotations * backward comp	2022-11-14 17:45:46 +00:00
Wonhyeong Seo	7ecb039176	feat: add i18n issue template (#20199 ) Part of #20183 docs: add relevant labels to i18n issue template fix: typo on completion count	2022-11-14 12:36:58 -05:00
Wonhyeong Seo	07d8d6e2f7	docs: translated index page to korean (#20180 ) docs: i18n: first draft of index page docs: fix: first revision of index page docs: i18n: missed section - supported frameworks docs: fix: second revision of index page review by @ArthurZucker refactor: remove untranslated files from korean docs: fix: remove untranslated references from toctree.yml feat: enable korean docs in gh actions docs: feat: add in_translation page as placeholder docs: bug: testing if internal toc need alphabet chars docs: fix: custom english anchor for non-alphanumeric headings review by @sgugger docs: i18n: translate comments on install methods in _config.py docs: refactor: more concise wording for translations	2022-11-14 12:09:21 -05:00
Arthur	c149d366bb	add _keys_to_ignore_on_load_unexpected = [r"pooler"] (#20210 )	2022-11-14 18:05:19 +01:00
Younes Belkada	8dcf494ef1	[ROC_BERT] Make CI happy (#20175 ) * fix slow test * Update tests/models/roc_bert/test_modeling_roc_bert.py	2022-11-14 18:04:25 +01:00
Joao Gante	7b55bb4540	Generate: TF sample doctest result update (#20208 )	2022-11-14 15:42:48 +00:00
IMvision12	d24e84d9ed	Pytorch type hints (#20112 ) * initial commit * Update modeling_whisper.py * Fixing Tests * modeling_vision_text_dual_encoder * modeling_vision_encoder_decoder * Update modeling_vit.py * Update modeling_vit_msn.py * Update modeling_trajectory_transformer.py * style * Update modeling_time_series_transformer.py * Update modeling_time_series_transformer.py * Update modeling_segformer.py * Update modeling_plbart.py * Update modeling_dpt.py * Update modeling_deit.py * Update modeling_dpt.py * Update modeling_esm.py * Update modeling_fnet.py * Update modeling_fnet.py * Update modeling_fnet.py * Update modeling_flava.py * Update modeling_flava.py * Update modeling_layoutlmv3.py * Update modeling_levit.py	2022-11-14 12:39:18 +00:00
Nicolas Patry	03bc6ece1b	Proposal Remove the weird `inspect` in ASR pipeline and make WhisperEncoder just nice to use. (#19571 ) * Proposal Remove the weird `inspect` in ASR pipeline and make WhisperEncoder just nice to use. It seems that accepting `attention_mask` is kind of an invariant of our models. For Seq2Seq ASR models, we had a special comment on how it actually was important to send it. `inspecting` seems pretty brittle way to handle this case. My suggestion is to simply add it as an kwarg that and just ignoring it with the docstring explaining why it's ignored. * Fixup. * Update src/transformers/models/whisper/modeling_whisper.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Doc fixing . Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2022-11-14 09:34:30 +01:00
code-with-rajeev	2308f3d42c	Update README.md (#19530 ) Fixed a grammatical error.	2022-11-14 01:36:38 -05:00
Bartosz Szmelczynski	78a471ff71	Fix tapas scatter (#20149 ) * First draft * Remove scatter dependency * Add require_torch * update vectorized sum test, add clone call * remove artifacts * fix style * fix style v2 * remove "scatter" mentions from the code base * fix isort error Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-14 01:04:26 -05:00
Matthijs Hollemans	f711d683b5	add MobileNetV2 model (#17845 ) * add model files etc for MobileNetV2 * rename files for MobileNetV1 * initial implementation of MobileNetV1 * fix conversion script * cleanup * write docs * tweaks * fix conversion script * extract hidden states * fix test cases * make fixup * fixup it all * rename V1 to V2 * fix checkpoints * fixup * implement first block + weight conversion * add remaining layers * add output stride and dilation * fixup * add tests * add deeplabv3+ head * a bit of fixup * finish deeplab conversion * add link to doc * fix issue with JIT trace in_height and in_width would be Tensor objects during JIT trace, which caused Core ML conversion to fail on the remainder op. By making them ints, the result of the padding calculation becomes a constant value. * cleanup * fix order of models * fix rebase error * remove main from doc link * add image processor * remove old feature extractor * fix converter + other issues * fixup * fix unit test * add to onnx tests (but these appear broken now) * add post_process_semantic_segmentation * use google org * remove unused imports * move args * replace weird assert	2022-11-14 01:00:10 -05:00
amyeroberts	6cc06d1739	Fix type - update any PIL.Image.Resampling (#20172 )	2022-11-11 16:55:59 +00:00
NielsRogge	cbbeca3d17	[OWL-ViT] Make model consistent with CLIP (#20144 ) * Apply fix * Fix test * Remove another argument which is not used * Fix pipeline test * Add argument back, add deprecation warning * Add warning add other location * Use warnings instead * Add num_channels to config Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>	2022-11-11 11:36:17 +01:00
Nicolas Patry	d3c0566679	Fix object-detection bug (height, width inversion). (#20167 )	2022-11-11 10:14:48 +01:00
Arthur	61a51f5f23	Add Jukebox model (replaces #16875 ) (#17826 )	2022-11-10 21:05:27 +01:00
Sylvain Gugger	9740a03f61	Skip broken test	2022-11-10 14:59:32 -05:00
Sanchit Gandhi	905e5773a3	[processor] Add 'model input names' property (#20117 ) * [processor] Add 'model input names' property * add test * no f string * add generic property method to mixin * copy to multimodal * copy to vision * tests for all audio * remove ad-hoc tests * style * fix flava test * fix test * fix processor code	2022-11-10 19:29:20 +00:00
Matt	68187c4642	Fix arg names for our models (#20166 ) * Fix arg names for our models * Clean out the other uses of "residx" in infer() * make fixup	2022-11-10 16:47:58 +00:00
Joao Gante	6dda14dc47	Generate: fix TF doctests (#20159 )	2022-11-10 15:30:39 +00:00
Yih-Dar	e0d7c831c7	Update `OnnxConfig.generate_dummy_inputs` to check `ImageProcessingMixin` (#20157 ) * Check ImageProcessingMixin in OnnxConfig.generate_dummy_inputs * Check ImageProcessingMixin in OnnxConfig.generate_dummy_inputs * Add back Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-10 16:04:51 +01:00
Matthijs Hollemans	daf4436e07	doc comment fix: Args was in wrong place (#20164 )	2022-11-10 10:02:24 -05:00
NielsRogge	9f0c72f93b	Add doc tests (#20158 ) Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>	2022-11-10 15:25:30 +01:00
Nicolas Patry	d066c3731b	Adding support for LayoutLMvX variants for `object-detection`. (#20143 ) * Adding support for LayoutLMvX variants for `object-detection`. * Revert bogs `layoutlm` feature extractor which does not exist (it was a V2 model) . * Updated condition. * Handling the comments.	2022-11-10 11:33:38 +01:00
Yih-Dar	7ec1dc8817	Add `RoCBertTokenizer` to `TOKENIZER_MAPPING_NAMES` (#20141 ) * Add RoCBertTokenizer to TOKENIZER_MAPPING_NAMES * fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-09 20:58:56 +01:00
Mario Šaško	67b3789133	Make DummyObject more robust (#20146 )	2022-11-09 12:57:27 -05:00
NielsRogge	93e14486d6	[CLIPSeg] Add resources (#20118 ) * Add resource * Add tag Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-11-09 18:31:22 +01:00
amyeroberts	f3d99e49d4	Update VisionEncoderDecoder to use an image processor (#20137 ) * TrOCR processor uses an image processor * Update VisionEncoderDecoder * Add feature_extractor_class property	2022-11-09 16:31:05 +00:00
Steven Liu	a44985b41c	add cv + audio labels (#20114 )	2022-11-09 07:40:15 -08:00
Joao Gante	f270b960d6	Generate: move generation_.py src files into generation/.py (#20096 ) * move generation_.py src files into generation/.py * populate generation.__init__ with lazy loading * move imports and references from generation.xxx.object to generation.object	2022-11-09 15:34:08 +00:00
Nicolas Patry	bac2d29a80	Attempting to test automatically the `_keys_to_ignore`. (#20042 ) * Attempting to test automatically the `_keys_to_ignore`. * Style. * First fix pass. * Moving test on its own. * Another batch. * Second round removing BatchNorm * Fixing layoutlmv{2,3} + support older Python. * Disable miss missing warning. * Removing dodgy additions. * Big pass. * mbart. * More corrections. * Fixup. * Updating test_correct_missing_keys * Add escape hatch for when the head has no extra params so doesn't need the missing keys check. * Fixing test. * Greener. * Green ! (except for weird splinter bug). * Adding a test about `named_parameters` usage. * Shorten message. * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * After rebase modifications. * More explicit condition checking. * Fixing slow tests issues. * Remove extra pdb. * Remove print. * Attempt to make failure consistent + fixing roc_bert. * Removing the seed (all tests passing with it). Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-11-09 16:03:36 +01:00
amyeroberts	d606d566ab	Update SwinForMaskedImageModeling doctest values (#20139 ) * Update doctest values * Update copy statement	2022-11-09 14:53:01 +00:00
Yih-Dar	c4cad8e301	Update `CLIPSegModelTester` (#20134 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-09 15:21:52 +01:00
Saad Mahmud	0946ed94fd	Remove BertConfig inheritance from RobertaConfig (#20124 ) * Remove BertConfig inheritance from RobertaConfig * Fix Typo: BERT to RoBERTa	2022-11-09 08:51:12 -05:00
Yih-Dar	316bf04d3d	Improve tiny model creation script (#20119 ) * Improve tiny model creation script * sort the list of models to upload Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-09 11:34:35 +01:00

... 74 75 76 77 78 ...

15053 Commits