transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-30 17:52:35 +06:00

Author	SHA1	Message	Date
Yang An	721764028e	Add Chinese-CLIP implementation (#20368 ) * init chinese-clip model from clip * init model tests and docs * implement chinese-clip into hf * implement chinese-clip into hf * implement chinese-clip into hf * implement chinese-clip into hf * implement chinese-clip into hf * update usecase example in model implementation * fix codestyle * fix model_type typo in readme * add placeholder in doc * add placeholder in doc * update the init script * update usecase * fix codestyle * update testcase * update testcase * update testcase * update testcase * update testcase * update testcase * update testcase * update testcase * update testcase * update testcase * update testcase * update testcase * forward the convert_rgb * update testcase * update testcase * update testcase * merge the recent update from clip about model_input_name property * update the doc * update the doc * update the doc * update the doc * remove unused imports * reformat code style * update the doc * fix isort style * bypass a weird failed unit test which is unrelated with my PR * update the doc * implement independent vision config class * implement independent vision model class * fix refactor bug * fix refactor bug * fix refactor bug * make style * fix refactor bug * make style * fix refactor bug * fix refactor bug * make style * fix refactor bug * fix refactor bug * doc-build restyle * implement independent text config class * implement independent text model class * implement independent text model class * make style * make fix-copies * fix refactor bug * fix refactor bug * fix refactor bug * fix refactor bug * fix refactor bug * fix refactor bug * fix refactor bug * fix refactor bug * fix refactor bug * fix refactor bug * make style * update doc * black and isort * update doc * Update src/transformers/models/chinese_clip/configuration_chinese_clip.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/auto/tokenization_auto.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * modify the model type from chinese-clip to chinese_clip * format the example comment of ChineseCLIPVisionConfig * correct the copyright comment * fix the tokenizer specification * add copied from for loss function * remove unused class * update CHINESE_CLIP_TEXT_INPUTS_DOCSTRING * update CHINESE_CLIP_INPUTS_DOCSTRING * update doc * update doc * update code comment in config * update copied from statement * make style * rename the doc file * add copied statement * remove unused attention_mask, causal_attention_mask in ChineseCLIPVisionEncoder * remove ChineseCLIPTextPreTrainedModel * fix bug * fix bug * fix bug * update doc * make style * Update src/transformers/models/chinese_clip/configuration_chinese_clip.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/chinese_clip/configuration_chinese_clip.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * update ChineseCLIPImageProcessor in image_processing_auto * fix config_class of chinesecliptextmodel * fix the test case * update the docs * remove the copied from comment for ChineseCLIPTextModel, since it has diverged from BertModel with customed config_class * update the testcase * final fix Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-30 19:22:23 +01:00
Sylvain Gugger	08b4621899	Repurpose torchdynamo training args towards torch._dynamo (#20498 ) * Repurpose torchdynamo training args towards torch._dynamo * Add doc	2022-11-30 11:10:45 -05:00
Matt	afad0c18d9	Fix TF nightly tests (#20507 ) * Fixed test_saved_model_extended * Fix TFGPT2 tests * make fixup * Make sure keras-nlp utils are available for type hinting too * Update src/transformers/testing_utils.py Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * make fixup Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2022-11-30 14:47:54 +00:00
Arthur	761b3fad92	Expected output for the test changed (#20493 )	2022-11-30 15:07:28 +01:00
amyeroberts	de6d19ea92	Add segmentation + object detection image processors (#20160 ) * Add transforms for object detection * DETR models + Yolos * Scrappy additions * Maskformer image processor * Fix up; MaskFormer tests * Update owlvit processor * Add to docs * OwlViT tests * Update pad logic * Remove changes to transforms * Import fn directly * Update to include pad transformation * Remove uninstended changes * Add new owlvit post processing function * Tidy up * Fix copies * Fix some copies * Include device fix * Fix scipy imports * Update _pad_image * Update padding functionality * Fix bug * Properly handle ignore index * Fix up * Remove defaults to None in docstrings * Fix docstrings & docs * Fix sizes bug * Resolve conflicts in init * Cast to float after resizing * Tidy & add size if missing * Allow kwards when processing for owlvit * Update test values	2022-11-30 10:24:03 +00:00
Pi Esposito	fb2b45e562	add in layer gpt2 tokenizer (#20421 ) * add minimal working gpt2 tokenizer * graph mode and output equivalence tests working * not today tensorflow. serialization test passing! * fix style, documentation, docstrings and all that jazz * passing consistency checks * move keras nlp to tf dependencies * fix tf modeling utils and gpt2 attention to enable compiling * fix (I hope) keras nlp dependencies * rever changes on generation * remove debug prints * remove redundant tf dummy objects * add from config, get config and max length settings to address review * let flake ignore the error on distillation you are welcome * test from config * add padding test * address sgugger review	2022-11-29 10:02:40 -05:00
amyeroberts	ae1cffaf3c	Add Donut image processor (#20425 ) * Add Donut image processor * Update src/transformers/image_transforms.py Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com> * Fix docstrings * Full var names in docstring Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com>	2022-11-29 10:38:01 +00:00
NielsRogge	6dc884abc8	[Maskformer] Add MaskFormerSwin backbone (#20344 ) * First draft * Fix backwards compatibility * More fixes * More fixes * Make backbone more general * Improve backbone * Improve test * Fix config checkpoint * Address comments * Use model_type * Address more comments * Fix special model names * Remove MaskFormerSwinModel and MaskFormerSwinPreTrainedModel from main init * Fix typo * Update backbone * Apply suggestion Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-11-28 20:33:49 +01:00
amyeroberts	321ef388fe	Include image processor in add-new-model-like (#20439 )	2022-11-28 16:46:02 +00:00
NielsRogge	0bae286de9	[AutoBackbone] Improve API (#20407 ) * Add hidden states and attentions to backbone outputs * Update ResNet * Fix more tests * Debug test * Fix test_determinism * Fix test_save_load * Remove file * Disable fx tests * Test * Add fx support for backbones Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-11-28 17:20:24 +01:00
Arthur	39a72125e7	fix both failing RoCBert tests (#20469 )	2022-11-28 17:08:57 +01:00
Lianmin Zheng	ac2f6674a3	[FLAX] Add dtype to embedding for bert/bart/opt/t5 (#20340 ) * [FLAX] Add dtype to embedding for bert/bart/opt/t5 * Fix all copies * Add a test case	2022-11-28 10:21:42 -05:00
Matt	de4159a318	More TF int dtype fixes (#20384 ) * Add a test to ensure int dummy inputs are int64 * Move the test into the existing int64 test and update a lot of existing dummies * Fix remaining dummies * Fix remaining dummies * Test for int64 serving sigs as well * Update core tests to use tf.int64 * Add better messages to the assertions * Update all serving sigs to int64 * More sneaky hiding tf.int32s * Add an optional int32 signature in save_pretrained * make fixup * Add Amy's suggestions * Switch all serving sigs back to tf.int32 * Switch all dummies to tf.int32 * Adjust tests to check for tf.int32 instead of tf.int64 * Fix base dummy_inputs dtype * Start casting to tf.int32 in input_processing * Change dtype for unpack_inputs test * Add proper tf.int32 test * Make the alternate serving signature int64	2022-11-28 13:24:44 +00:00
Matt	72b19ca680	Fix ESM checkpoints for tests (#20436 ) * Re-enable TF ESM tests, make sure we use facebook checkpoints * make fixup	2022-11-28 13:19:28 +00:00
Yih-Dar	2cdac665b0	Fix device issues in `CLIPSegModelIntegrationTest` (#20467 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-28 10:41:28 +01:00
Stas Bekman	a547d5bda5	[AnyPrecisionAdamW] test fix (#20454 )	2022-11-25 09:02:10 -08:00
Younes Belkada	a1d4563f7a	`accelerate` support for `OwlViT` (#20411 ) * `accelerate` support for `OwlViT` - added `accelerate` support - added slow `fp16` tests * apply suggestions	2022-11-25 11:20:44 +01:00
Sylvain Gugger	afce73bd9d	Fix ModelOutput instantiation when there is only one tuple (#20416 )	2022-11-23 15:09:21 -05:00
Younes Belkada	ad654e4484	[BNB] Throw `ValueError` when trying to cast or assign (#20409 ) * `bnb` ValueError when tries to cast or assign * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * remove docstrings * change error log Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-11-23 15:51:50 +01:00
raghavanone	03ae1f060b	change the way sentinel tokens can retrived (#20373 ) * change the way sentinel tokens can retrived * Fix line length for doc string * Fix line length for doc string * Add more stronger test for t5 tokenization * Format file changes * Make a stronger test for filtering sentinel tokens * fix file format issues	2022-11-23 09:35:44 -05:00
Patrick von Platen	81c46679bd	[Image Transformers] to_pil fix float edge cases (#20406 ) * Correct type checking * up	2022-11-23 13:47:59 +01:00
Joao Gante	e53331c905	Generate: fix plbart generation tests (#20391 )	2022-11-22 17:56:04 +00:00
lewtun	d2357a0133	Use tiny models for ONNX tests - text modality (#20333 ) * Use tiny ONNX models * Fix broken tests * Add tiny perceiver * Add tiny convbert	2022-11-22 17:11:17 +01:00
NielsRogge	9ef46659da	Improve backbone (#20380 ) Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-11-22 17:00:08 +01:00
Michael Nation	dfc3deafa3	Optimizes DonutProcessor token2json method for speed (#20283 ) * Optimizes DonutProcessor token2json method for speed * Applies black formatting * Updates Donut pretrained model name in test file * remaining pytorch type hints (#20217) * Update modeling_flava.py * Update modeling_markuplm.py * Update modeling_glpn.py * Update modeling_roc_bert.py * Update modeling_segformer.py * Update modeling_tapas.py * Update modeling_tapas.py * Update modeling_tapas.py * Update modeling_tapas.py * Update modeling_trocr.py * Update modeling_videomae.py * Update modeling_videomae.py * Update modeling_videomae.py * Update modeling_yolos.py * Update modeling_wav2vec2.py * Update modeling_jukebox.py * Update modeling_jukebox.py * Update modeling_jukebox.py * Update modeling_jukebox.py * Data collator for token classification pads labels column when receives pytorch tensors (#20244) * token cls data_collator pads labels column * remove walrus operator for code quality * remove redundat space * remove comment that was fixed * PR comments fix Co-authored-by: Alexander Markov <amarkov.me@gmail.com> * [Doctest] Add configuration_deformable_detr.py (#20273) * Update configuration_deformable_detr.py comment * Add DeformableDetrConfig to documentation_tests.txt * Fix summarization script (#20286) * [DOCTEST] Fix the documentation of RoCBert (#20142) * update part of the doc * add temp values, fix part of the doc * add template outputs * add correct models and outputss * style * fixup * [bnb] Let's warn users when saving 8-bit models (#20282) * add warning on 8-bit models - added tests - added wrapper * move to a private attribute - remove wrapper - changed `save_pretrained` method * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fix suggestions Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Adding `zero-shot-object-detection` pipeline doctest. (#20274) * Adding `zero-shot-object-detection` pipeline doctest. * Remove nested_simplify. * Adding doctest for `object-detection` pipeline. (#20258) * Adding doctest for `object-detection` pipeline. * Removed nested_simplify. * Image transforms functionality used instead (#20278) * Image transforms functionality used instead * Import torch * Import rather than copy * Update src/transformers/models/conditional_detr/feature_extraction_conditional_detr.py * TF: add test for `PushToHubCallback` (#20231) * test hub tf callback * create repo before cloning it * Generate: general TF XLA constrastive search are now slow tests (#20277) * move contrastive search test to slow * Fixing the doctests failures. (#20294) * Fixing the doctests failures. * Fixup. * set the default cache_enable to True, aligned with the default value in pytorch cpu/cuda amp autocast (#20289) Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> * Add docstrings for canine model (#19457) * Add docstrings for canine model * Update CanineForTokenClassification Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> * Add AutoBackbone + ResNetBackbone (#20229) * Add ResNetBackbone * Define channels and strides as property * Remove file * Add test for backbone * Update BackboneOutput class * Remove strides property * Fix docstring * Add backbones to SHOULD_HAVE_THEIR_OWN_PAGE * Fix auto mapping name * Add sanity check for out_features * Set stage names based on depths * Update to tuple Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> * Add missing report button for Example test (#20293) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> * refactor test (#20300) - simplifies the devce checking test * [Tiny model creation] deal with `ImageProcessor` (#20298) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> * Fix blender bot missleading doc (#20301) * fix the doc to specify that add_prefix_space = False * add correct expected output * remove two tokens that should not be suppressed (#20302) * [ASR Examples] Update README for Whisper (#20230) * [ASR Examples] Update README for seq2seq * add language info * add training results * re-word * Add padding image transformation (#19838) * Add padding transformation * Add in upstream changes * Update tests & docs * Code formatting tuples in docstring * Pin TensorFlow (#20313) * Pin to the right version... * Also pin TensorFlow CPU * Add AnyPrecisionAdamW optimizer (#18961) * Add AnyPrecisionAdamW optimizer * Add optim_args argument to TrainingArgs * Add tests for AnyPrecisionOptimizer * Change AnyPrecisionAdam default params to float32 * Move default_anyprecision_kwargs in trainer test * Rename AnyPrecisionAdamW * [Proposal] Breaking change `zero-shot-object-detection` for improved consistency. (#20280) * [Proposal] Breaking change `zero-shot-object-detection` for improved consistency. This is a proposal to modify the output of `zero-shot-object-detection` to provide better alignment with other pipelines. The output is now strictly the same as `object-detection` whereas before it would output lists of lists. The name `candidate_labels` is used throughout for consistency with other `zero-shot` pipelines. The pipeline is changed to `ChunkPipeline` to support batching cleanly. This removes all the lists and list of lists shenanigans, it's now a matter of the base pipeline handling all this not this specific one. Breaking change: It did remove complex calls potentials `pipe(images = [image1, image2], text_queries=[candidates1, candidates2])` to support only `pipe([{"image": image1, "candidate_labels": candidates1}, {"image": image2, "candidate_labels": candidates2}])` when dealing with lists and/or datasets. We could keep them, but it will add a lot of complexity to the code base, since the pipeline is rather young, I'd rather break to keep the code simpler, but we can revert this. Breaking change: The name of the argument is now `image` instead of `images` since it expects by default only 1 image. This is revertable like the previous one. Breaking change: The types is now simplified and flattened: `pipe(inputs) == [{object1}, {object2}]` instead of the previous `pipe(inputs) == [[{object1}, {object1}], [{object2}]]` Where the different instances would be grouped by candidate labels within lists. IMHO this is not really desirable, since it would output empty lists and is only adding superflous indirection compared to `zero-shot-object-detection`. It is relatively change free in terms of how the results, it does change computation however since now the batching is handled by the pipeline itself. It did** change the results for the small models so there seems to be a real difference in how the models handle this. * Fixing the doctests. * Behind is_torch_available. * Fix flakey test with seed (#20318) * Pin TF 2.10.1 for Push CI (#20319) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> * Remove double brackets (#20307) * remove double brackets * oops get other bracket * TF: future proof our keras imports (#20317) * future proof our tf code * parse tf versions * Add Neighborhood Attention Transformer (NAT) and Dilated NAT (DiNAT) models (#20219) * Add DiNAT * Adds DiNAT + tests * Minor fixes * Added HF model * Add natten to dependencies. * Cleanup * Minor fixup * Reformat * Optional NATTEN import. * Reformat & add doc to _toctree * Reformat (finally) * Dummy objects for DiNAT * Add NAT + minor changes Adds NAT as its own independent model + docs, tests Adds NATTEN to ext deps to ensure ci picks it up. * Remove natten from `all` and `dev-torch` deps, add manual pip install to ci tests * Minor fixes. * Fix READMEs. * Requested changes to docs + minor fixes. * Requested changes. * Add NAT/DiNAT tests to layoutlm_job * Correction to Dinat doc. * Requested changes. * organize pipelines by modality (#20306) * Fix torch device issues (#20304) * fix device issue Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> * Generate: add generation config class (#20218) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * translate zh quicktour(#20095) (#20181) * zh quicktour(#20095) * add zh to doc workflow * remove untranslation from toctree Co-authored-by: BeifangSusu <BeifangSusu@bfss.com> * Add Spanish translation of serialization.mdx (#20245) * Update _toctree and clone original content * Translate first three sections * Add more translated chapters. Only 3 more left. * Finish translation * Run style from doc-builder * Address recommended changes from reviewer * Add LayerScale to NAT/DiNAT (#20325) * Add LayerScale to NAT/DiNAT. Completely dropped the ball on LayerScale in the original PR (#20219). This is just an optional argument in both models, and is only activated for larger variants in order to provide training stability. * Add LayerScale to NAT/DiNAT. Minor error fixed. Co-authored-by: Ali Hassani <ahassanijr@gmail.com> * [Switch Transformers] Fix failing slow test (#20346) * run slow test on GPU * remove unnecessary device assignment * use `torch_device` instead * fix: "BigSicence" typo in docs (#20331) * add MobileNetV1 model (#17799) * add model files etc for MobileNetV2 rename files for MobileNetV1 initial implementation of MobileNetV1 fix conversion script cleanup write docs tweaks fix conversion script extract hidden states fix test cases make fixup fixup it all remove main from doc link fixes fix tests fix up use google org fix weird assert * fixup * use google organization for checkpoints * Generate: `model_kwargs` can also be an input to `prepare_inputs_for_generation` (#20353) * Update Special Language Tokens for PLBART (#19980) * Update Special Language Tokens for PLBART * fix format * making mapping for language codes and updating tests: * fix format * fix consistency * add assert to both tokenizer tests. * fix format * Update src/transformers/models/plbart/tokenization_plbart.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * improvin readability, setting self.tgt_lang * fixing * readability Co-authored-by: jordiclive <jordiclive19@imperial.ac.uk> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Add resources (#20296) Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> * Enhance HfArgumentParser functionality and ease of use (#20323) * Enhance HfArgumentParser * Fix type hints for older python versions * Fix and add tests (+formatting) * Add changes * doc-builder formatting * Remove unused import "Call" * Add Audio Spectogram Transformer (#19981) * First draft * Make conversion script work * Add id2label mapping, run code quality * Fix copies * Add first draft of feature extractor * Update conversion script to use feature extractor * Make more tests pass * Add docs * update input_features to input_values + pad by default to max length * Fix doc tests * Add feature extractor tests * Add proper padding/truncation to feature extractor * Add support for conversion of all audioset checkpoints * Improve docs and extend conversion script * Fix README * Rename spectogram to spectrogram * Fix copies * Add integration test * Remove dummy conv * Update to ast * Update organization * Fix init * Rename model to AST * Add require_torchaudio annotator * Move import of ASTFeatureExtractor under a is_speech_available * Fix rebase * Add pipeline config * Update name of classifier head * Rename time_dimension and frequency_dimension for clarity * Remove print statement * Fix pipeline test * Fix pipeline test * Fix index table * Fix init * Fix conversion script * Rename to ForAudioClassification * Fix index table Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> * Add inference section to task guides (#18781) * 📝 start adding inference section to task guides * ✨ make style * 📝 add multiple choice * add rest of inference sections * make style * add compute_metric, push_to_hub, pipeline * make style * add updated sequence and token classification * make style * make edits in token classification * add audio classification * make style * add asr * make style * add image classification * make style * add summarization * make style * add translation * make style * add multiple choice * add language modeling * add qa * make style * review and edits * apply reviews * make style * fix call to processor * apply audio reviews * update to better asr model * make style * Fix toctree for Section 3 in Spanish Documentation (#20360) * Order and group topics in the right section * Translate "Computer Vision" Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> Co-authored-by: IMvision12 <88665786+IMvision12@users.noreply.github.com> Co-authored-by: Alexander Markov <almarkv@yandex.ru> Co-authored-by: Alexander Markov <amarkov.me@gmail.com> Co-authored-by: Saad Mahmud <shuvro.mahmud79@gmail.com> Co-authored-by: Zachary Mueller <muellerzr@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by: Wang, Yi <yi.a.wang@intel.com> Co-authored-by: raghavanone <115454562+raghavanone@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com> Co-authored-by: atturaioe <76523524+atturaioe@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Ali Hassani <68103095+alihassanijr@users.noreply.github.com> Co-authored-by: BFSS <31245245+bfss@users.noreply.github.com> Co-authored-by: BeifangSusu <BeifangSusu@bfss.com> Co-authored-by: Ian C <7807897+donelianc@users.noreply.github.com> Co-authored-by: Ali Hassani <ahassanijr@gmail.com> Co-authored-by: Raj Rajhans <me@rajrajhans.com> Co-authored-by: Matthijs Hollemans <mail@hollance.com> Co-authored-by: Jordan Clive <jordan.clive19@imperial.ac.uk> Co-authored-by: jordiclive <jordiclive19@imperial.ac.uk> Co-authored-by: Konstantin Dobler <konstantin.j.dobler@gmail.com>	2022-11-22 10:40:59 -05:00
Sylvain Gugger	f3a1efd1cf	Skip failing test	2022-11-22 09:53:56 -05:00
Joao Gante	f2e7d270ec	Generate: shorter XLA contrastive search tests (#20354 )	2022-11-22 11:47:12 +00:00
NielsRogge	4973d2a04c	Add Audio Spectogram Transformer (#19981 ) * First draft * Make conversion script work * Add id2label mapping, run code quality * Fix copies * Add first draft of feature extractor * Update conversion script to use feature extractor * Make more tests pass * Add docs * update input_features to input_values + pad by default to max length * Fix doc tests * Add feature extractor tests * Add proper padding/truncation to feature extractor * Add support for conversion of all audioset checkpoints * Improve docs and extend conversion script * Fix README * Rename spectogram to spectrogram * Fix copies * Add integration test * Remove dummy conv * Update to ast * Update organization * Fix init * Rename model to AST * Add require_torchaudio annotator * Move import of ASTFeatureExtractor under a is_speech_available * Fix rebase * Add pipeline config * Update name of classifier head * Rename time_dimension and frequency_dimension for clarity * Remove print statement * Fix pipeline test * Fix pipeline test * Fix index table * Fix init * Fix conversion script * Rename to ForAudioClassification * Fix index table Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-11-21 18:58:54 +01:00
Konstantin Dobler	1e3f17b5ab	Enhance HfArgumentParser functionality and ease of use (#20323 ) * Enhance HfArgumentParser * Fix type hints for older python versions * Fix and add tests (+formatting) * Add changes * doc-builder formatting * Remove unused import "Call"	2022-11-21 12:33:37 -05:00
Jordan Clive	149483b252	Update Special Language Tokens for PLBART (#19980 ) * Update Special Language Tokens for PLBART * fix format * making mapping for language codes and updating tests: * fix format * fix consistency * add assert to both tokenizer tests. * fix format * Update src/transformers/models/plbart/tokenization_plbart.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * improvin readability, setting self.tgt_lang * fixing * readability Co-authored-by: jordiclive <jordiclive19@imperial.ac.uk> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2022-11-21 11:53:08 -05:00
Joao Gante	4cf38148dc	Generate: `model_kwargs` can also be an input to `prepare_inputs_for_generation` (#20353 )	2022-11-21 16:20:27 +00:00
Matthijs Hollemans	d21c97cc0f	add MobileNetV1 model (#17799 ) * add model files etc for MobileNetV2 rename files for MobileNetV1 initial implementation of MobileNetV1 fix conversion script cleanup write docs tweaks fix conversion script extract hidden states fix test cases make fixup fixup it all remove main from doc link fixes fix tests fix up use google org fix weird assert * fixup * use google organization for checkpoints	2022-11-21 10:21:28 -05:00
Younes Belkada	74297d0a55	[Switch Transformers] Fix failing slow test (#20346 ) * run slow test on GPU * remove unnecessary device assignment * use `torch_device` instead	2022-11-21 15:36:49 +01:00
Joao Gante	3de07473da	Generate: add generation config class (#20218 ) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-11-21 13:30:15 +00:00
Yih-Dar	8503cc7550	Fix torch device issues (#20304 ) * fix device issue Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-21 10:12:25 +01:00
Ali Hassani	fc4a993e1b	Add Neighborhood Attention Transformer (NAT) and Dilated NAT (DiNAT) models (#20219 ) * Add DiNAT * Adds DiNAT + tests * Minor fixes * Added HF model * Add natten to dependencies. * Cleanup * Minor fixup * Reformat * Optional NATTEN import. * Reformat & add doc to _toctree * Reformat (finally) * Dummy objects for DiNAT * Add NAT + minor changes Adds NAT as its own independent model + docs, tests Adds NATTEN to ext deps to ensure ci picks it up. * Remove natten from `all` and `dev-torch` deps, add manual pip install to ci tests * Minor fixes. * Fix READMEs. * Requested changes to docs + minor fixes. * Requested changes. * Add NAT/DiNAT tests to layoutlm_job * Correction to Dinat doc. * Requested changes.	2022-11-18 13:08:26 -05:00
Nicolas Patry	8e777b3ba4	[Proposal] Breaking change `zero-shot-object-detection` for improved consistency. (#20280 ) * [Proposal] Breaking change `zero-shot-object-detection` for improved consistency. This is a proposal to modify the output of `zero-shot-object-detection` to provide better alignment with other pipelines. The output is now strictly the same as `object-detection` whereas before it would output lists of lists. The name `candidate_labels` is used throughout for consistency with other `zero-shot` pipelines. The pipeline is changed to `ChunkPipeline` to support batching cleanly. This removes all the lists and list of lists shenanigans, it's now a matter of the base pipeline handling all this not this specific one. Breaking change: It did remove complex calls potentials `pipe(images = [image1, image2], text_queries=[candidates1, candidates2])` to support only `pipe([{"image": image1, "candidate_labels": candidates1}, {"image": image2, "candidate_labels": candidates2}])` when dealing with lists and/or datasets. We could keep them, but it will add a lot of complexity to the code base, since the pipeline is rather young, I'd rather break to keep the code simpler, but we can revert this. Breaking change: The name of the argument is now `image` instead of `images` since it expects by default only 1 image. This is revertable like the previous one. Breaking change: The types is now simplified and flattened: `pipe(inputs) == [{object1}, {object2}]` instead of the previous `pipe(inputs) == [[{object1}, {object1}], [{object2}]]` Where the different instances would be grouped by candidate labels within lists. IMHO this is not really desirable, since it would output empty lists and is only adding superflous indirection compared to `zero-shot-object-detection`. It is relatively change free in terms of how the results, it does change computation however since now the batching is handled by the pipeline itself. It did** change the results for the small models so there seems to be a real difference in how the models handle this. * Fixing the doctests. * Behind is_torch_available.	2022-11-18 15:57:28 +01:00
atturaioe	84c9cc6d15	Add AnyPrecisionAdamW optimizer (#18961 ) * Add AnyPrecisionAdamW optimizer * Add optim_args argument to TrainingArgs * Add tests for AnyPrecisionOptimizer * Change AnyPrecisionAdam default params to float32 * Move default_anyprecision_kwargs in trainer test * Rename AnyPrecisionAdamW	2022-11-18 09:27:08 -05:00
amyeroberts	b98269425e	Add padding image transformation (#19838 ) * Add padding transformation * Add in upstream changes * Update tests & docs * Code formatting tuples in docstring	2022-11-18 11:27:21 +00:00
Younes Belkada	4bb0764750	refactor test (#20300 ) - simplifies the devce checking test	2022-11-17 15:59:22 +01:00
NielsRogge	6b217c52e6	Add AutoBackbone + ResNetBackbone (#20229 ) * Add ResNetBackbone * Define channels and strides as property * Remove file * Add test for backbone * Update BackboneOutput class * Remove strides property * Fix docstring * Add backbones to SHOULD_HAVE_THEIR_OWN_PAGE * Fix auto mapping name * Add sanity check for out_features * Set stage names based on depths * Update to tuple Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-11-17 15:43:20 +01:00
Joao Gante	0f78529f98	Generate: general TF XLA constrastive search are now slow tests (#20277 ) * move contrastive search test to slow	2022-11-17 12:34:46 +00:00
Joao Gante	2062c28552	TF: add test for `PushToHubCallback` (#20231 ) * test hub tf callback * create repo before cloning it	2022-11-17 12:33:44 +00:00
Younes Belkada	7d65efec29	[bnb] Let's warn users when saving 8-bit models (#20282 ) * add warning on 8-bit models - added tests - added wrapper * move to a private attribute - remove wrapper - changed `save_pretrained` method * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fix suggestions Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-11-17 08:16:36 +01:00
Alexander Markov	610acc5ae9	Data collator for token classification pads labels column when receives pytorch tensors (#20244 ) * token cls data_collator pads labels column * remove walrus operator for code quality * remove redundat space * remove comment that was fixed * PR comments fix Co-authored-by: Alexander Markov <amarkov.me@gmail.com>	2022-11-16 12:18:46 -05:00
Alara Dirik	a00b7e85ea	Adds image-guided object detection support to OWL-ViT (#20136 ) Adds image-guided object detection method to OwlViTForObjectDetection class as described in the original paper. One-shot/ image-guided object detection enables users to use a query image to search for similar objects in the input image. Co-Authored-By: Dhruv Karan k4r4n.dhruv@gmail.com	2022-11-16 09:07:46 +03:00
Matt	26ec7928d0	Slightly alter Keras dummy loss (#20232 ) * Slightly alter Keras dummy loss * Slightly alter Keras dummy loss * Add sample weight to test_keras_fit * Fix test_keras_fit for datasets * Skip the sample_weight stuff for models where the model tester has no batch_size	2022-11-15 16:58:43 +00:00
Suraj Patil	7f74433814	[CLIP] allow loading projection layer in vision and text model (#18962 ) * allow loading projection in text and vision model * begin tests * finish test for CLIPTextModelTest * style * add slow tests * add new classes for projection heads * remove with_projection * add in init * add in doc * fix tests * fix some more tests * fix copies * fix docs * remove leftover from fix-copies * add the head models in IGNORE_NON_AUTO_CONFIGURED * fix docstr * fix tests * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * add docstr for models Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-11-15 17:50:07 +01:00
Sylvain Gugger	9643ecf8ca	Enable PyTorch 1.13 (#20168 ) * Try PT1.13 by removing torch scatter * Skip failing tests * Style * Remvoe testing extras for repo utils * Try with all decorators * Try to wipe the cache * Fix all tests? * Try this way * Fix comma * Update to main * Try with less deps * Quality	2022-11-15 11:33:09 -05:00
NielsRogge	b4997382da	Fix MaskformerFeatureExtractor (#20100 ) * Fix bug * Add another fix * Add print statement * Apply fix * Fix feature extractor * Fix feature extractor * Add print statements * Add print statements * Remove print statements * Add instance segmentation integration test * Add integration test for semantic segmentation * Add draft for panoptic segmentation integration test * Fix integration test for panoptic segmentation * Remove slow annotator Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-11-15 16:00:37 +01:00

1 2 3 4 5 ...

2297 Commits