transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-06 14:20:04 +06:00

Author	SHA1	Message	Date
Alara Dirik	a00b7e85ea	Adds image-guided object detection support to OWL-ViT (#20136 ) Adds image-guided object detection method to OwlViTForObjectDetection class as described in the original paper. One-shot/ image-guided object detection enables users to use a query image to search for similar objects in the input image. Co-Authored-By: Dhruv Karan k4r4n.dhruv@gmail.com	2022-11-16 09:07:46 +03:00
Ambuj Pawar	c19aa7acce	Add clip resources to the transformers documentation (#20190 ) * WIP: Added CLIP resources from HuggingFace blog * ADD: Notebooks documentation to clip * Add link straight to notebook Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Change notebook links to colab Co-authored-by: Ambuj Pawar <your_email@abc.example> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2022-11-15 13:26:46 -05:00
Saad Mahmud	5b62f8ea2b	Add to DeBERTa resources (#20155 ) * Add to DeBERTa resources * Fix mistakes with chapter number * Add fill-mask pipeline * Add sequence, token and QA pipeline * Change token classification pipeline order * Remove flax script and notebook links	2022-11-15 13:26:07 -05:00
Suraj Patil	7f74433814	[CLIP] allow loading projection layer in vision and text model (#18962 ) * allow loading projection in text and vision model * begin tests * finish test for CLIPTextModelTest * style * add slow tests * add new classes for projection heads * remove with_projection * add in init * add in doc * fix tests * fix some more tests * fix copies * fix docs * remove leftover from fix-copies * add the head models in IGNORE_NON_AUTO_CONFIGURED * fix docstr * fix tests * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * add docstr for models Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-11-15 17:50:07 +01:00
Muhammad Sakib Khan Inan	777b1bfe62	New logging support to "Trainer" Class (ClearML Logger) (#20184 ) * Init Update * ClearML Callbacks integration * update corrections * args reporting updated * {'tensorboard': False, 'pytorch': False} * ClearML Tests added * add clearml * output_uri=True in Task.init * reformatted integrations.py * reformatted and fixed * IF-ELSE statement issue on "has_clearml" resolved * Add clearml in main callback docs * Add additional clearml documentation * Update src/transformers/integrations.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Accept suggestion Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Accept suggestion Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Small change in comments * Make style clearml * Accept suggestion Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Victor Sonck <victor.sonck@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-11-15 10:08:59 -05:00
Kendall	683cbc4c34	fixed spelling error in testing.mdx (#20220 )	2022-11-15 09:40:06 -05:00
amyeroberts	4c7e8d0900	Add object detection + segmentation transforms (#20003 ) * Add transforms for object detection * Update src/transformers/image_transforms.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Better var names & docstring * Remove unused var desc in docstring * Update src/transformers/image_transforms.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-11-15 12:50:03 +00:00
Younes Belkada	163ac3d3ee	Add Switch transformers (#19323 ) * first commit * add more comments * add router v1 * clean up - remove `tf` modeling files * clean up - remove `tf` modeling files * clean up * v0 routers * added more router - Implemented `ExpertsChooseMaskedRouter` - added tests - 2 more routers to implement * last router * improved docstring - completed the docstring in `router.py` - added more args in the config * v0 sparse mlp * replace wrong naming * forward pass run * update MOE layer * small router update * fixup * consistency * remove scatter router * remove abstract layer * update test and model for integration testing * v1 conversion * update * hardcode hack * all keys match * add gin conversion, without additional libraries * update conversion sctipy * delete router file * update tests wrt router deletion * fix router issues * update expert code * update, logits match, code needsREFACTORING * Refactor code Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com> * add generate tests Co-authored-by: younesbelkada <younesbelkada@gmail.com> * add support for router loss Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com> * fix forward error * refactor a bit * remove `FlaxSwitchTransformers` modules * more tests pass * Update code Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com> * fixup * fix tests * fix doc * fix doc + tokenization * fix tokenizer test * fix test * fix loss output * update code for backward pass * add loss support * update documentation * fix documentation, clean tokenizer * more doc fix, cleanup example_switch * fix failing test * fix test * fix test * fix loss issue * move layer * update doc and fix router capacity usage * fixup * add sparse mlp index for documentation on hub * fixup * test sparse mix architecture * Apply suggestions from code review * Update docs/source/en/model_doc/switch_transformers.mdx * fixup on update * fix tests * fix another test * attempt fix * Update src/transformers/models/switch_transformers/configuration_switch_transformers.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/models/switch_transformers/convert_switch_transformers_original_flax_checkpoint_to_pytorch.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * try * all tests pass * fix jitter noise * Apply suggestions from code review * doc tests pass * Update src/transformers/models/switch_transformers/modeling_switch_transformers.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/models/switch_transformers/modeling_switch_transformers.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * remove assert * change config order * fix readme japanese * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * remove parallelizable tests + add one liners * remove ONNX config * fix nits - add `T5Tokenizer` in auto mapping - remove `Switch Transformers` from ONNX supported models * remove `_get_router` * remove asserts * add check in test for `router_dtype` * add `SwitchTransformersConfig` in `run_pipeline_test` * Update tests/pipelines/test_pipelines_summarization.py * add huge model conversion script * fix slow tests - add better casting for `Linear8bitLt` - remove `torchscript` tests * add make dir * style on new script * fix nits - doctest - remove `_keys_to_ignore_on_load_unexpected` * Update src/transformers/models/switch_transformers/configuration_switch_transformers.py * add google as authors * fix year * remove last `assert` statements * standardize vertical spaces * fix failing import * fix another failing test * Remove strange àuthorized_keys` * removing todo and padding that is never used Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com> Co-authored-by: ybelkada <younes@huggingface.co> Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Arthur Zucker <arthur@huggingface.co>	2022-11-15 13:06:45 +01:00
bofeng huang	9625924c60	Update tokenizer_summary.mdx (#20135 )	2022-11-15 01:18:13 +01:00
Wonhyeong Seo	8fadfd5035	[docs] set overflowing image width to auto-scale (#20197 ) * docs: fix: set overflowing image width to auto-scale * docs: fix: new language Korean is also affected * docs: fix: unnecessary line break in index page	2022-11-15 01:13:40 +01:00
Wonhyeong Seo	07d8d6e2f7	docs: translated index page to korean (#20180 ) docs: i18n: first draft of index page docs: fix: first revision of index page docs: i18n: missed section - supported frameworks docs: fix: second revision of index page review by @ArthurZucker refactor: remove untranslated files from korean docs: fix: remove untranslated references from toctree.yml feat: enable korean docs in gh actions docs: feat: add in_translation page as placeholder docs: bug: testing if internal toc need alphabet chars docs: fix: custom english anchor for non-alphanumeric headings review by @sgugger docs: i18n: translate comments on install methods in _config.py docs: refactor: more concise wording for translations	2022-11-14 12:09:21 -05:00
Bartosz Szmelczynski	78a471ff71	Fix tapas scatter (#20149 ) * First draft * Remove scatter dependency * Add require_torch * update vectorized sum test, add clone call * remove artifacts * fix style * fix style v2 * remove "scatter" mentions from the code base * fix isort error Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-14 01:04:26 -05:00
Matthijs Hollemans	f711d683b5	add MobileNetV2 model (#17845 ) * add model files etc for MobileNetV2 * rename files for MobileNetV1 * initial implementation of MobileNetV1 * fix conversion script * cleanup * write docs * tweaks * fix conversion script * extract hidden states * fix test cases * make fixup * fixup it all * rename V1 to V2 * fix checkpoints * fixup * implement first block + weight conversion * add remaining layers * add output stride and dilation * fixup * add tests * add deeplabv3+ head * a bit of fixup * finish deeplab conversion * add link to doc * fix issue with JIT trace in_height and in_width would be Tensor objects during JIT trace, which caused Core ML conversion to fail on the remainder op. By making them ints, the result of the padding calculation becomes a constant value. * cleanup * fix order of models * fix rebase error * remove main from doc link * add image processor * remove old feature extractor * fix converter + other issues * fixup * fix unit test * add to onnx tests (but these appear broken now) * add post_process_semantic_segmentation * use google org * remove unused imports * move args * replace weird assert	2022-11-14 01:00:10 -05:00
Arthur	61a51f5f23	Add Jukebox model (replaces #16875 ) (#17826 )	2022-11-10 21:05:27 +01:00
NielsRogge	9f0c72f93b	Add doc tests (#20158 ) Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>	2022-11-10 15:25:30 +01:00
NielsRogge	93e14486d6	[CLIPSeg] Add resources (#20118 ) * Add resource * Add tag Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-11-09 18:31:22 +01:00
Steven Liu	a44985b41c	add cv + audio labels (#20114 )	2022-11-09 07:40:15 -08:00
Joao Gante	f270b960d6	Generate: move generation_.py src files into generation/.py (#20096 ) * move generation_.py src files into generation/.py * populate generation.__init__ with lazy loading * move imports and references from generation.xxx.object to generation.object	2022-11-09 15:34:08 +00:00
amyeroberts	4eb918e656	AutoImageProcessor (#20111 ) * AutoImageProcessor skeleton * Update references * Add mapping in init * Add model image processors to __init__ for importing * Add AutoImageProcessor tests * Fix up * Image Processor documentation * Remove pdb * Update docs/source/en/model_doc/mobilevit.mdx * Update docs * Don't add whitespace on json files * Remove fixtures * Move checking model config down * Fix up * Add check for image processor * Remove FeatureExtractorMixin in docstrings * Rename model_tmpfile to config_tmpfile * Don't make None if not in image processor map	2022-11-08 19:54:41 +00:00
Weiwe Shi	efa889d2e4	Add RocBert (#20013 ) * add roc_bert * update roc_bert readme * code style * change name and delete unuse file * udpate model file * delete unuse log file * delete tokenizer fast * reformat code and change model file path * add RocBertForPreTraining * update docs * delete wrong notes * fix copies * fix make repo-consistency error * fix files are not present in the table of contents error * change RocBert -> RoCBert * add doc, add detail test Co-authored-by: weiweishi <weiweishi@tencent.com>	2022-11-08 10:03:43 -05:00
NielsRogge	258963062b	Add CLIPSeg (#20066 ) * Add first draft * Update conversion script * Improve conversion script * Improve conversion script some more * Add conditional embeddings * Add initial decoder * Fix activation function of decoder * Make decoder outputs match original implementation * Make decoder outputs match original implementation * Add more copied from statements * Improve model outputs * Fix auto tokenizer file * Fix more tests * Add test * Improve README and docs, improve conditional embeddings * Fix more tests * Remove print statements * Remove initial embeddings * Improve conversion script * Add interpolation of position embeddings * Finish addition of interpolation of position embeddings * Add support for refined checkpoint * Fix refined checkpoint * Remove unused parameter * Improve conversion script * Add support for training * Fix conversion script * Add CLIPSegFeatureExtractor * Fix processor * Fix CLIPSegProcessor * Fix conversion script * Fix most tests * Fix equivalence test * Fix README * Add model to doc tests * Use better variable name * Convert other checkpoint as well * Update config, add link to paper * Add docs * Update organization * Replace base_model_prefix with clip * Fix base_model_prefix * Fix checkpoint of config * Fix config checkpoint * Remove file * Use logits for output * Fix tests Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-11-08 10:55:47 +01:00
Tom Aarsen	6156bffa2b	Replace awkward timm link with the expected one (#20109 )	2022-11-07 13:57:39 -05:00
Steven Liu	71f772ebd0	Add new terms to the glossary (#20051 ) * add new terms * apply review	2022-11-07 10:45:27 -08:00
Tom Aarsen	d44ac47bac	docs: Fixed variables in f-strings (#20087 ) * docs: Fixed variables in f-strings * Replace unknown `block` with known `block_type` in ValueError Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Add missing torch import in docs code block Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-11-07 13:18:09 -05:00
Tom Aarsen	3222fc645b	docs: Resolve many typos in the English docs (#20088 ) * docs: Fix typo in ONNX parser help: 'tolerence' => 'tolerance' * docs: Resolve many typos in the English docs Typos found via 'codespell ./docs/source/en'	2022-11-07 09:19:04 -05:00
Tom Aarsen	b8112eddec	Replace unsupported facebookresearch/bitsandbytes (#20093 ) With https://github.com/TimDettmers/bitsandbytes, which is by the same author and is still being updated	2022-11-07 08:52:03 -05:00
Jordan Clive	3bd0007e87	Update documentation on seq2seq models with absolute positional embeddings, to be in line with Tips section for BERT and GPT2 (#20068 ) Co-authored-by: jordiclive <jordiclive19@imperial.ac.uk>	2022-11-04 11:32:44 -04:00
Matt	6e1c5786dc	Update READMEs for ESMFold and add notebooks (#20067 ) * Update READMEs for ESMFold and add notebooks * Fix PyCharm formatting * make fix-copies	2022-11-04 15:10:13 +00:00
Wang, Yi	2564f0c21d	fix jit trace error for model forward sequence is not aligned with jit.trace tuple input sequence, update related doc (#19891 ) * fix jit trace error for classification usecase, update related doc Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> * add implementation in torch 1.14.0 Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> * update_doc Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> * update_doc Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>	2022-11-03 10:50:03 -04:00
Sanchit Gandhi	06d488061f	[Whisper Tokenizer] Make more user-friendly (#19921 ) * [Whisper Tokenizer] Make more user-friendly * use property * make indexing rigorous * small clean-up * tests * skip seq2seq tests * remove multilingual arg * reorder args * collapse to one function Co-authored-by: ArthurZucker <arthur@huggingface.co> * option to override attributes Co-authored-by: ArthurZucker <arthur@huggingface.co> * add to docs * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * make comment more clear Co-authored-by: sgugger <sylvain@huggingface.co> * don't add special tokens in get_decoder_prompt_ids * add test for set_prefix_tokens Co-authored-by: ArthurZucker <arthur@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: sgugger <sylvain@huggingface.co>	2022-11-03 14:22:40 +00:00
Yih-Dar	9ccea7acb1	Fix some doctests after PR 15775 (#20036 ) * Add skip_special_tokens=True in some doctest * For T5 * Fix for speech_to_text.mdx Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-03 14:18:45 +01:00
Steven Liu	aa39967b28	reorganize glossary (#20010 )	2022-11-02 16:58:17 -07:00
Yih-Dar	fb7cbe236b	Fix doctest (#20023 ) * Fix doctest Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-02 19:37:25 +01:00
amyeroberts	a6b7759880	Add Image Processors (#19796 ) * Add CLIP image processor * Crop size as dict too * Update warning * Actually use logger this time * Normalize doesn't change dtype of input * Add perceiver image processor * Tidy up * Add DPT image processor * Add Vilt image processor * Tidy up * Add poolformer image processor * Tidy up * Add LayoutLM v2 and v3 imsge processors * Tidy up * Add Flava image processor * Tidy up * Add deit image processor * Tidy up * Add ConvNext image processor * Tidy up * Add levit image processor * Add segformer image processor * Add in post processing * Fix up * Add ImageGPT image processor * Fixup * Add mobilevit image processor * Tidy up * Add postprocessing * Fixup * Add VideoMAE image processor * Tidy up * Add ImageGPT image processor * Fixup * Add ViT image processor * Tidy up * Add beit image processor * Add mobilevit image processor * Tidy up * Add postprocessing * Fixup * Fix up * Fix flava and remove tree module * Fix image classification pipeline failing tests * Update feature extractor in trainer scripts * Update pad_if_smaller to accept tuple and int size * Update for image segmentation pipeline * Update src/transformers/models/perceiver/image_processing_perceiver.py Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com> * Update src/transformers/image_processing_utils.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/beit/image_processing_beit.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * PR comments - docstrings; remove accidentally added resize; var names * Update docstrings * Add exception if size is not in the right format * Fix exception check * Fix up * Use shortest_edge in tuple in script Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>	2022-11-02 11:57:36 +00:00
Steven Liu	79c720c062	fix typo (#20006 )	2022-11-01 11:30:36 -07:00
Steven Liu	ab74ac11e4	Add LayoutLMv3 resource (#19932 ) * add layoutlmv3 resource * add layoutlmv2 resources * fix button	2022-11-01 11:10:46 -07:00
Steven Liu	dec8578e70	Add BERT resources (#19852 ) * add resources for bert * add course chapters * apply reviews * add pipeline icons and community resource * fix buttons	2022-11-01 11:09:53 -07:00
Steven Liu	1f6885bad0	add dataset (#20005 )	2022-11-01 10:37:20 -07:00
Sayak Paul	c87ae86a8f	Update image_classification.mdx (#19996 )	2022-11-01 07:54:41 -04:00
Mohit Sharma	c796b6dea6	Added onnx config whisper (#19525 ) * Added onnx config whisper * added whisper support onnx * add audio input data * added whisper support onnx * fixed the seqlength value * Updated the whisper onnx ocnfig * restore files to old version * removed attention mask from inputs * Updated get_dummy_input_onnxruntime docstring * Updated relative imports and token generation * update docstring	2022-11-01 07:50:42 -04:00
Matt	7f9b7b3f0e	Add ESMFold (#19977 ) * initial commit * First draft that gets outputs without crashing! * Add all the ported openfold dependencies * testing * Restructure config files for ESMFold * Debugging to find output discrepancies * Mainly style * Make model runnable without extra deps * Remove utils and merge them to the modeling file * Use correct gelu and remove some debug prints * More cleanup * Update esm docs * Update conversion script to support ESMFold properly * Port some top-level changes from ESMFold repo * Expand EsmFold docstrings * Make attention_mask optional (default to all 1s) * Add inference test for ESMFold * Use config and not n kwargs * Add modeling output class * Remove einops * Remove chunking in ESM FFN * Update tests for ESMFold * Quality * REpo consistency * Remove tree dependency from ESMFold * make fixup * Add an error in case my structure map function breaks later * Remove needless code * Stop auto-casting the LM to float16 so CPU tests pass * Stop auto-casting the LM to float16 so CPU tests pass * Final test updates * Split test file * Copyright and quality * Unpin PyTorch to see built doc * Fix config file to_dict() method * Add some docstrings to the output * Skip TF checkpoint tests for ESM until we reupload those * make fixup * More docstrings * Unpin to get even with main * Flag example to write Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>	2022-10-31 21:32:58 -04:00
Jean Charles Kouame	6aede2d602	Tranformers documentation translation to Italian #17459 (#19988 )	2022-10-31 13:19:15 -04:00
NielsRogge	0b294c2334	[Conditional, Deformable DETR] Add postprocessing methods (#19709 ) * Add postprocessing methods * Update docs * Add fix * Add test * Add test for deformable detr postprocessing * Add post processing methods for segmentation * Update code examples * Add post_process to make the pipeline work * Apply updates Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-10-31 08:28:44 +01:00
Steven Liu	2e35bac4e7	Add wav2vec2 resources (#19931 ) * add wav2vec2 resources * apply review Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>	2022-10-28 13:28:18 -07:00
Steven Liu	9d2788b46b	add resources for distilbert (#19930 )	2022-10-28 13:16:07 -07:00
Steven Liu	b0a2c3a2d6	add resources for bart (#19928 )	2022-10-28 13:15:43 -07:00
Raghav Prabhakar	0d4c45c585	Add Onnx Config for ImageGPT (#19868 ) * add Onnx Config for ImageGPT * add generate_dummy_inputs for onnx config * add TYPE_CHECKING clause * Update doc for generate_dummy_inputs Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-10-28 09:39:53 -04:00
Steven Liu	e4132952a1	Add GPT2 resources (#19879 ) * add resources for gpt2 * add pipeline icons and community resources	2022-10-27 11:34:00 -07:00
Steven Liu	d818dd3a41	Add BLOOM resources (#19881 ) * add bloom resources * add pipeline icon	2022-10-27 11:33:52 -07:00
Steven Liu	50f5266b2c	Add T5 resources (#19878 ) * add resources for t5 * add pipeline icons and community resources	2022-10-27 11:33:37 -07:00

1 2 3 4 5 ...

1565 Commits