transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-21 21:49:06 +06:00

Author	SHA1	Message	Date
Arthur	a081f292ca	[RobertaPreLayernom] Fixes the CI daily test (#20886 ) get correct checkpoint	2022-12-23 19:55:17 +01:00
Younes Belkada	cab7799f7b	Add japanese translation of template (#20870 ) * add japanese translation of template * fix japanese translation - fix special cases - fix typos - manually translate special cases Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2022-12-23 14:39:42 +01:00
Jasmijn Bastings	efed8a2794	Add script to convert T5X T5 (v1.0 and v1.1) checkpoints to PyTorch (#20801 ) * Add script to convert T5X T5 (v1.0 and v1.1) checkpoints to PyTorch * Remove unnecessary check and update docstring * Format docstring * Fix whitespace in docstring	2022-12-23 14:36:46 +01:00
Nicolas Patry	f7f0ec2f54	Adding support for `fp16` for asr pipeline. (#20864 ) * Supporting `fp16` for asr pipeline * Adding test. * Style. * Oops. * Flake8 update ? * Fixing flake8 ? * Revert "Flake8 update ?" This reverts commit `0b917fcb52`. * Style (acctidentally deleted flake8 F401.) * Move to a bigger test (no small whisper model, and s2t doesn't seem to accept torch_dtype=fp16). Also we need to use a GPU to actually compute on fp16. * Using BatchFeature capability.	2022-12-23 10:18:45 +01:00
Syed Abdul Gaffar Shakhadri	15bc776fec	Add Onnx Config for PoolFormer (#20868 ) poolformer onnx Co-authored-by: syed <syed.abdul@sandlogic.com>	2022-12-23 01:30:57 -05:00
Sourab Mangrulkar	4a4cd6cd02	having new model entries in Hindi for Hindi README (#20869 )	2022-12-23 12:00:48 +05:30
Younes Belkada	52dd2b61bf	[`MobileNet-v2`] Fix ONNX typo (#20860 ) * fix typo `onnx` * fix test	2022-12-22 18:52:54 +01:00
Younes Belkada	4d10ffd506	[`FSMT`] Make it compatible with `xxxForConditionalGeneration` models (#20825 ) * add `get_encoder` and `get_decoder` * add additional kwargs support * fix condition * add better checks * better checks * fix embed positions * better test to consider padding * fix debug statement * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * add arguments on docstring Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2022-12-22 11:11:19 +01:00
dhansmair	2222740f50	change strings to f-strings in image_processing_utils.py (#20865 ) change strings to f-strings	2022-12-22 02:06:50 -05:00
Joao Gante	829e889418	Generate: post-generate config doctest fix (#20804 ) * fix doctests * revert unwanted change	2022-12-21 19:18:45 +00:00
Yih-Dar	39e620c134	Update `HubertModelIntegrationTest.test_inference_keyword_spotting` (#20863 ) fix ci Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-21 18:40:14 +01:00
Arthur	4a433e321f	Add-warning-tokenizer (#20826 ) * add fast not use warning * update	2022-12-21 18:18:34 +01:00
Arthur	76d02feadb	Fix doctest (#20843 ) * fix doc for generation, dinat, nat and prelayernorm * style * update * fix cpies * use auto config and auto tokenizer Co-authored-by: sgugger <sylvain.gugger@gmail.com> * als modify roberta and the depending models Co-authored-by: sgugger <sylvain.gugger@gmail.com>	2022-12-21 16:34:31 +01:00
Mohit Sharma	aaa6296de2	Fix whisper export (#20800 ) * fix_whisper_export * update input * update input	2022-12-21 16:28:42 +01:00
Yih-Dar	3090e70857	Fix past CI by skipping `LevitModelTest.test_problem_types` (#20859 ) * Fix past CI * Fix past CI Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-21 14:29:13 +01:00
Maria Khalusova	04c560225b	Adding `evaluate` to the list of libraries required in generated notebooks (#20850 ) Adding `evaluate` to the list of libraries to be installed for every generated notebook in transformers	2022-12-21 14:04:08 +01:00
İdil Sülo	0ae58204c6	Add visual prompt to processor of CLIPSeg model (#20816 ) Adds visual_prompt argument to CLIPSegProcessor to enable image-guided segmentation	2022-12-21 15:23:45 +03:00
ValeKnappich	2da82bb4a7	fix past_key_values in GPTNeoXForCausalLM.prepare_inputs_for_generation (#20621 ) * fix past_key_values in GPTNeoXForCausalLM.prepare_inputs_for_generation * fix formatting	2022-12-21 11:46:04 +00:00
Yih-Dar	852e7ebaa2	Use `config.num_channels` in CLIP-like modeling files (#20857 ) Use config.num_channels in CLIP-like modeling files Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-21 11:51:23 +01:00
NielsRogge	d87e381f93	[Examples] Update big table (#20845 ) Update big table Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>	2022-12-21 11:34:31 +01:00
NielsRogge	9efad4efed	[Swin2SR] Add doc tests (#20829 ) * Fix doc tests * Use Auto API * Apply suggestion * Revert "Apply suggestion" This reverts commit `cd9507a866`. Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>	2022-12-21 10:09:50 +01:00
Younes Belkada	0d284bd574	Add BLIP (#20716 ) * add new model like * add v1 * v1 * v1 * vision encoder logits match * v2 * fix * add docstring * CI tests pass * fix tests * make fixup * add to `toctree` * fix processors * fix processors * fix doc * fill title * add content doc * remove from tokenization auto * fix config * change order * add `# Copied from` * few fixes - add correct license on modeling text - remove dummy argument * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * replace name * refactor a bit * more refactor * remove unused arg * make fixup + remove some `# Adapted from ...` * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * more `# Copied from` * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * now `generate` supports no prefix * remove `FeatureExtractor` * fix path * correct dependency * fix tests * few fixes * add integration tests * add correct conversion script * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * add `blip` to tokenization auto * fix docstrings * fix test + add image * remove processor from uncorrect place * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * clean up a bit * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * clean pixel mask * clean pixel mask * fix `F` * Update src/transformers/models/blip/modeling_blip.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * fix output * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * fix pad token id * remove `token_type_ids` * make fixup * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * make fixup * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * add comments * Update src/transformers/models/blip/modeling_blip.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * remove `token_type_ids` * make fixup * better name * replace with `image_attention_mask` * refactor * make fixup * better docstring * replace `answer_xx` * remove ununsed args * add `labels` * add `labels` * fix processing tests * make fixup * make fixup * put correct repo * remove `pad` * remove `crop` and `center_crop` * Update src/transformers/models/blip/image_processing_blip.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * fix * remove `size_divisor` * fix weights `init` * remove unneeded functions * add suggestions * minor changes - change slow test output for PT 1.13 - docstring order * replace `feature_extractor` by `image_processor` * fix doctests * fix weight init order + add fp16 slow test * add `blip` to doctest * add correct repo name and fix test * Update src/transformers/models/blip/processing_blip.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * fix tests * use `convert_to_rgb` from `image_transforms` * make fixup * fix large loading issue Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-12-21 09:39:10 +01:00
Steven Liu	3be028bc9d	Embed circle packing chart for model summary (#20791 ) * embed circle packing chart * trim whitespace from bottom * explain bubble sizes	2022-12-20 10:26:52 -08:00
Sanchit Gandhi	bd1a43b699	[S2T, Whisper] Add copied from statements (#20787 ) * [S2T, Whisper] Add copied from statements * rebase and fix-copies	2022-12-20 18:13:56 +00:00
Steven Liu	5eecf3ff17	Clarify `use_fast` parameter in docstring (#20840 ) * clarify use_fast parameter * make style * remove check frameworks, apply review	2022-12-20 08:42:26 -08:00
NielsRogge	2875fa971c	[SegFormer] Add support for segmentation masks with one label (#20279 ) * Add support for binary segmentation * Fix loss calculation and add test * Remove space * use fstring Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>	2022-12-20 16:46:50 +01:00
Yih-Dar	2280880cb7	remove unused `use_cache` in config classes (#20844 ) remove unused use_cache in config classes Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-20 16:46:43 +01:00
Matt	d0bfdd20f4	TF AdamWeightDecay fix for 2.11 (#20848 ) * Fix incorrect import for the base optimizer for AdamWeightDecay * Fix incorrect import for the base optimizer for AdamWeightDecay	2022-12-20 13:40:45 +00:00
Sanchit Gandhi	d1d3ac9403	[mBART] fix erroneous italics in docstring (#20835 ) * [mBART] fix erroneous italics in docstring * fix-copies	2022-12-20 10:23:36 +00:00
Yih-Dar	244dd0f150	Remove unused `max_position_embeddings` in config classes (#20836 ) Removed unused max_position_embeddings in config classes Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-20 10:09:34 +01:00
fzyzcjy	ae3cbbcaf6	Fix tiny typo (#20841 ) * Fix typo * Update README.md * Update run_mlm_flax_stream.py * Update README.md	2022-12-20 03:17:59 -05:00
Thomas-MMJ	7ef3f19c3c	fix typo output not ouput in bitsandbytes trainer test (#20839 ) fix typo output not ouput typo was causing an error on pytest collection	2022-12-20 03:16:26 -05:00
stanleycai95	bdb84e2bad	Add model resources for ViT (#20723 ) * Set up overall resources documentation structure * Update vit.mdx * Removing irrelevant sections on text models * Update vit.mdx * Update vit.mdx * Update vit.mdx * Update vit.mdx * Update vit.mdx * Update vit.mdx * Update vit.mdx * Update vit.mdx * Update vit.mdx * Update vit.mdx * Update vit.mdx * Update vit.mdx * Update vit.mdx * Update vit.mdx	2022-12-19 10:59:34 -08:00
Stas Bekman	f76518e56a	[clip] fix error message (#20818 ) * [clip] fix error message * sync	2022-12-19 08:25:16 -08:00
amyeroberts	76924384af	Vilt - use image_transforms pad (#20780 ) Use image_transforms pad	2022-12-19 11:43:07 +00:00
Younes Belkada	ecd7de3dff	[`Vision`] [Refactor] Initialize weights on the correct place (#20803 ) * fix nit - initialization on `_init_weights` - fix copies * add copied from	2022-12-19 10:37:14 +01:00
daquexian	6b5a8f83ce	lazy import torch._softmax_backward_data for better compatibility (#20796 ) lazy import torch._softmax_backward_data Signed-off-by: daquexian <daquexian566@gmail.com> Signed-off-by: daquexian <daquexian566@gmail.com>	2022-12-19 03:37:20 -05:00
Andreas Madsen	b4b613b102	Implement Roberta PreLayerNorm (#20305 ) * Copy RoBERTa * formatting * implement RoBERTa with prelayer normalization * update test expectations * add documentation * add convertion script for DinkyTrain weights * update checkpoint repo Unfortunately the original checkpoints assumes a hacked roberta model * add to RoBERTa-PreLayerNorm docs to toc * run utils/check_copies.py * lint files * remove unused import * fix check_repo reporting wrongly a test is missing * fix import error, caused by rebase * run make fix-copies * add RobertaPreLayerNormConfig to ROBERTA_EMBEDDING_ADJUSMENT_CONFIGS * Fix documentation <Facebook> -> Facebook Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fixup: Fix documentation <Facebook> -> Facebook Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Add missing Flax header Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * expected_slice -> EXPECTED_SLICE Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * update copies after rebase * add missing copied from statements * make fix-copies * make prelayernorm explicit in code * fix checkpoint path for the original implementation * add flax integration tests * improve docs * update utils/documentation_tests.txt * lint files * Remove Copyright notice Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * make fix-copies * Remove EXPECTED_SLICE calculation comments Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-12-19 09:30:17 +01:00
Yih-Dar	7032e02032	Install `sentencepiece` in `DeepSpeed` CI image (#20795 ) * Install sentencepiece in DS CI image * update Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-16 18:23:46 +01:00
NielsRogge	26dd041c6e	Add Swin2SR (#19784 ) * First draft * Add more improvements * Improve forward pass * Fix layernorm * Add upscaler * More improvements * More improvements * More improvements * Improve conversion script * Add preprocessing * Make output match original implementation * Add additional attributes * Add support for more models * Support more models * Add support for real world sr * Add initial Swin2SRFeatureExtractor * Add ImageSuperResolutionOutput * Make more tests pass * Use BaseModelOutput * Fix one more test * Fix more tests * Fix another test * Fix all tests * Rename to Swin2SRImageProcessor * Fix toctree * Fix toctree * Fix rebase * Improve Swin2SRImageProcessor * Remove feature extractor file * Improve model * Improve conversion script * Fix integration test * Fix init * Fix conversion script * Address comments * Improve upsampler * Add NearestConvUpsampler * Improve pixel shuffle upsampler * Improve auxiliary upsampler * Improve conversion script * Rename conv_last to final_convolution * Fix rebase * Improve upsample module * Add padding to image processor * Fix bug * Update padding * Remove print statement and fix integration test * Improve docs * Add image processor tests * Convert all checkpoints, fix testsé * Remove print statements * Fix import Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-12-16 16:24:01 +01:00
NielsRogge	7f99861218	Add Universal Segmentation class + mapping (#20766 ) * Add mapping * Add mapping to pipeline * Apply suggestions * Fix feature extractor tests * Use ForInstance, add model to universal mapping * More fixes * Remove model from deprecated objectsé Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-12-16 14:22:46 +01:00
Matt	e65445b4d6	Stop calling expand_1d on newer TF versions (#20786 )	2022-12-16 13:10:07 +00:00
Nicolas Patry	3ee958207a	Fix object detection2 (#20798 ) * Revert "Fixing object detection with `layoutlm` (#20776)" This reverts commit `fca66abe2a`. * Better fix for layoutlm object detection. * Style.	2022-12-16 13:25:36 +01:00
Younes Belkada	4341f4e224	[Pipeline] skip feature extraction test if in `IMAGE_PROCESSOR_MAPPING` (#20790 ) skip feature extraction test if in `IMAGE_PROCESSOR_MAPPING`	2022-12-16 12:46:58 +01:00
Yih-Dar	1543cee7c8	Recompile `apex` in `DeepSpeed` CI image (#20788 ) Recompile apex in DeepSpeed CI image Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-15 21:35:27 +01:00
amyeroberts	491e951875	Move convert_to_rgb to image_transforms module (#20784 ) * Move convert_to_rgb to image_transforms module * Fix tests	2022-12-15 18:47:04 +00:00
Joao Gante	4bc723f87d	Generate: use `GenerationConfig` as the basis for `.generate()` parametrization (#20388 ) * generate from config mvp * fix failing tests * max_time test * Load default gen config at model load time; Update docs * further documentation; add tests * adapt rag to the new structure * handle models not instantiated with from_pretained (like in tests) * better default generation config * add can_generate fn * handle legacy use case of ad hoc model config changes * initialize gen config from config in individual methods, if gen config is none * fix _get_decoder_start_token_id when called outside GenerationMixin * correct model config load order (set attr > model config > decoder config) * update rag to match latest changes * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * load gen config from model config in model.from_pretrained * fix can_generate fn * handle generate calls without a previous from_pretrained (e.g. tests) * add legacy behavior (and a warning) * lower logger severity Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-12-15 18:27:20 +00:00
Yih-Dar	b1706f6908	Install video dependency for pipeline CI (#20777 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-15 18:47:05 +01:00
Nicolas Patry	fca66abe2a	Fixing object detection with `layoutlm` (#20776 ) * Fixing object detection with layoutlm. * Fixup.	2022-12-15 18:46:43 +01:00
Younes Belkada	8891193e83	[Pipeline] fix failing bloom `pipeline` test (#20778 ) fix failing `pipeline` test	2022-12-15 18:46:00 +01:00

... 67 68 69 70 71 ...

15053 Commits