transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-06 06:10:04 +06:00

Author	SHA1	Message	Date
Eugene Zapolsky	129011c20b	adding a tip for deepspeed integration in multi-node environment (#21459 ) * adding note concerning use_node_local_storage * overriding checkpoint.use_node_local_storage if save_on_each_node == True * add more content * add more content * improve * style --------- Co-authored-by: Stas Bekman <stas@stason.org>	2023-02-10 09:12:56 -05:00
Younes Belkada	f83942684d	[`pipeline`] A simple fix for half-precision & 8bit models (#21479 ) * v1 fix * adapt from suggestions * make style * fix tests * add gpu tests * update docs * fix other tests * Apply suggestions from code review Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> * better fix * make fixup * better example * revert changes * proposal * more elegant solution * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-02-10 10:26:17 +01:00
Sylvain Gugger	04b2f13c37	🚨🚨🚨 Enforce single model initialization (#21431 ) * Enforce single model initialization * Add OneFormer example for problem 3 * Do it the Stas way * Actually rename the uses... * Rewrite test * Try to change the test this way * Fix all init slow/fast tests * Break connection * Fix more tests * Fix test for initialization * Remove custom test * Quality * Fix last failing tests * The end?	2023-02-09 15:46:26 -05:00
NielsRogge	d7f1e7c009	Add BLIP-2 (#21441 ) * First draft * More improvements * More improvements * Improve conversion script * Convert all weights * Make forward pass work * Make logits match * More improvements * More improvements * More improvements * Use get_input_embeddings * Improve some more * Improve model tests * Improve model tests * More improvements * Fix processor * Update files * Update prepare_inputs_for_generation * More improvements * Fix copies * More fixes * Make fixup * More improvements * Add support for seq2seq language model * More improvements * Fix test * More improvements * Improve conversion script * Remove some todo's * Fix README's * Improve conversion script * Fix generation * Fix style and remove Blip2Model * Fix model outputs * More improvements * Set eos_token_id in config * Fix quality * Small improvements * Add processor tests * More improvements * Apply suggestions * Apply suggestions * Add integration test * Update image URL * Add integration test * Fix model_type * Update style * Improve docs * Add doc tests * Fix copies * Remove tests which are passing * Improve some more * Add tests for seq2seq language models * Minor fix * Convert more checkpoints * finalize CI * Fix blip and blip2 processors * add `accelerate` support for `blip2` * clean up * make style * Update conversion script * Update conversion script some more * Update organization * revert toc file * add blip-2 to toc file * Some more improvements * Fix docstring * Improve docs --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: younesbelkada <younesbelkada@gmail.com>	2023-02-09 16:52:11 +01:00
Joao Gante	1d9c26a4b8	Generate: TF `compute_transition_scores` (#21341 )	2023-02-08 16:36:43 +00:00
Stefan Schweter	7e51a441e4	Add XLM-V to Model Doc (#21498 ) * doc: introduce new section for XLM-V model * doc: mention more details for XLM-V integration * docs: paper abstract in italics, model identifier for base model added * doc: mention new XLM-V support * auto: add XLM-V mapping * doc: run make fix-copies ;)	2023-02-07 16:43:19 -05:00
Adrian Sager La Ganga	a3034c7004	Add inverse sqrt learning rate scheduler (#21495 ) * added inverse sqrt lr scheduler * Updated get_scheduler in src/transformers/optimization.py * Updated src/transformers/__init__.py * Added inverse sqrt lr scheduler test * Updated docs/source/en/main_classes/optimizer_schedules.mdx * Ran style and quality scripts * Fix get_inverse_sqrt_schedule docstring * Comment implementation URL	2023-02-07 15:00:50 -05:00
Sylvain Gugger	67d074874d	Cleanup quality (#21493 ) * Remove mentions of flake8/isort * Clean up inits * Deall with all other inits * Last special rule for dummy files	2023-02-07 12:27:31 -05:00
Yih-Dar	479322bfaa	A new test to check config attributes being used (#21453 ) * Add a new test to check config attributes being used * Add a new test to check config attributes being used * Add a new test to check config attributes being used * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions * Update allowed cases - part 1 * Update allowed cases - part 2 * final --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-02-07 17:49:30 +01:00
Matt	28ec07d8ad	Typos/fixes to link syntax (#21450 ) * Typos/fixes to link syntax * Trying section headers * Add header formatting for Rule #3	2023-02-07 15:19:19 +00:00
Younes Belkada	fa0ae17958	[`Doc`] Fix int8 docs (#21487 ) fix int8 docs	2023-02-07 15:09:27 +01:00
Sylvain Gugger	5b49376202	Deprecate parallelize API (#21448 ) * Deprecate parallelize API * Add documentation * Fix copies	2023-02-06 19:39:13 -05:00
Sylvain Gugger	6f79d26442	Update quality tooling for formatting (#21480 ) * Result of black 23.1 * Update target to Python 3.7 * Switch flake8 to ruff * Configure isort * Configure isort * Apply isort with line limit * Put the right black version * adapt black in check copies * Fix copies	2023-02-06 18:10:56 -05:00
lewtun	b7bb2b59f7	Add tips for generation with Int8 models (#21424 ) * Add tips for generation with Int8 models * Empty commit to trigger CI * Apply suggestions from code review Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/en/perf_infer_gpu_one.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-02-06 20:25:40 +01:00
Nolwenn Bernard	baf4bacb1f	[i18n-fr] Translate index page to French (#21458 ) * Translate index page to French * Fix indent * Fix toctree * Replace missing file by in_translation * Add index * Update docs/source/fr/index.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-02-06 12:25:49 -05:00
Irene López	7dbee87e09	Fix `PushToHubCallback` import in Share a model docs (#21457 ) docs: update PushToHubCallback import in docs	2023-02-06 09:26:22 -05:00
Jinen Setpal	5ac1c7ea85	Added documentation for DagsHubCallback (#21452 ) updated documentation	2023-02-06 09:24:18 -05:00
jianan-gu	ae31831879	Add perf numbers for perf_train_cpu (#20974 ) * Update perf_train_cpu.mdx * Update perf_train_cpu.mdx * Update perf_train_cpu.mdx * Update docs/source/en/perf_train_cpu.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update perf_train_cpu.mdx * Update perf_train_cpu.mdx * Update perf_train_cpu.mdx * Update perf_train_cpu.mdx --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-02-06 09:20:43 -05:00
Matt	833174c929	Add tutorial doc for TF + TPU (#21429 ) * Add tutorial doc for TF + TPU * Fix all those extra asterisks in the markdown * Use the actual Tip formatting * Remove unnecessary spaces * Reformat checklist * Fix checklist and reformat tips slightly * Update docs/source/en/perf_train_tpu_tf.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/en/perf_train_tpu_tf.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/en/perf_train_tpu_tf.mdx Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update docs/source/en/perf_train_tpu_tf.mdx Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Add link to TPU notebook in the notebooks list * Add links to the TPU notebook in the tutorial doc * Make the markdown table a bit less wild * Fix notebook link * More notebook links * More fixes to wild tables --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-02-03 19:07:42 +00:00
Matthijs Hollemans	e4bacf6614	[WIP] add SpeechT5 model (#18922 ) * make SpeechT5 model by copying Wav2Vec2 * add paper to docs * whoops added docs in wrong file * remove SpeechT5Tokenizer + put CTC back in the name * remove deprecated class * remove unused docstring * delete SpeechT5FeatureExtractor, use Wav2Vec2FeatureExtractor instead * remove classes we don't need right now * initial stab at speech encoder prenet * add more speech encoder prenet stuff * improve SpeechEncoderPrenet * add encoder (not finished yet) * add relative position bias to self-attention * add encoder CTC layers * fix formatting * add decoder from BART, doesn't work yet * make it work with generate loop * wrap the encoder into a speech encoder class * wrap the decoder in a text decoder class * changed my mind * changed my mind again ;-) * load decoder weights, make it work * add weights for text decoder postnet * add SpeechT5ForCTC model that uses only the encoder * clean up EncoderLayer and DecoderLayer * implement _init_weights in SpeechT5PreTrainedModel * cleanup config + Encoder and Decoder * add head + cross attention masks * improve doc comments * fixup * more cleanup * more fixup * TextDecoderPrenet works now, thanks Kendall * add CTC loss * add placeholders for other pre/postnets * add type annotation * fix freeze_feature_encoder * set padding tokens to 0 in decoder attention mask * encoder attention mask downsampling * remove features_pen calculation * disable the padding tokens thing again * fixup * more fixup * code review fixes * rename encoder/decoder wrapper classes * allow checkpoints to be loaded into SpeechT5Model * put encoder into wrapper for CTC model * clean up conversion script * add encoder for TTS model * add speech decoder prenet * add speech decoder post-net * attempt to reconstruct the generation loop * add speech generation loop * clean up generate_speech * small tweaks * fix forward pass * enable always dropout on speech decoder prenet * sort declaration * rename models * fixup * fix copies * more fixup * make consistency checker happy * add Seq2SeqSpectrogramOutput class * doc comments * quick note about loss and labels * add HiFi-GAN implementation (from Speech2Speech PR) * rename file * add vocoder to TTS model * improve vocoder * working on tokenizer * more better tokenizer * add CTC tokenizer * fix decode and batch_code in CTC tokenizer * fix processor * two processors and feature extractors * use SpeechT5WaveformFeatureExtractor instead of Wav2Vec2 * cleanup * more cleanup * even more fixup * notebooks * fix log-mel spectrograms * support reduction factor * fixup * shift spectrograms to right to create decoder inputs * return correct labels * add labels for stop token prediction * fix doc comments * fixup * remove SpeechT5ForPreTraining * more fixup * update copyright headers * add usage examples * add SpeechT5ProcessorForCTC * fixup * push unofficial checkpoints to hub * initial version of tokenizer unit tests * add slow test * fix failing tests * tests for CTC tokenizer * finish CTC tokenizer tests * processor tests * initial test for feature extractors * tests for spectrogram feature extractor * fixup * more fixup * add decorators * require speech for tests * modeling tests * more tests for ASR model * fix imports * add fake tests for the other models * fixup * remove jupyter notebooks * add missing SpeechT5Model tests * add missing tests for SpeechT5ForCTC * add missing tests for SpeechT5ForTextToSpeech * sort tests by name * fix Hi-Fi GAN tests * fixup * add speech-to-speech model * refactor duplicate speech generation code * add processor for SpeechToSpeech model * add usage example * add tests for speech-to-speech model * fixup * enable gradient checkpointing for SpeechT5FeatureEncoder * code review * push_to_hub now takes repo_id * improve doc comments for HiFi-GAN config * add missing test * add integration tests * make number of layers in speech decoder prenet configurable * rename variable * rename variables * add auto classes for TTS and S2S * REMOVE CTC!!! * S2S processor does not support save/load_pretrained * fixup * these models are now in an auto mapping * fix doc links * rename HiFiGAN to HifiGan, remove separate config file * REMOVE auto classes * there can be only one * fixup * replace assert * reformat * feature extractor can process input and target at same time * update checkpoint names * fix commit hash	2023-02-03 12:43:46 -05:00
Avi Singhal	0df802822c	Added model resources for LayoutLM Issue#19848 (#21377 ) * updated resources for LayoutLM * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * fixed formatting, removed extra section --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-02-03 08:53:16 -05:00
Steven Liu	fbee82951f	Update task summary (#21067 ) * first draft of audio section * make style * first draft of computer vision section * add convnext and encoder tasks * finish up nlp tasks * minor edits * add arch images, more edits * fix image links * apply sanchit feedback * model naming convention * apply niels vit feedback * replace detr for segmentation with mask2former * apply feedback * apply feedback	2023-02-02 11:41:27 -08:00
Steven Liu	0a75717602	Fix task guide formatting (#21409 ) fix formatting	2023-02-02 10:06:26 -08:00
Maria Khalusova	65b5035a1d	Moved LiLT under multimodal models in TOC (#21393 ) moved LiLT under multimodal models	2023-02-01 08:03:00 -05:00
NielsRogge	c21298a69b	[Docs] Minor fixes (#21383 ) * Improve docs * Add DETA resources --------- Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2023-01-31 15:13:12 +01:00
NielsRogge	5451f8896c	Add DETA (#20983 ) * First draft * Add initial draft of conversion script * Convert all weights * Fix config * Add image processor * Fix DetaImageProcessor * Run make fix copies * Remove timm dependency * Fix dummy objects * Improve loss function * Remove conv_encoder attribute * Update conversion scripts * Improve postprocessing + docs * Fix copied from statements * Add tests * Improve postprocessing * Improve postprocessing * Update READMEs * More improvements * Fix rebase * Add is_torchvision_available * Add torchvision dependency * Fix typo and README * Fix bug * Add copied from * Fix style * Apply suggestions * Fix thanks to @ydshieh * Fix another dependency check * Simplify image processor * Add scipy * Improve code * Add threshold argument * Fix bug * Set default threshold * Improve integration test * Add another integration test * Update setup.py * Address review * Improve deformable attention function * Improve copied from * Use relative imports * Address review * Replace assertions * Address review * Update dummies * Remove dummies * Address comments, update READMEs * Remove custom kernel code * Add image processor tests * Add requires_backends * Add minor comment * Update scripts * Update organization name * Fix defaults, add doc tests * Add id2label for object 365 * Fix tests * Update task guide	2023-01-31 10:43:10 +01:00
BFSS	95be242adc	translate index to zh(#20095 ) (#21351 ) translate index to zh Co-authored-by: bfss <bfss@bfss.com>	2023-01-30 16:50:57 -05:00
Adit Krishnan	914e5009fa	Adding resource section to GPT-J docs (#21270 ) * Added resource section to GPT-J docs * Added most of the links found * Addressing review comments * Fixing formatting * Update docs/source/en/model_doc/gptj.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Fixing one of the labels --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-01-30 16:48:04 -05:00
Maria Khalusova	73a2ff6974	Automated compatible models list for task guides (#21338 ) * initial commit. added tip placeholders and a script * removed unused imports, fixed paths * fixed generated links * make style * split language modeling doc into two: causal language modeling and masked language modeling * added check_task_guides.py to make fix-copies * review feedback addressed	2023-01-27 13:19:28 -05:00
Wonhyeong Seo	a01dd3818f	[i18n-KO] Translated quicktour page to Korean (#20946 ) docs: ko: quicktour page review by @ArthurZucker docs: fix: remove duplicate Co-Authored-By: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-01-26 14:10:02 +01:00
Anahita Bhiwandiwalla	3a6e4a221c	Add BridgeTower model (#20775 ) * Commit with BTModel and latest HF code * Placeholder classes for BTForMLM and BTForITR * Importing Bert classes from transformers * Removed objectives.py and dist_utils.py * Removed swin_transformer.py * Add image normalization, BridgeTowerForImageAndTextRetrieval * Add center_crop * Removing bert tokenizer and LCI references * Tested config loading from HF transformers hub * Removed state_dict updates and added path to hub * Enable center crop * Getting image_size from config, renaming num_heads and num_layers * Handling max_length in BridgeTowerProcessor * Add BridgeTowerForMaskedLM * Add doc string for BridgeTowerConfig * Add doc strings for BT config, processor, image processor * Adding docs, removed swin * Removed convert_bridgetower_original_to_pytorch.py * Added doc files for bridgetower, removed is_vision * Add support attention_mask=None and BridgeTowerModelOutput * Fix formatting * Fixes with 'make style', 'make quality', 'make fixup' * Remove downstream tasks from BridgeTowerModel * Formatting fixes, add return_dict to BT models * Clean up after doc_test * Update BTModelOutput return type, fix todo in doc * Remove loss_names from init * implement tests and update tuples returned by models * Add image reference to bridgetower.mdx * after make fix-copies, make fixup, make style, make quality, make repo-consistency * Rename class names with BridgeTower prefix * Fix for image_size in BTImageProcessor * implement feature extraction bridgetower tests * Update image_mean and image_std to be list * remove unused import * Removed old comments * Rework CLIP * update config in tests followed config update * Formatting fixes * Add copied from for BridgeTowerPredictionHeadTransform * Update bridgetower.mdx * Update test_feature_extraction_bridgetower.py * Update bridgetower.mdx * BridgeTowerForMaskedLM is conditioned on image too * Add BridgeTowerForMaskedLM * Fixes * Call post_init to init weights * Move freeze layers into method * Remove BTFeatureExtractor, add BT under multimodal models * Remove BTFeatureExtractor, add BT under multimodal models * Code review feedback - cleanup * Rename variables * Formatting and style to PR review feedback * Move center crop after resize * Use named parameters * Style fix for modeling_bridgetower.py * Update docs/source/en/model_doc/bridgetower.mdx Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update docs/source/en/model_doc/bridgetower.mdx Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update docs/source/en/model_doc/bridgetower.mdx Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/bridgetower/modeling_bridgetower.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/bridgetower/modeling_bridgetower.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update docs/source/en/model_doc/bridgetower.mdx Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update src/transformers/models/bridgetower/modeling_bridgetower.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Rename config params, copy BERT classes, clean comments * Cleanup irtr * Replace Roberta imports, add BTTextConfig and Model * Update docs, add visionconfig, consistent arg names * make fixup * Comments for forward in BTModel and make fixup * correct tests * Remove inconsistent roberta copied from * Add BridgeTowerTextModel to dummy_pt_objects.py * Add BridgeTowerTextModel to IGNORE_NON_TESTED * Update docs for BT Text and Vision Configs * Treat BridgeTowerTextModel as a private model * BridgeTowerTextModel as private * Run make fix-copies * Adding BTTextModel to PRIVATE_MODELS * Fix for issue with BT Text and Image configs * make style changes * Update README_ja.md Add から to BridgeTower's description * Clean up config, .mdx and arg names * Fix init_weights. Remove nn.Sequential * Formatting and style fixes * Re-add tie_word_embeddings in config * update test implementation * update style * remove commented out * fix style * Update README with abs for BridgeTower * fix style * fix mdx file * Update bridgetower.mdx * Update img src in bridgetower.mdx * Update README.md * Update README.md * resolve style failed * Update _toctree.yml * Update README_ja.md * Removed mlp_ratio, rename feats, rename BTCLIPModel * Replace BTCLIP with BTVisionModel,pass in vision_config to BTVisionModel * Add test_initialization support * Add support for output_hidden_states * Update support for output_hidden_states * Add support for output_attentions * Add docstring for output_hidden_states * update tests * add bridgetowervisionmodel as private model * rerun the PR test * Remove model_type, pass configs to classes, renames * Change self.device to use weight device * Remove image_size * Style check fixes * Add hidden_size and num_hidden_layers to BridgeTowerTransformer * Update device setting * cosmetic update * trigger test again * trigger tests again * Update test_modeling_bridgetower.py trigger tests again * Update test_modeling_bridgetower.py * minor update * re-trigger tests * Update docs/source/en/model_doc/bridgetower.mdx Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Remove pad, update max_text_len, doc cleanup, pass eps to LayerNorm * Added copied to, some more review feedback * make fixup * Use BridgeTowerVisionEmbeddings * Code cleanup * Fixes for BridgeTowerVisionEmbeddings * style checks * re-tests * fix embedding * address comment on init file * retrigger tests * update import prepare_image_inputs * update test_image_processing_bridgetower.py to reflect test_image_processing_common.py * retrigger tests Co-authored-by: Shaoyen Tseng <shao-yen.tseng@intel.com> Co-authored-by: Tiep Le <tiep.le@intel.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Tiep Le <97980157+tileintel@users.noreply.github.com>	2023-01-25 14:04:32 -05:00
Maria Khalusova	238449414f	Documentation code sample fixes (#21302 ) * Fixed the following: pipe -> pipeline out in pipe(data()) is a list of dict, not a dict * Fixed the TypeError: __init__() missing 1 required positional argument: 'key' * Added a tip: code sample requires additional libraries to run * Fixed custom config's name * added seqeval to the required libraries * fixed a missing dependency, fixed metric naming, added checkpoint to fix the datacollator * added checkpoint to fix the datacollator, added missing dependency	2023-01-25 11:33:39 -05:00
NielsRogge	f83135eb76	[Mask2Former] Add doc tests (#21232 ) * Add doc tests * Add OneFormer resourcesé * Fix merge * Fix style Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2023-01-25 12:34:43 +01:00
Steven Liu	de1ca3a0c5	Update expected values for doctest (#21284 ) update expected values	2023-01-24 13:32:31 -08:00
Alara Dirik	f424b09410	Fix MaskFormerImageProcessor.post_process_instance_segmentation (#21256 ) * fix instance segmentation post processing * add Mask2FormerImageProcessor	2023-01-24 18:49:29 +03:00
Younes Belkada	f0fc791298	[Doc] fix broken link (#21276 ) fix broken link	2023-01-24 11:18:48 +01:00
Maria Khalusova	275ad9d80a	Add: TensorFlow example for semantic segmentation task guide (#21223 ) * wip: adding tf example for semantic segmentation guide * completed the working example in tf * make style * Update docs/source/en/tasks/semantic_segmentation.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/tasks/semantic_segmentation.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * fixed a callback doc links Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-01-23 13:32:15 -05:00
Kambe Hiroyuki	929111698c	Add Japanese translation installation.mdx (#21241 ) * Add Japanese translation installation.mdx * Fixed for consistency with english version	2023-01-23 15:38:30 +01:00
Steven Liu	142ad1a1cc	Fix task summary doctest (#21200 ) * add outputs to code snippets * fix example text * apply feedback * style changes * make style	2023-01-20 09:58:07 -08:00
Steven Liu	2553363826	Fix code example in training tutorial (#21201 ) change text to sentence	2023-01-20 07:38:15 -08:00
Joao Gante	af37d183b3	Generate: documented function to compute the transition scores (#21191 ) Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-01-20 12:50:01 +00:00
Bartosz Szmelczynski	1b37fb5e17	Efficientformer (#20459 ) - Adds EfficientFormer V1 to transformers - PR co-authored by @novice03 and @Bearnardd Co-authored-by: novice <pranavpulijala@gmail.com> Co-authored-by: novice <44259234+novice03@users.noreply.github.com>	2023-01-20 11:35:42 +03:00
Clémentine Fourrier	87208a05af	Graphormer model for Graph Classification (#20968 ) * [FT] First commit for graphormer architecture. The model has no tokenizer, as it uses a collator and preprocessing function for its input management. Architecture to be tested against original one. The arch might need to be changed to fit the checkpoint, but a revert to the original arch will make the code less nice to read. TODO: doc * [FIX] removed test model * [FIX] import error * [FIX] black and flake * [DOC] added paper refs * [FIX] [DOC] * [FIX] black * [DOC] Updated READMEs * [FIX] Order of imports + rm Tokenizer calls * [FIX] Moved assert in class to prevent doc build failure * [FIX] make fix-copies * [Doc] update from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * [FIX] Removed Graphormer from Sequence classification model list * [DOC] Added HF copyright to Cython file * [DOC] Fixed comments * [FIX] typos in class doc + removed config classes. Todo: update doc from paper definitions * [FIX] Removed dependency to fairseq, and replaced all asserts with Exception management * [FIX] Homogeneized initialization of weights to pretrained constructor * [FIX] [CP] Updated multi_hop parameter to get same results as in original implementation * [DOC] Relevant parameter description in the configuration file * [DOC] Updated doc and comments in main graphormer file * [FIX] make style and quality checks * [DOC] Fix doc format * [FIX] [WIP] Updated part of the tests, though still a wip * [FIX] [WIP] * [FIX] repo consistency * [FIX] Changed input names for more understandability * [FIX] [BUG] updated num_classes params for propagation in the model * simplified collator * [FIX] Updated tests to follow new naming pattern * [TESTS] Updated test suite along with model * \|FIX] rm tokenizer import * [DOC] add link to graphormerdoc * Changed section in doc from text model to graph model * Apply suggestions from code review Spacing, inits Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * [DOC] Explain algos_graphormer functions * Cython soft import protection * Rm call to Callable in configuration graphormer * [FIX] replaced asserts with Exceptions * Add org to graphormer checkpoints * Prefixed classes with Graphormer * Management of init functions * format * fixes * fix length file * update indent * relaunching ci * Errors for missing cython imports * fix style * fix style doc Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-01-19 13:05:59 -05:00
Kambe Hiroyuki	705e332b46	Add Japanese translation index.mdx (#21186 ) * Add Japanese translation index.mdx * Fix the year of the license * Change the models list to Japanese	2023-01-19 17:53:28 +01:00
Maria Khalusova	0359e2e15f	Updates to computer vision section of the Preprocess doc (#21181 ) * Extended the CV preprocessing section with more details and refactored the example * added padding to the CV section, though it is a special case * Added a tip about post processing methods * make style * link update * Apply suggestions from review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * review feedback Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-01-19 08:43:36 -05:00
Jitesh Jain	5b949623c7	Add OneFormer Model (#20577 ) * Add Oneformer Model * Add OneFormer Tests * Add UNIVERSAL_SEGMENTATION_MAPPING * Fix config * 🐛 Fix error encountered while writing tests * 🔨 Fix instance segmentation post processing * Format Files and Add Documentation * Add Documentation mdx file * Run make fixup * Run make fix-copies * Remove unnecessary code * Format modeling_oneformer.py * Add OneFormer to ImageSegmentationPipeline * Format files * Add Demo link to Readme * Fix fomatting errors * Fix test failures * Update Table in index.mdx * Fix version * Fix style * Remove OneFormer from TF * Fix Imports * Fix dummy objects * Fix tests * Add newline * Remove OneFormerFeatureExtractor * Remove CUDA Kernels * Use AutoBackbone for Swin * Fix description * Use Image Processor * Fix copies * Fix formatting * Fix import order * Fix flake8 errors * Fix doc errors * Add Hindi Readme entry * Update supported backbones * Update supported backbones * Undo Changes * Fix type of config * Fix isort * Fix auto.mdx * Fix swin config * Replace DinatBackbone with AutoBackbone * Use SwinBackbone * Use SwinBackbone * Fix conversion script * Fix arguments * Add argument description * Fix style * Add OneFormerProcessor * Fix OneFormerProcessor Tests * Fix mapping * Fix imports * Fix inits * Fix style * Fix comment * Fix docstring * Move OneFormer to MultiModal * Fix Copies * Remove size divisor * Fix check_repo.py * Fix copies * Add Processor for Testing Pipeline * Fix padding for tokens * Fix variables * Fix formatting with correct black version * Add Image Processor Test * Apply suggestions * Revert common modeling * Add check for task * Fix conversion script * Fix initialization order * Fix tests * Undo Pipeline Changes * Fix layers in MLP * Fix copies * Update image paths * Fix copies * Apply suggestions	2023-01-19 09:31:07 +01:00
Matt	00ba7cadd8	Rewrite a couple of lines in the TF XLA doc (#21177 ) * Rewrite a couple of lines in the TF XLA doc to explain that jit_compile can be used in model.compile() too * Remove extra )	2023-01-18 17:53:05 +00:00
Samuel Xu	defdcd2862	Remove Roberta Dependencies from XLM Roberta Flax and Tensorflow models (#21047 ) * Added flax model code * Added tf changes * missed some * Added copy comments * Added style hints * Fixed copy statements * Added suggested fixes * Made some fixes * Style fixup * Added necessary copy statements * Fixing copy statements * Added more copies * Final copy fix * Some bugfixes * Adding imports to init * Fixed up all make fixup errors * Fixed doc errors * Auto model changes	2023-01-18 07:49:39 -05:00
Younes Belkada	023f51fe16	`blip` support for training (#21021 ) * `blip` support for training * remove labels creation * remove unneeded `decoder_input_ids` creation * final changes - add colab link to documentation - reduction = mean for loss * fix nits * update link * clearer error message	2023-01-18 11:24:37 +01:00
Shogo Hida	14154f7238	Add Japanese translation to multilingual.mdx (#21084 ) * Create toctree for Japanese translations Signed-off-by: Shogo Hida <shogo.hida@gmail.com> * Copy English version Signed-off-by: Shogo Hida <shogo.hida@gmail.com> * Add Japanese translations Signed-off-by: Shogo Hida <shogo.hida@gmail.com> * Add Japanese translations Signed-off-by: Shogo Hida <shogo.hida@gmail.com> Signed-off-by: Shogo Hida <shogo.hida@gmail.com>	2023-01-18 10:08:18 +01:00

1 2 3 4 5 ...

1696 Commits