transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

History

Sayak Paul 2d9853b226 MSN (Masked Siamese Networks) for ViT (#18815 ) * feat: modeling and conversion scripts for msn. * chore: change license year. * chore: remove unneeded modules. * feat: direct loading of state_dict from remote url. * fix: import paths. * add: rest of the files. * add and fix rest of the files. Co-authored-by: Niels <niels.rogge1@gmail.com> * chore: formatting. * code quality fix. * chore: remove pooler. * feat: add classification top. * fix: configuration object. * add: initial test cases (one failing). * fix: basemodeloutput. * add: caution on using the classification head. * add: rest of the model related files. * add: vit msn readme. * fix: copied from statement. * fix: dummy objects. * add: ViTMSNPreTrainedModel to inits. * fix: repo consistency. * minor change in the model doc. * fix: tests. * Empty-Commit * Update src/transformers/models/vit_msn/configuration_vit_msn.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * address PR comments. * Update src/transformers/models/vit_msn/modeling_vit_msn.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * chore: put model in no_grad() and formatting. Co-authored-by: Niels <niels.rogge1@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>		2022-09-22 07:15:03 -04:00
..
albert	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
auto	Make AutoProcessor a magic loading class for all modalities (#18963 )	2022-09-14 07:36:12 -04:00
bart	TF: TF 2.10 unpin + related onnx test skips (#18995 )	2022-09-12 19:30:27 +01:00
barthez	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
bartpho	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
beit	Fix BeitFeatureExtractor postprocessing (#19119 )	2022-09-20 18:53:40 +03:00
bert	TF: TF 2.10 unpin + related onnx test skips (#18995 )	2022-09-12 19:30:27 +01:00
bert_generation	Black preview (#17217 )	2022-05-12 16:25:55 -04:00
bert_japanese	Black preview (#17217 )	2022-05-12 16:25:55 -04:00
bertweet	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
big_bird	Use higher value for hidden_size in Flax BigBird test (#17822 )	2022-06-24 19:31:30 +02:00
bigbird_pegasus	Black preview (#17217 )	2022-05-12 16:25:55 -04:00
blenderbot	Update serving code to enable `saved_model=True` (#18153 )	2022-07-22 18:05:38 +01:00
blenderbot_small	Update serving code to enable `saved_model=True` (#18153 )	2022-07-22 18:05:38 +01:00
bloom	fix arg name in BLOOM testing and remove unused arg document (#18843 )	2022-09-15 20:25:32 +02:00
bort	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
byt5	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
camembert	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
canine	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
clip	Update serving code to enable `saved_model=True` (#18153 )	2022-07-22 18:05:38 +01:00
codegen	Update expected values in CodeGen tests (#17888 )	2022-07-01 15:33:36 +02:00
conditional_detr	Add support for conditional detr (#18948 )	2022-09-22 09:45:04 +02:00
convbert	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
convnext	FX support for ConvNext, Wav2Vec2 and ResNet (#19053 )	2022-09-16 10:57:41 +02:00
cpm	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
ctrl	Fix CTRL tests (#17508 )	2022-06-01 16:27:23 +02:00
cvt	has_attentions - consistent test skipping logic and tf tests (#17495 )	2022-06-09 09:50:03 +02:00
data2vec	skip some test_multi_gpu_data_parallel_forward (#18188 )	2022-07-20 15:54:44 +02:00
deberta	fix `train_new_from_iterator` in the case of byte-level tokenizers (#17549 )	2022-06-08 15:30:41 +02:00
deberta_v2	Fx support for Deberta-v[1-2], Hubert and LXMERT (#17539 )	2022-06-07 18:05:20 +02:00
decision_transformer	Update expected values in DecisionTransformerModelIntegrationTest (#18016 )	2022-07-05 14:53:43 +02:00
deformable_detr	[fix] Add DeformableDetrFeatureExtractor (#19140 )	2022-09-22 09:45:24 +02:00
deit	Add TF DeiT implementation (#17806 )	2022-07-13 18:04:08 +01:00
detr	[fix] Add DeformableDetrFeatureExtractor (#19140 )	2022-09-22 09:45:24 +02:00
distilbert	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
dit	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
donut	remvoe `_create_and_check_torch_fx_tracing` in specific test files (#18667 )	2022-09-07 16:22:09 +02:00
dpr	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
dpt	Add post_process_semantic_segmentation method to DPTFeatureExtractor (#19107 )	2022-09-21 15:15:26 +03:00
electra	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
encoder_decoder	TF: use the correct config with `(...)EncoderDecoder` models (#18097 )	2022-07-22 13:31:45 +01:00
ernie	add task_type_id to BERT to support ERNIE-2.0 and ERNIE-3.0 models (#18686 )	2022-09-09 07:36:46 -04:00
flaubert	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
flava	has_attentions - consistent test skipping logic and tf tests (#17495 )	2022-06-09 09:50:03 +02:00
fnet	Black preview (#17217 )	2022-05-12 16:25:55 -04:00
fsmt	Not use -1e4 as attn mask (#17306 )	2022-06-20 16:16:16 +02:00
funnel	Update serving code to enable `saved_model=True` (#18153 )	2022-07-22 18:05:38 +01:00
glpn	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
gpt_neo	fix `train_new_from_iterator` in the case of byte-level tokenizers (#17549 )	2022-06-08 15:30:41 +02:00
gpt_neox	skip some gpt_neox tests that require 80G RAM (#17923 )	2022-07-01 09:04:38 -04:00
gpt_neox_japanese	Add support for Japanese GPT-NeoX-based model by ABEJA, Inc. (#18814 )	2022-09-14 10:17:40 -04:00
gpt2	Fixing OPT fast tokenizer option. (#18753 )	2022-09-15 17:12:58 +02:00
gptj	TF: GPT-J compatible with XLA generation (#17986 )	2022-07-06 15:02:07 +01:00
groupvit	Adding GroupViT Models (#17313 )	2022-06-28 20:51:47 +02:00
herbert	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
hubert	Fix train_step, test_step and tests for CLIP (#18684 )	2022-09-09 20:01:02 +01:00
ibert	fix `train_new_from_iterator` in the case of byte-level tokenizers (#17549 )	2022-06-08 15:30:41 +02:00
imagegpt	Enabling `imageGPT` auto feature extractor. (#16871 )	2022-05-24 12:30:46 +02:00
layoutlm	TF: TF 2.10 unpin + related onnx test skips (#18995 )	2022-09-12 19:30:27 +01:00
layoutlmv2	add warning to let the user know that the `__call__` method is faster than `encode` + `pad` for a fast tokenizer (#18693 )	2022-08-24 06:27:56 -04:00
layoutlmv3	Fix `test_tf_encode_plus_sent_to_model` for `LayoutLMv3` (#18898 )	2022-09-06 14:51:03 +02:00
layoutxlm	LayoutXLMProcessor: ensure 1-to-1 mapping between samples and images, and add test for it (#18774 )	2022-08-30 14:43:14 +02:00
led	Fix tflongformer int dtype (#18907 )	2022-09-12 17:51:10 +01:00
levit	Add skip logic for attentions test - Levit (#17633 )	2022-06-10 12:46:30 +02:00
longformer	Fix tflongformer int dtype (#18907 )	2022-09-12 17:51:10 +01:00
longt5	Skip `test_export_to_onnx` for `LongT5` if `torch` < 1.11 (#19122 )	2022-09-20 21:52:18 +02:00
luke	Adding fine-tuning models to LUKE (#18353 )	2022-08-01 11:09:47 -04:00
lxmert	Update serving code to enable `saved_model=True` (#18153 )	2022-07-22 18:05:38 +01:00
m2m_100	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
marian	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
maskformer	Fix `test_inference_instance_segmentation_head` (#17872 )	2022-06-24 19:36:45 +02:00
mbart	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
mbart50	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
mctct	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
megatron_bert	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
megatron_gpt2	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
mluke	Black preview (#17217 )	2022-05-12 16:25:55 -04:00
mobilebert	Fix train_step, test_step and tests for CLIP (#18684 )	2022-09-09 20:01:02 +01:00
mobilevit	TensorFlow MobileViT (#18555 )	2022-09-01 10:35:15 -04:00
mpnet	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
mt5	Fix expected loss values in some (m)T5 tests (#18177 )	2022-07-18 15:26:21 +02:00
mvp	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
nezha	speed up test (#18106 )	2022-07-12 04:28:28 -04:00
nllb	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
nystromformer	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
openai	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
opt	add missing `require_tf` for `TFOPTGenerationTest` (#19010 )	2022-09-13 18:10:11 +02:00
owlvit	fix owlvit tests, update docstring examples (#18586 )	2022-08-11 19:10:25 +03:00
pegasus	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
pegasus_x	Fix CI for `PegasusX` (#19025 )	2022-09-14 14:45:00 +02:00
perceiver	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
phobert	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
plbart	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
poolformer	has_attentions - consistent test skipping logic and tf tests (#17495 )	2022-06-09 09:50:03 +02:00
prophetnet	Black preview (#17217 )	2022-05-12 16:25:55 -04:00
qdqbert	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
rag	Avoid GPU OOM for a TF Rag test (#17638 )	2022-06-10 18:50:29 +02:00
realm	Black preview (#17217 )	2022-05-12 16:25:55 -04:00
reformer	Black preview (#17217 )	2022-05-12 16:25:55 -04:00
regnet	Run tests if skip condition not met (#18764 )	2022-08-30 14:03:28 +02:00
rembert	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
resnet	FX support for ConvNext, Wav2Vec2 and ResNet (#19053 )	2022-09-16 10:57:41 +02:00
retribert	fix retribert's `test_torch_encode_plus_sent_to_model` (#17231 )	2022-05-17 14:33:13 +02:00
roberta	fix `train_new_from_iterator` in the case of byte-level tokenizers (#17549 )	2022-06-08 15:30:41 +02:00
roformer	Skip RoFormer ONNX test if rjieba not installed (#16981 )	2022-05-04 10:04:10 +02:00
segformer	Add post_process_semantic_segmentation method to SegFormer (#19072 )	2022-09-21 11:40:35 +03:00
sew	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
sew_d	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
speech_encoder_decoder	send model to the correct device (#18800 )	2022-08-29 18:46:30 +02:00
speech_to_text	remvoe `_create_and_check_torch_fx_tracing` in specific test files (#18667 )	2022-09-07 16:22:09 +02:00
speech_to_text_2	Fx support for multiple model architectures (#17393 )	2022-05-31 10:02:55 +02:00
splinter	Fix Splinter test (#17854 )	2022-06-24 16:26:14 +02:00
squeezebert	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
swin	remvoe `_create_and_check_torch_fx_tracing` in specific test files (#18667 )	2022-09-07 16:22:09 +02:00
swinv2	Add swin transformer v2 (#17469 )	2022-07-27 11:14:47 -04:00
t5	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
tapas	Fix train_step, test_step and tests for CLIP (#18684 )	2022-09-09 20:01:02 +01:00
tapex	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
trajectory_transformer	Add trajectory transformer (#17141 )	2022-05-17 19:07:43 -04:00
transfo_xl	Add magic method to our TF models to convert datasets with column inference (#17160 )	2022-06-06 15:53:49 +01:00
trocr	Fx support for multiple model architectures (#17393 )	2022-05-31 10:02:55 +02:00
unispeech	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
unispeech_sat	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
van	has_attentions - consistent test skipping logic and tf tests (#17495 )	2022-06-09 09:50:03 +02:00
videomae	Add VideoMAE (#17821 )	2022-08-04 18:02:55 +02:00
vilt	Add `accelerate` support for ViLT (#18683 )	2022-09-22 13:14:39 +02:00
vision_encoder_decoder	send model to the correct device (#18800 )	2022-08-29 18:46:30 +02:00
vision_text_dual_encoder	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
visual_bert	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
vit	Improve vision models (#17731 )	2022-06-24 11:34:51 +02:00
vit_mae	TF: tests for (de)serializable models with resized tokens (#19013 )	2022-09-16 16:38:08 +01:00
vit_msn	MSN (Masked Siamese Networks) for ViT (#18815 )	2022-09-22 07:15:03 -04:00
wav2vec2	FX support for ConvNext, Wav2Vec2 and ResNet (#19053 )	2022-09-16 10:57:41 +02:00
wav2vec2_conformer	[Test] Fix W2V-Conformer integration test (#17303 )	2022-05-17 18:20:36 +02:00
wav2vec2_phoneme	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
wav2vec2_with_lm	[Wav2vec2 + LM Test] Improve wav2vec2 with lm tests and make torch version dependent for now (#18749 )	2022-08-26 14:11:55 +02:00
wavlm	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
x_clip	Add X-CLIP (#18852 )	2022-09-08 14:50:30 +02:00
xglm	remvoe `_create_and_check_torch_fx_tracing` in specific test files (#18667 )	2022-09-07 16:22:09 +02:00
xlm	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
xlm_prophetnet	Black preview (#17217 )	2022-05-12 16:25:55 -04:00
xlm_roberta	Black preview (#17217 )	2022-05-12 16:25:55 -04:00
xlm_roberta_xl	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
xlnet	Return scalar losses instead of per-sample means (#18013 )	2022-07-04 17:26:19 +01:00
yolos	[fix] Add DeformableDetrFeatureExtractor (#19140 )	2022-09-22 09:45:24 +02:00
yoso	fix `train_new_from_iterator` in the case of byte-level tokenizers (#17549 )	2022-06-08 15:30:41 +02:00
__init__.py	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00