transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-02 19:21:31 +06:00

History

Tom Aarsen 79444f370f Deprecate unused OpenLlama architecture (#24922 ) * Resolve typo in check_repo.py * Specify encoding when opening modeling files * Deprecate the OpenLlama architecture * Add disclaimer pointing to Llama I'm open to different wordings here * Match the capitalisation of LLaMA		2023-07-20 07:03:24 -04:00
..
albert	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
align	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
altclip	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
audio_spectrogram_transformer	is_batched fix for remaining 2-D numpy arrays (#23309 )	2023-05-23 14:37:35 -04:00
auto	Remote code improvements (#23959 )	2023-06-06 14:31:14 -04:00
autoformer	Compute `dropout_probability` only in training mode (#24486 )	2023-06-26 18:36:47 +02:00
bark	Add bark (#24086 )	2023-07-17 17:53:24 +01:00
bart	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
barthez
bartpho
beit	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
bert	fix: Text splitting in the BasicTokenizer (#22280 )	2023-07-11 11:07:58 -04:00
bert_generation
bert_japanese
bertweet
big_bird	Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )	2023-06-22 16:11:27 +02:00
bigbird_pegasus	Generate: skip left-padding tests on old models (#23437 )	2023-05-18 11:04:51 +01:00
biogpt	Update tiny models and pipeline tests (#23446 )	2023-05-18 17:29:04 +02:00
bit	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
blenderbot	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
blenderbot_small	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
blip	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
blip_2	Add InstructBLIP (#23460 )	2023-06-26 11:23:57 +02:00
bloom	Byebye pytorch 1.9 (#24080 )	2023-06-16 16:38:23 +02:00
bridgetower	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
byt5
camembert	Better TF docstring types (#23477 )	2023-05-24 13:52:52 +01:00
canine	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
chinese_clip	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
clap	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
clip	fix: Text splitting in the BasicTokenizer (#22280 )	2023-07-11 11:07:58 -04:00
clipseg	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
codegen	Fix TypeError: Object of type int64 is not JSON serializable (#24340 )	2023-06-27 12:15:49 +01:00
conditional_detr	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
convbert	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
convnext	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
convnextv2	Update `ConvNextV2ModelIntegrationTest::test_inference_image_classification_head` (#23402 )	2023-05-16 23:35:11 +02:00
cpm
cpmant	Update tiny models and pipeline tests (#23446 )	2023-05-18 17:29:04 +02:00
ctrl	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
cvt	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
data2vec	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
deberta	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
deberta_v2	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
decision_transformer
deformable_detr	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
deit	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
deta	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
detr	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
dinat	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
dinov2	Add DINOv2 (#24016 )	2023-07-18 15:34:06 +01:00
distilbert	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
dit	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
donut
dpr	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
dpt	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
efficientformer	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
efficientnet	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
electra	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
encodec	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
encoder_decoder	Move TF building to an actual build() method (#23760 )	2023-06-06 18:30:51 +01:00
ernie
ernie_m
esm	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
falcon	Falcon port (#24523 )	2023-07-11 13:36:31 +01:00
flaubert	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
flava	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
fnet	Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )	2023-06-22 16:11:27 +02:00
focalnet	Update tiny models and pipeline tests (#23446 )	2023-05-18 17:29:04 +02:00
fsmt
funnel	Big TF test cleanup (#24282 )	2023-06-16 15:40:49 +01:00
git	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
glpn	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
gpt_bigcode	Add `torch >=1.12` requirement for `Tapas` (#24251 )	2023-06-13 19:19:40 +02:00
gpt_neo	Update tiny models and pipeline tests (#23446 )	2023-05-18 17:29:04 +02:00
gpt_neox	Llama/GPTNeoX: add RoPE scaling (#24653 )	2023-07-13 16:47:30 +01:00
gpt_neox_japanese
gpt_sw3
gpt2	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
gptj	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
gptsan_japanese	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
graphormer	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
groupvit	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
herbert
hubert	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
ibert
imagegpt	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
informer	Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )	2023-06-22 16:11:27 +02:00
instructblip	[InstructBLIP] Fix bos token of LLaMa checkpoints (#24492 )	2023-07-11 20:43:01 +01:00
jukebox
layoutlm	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
layoutlmv2	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
layoutlmv3	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
layoutxlm	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
led	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
levit	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
lilt	Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )	2023-06-22 16:11:27 +02:00
llama	[`Llama2`] Add support for Llama 2 (#24891 )	2023-07-18 15:18:31 -04:00
longformer	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
longt5
luke	Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )	2023-06-22 16:11:27 +02:00
lxmert	Big TF test cleanup (#24282 )	2023-06-16 15:40:49 +01:00
m2m_100
marian	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
markuplm
mask2former	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
maskformer	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
mbart	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
mbart50
mega
megatron_bert
megatron_gpt2
mgp_str	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
mluke
mobilebert	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
mobilenet_v1	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
mobilenet_v2	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
mobilevit	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
mobilevitv2	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
mpnet	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
mra	Add Multi Resolution Analysis (MRA) (New PR) (#24513 )	2023-07-10 10:50:43 +01:00
mt5	Better TF docstring types (#23477 )	2023-05-24 13:52:52 +01:00
musicgen	Skip torchscript tests for `MusicgenForConditionalGeneration` (#24782 )	2023-07-13 15:54:18 +02:00
mvp	Generate: skip left-padding tests on old models (#23437 )	2023-05-18 11:04:51 +01:00
nat	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
nezha
nllb
nllb_moe
nystromformer
oneformer	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
openai	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
opt	Big TF test cleanup (#24282 )	2023-06-16 15:40:49 +01:00
owlvit	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
pegasus	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
pegasus_x
perceiver	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
phobert
pix2struct	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
plbart	Generate: skip left-padding tests on old models (#23437 )	2023-05-18 11:04:51 +01:00
poolformer	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
prophetnet	Generate: skip left-padding tests on old models (#23437 )	2023-05-18 11:04:51 +01:00
qdqbert
rag	Big TF test cleanup (#24282 )	2023-06-16 15:40:49 +01:00
realm	Unpin numba (#23162 )	2023-05-31 14:59:30 +01:00
reformer	Generate: skip left-padding tests on old models (#23437 )	2023-05-18 11:04:51 +01:00
regnet	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
rembert	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
resnet	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
roberta	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
roberta_prelayernorm	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
roc_bert
roformer	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
rwkv	Fix TypeError: Object of type int64 is not JSON serializable (#24340 )	2023-06-27 12:15:49 +01:00
sam	Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )	2023-06-22 16:11:27 +02:00
segformer	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
sew	Fix TypeError: Object of type int64 is not JSON serializable (#24340 )	2023-06-27 12:15:49 +01:00
sew_d	Fix TypeError: Object of type int64 is not JSON serializable (#24340 )	2023-06-27 12:15:49 +01:00
speech_encoder_decoder
speech_to_text	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
speech_to_text_2
speecht5	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
splinter
squeezebert
swiftformer	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
swin	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
swin2sr
swinv2	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
switch_transformers	[Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words (#24622 )	2023-07-11 15:02:18 +02:00
t5	[Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words (#24622 )	2023-07-11 15:02:18 +02:00
table_transformer	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
tapas	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
time_series_transformer	Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )	2023-06-22 16:11:27 +02:00
timesformer	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
timm_backbone	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
transfo_xl	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
trocr	Generate: skip left-padding tests on old models (#23437 )	2023-05-18 11:04:51 +01:00
tvlt	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
umt5	[Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words (#24622 )	2023-07-11 15:02:18 +02:00
unispeech	Fix TypeError: Object of type int64 is not JSON serializable (#24340 )	2023-06-27 12:15:49 +01:00
unispeech_sat	Fix TypeError: Object of type int64 is not JSON serializable (#24340 )	2023-06-27 12:15:49 +01:00
upernet	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
videomae	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
vilt	Removal of deprecated vision methods and specify deprecation versions (#24570 )	2023-06-29 15:09:51 +01:00
vision_encoder_decoder	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
vision_text_dual_encoder	Fix `VisionTextDualEncoderIntegrationTest` (#24661 )	2023-07-05 13:44:30 +02:00
visual_bert	Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )	2023-06-22 16:11:27 +02:00
vit	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
vit_hybrid	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
vit_mae	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
vit_msn	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
vivit	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
wav2vec2	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
wav2vec2_conformer	Fix TypeError: Object of type int64 is not JSON serializable (#24340 )	2023-06-27 12:15:49 +01:00
wav2vec2_phoneme
wav2vec2_with_lm
wavlm	Fix TypeError: Object of type int64 is not JSON serializable (#24340 )	2023-06-27 12:15:49 +01:00
whisper	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
x_clip	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
xglm	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
xlm	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
xlm_prophetnet
xlm_roberta	Better TF docstring types (#23477 )	2023-05-24 13:52:52 +01:00
xlm_roberta_xl
xlnet	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
xmod
yolos	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
yoso
__init__.py