transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-24 23:08:57 +06:00

History

Andy Ehrenberg 2840272c5f add flax whisper implementation (#20479 ) * add flax whisper implementation * rever change to setup * remove unused imports * revert generation changes * flax whisper docs * docs * import order * import sorting * isort * add dummy objects * doc formatting * formatting * remove trailing whitespaces * fix flax whisper docs * add generation logic to unlock flax whisper * remove scans * give credits to Flax Bart implementation * remove unused imports * add license * remove assert * more credits to Bart * fix style * formatting * support left padding * add flax whisper generation test * remove copied from comments whenever not a full copy * fix docstrings for logits processors * revert change to FlaxForceTokensLogitsProcessor * revert doc changes * improve generation docs * reorganize * formatting * cleanup docs * add tests * handle empty list case * fix forced decoder ids in flax tests * add flax whisper to inits * upate dummy objects * docs for FlaxAutoModelForSpeechSeq2Seq * fix decoder_position_ids computation in pretrained model decode/__call__ fns * add Copied from statements as necessary * compute position_ids only in __call__ and decode methods of pretrained model subclasses * improve readabilityof compute positional embeddings * check dimensionality of input_features instead of hidden_states * copied from statement for init_cache * formatting * fix copies * fix copies * pass attention mask to encoder layers * fix decoder module outputs * set dtype Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * smaller flax model for whisper test * Update src/transformers/generation/flax_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/whisper/modeling_flax_whisper.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/models/whisper/test_modeling_flax_whisper.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * cleanup Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/whisper/modeling_flax_whisper.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * bias cleanup * doc fix * align style for force tokens processor * readability * fix input shape in tests * revert FlaxGenerationMixin docstring * formatting * fix tests * fix imports * consistent encoder hidden states * consistent hidden states * input shapes * typo * partial class trick * partial class for input shape * base_class with correct input shape * partial base classes * match by name * set main_input_name * compare on names * formatting * remove unused import * safer position ids computation * safer position id computation * Update src/transformers/models/whisper/modeling_flax_whisper.py Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Update src/transformers/models/whisper/modeling_flax_whisper.py Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * remove identical inherited tests * fix prompt ids in tests * use generation config * use jnp array * better var names * more explicit bias use * import transformers * formatting * test formatting * remove unused imports * remove unused imports * formatting * isort * docs * fix ln orders for encoder hidden states * whisper unique generation stuff * flake * use finfo for attention bias * docs * Update src/transformers/generation/flax_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * docs * add timestamp flax test * jit for timestamps * formatting * clean up timestamps processor * formatting * remove if_true * cleanup --------- Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>		2023-02-20 09:17:40 +01:00
..
albert.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
altclip.mdx	Add AltCLIP (#20446 )	2023-01-04 09:18:57 +01:00
audio-spectrogram-transformer.mdx	Add resources (#20872 )	2023-01-17 17:42:33 +01:00
auto.mdx	add flax whisper implementation (#20479 )	2023-02-20 09:17:40 +01:00
bart.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
barthez.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
bartpho.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
beit.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
bert-generation.mdx	Result of new doc style with fixes (#17015 )	2022-04-29 17:42:15 -04:00
bert-japanese.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
bert.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
bertweet.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
big_bird.mdx	Update documentation on seq2seq models with absolute positional embeddings, to be in line with Tips section for BERT and GPT2 (#20068 )	2022-11-04 11:32:44 -04:00
bigbird_pegasus.mdx	Update documentation on seq2seq models with absolute positional embeddings, to be in line with Tips section for BERT and GPT2 (#20068 )	2022-11-04 11:32:44 -04:00
biogpt.mdx	Add BioGPT (#20420 )	2022-12-05 10:12:03 -05:00
bit.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
blenderbot-small.mdx	Update documentation on seq2seq models with absolute positional embeddings, to be in line with Tips section for BERT and GPT2 (#20068 )	2022-11-04 11:32:44 -04:00
blenderbot.mdx	Update documentation on seq2seq models with absolute positional embeddings, to be in line with Tips section for BERT and GPT2 (#20068 )	2022-11-04 11:32:44 -04:00
blip-2.mdx	Add BLIP-2 (#21441 )	2023-02-09 16:52:11 +01:00
blip.mdx	`blip` support for training (#21021 )	2023-01-18 11:24:37 +01:00
bloom.mdx	docs: Resolve many typos in the English docs (#20088 )	2022-11-07 09:19:04 -05:00
bort.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
bridgetower.mdx	Add BridgeTower model (#20775 )	2023-01-25 14:04:32 -05:00
byt5.mdx	[Doctests] Fix all T5 doc tests (#16646 )	2022-04-13 11:36:54 +02:00
camembert.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
canine.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
chinese_clip.mdx	Add Chinese-CLIP implementation (#20368 )	2022-11-30 19:22:23 +01:00
clap.mdx	[CLAP] Add CLAP to the library (#21370 )	2023-02-16 20:59:27 +01:00
clip.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
clipseg.mdx	[CLIPSeg] Add resources (#20118 )	2022-11-09 18:31:22 +01:00
codegen.mdx	Add CodeGen model (#17443 )	2022-06-24 17:10:38 +02:00
conditional_detr.mdx	Add segmentation + object detection image processors (#20160 )	2022-11-30 10:24:03 +00:00
convbert.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
convnext.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
cpm.mdx	Allow all imports from transformers (#17050 )	2022-05-02 12:47:39 -04:00
ctrl.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
cvt.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
data2vec.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
deberta-v2.mdx	Add DebertaV2ForMultipleChoice (#17135 )	2022-05-10 16:21:44 -04:00
deberta.mdx	Add to DeBERTa resources (#20155 )	2022-11-15 13:26:07 -05:00
decision_transformer.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
deformable_detr.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
deit.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
deta.mdx	[Docs] Minor fixes (#21383 )	2023-01-31 15:13:12 +01:00
detr.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
dialogpt.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
dinat.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
distilbert.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
dit.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
donut.mdx	Add Donut image processor (#20425 )	2022-11-29 10:38:01 +00:00
dpr.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
dpt.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
efficientformer.mdx	Efficientformer (#20459 )	2023-01-20 11:35:42 +03:00
electra.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
encoder-decoder.mdx	[EncoderDecoder] Improve docs (#18271 )	2022-07-27 10:08:59 +02:00
ernie_m.mdx	Add Ernie-M Model to huggingface (#21349 )	2023-02-15 09:24:56 -05:00
ernie.mdx	add task_type_id to BERT to support ERNIE-2.0 and ERNIE-3.0 models (#18686 )	2022-09-09 07:36:46 -04:00
esm.mdx	Add ESMFold (#19977 )	2022-10-31 21:32:58 -04:00
flan-t5.mdx	Update flan-t5 original model link (#20897 )	2022-12-27 02:26:14 -05:00
flaubert.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
flava.mdx	AutoImageProcessor (#20111 )	2022-11-08 19:54:41 +00:00
fnet.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
fsmt.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
funnel.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
git.mdx	Add resources (#20872 )	2023-01-17 17:42:33 +01:00
glpn.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
gpt_neo.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
gpt_neox_japanese.mdx	Add support for Japanese GPT-NeoX-based model by ABEJA, Inc. (#18814 )	2022-09-14 10:17:40 -04:00
gpt_neox.mdx	[WIP] Adding GPT-NeoX-20B (#16659 )	2022-05-24 09:31:10 -04:00
gpt-sw3.mdx	Add gpt-sw3 model to transformers (#20209 )	2022-12-12 13:12:13 -05:00
gpt2.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
gptj.mdx	Adding resource section to GPT-J docs (#21270 )	2023-01-30 16:48:04 -05:00
graphormer.mdx	Graphormer model for Graph Classification (#20968 )	2023-01-19 13:05:59 -05:00
groupvit.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
herbert.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
hubert.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
ibert.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
imagegpt.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
jukebox.mdx	Add Jukebox model (replaces #16875 ) (#17826 )	2022-11-10 21:05:27 +01:00
layoutlm.mdx	Added model resources for LayoutLM Issue#19848 (#21377 )	2023-02-03 08:53:16 -05:00
layoutlmv2.mdx	AutoImageProcessor (#20111 )	2022-11-08 19:54:41 +00:00
layoutlmv3.mdx	AutoImageProcessor (#20111 )	2022-11-08 19:54:41 +00:00
layoutxlm.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
led.mdx	Update documentation on seq2seq models with absolute positional embeddings, to be in line with Tips section for BERT and GPT2 (#20068 )	2022-11-04 11:32:44 -04:00
levit.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
lilt.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
longformer.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
longt5.mdx	Update longt5.mdx (#18634 )	2022-08-16 10:20:46 -05:00
luke.mdx	Adding fine-tuning models to LUKE (#18353 )	2022-08-01 11:09:47 -04:00
lxmert.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
m2m_100.mdx	Fix `m2m_100.mdx` doc example missing `labels` (#19149 )	2022-09-29 13:27:58 +02:00
marian.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
markuplm.mdx	Fix doctest for `MarkupLM` (#19845 )	2022-10-24 17:54:23 +02:00
mask2former.mdx	[Mask2Former] Add doc tests (#21232 )	2023-01-25 12:34:43 +01:00
maskformer.mdx	Add Mask2Former (#20792 )	2023-01-16 20:37:07 +03:00
mbart.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
mctct.mdx	[Past CI] 🔥 Leave Past CI failures in the past 🔥 (#20861 )	2022-12-27 18:37:25 +01:00
megatron_gpt2.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
megatron-bert.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
mluke.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
mobilebert.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
mobilenet_v1.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
mobilenet_v2.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
mobilevit.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
mpnet.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
mt5.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
mvp.mdx	Add MVP model (#17787 )	2022-06-29 09:30:55 -04:00
nat.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
nezha.mdx	Nezha Pytorch implementation (#17776 )	2022-06-23 12:36:22 -04:00
nllb.mdx	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
nystromformer.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
oneformer.mdx	[Mask2Former] Add doc tests (#21232 )	2023-01-25 12:34:43 +01:00
openai-gpt.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
opt.mdx	Add OPT resources to the transformers documentation (#21625 )	2023-02-16 12:44:28 -05:00
owlvit.mdx	Improve OWL-ViT postprocessing (#20980 )	2023-01-03 19:25:09 +03:00
pegasus_x.mdx	PEGASUS-X (#18551 )	2022-09-02 19:54:02 +02:00
pegasus.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
perceiver.mdx	AutoImageProcessor (#20111 )	2022-11-08 19:54:41 +00:00
phobert.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
plbart.mdx	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
poolformer.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
prophetnet.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
qdqbert.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
rag.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
realm.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
reformer.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
regnet.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
rembert.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
resnet.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
retribert.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
roberta-prelayernorm.mdx	Implement Roberta PreLayerNorm (#20305 )	2022-12-19 09:30:17 +01:00
roberta.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
roc_bert.mdx	Add RocBert (#20013 )	2022-11-08 10:03:43 -05:00
roformer.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
segformer.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
sew-d.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
sew.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
speech_to_text_2.mdx	Generate: move generation_.py src files into generation/.py (#20096 )	2022-11-09 15:34:08 +00:00
speech_to_text.mdx	Fix some doctests after PR 15775 (#20036 )	2022-11-03 14:18:45 +01:00
speech-encoder-decoder.mdx	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
speecht5.mdx	[WIP] add SpeechT5 model (#18922 )	2023-02-03 12:43:46 -05:00
splinter.mdx	docs: Resolve many typos in the English docs (#20088 )	2022-11-07 09:19:04 -05:00
squeezebert.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
swin.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
swin2sr.mdx	Add Swin2SR (#19784 )	2022-12-16 16:24:01 +01:00
swinv2.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
switch_transformers.mdx	Add Switch transformers (#19323 )	2022-11-15 13:06:45 +01:00
t5.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
t5v1.1.mdx	docs: Resolve many typos in the English docs (#20088 )	2022-11-07 09:19:04 -05:00
table-transformer.mdx	Update doc examples feature extractor -> image processor (#20501 )	2022-11-30 14:50:55 +00:00
tapas.mdx	Fix tapas scatter (#20149 )	2022-11-14 01:04:26 -05:00
tapex.mdx	Add TAPEX (#16473 )	2022-04-08 10:57:51 +02:00
time_series_transformer.mdx	[MINOR] Fix link in timeseries transformer docs (#21602 )	2023-02-13 10:11:16 -05:00
timesformer.mdx	[New Model] Add TimeSformer model (#18908 )	2022-12-02 09:13:25 +01:00
trajectory_transformer.mdx	Add trajectory transformer (#17141 )	2022-05-17 19:07:43 -04:00
transfo-xl.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
trocr.mdx	Update doc examples feature extractor -> image processor (#20501 )	2022-11-30 14:50:55 +00:00
tvlt.mdx	Add TVLT (#20725 )	2023-02-15 18:10:30 +00:00
ul2.mdx	Add UL2 (just docs) (#17740 )	2022-06-21 10:24:50 +02:00
unispeech-sat.mdx	Add Wav2Vec2Conformer (#16812 )	2022-05-17 00:43:16 +02:00
unispeech.mdx	Add Wav2Vec2Conformer (#16812 )	2022-05-17 00:43:16 +02:00
upernet.mdx	[Docs] Minor fixes (#21383 )	2023-01-31 15:13:12 +01:00
van.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
videomae.mdx	Update doc examples feature extractor -> image processor (#20501 )	2022-11-30 14:50:55 +00:00
vilt.mdx	AutoImageProcessor (#20111 )	2022-11-08 19:54:41 +00:00
vision-encoder-decoder.mdx	Update doc examples feature extractor -> image processor (#20501 )	2022-11-30 14:50:55 +00:00
vision-text-dual-encoder.mdx	docs: Resolve many typos in the English docs (#20088 )	2022-11-07 09:19:04 -05:00
visual_bert.mdx	Update doc examples feature extractor -> image processor (#20501 )	2022-11-30 14:50:55 +00:00
vit_hybrid.mdx	Add BiT + ViT hybrid (#20550 )	2022-12-07 11:03:39 +01:00
vit_mae.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
vit_msn.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
vit.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
wav2vec2_phoneme.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
wav2vec2-conformer.mdx	[Wav2Vec2Conformer] Official release (#17709 )	2022-06-15 18:34:15 +02:00
wav2vec2.mdx	Add wav2vec2 resources (#19931 )	2022-10-28 13:28:18 -07:00
wavlm.mdx	Add Wav2Vec2Conformer (#16812 )	2022-05-17 00:43:16 +02:00
whisper.mdx	add flax whisper implementation (#20479 )	2023-02-20 09:17:40 +01:00
xclip.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
xglm.mdx	Add TF implementation of `XGLMModel` (#16543 )	2022-08-24 10:51:05 +01:00
xlm-prophetnet.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
xlm-roberta-xl.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
xlm-roberta.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
xlm-v.mdx	Add XLM-V to Model Doc (#21498 )	2023-02-07 16:43:19 -05:00
xlm.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
xlnet.mdx	Refactor model summary (#21408 )	2023-02-15 10:35:14 -08:00
xls_r.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
xlsr_wav2vec2.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
xmod.mdx	[WIP] Move X-MOD models to facebook organization (#21640 )	2023-02-16 09:18:25 -05:00
yolos.mdx	Add batch of resources (#20647 )	2023-01-17 17:18:56 +01:00
yoso.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00