transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-02 19:21:31 +06:00

Author	SHA1	Message	Date
Patrick von Platen	5a9957358c	Add Wav2Vec2Conformer (#16812 ) * save intermediate * add wav2vec2 conformer * add more code * more * first test passes * make all checkpoints work * update * up * more clean ups * save clean-up * save clean-up * save more * remove bogus * finalize design conformer * remove vision * finish all tests * more changes * finish code * add doc tests * add slow tests * fix autoconfig test * up * correct docstring * up * update * fix * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * Update docs/source/en/model_doc/wav2vec2-conformer.mdx * upload * save copied from * correct configs * fix model outputs * add to docs * fix imports * finish * finish code * correct copied from * correct again * correct make fix * improve make fix copies * save * correct fix copy from * correct init structure * correct * fix import * apply suggestions Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>	2022-05-17 00:43:16 +02:00
Kyungmin Lee	f0395cf58e	Fix test_model_parallelization (#17249 ) * Fix test_model_parallelization * Modify	2022-05-16 23:30:49 +02:00
Patrick von Platen	e705e1267c	[Tests] Fix slow opt tests (#17282 ) * fix opt tests * remove unused tok * make style * make flake8 happy * Update tests/models/opt/test_modeling_opt.py	2022-05-16 23:24:20 +02:00
amyeroberts	f6a6388972	Add Tensorflow Swin model (#16988 ) Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-05-16 22:19:53 +01:00
Yih-Dar	3fb82f74fd	Fix FlavaForPreTrainingIntegrationTest CI test (#17232 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-16 21:14:25 +02:00
Yih-Dar	66b3e106a1	Make TrainerHyperParameterSigOptIntegrationTest slow test (#17288 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-16 14:18:09 -04:00
Sylvain Gugger	ddb1a47ec8	Automatically sort auto mappings (#17250 ) * Automatically sort auto mappings * Better class extraction * Some auto class magic * Adapt test and underlying behavior * Remove re-used config * Quality	2022-05-16 13:24:20 -04:00
Yih-Dar	38043d8453	Update self-push workflow (#17177 ) * update push ci * install git-python * update comment * update deepspeed jobs * fix report * skip 2 more tests that require fairscale * Fix changes in test_fetcher.py (to deal with `setup.py` is changed) * set RUN_PT_TF_CROSS_TESTS=1 and final clean-up * remove SIGOPT_API_TOKEN * remove echo "$matrix_folders" Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-13 16:28:00 +02:00
Patrick von Platen	18d6b356c5	OPT - fix docstring and improve tests slighly (#17228 ) * correct some stuff * fix doc tests * make style	2022-05-13 15:14:50 +02:00
Younes Belkada	dfc76018c1	OPT-fix (#17229 ) * try fixes * Revert "try fixes" This reverts commit `a8ad75ef69`. * add correct shape * add correct path	2022-05-13 15:14:23 +02:00
Sylvain Gugger	afe5d42d8d	Black preview (#17217 ) * Black preview * Fixup too! * Fix check copies * Use the same version as the CI * Bump black	2022-05-12 16:25:55 -04:00
Matt	f04257fdbc	Add test to ensure models can take int64 inputs (#17210 ) * Add test to ensure models can take int64 inputs * is_integer is an attribute, not a method * Fix test when some inputs aren't tensors * Add casts to blenderbot and blenderbot-small * Add casts to the other failing models	2022-05-12 16:09:25 +01:00
Younes Belkada	b971c769e8	Add OPT (#17088 ) * First version - OPT model * Final changes - putting use cache to False * few changes - remove commented block * few changes - remove unecessary files * fix style issues * few changes - remove a test file - added the logits test * Update src/transformers/models/auto/tokenization_auto.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * add gen tests * few changes - rm mask filling example on docstring * few changes - remove useless args * some changes - more tests should pass now - needs to clean more - documentation still needs to be done * fix code quality * major changes - change attention architecture to BART-like - modify some tests - style fix * rm useless classes - remove opt for: - QA - cond generation - seq classif * Removed autodoc calls to non-existant classes TOkenizers are not implemented * Update src/transformers/__init__.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/__init__.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/models/auto/modeling_tf_auto.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Replaced OPTTokeniser with GPT2 tokenizer * added GPT2Tokenizer.from_pretrained("patrickvonplaten/opt_gpt2_tokenizer") * Removed OPTTokenizer * make style * Make style replaces ``` ...).unsqueeze(``` by ``` >>>).unsqueeze(``` * make repo consistency * Removed PretrainedOPTModel * fix opt.mdx removed other heads * fix init, removed 3 heads * removed heads * finished cleaning head * removed seauence classif and question answering * removed unused imports * removed useless dummy object for QA, SC and CG * removed tests for removed useless dummy object for QA, SC and CG * Removed head_mask using encoder layers which don't exist * fixed test * fix line * added OPT to toctree * Updated model path with pushed weigths * fix model path * fixed code quality * fixed embeddings and generation tests * update paths * clean comments * removed OPTClassificationHead for sentence classification * renamed hidden layer * renamed num layers to standard num_hidden_layers * num_attention_heads fix * changes for 125m * add first version for 125m * add first version - flax * add new version * causal LM output * replace output type with BaseModelOutputWithPastAndCrossAttentions * revert working config from 150m to 350m * clean * removed decoder input ids * fixed embed dim * more embed_dim issues * make style + removed enc_dec test * update falx model * removed troublesome copy * added is_encoder_decoder=False to config * added set_input emb fuinction to model class * requires torch on embed test * use head mask instead of decoder head mask input param solves a test * 8 test remaining, update * Updated create_and_check_decoder_model_past_large_inputs * Make style * update op tokenizer with condition * make style * See if I can push * some clean up * remove linear head hack * save intermediate * save correct attention * add copied from from bart * Update src/transformers/models/opt/modeling_opt.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fix part of the reviewss Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * same changes in naming / conversion * correct mask * more fixes * delete FlaxOPT and TfOPT * clean traces of Flax and Tf * fix mask * fixed positionnal embedding length when past key value is provoded * get 125m, 6.7b to work * Added do_layer_norm * solved mismatch in load dictionnary * clean up preapre opt input dict * fixed past key value as bool * fix previus * fixed return dict False tuple issue * All tests are passing * Make style * Ignore OPTDecoder non tested * make fix-copies * make repo consistency * small fix * removed uselss @torch.no_grad decorator * make styl;e * fix previous opt test * style * make style * added opt documentation * update OPT_PRETRAINED_MODEL_ARCHIVE_LIST * up * more fixes * model & config work * Update src/transformers/models/opt/modeling_opt.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/opt/modeling_opt.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/opt/modeling_opt.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * added comment on padding hack (+2) * cleaup * review update * docstring for missing arg * Update docs/source/en/model_doc/opt.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/en/model_doc/opt.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/en/model_doc/opt.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/opt/__init__.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * update pretrained map * update path and tests * make style * styling * make consistency * add gpt2 tok new * more tok fixes * Update src/transformers/models/auto/tokenization_auto.py * Update docs/source/en/model_doc/opt.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/en/model_doc/opt.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/en/model_doc/opt.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/opt/modeling_opt.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/models/opt/test_modeling_opt.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/opt/modeling_opt.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/opt/modeling_opt.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/opt/modeling_opt.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/opt/modeling_opt.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/opt/modeling_opt.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update based on reviews * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * make style * make tokenizer auto tests pass * apply Lysandre suggestion * finish tests * add some good tokenizer tests * improve docs slighly Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: ArthurZucker <arthur.zucker@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2022-05-12 12:24:35 +02:00
Michael Benayoun	8c7481f35c	ViT and Swin symbolic tracing with torch.fx (#17182 ) * Support tracing for ViT * Swin support * Fix copies * Fix type annotation issue * Removed unused import	2022-05-12 10:42:27 +02:00
Amanpreet Singh	a10f61834d	[feat] Add FLAVA model (#16654 ) * [WIP] Add FLAVA model This PR aims to add [FLAVA](ihttps://arxiv.org/abs/2112.04482) model to the transformers repo. Following checklist delineates the list of things to be done for this PR to be complete: [x] Flava init [x] Flava base models [x] Flava layers [x] Flava Configs [x] Flava encoders [x] Flava pretraining models [ ] Flava classification/retrieval models (To be added in a separate PR) [x] Documentation updates [x] Imports updates [x] Argstring updates [x] Flava pretrained checkpoints [x] Flava tests [x] Flava processors [x] Sanity check [x] Lint	2022-05-11 14:56:48 -07:00
Antoni Baum	47412c7d43	Ensure tensors are at least 1d for pad and concat (#17179 ) * Ensure tensors are at least 1d for pad and concat * Compatibility * Fix * Fix * Add test * Retrigger CI * Consistency with master * Retrigger CI	2022-05-11 13:19:08 -04:00
Antoni Baum	edcc66d27c	Remove unnecessary columns for all dataset types in `Trainer` (#17166 ) * Remove unneeded columns for IterableDataset * Add test * Update trainer tests * Edit docstring * Lint * Apply feedback * Apply feedback	2022-05-11 11:11:26 -04:00
Martin Pömsl	5229744b26	Add missing RetriBERT tokenizer tests (#17017 ) * Create RetriBERT tests folder * Add missing RetriBERT tokenizer test file * Apply style corrections * Add non-english filter * Update tests/retribert/test_tokenization_retribert.py Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com> * Update tests/retribert/test_tokenization_retribert.py Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com> * Move test files to new directory * Update import path for testing utils to new test file structure Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>	2022-05-11 15:04:07 +02:00
Heng Kuan Wee	6bc6797e04	Convert image to rgb for clip model (#17101 ) Co-authored-by: kuanwee.heng <kuanwee.heng@aaqua.live>	2022-05-11 13:09:54 +01:00
Leon Derczynski	4a419d4995	MobileBERT tokenizer tests (#16896 ) * unhardcode pretrained model path, make it a class var * add tests for mobilebert tokenizer * allow tempfiles for vocab & merge similarity test to autodelete * add explanatory comments * remove unused imports, let make style do its.. thing * remove inheritance and use BERT tok tests for MobileBERT * Update tests/mobilebert/test_tokenization_mobilebert.py Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com> * amend class names, remove unused import, add fix for mobilebert's hub pathname * unhardcode pretrained model path, make it a class var * add tests for mobilebert tokenizer * allow tempfiles for vocab & merge similarity test to autodelete * add explanatory comments * remove unused imports, let make style do its.. thing * remove inheritance and use BERT tok tests for MobileBERT * Update tests/mobilebert/test_tokenization_mobilebert.py Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com> * amend class names, remove unused import, add fix for mobilebert's hub pathname * amend paths for model tests being in models/ subdir of /tests * explicitly rm test from prev path Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>	2022-05-10 16:39:58 -04:00
Jason Phang	48a8f3daa1	Add DebertaV2ForMultipleChoice (#17135 )	2022-05-10 16:21:44 -04:00
Nicolas Brousse	e99f0efedc	Add MLFLOW_FLATTEN_PARAMS support in MLflowCallback (#17148 ) * add support for MLFLOW_FLATTEN_PARAMS * ensure key is str * fix style and update warning msg * Empty commit to trigger CI * fix bug in check_inits.py * add unittest for flatten_dict utils * fix 'NoneType' object is not callable on __del__ * add generic flatten_dict unittest to SPECIAL_MODULE_TO_TEST_MAP * fix style	2022-05-10 14:29:18 -04:00
Stas Bekman	976835d515	missing file (#17164 )	2022-05-10 10:19:50 -07:00
Stas Bekman	f861504466	[Deepspeed] add many more models to the model zoo test (#12695 ) * model zoo take 2 * add deberta * new param for zero2 * doc update * doc update * add layoutlm * bump deepspeed * add deberta-v2, funnel, longformer * new models * style * add t5_v1 * update TAPAS status * reorg problematic models * move doc to another PR * style * fix checkpoint check test * making progress on more models running * cleanup * new version * cleanup	2022-05-10 08:22:42 -07:00
Nicolas Patry	6d80c92c77	LogSumExp trick `question_answering` pipeline. (#17143 ) * LogSumExp trick `question_answering` pipeline. * Adding a failing test.	2022-05-10 10:03:55 +02:00
Zachary Mueller	2fbb237967	Add the auto_find_batch_size capability from Accelerate into Trainer (#17068 ) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> - Adds auto_batch_size finder - Moves training loop to an inner training loop	2022-05-09 12:29:18 -04:00
Manan Dey	dc3645dc9c	add `mobilebert` onnx configs (#17029 ) * update docs of length_penalty * Revert "update docs of length_penalty" This reverts commit `466bf4800b`. * add mobilebert onnx config * address suggestions * Update auto.mdx * Update __init__.py * Update features.py	2022-05-09 10:36:53 -04:00
ghlai9665	e9fd583ce0	LayoutLMv2Processor: ensure 1-to-1 mapping between images and samples in case of overflowing tokens (#17092 ) * add get_overflowing_images function to ensure 1-to-1 mapping between samples and images in LayoutLMv2Processor * make style * add test for overflowing_tokens, change assert to ValueError, avoiding unrelated formatting changes * change line length by passing --preview into black	2022-05-09 07:39:08 -04:00
Ritik Nandwal	215e0681e4	Added BigBirdPegasus onnx config (#17104 ) * Add onnx configuration for bigbird-pegasus * Modify docs	2022-05-06 17:31:00 +02:00
Yih-Dar	a59eb349c5	fix missing "models" in pipeline test module (#17090 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-05 16:12:01 +02:00
Yih-Dar	6dc4c36acb	minor change on TF Data2Vec test (#17085 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-04 18:39:30 +02:00
Patrick Deutschmann	870e6f29a6	Fix DeBERTa `token_type_ids` (#17082 )	2022-05-04 18:23:37 +02:00
Sean Moriarity	279bc5849b	Allow saved_model export of TFCLIPModel in save_pretrained (#16886 ) * CLIP Serving * Add type hints per code review * Use black, flake8, and isort * Update src/transformers/models/clip/modeling_tf_clip.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Rollback serving_output and add TODO * Remove irrelevant portions of failing tests * Revert "Rollback serving_output and add TODO" This reverts commit a4abfa6ba3b7875a13538dbc2ddc4eb17dfcca8d. * Rollback to original test/serving_output * Fix unused var * Apply suggestions from code review * Update formatting with black * Fix style again from rebase * Update tests/models/clip/test_modeling_tf_clip.py Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by: Sean Moriarity <sean.l.moriarity.mil@army.mil> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2022-05-04 16:37:58 +02:00
Sayak Paul	049e791758	Add Data2Vec for Vision in TF (#17008 ) * add utilities till TFData2VecVisionLayer. * chore: pass window_size to attention layer. * feat: add TFData2VecVisionRelativePositionBias. * feat: initial implementation ready for tf data2vec. * fix: relative position bias index, table to be fixed. * chore: implementation added, tests remaining. * add: tests, other PR files. * fix: code quality. * fix: import structure in init. * chore: run make fix-copies. * chore: address PR feedback (round I). * chore: styling nit. * fix: tests due to removal of to_2tuple(). * chore: rebase with upstream main and move the test. * Update src/transformers/models/auto/modeling_tf_auto.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/auto/modeling_tf_auto.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fix: layer call. * chore: remove from_pt=True and rerun test. * chore: remove cast and tf.divide. * chore: minor edits to the test script. * Update src/transformers/models/data2vec/modeling_tf_data2vec_vision.py Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * fix: expand() on TF tensors with broadcast_to(). * fix: test import. Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>	2022-05-04 08:08:25 -04:00
Sylvain Gugger	d76d2a2af7	Make sure telemetry arguments are not returned as unused kwargs (#17063 ) * Make sure telemetry arguments are not returned as unused kwargs * Fix test	2022-05-04 07:47:57 -04:00
lewtun	4bb1d0ec84	Skip RoFormer ONNX test if rjieba not installed (#16981 ) * Skip RoFormer ONNX test if rjieba not installed * Update deps table * Skip RoFormer serialization test * Fix RoFormer vocab * Add rjieba to CircleCI	2022-05-04 10:04:10 +02:00
Sylvain Gugger	1c9fcd0e04	Fix RNG reload in resume training from epoch checkpoint (#17055 ) * Fix RNG reload in resume training from epoch checkpoint * Fix test	2022-05-03 10:31:24 -04:00
Sylvain Gugger	a8fa2f91f4	Make Trainer compatible with sharded checkpoints (#17053 ) * Make Trainer compatible with sharded checkpoints * Add doc	2022-05-03 09:55:10 -04:00
Yih-Dar	19420fd99e	Move test model folders (#17034 ) * move test model folders (TODO: fix imports and others) * fix (potentially partially) imports (in model test modules) * fix (potentially partially) imports (in tokenization test modules) * fix (potentially partially) imports (in feature extraction test modules) * fix import utils.test_modeling_tf_core * fix path ../fixtures/ * fix imports about generation.test_generation_flax_utils * fix more imports * fix fixture path * fix get_test_dir * update module_to_test_file * fix get_tests_dir from wrong transformers.utils * update config.yml (CircleCI) * fix style * remove missing imports * update new model script * update check_repo * update SPECIAL_MODULE_TO_TEST_MAP * fix style * add __init__ * update self-scheduled * fix add_new_model scripts * check one way to get location back * python setup.py build install * fix import in test auto * update self-scheduled.yml * update slack notification script * Add comments about artifact names * fix for yolos Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-03 14:42:02 +02:00
Sanchit Gandhi	cd9274d010	[FlaxBert] Add ForCausalLM (#16995 ) * [FlaxBert] Add ForCausalLM * make style * fix output attentions * Add RobertaForCausalLM * remove comment * fix fx-to-pt model loading * remove comment * add modeling tests * add enc-dec model tests * add big_bird * add electra * make style * make repo-consitency * add to docs * remove roberta test * quality * amend cookiecutter * fix attention_mask bug in flax bert model tester * tighten pt-fx thresholds to 1e-5 * add 'copied from' statements * amend 'copied from' statements * amend 'copied from' statements * quality	2022-05-03 11:26:19 +02:00
Patrick von Platen	31616b8d61	[T5 Tokenizer] Model has no fixed position ids - there is no hardcode… (#16990 ) * [T5 Tokenizer] Model has no fixed position ids - there is no hardcoded max length * [T5 Tokenizer] Model has no fixed position ids - there is no hardcoded max length * correct t5 tokenizer * correct t5 tokenizer * fix test * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * finish Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-05-02 21:27:34 +02:00
NielsRogge	1ac698744c	Add YOLOS (#16848 ) * First draft * Add YolosForObjectDetection * Make forward pass work * Add mid position embeddings * Add interpolation of position encodings * Add expected values * Add YOLOS to tests * Add integration test * Support tiny model as well * Support all models in conversion script * Remove mid_pe_size attribute * Make more tests pass * Add model to README and fix config * Add copied from statements * Rename base_model_prefix to vit * Add missing YOLOS_PRETRAINED_CONFIG_ARCHIVE_MAP * Apply suggestions from code review * Apply more suggestions from code review * Convert remaining checkpoints * Improve docstrings * Add YolosFeatureExtractor * Add feature extractor to docs * Add corresponding tests * Fix style * Fix docs * Apply suggestion from code review * Fix bad rebase * Fix some more bad rebase * Fix missing character * Improve docs and variable names Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-05-02 18:30:55 +02:00
NielsRogge	2de2c9ecca	Clean up vision tests (#17024 ) * Clean up tests * Make fixup Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-05-02 16:28:58 +02:00
Joao Gante	fb0ae12947	TF: XLA bad words logits processor and list of processors (#16974 )	2022-04-29 15:54:58 +01:00
Yih-Dar	e952e049b4	use scale=1.0 in floats_tensor called in speech model testers (#17007 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-29 14:41:33 +02:00
Yih-Dar	5af5735f62	set eos_token_id to None to generate until max length (#16989 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-28 19:47:38 +02:00
Yih-Dar	49d5bcb0f3	Fix HubertRobustTest PT/TF equivalence test on GPU (#16943 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-27 10:50:03 +02:00
Krishna Sirumalla	aaee4038c3	Add onnx config for RoFormer (#16861 ) * add roformer onnx config	2022-04-26 16:51:15 +02:00
code-review-doctor	6568752039	Fix issue probably-meant-fstring found at https://codereview.doctor (#16913 )	2022-04-25 15:15:00 -04:00
Joao Gante	e03966e404	TF: XLA stable softmax (#16892 ) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-04-25 20:10:51 +01:00

1 2 3 4 5 ...

1802 Commits