transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

History

Matt 19a8a3036d Add magic method to our TF models to convert datasets with column inference (#17160 ) * Add method to call to_tf_dataset() with column inference * Add test for dataset creation * Add a default arg for data collator * Fix test * Fix call with non-dev version of datasets * Test correct column removal too * make fixup * More tests to make sure we remove unwanted columns * Fix test to avoid predicting on unbuilt models * Fix test to avoid predicting on unbuilt models * Fix test to remove unwanted head mask columns from inputs * Stop pushing your debug breakpoints to the main repo of the $2bn company you work for * Skip the test in convnext because no grouped conv support * Drop bools from the dataset dict * Make style * Skip the training test for models whose input dicts don't give us labels * Skip transformerXL in the test because it doesn't return a simple loss * Skip TFTapas because of some odd NaN losses * make style * make fixup * Add docstring * fixup * Update src/transformers/modeling_tf_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_tf_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_tf_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_tf_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_tf_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Remove breakpoint from tests * Fix assert, add requires_backends * Protect tokenizer import with if TYPE_CHECKING * make fixup * Add noqa, more fixup * More rearranging for ~* aesthetics ~ Adding defaults for shuffle and batch_size to match to_tf_dataset() * Update src/transformers/modeling_tf_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>		2022-06-06 15:53:49 +01:00
..
benchmark	[Test refactor 1/5] Per-folder tests reorganization (#15725 )	2022-02-23 15:46:28 -05:00
deepspeed	[deepspeed / testing] reset global state (#17553 )	2022-06-06 07:49:25 -07:00
extended	Update self-push workflow (#17177 )	2022-05-13 16:28:00 +02:00
fixtures	add a warning in `SpmConverter` for sentencepiece's model using the byte fallback feature (#16629 )	2022-04-11 11:06:10 +02:00
generation	[Generation] Fix Transition probs (#17311 )	2022-05-19 22:17:02 +02:00
models	Add magic method to our TF models to convert datasets with column inference (#17160 )	2022-06-06 15:53:49 +01:00
onnx	Add support for Perceiver ONNX export (#17213 )	2022-06-03 07:40:22 -04:00
optimization	[Test refactor 1/5] Per-folder tests reorganization (#15725 )	2022-02-23 15:46:28 -05:00
pipelines	Adding `batch_size` test to QA pipeline. (#17330 )	2022-05-19 14:28:12 -04:00
sagemaker	Black preview (#17217 )	2022-05-12 16:25:55 -04:00
tokenization	[Test refactor 1/5] Per-folder tests reorganization (#15725 )	2022-02-23 15:46:28 -05:00
trainer	Support compilation via Torchdynamo, AOT Autograd, NVFuser (#17308 )	2022-05-25 11:16:09 -04:00
utils	CLI: tool to convert PT into TF weights and open hub PR (#17497 )	2022-06-01 18:52:07 +01:00
__init__.py	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
test_configuration_common.py	Black preview (#17217 )	2022-05-12 16:25:55 -04:00
test_feature_extraction_common.py	[Json configs] Make json prettier for all saved tokenizer files & ensure same json format for all processors (tok + feat_extract) (#17457 )	2022-05-31 17:07:30 +02:00
test_modeling_common.py	Fix all offload and MP tests (#17533 )	2022-06-03 09:59:13 -04:00
test_modeling_flax_common.py	[Flax] improve large model init and loading (#16148 )	2022-04-19 14:19:55 +02:00
test_modeling_tf_common.py	Add magic method to our TF models to convert datasets with column inference (#17160 )	2022-06-06 15:53:49 +01:00
test_sequence_feature_extraction_common.py	Some tests misusing assertTrue for comparisons fix (#16771 )	2022-04-19 14:44:08 +02:00
test_tokenization_common.py	[Json configs] Make json prettier for all saved tokenizer files & ensure same json format for all processors (tok + feat_extract) (#17457 )	2022-05-31 17:07:30 +02:00