transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-18 03:58:25 +06:00

History

Kamal Raj d329b63369 Deberta tf (#12972 ) * TFDeberta moved weights to build and fixed name scope added missing , bug fixes to enable graph mode execution updated setup.py fixing typo fix imports embedding mask fix added layer names avoid autmatic incremental names +XSoftmax cleanup added names to layer disable keras_serializable Distangled attention output shape hidden_size==None using symbolic inputs test for Deberta tf make style Update src/transformers/models/deberta/modeling_tf_deberta.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Update src/transformers/models/deberta/modeling_tf_deberta.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Update src/transformers/models/deberta/modeling_tf_deberta.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Update src/transformers/models/deberta/modeling_tf_deberta.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Update src/transformers/models/deberta/modeling_tf_deberta.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Update src/transformers/models/deberta/modeling_tf_deberta.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Update src/transformers/models/deberta/modeling_tf_deberta.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> removed tensorflow-probability removed blank line * removed tf experimental api +torch_gather tf implementation from @Rocketknight1 * layername DeBERTa --> deberta * copyright fix * added docs for TFDeberta & make style * layer_name change to fix load from pt model * layer_name change as pt model * SequenceClassification layername change, to same as pt model * switched to keras built-in LayerNormalization * added `TFDeberta` prefix most layer classes * updated to tf.Tensor in the docstring		2021-08-12 05:01:26 -04:00
..
_static	Documentation for patch v4.9.2	2021-08-09 16:14:17 +02:00
imgs	[doc] DP/PP/TP/etc parallelism (#12524 )	2021-07-09 17:39:09 -07:00
internal	Init pickle (#12567 )	2021-07-08 07:20:46 -04:00
main_classes	[Flax] Correct flax docs (#12782 )	2021-08-04 16:31:23 +02:00
model_doc	Deberta tf (#12972 )	2021-08-12 05:01:26 -04:00
add_new_model.rst	consistent nn. and nn.functional: part 5 docs (#12161 )	2021-06-14 13:34:32 -07:00
benchmarks.rst	[Docs] fixed broken link (#12205 )	2021-06-16 15:14:53 -04:00
bertology.rst	Fix documentation links always pointing to master. (#9217 )	2021-01-05 06:18:48 -05:00
community.md	docs: add HuggingArtists to community notebooks (#13050 )	2021-08-10 09:36:44 +02:00
conf.py	Add multilingual documentation support (#12952 )	2021-07-30 20:56:14 +08:00
contributing.md	Update installation page and add contributing to the doc (#5084 )	2020-06-17 14:01:10 -04:00
converting_tensorflow_models.rst	Examples reorg (#11350 )	2021-04-21 11:11:20 -04:00
custom_datasets.rst	Rename NLP library to Datasets library (#10920 )	2021-03-26 08:07:59 -04:00
debugging.rst	[debug] DebugUnderflowOverflow doesn't work with DP (#12816 )	2021-07-21 09:36:02 -07:00
examples.md	per_device instead of per_gpu/error thrown when argument unknown (#4618 )	2020-05-27 11:36:55 -04:00
fast_tokenizers.rst	Documentation about loading a fast tokenizer within Transformers (#11029 )	2021-04-05 10:51:16 -04:00
favicon.ico	Adding usage examples for common tasks (#2850 )	2020-02-25 13:48:24 -05:00
glossary.rst	Add video links to the documentation (#12162 )	2021-06-15 06:37:37 -04:00
index.rst	Deberta tf (#12972 )	2021-08-12 05:01:26 -04:00
installation.md	Add mention of the huggingface_hub methods for offline mode (#12320 )	2021-06-23 09:45:30 -04:00
migration.md	consistent nn. and nn.functional: part 5 docs (#12161 )	2021-06-14 13:34:32 -07:00
model_sharing.rst	Add video links to the documentation (#12162 )	2021-06-15 06:37:37 -04:00
model_summary.rst	Add video links to the documentation (#12162 )	2021-06-15 06:37:37 -04:00
multilingual.rst	Examples reorg (#11350 )	2021-04-21 11:11:20 -04:00
notebooks.md	Update notebooks (#3620 )	2020-04-06 14:32:39 -04:00
parallelism.md	[parallelism doc] document Deepspeed-Inference and parallelformers (#12836 )	2021-07-21 15:11:02 -07:00
performance.md	[doc] performance: batch sizes (#12725 )	2021-07-15 09:39:34 -07:00
perplexity.rst	Create perplexity.rst (#13004 )	2021-08-05 02:56:13 -04:00
philosophy.rst	Minor documentation revisions from copyediting (#9266 )	2020-12-23 10:15:49 -05:00
preprocessing.rst	Add video links to the documentation (#12162 )	2021-06-15 06:37:37 -04:00
pretrained_models.rst	GPT Neo few fixes (#10968 )	2021-03-30 11:15:55 -04:00
quicktour.rst	Doctests job (#13088 )	2021-08-12 03:42:25 -04:00
sagemaker.md	remove documentation (#12657 )	2021-07-12 18:02:51 +02:00
serialization.rst	Add to ONNX docs (#13048 )	2021-08-09 09:51:49 -04:00
task_summary.rst	Doctests job (#13088 )	2021-08-12 03:42:25 -04:00
testing.rst	[doc] testing: how to trigger a self-push workflow (#12724 )	2021-07-15 16:18:56 -07:00
tokenizer_summary.rst	Add video links to the documentation (#12162 )	2021-06-15 06:37:37 -04:00
training.rst	fixed docs (#12646 )	2021-07-12 12:03:13 -04:00
troubleshooting.md	[troubleshooting] add 2 points of reference to the offline mode (#11236 )	2021-04-14 08:39:23 -07:00