transformers/docs/source
Sayak Paul 84eaa6acf5
Add TFConvNextModel (#15750)
* feat: initial implementation of convnext in tensorflow.

* fix: sample code for the classification model.

* chore: added checked for  from the classification model.

* chore: set bias initializer in the classification head.

* chore: updated license terms.

* chore: removed ununsed imports

* feat: enabled  argument during using drop_path.

* chore: replaced tf.identity with layers.Activation(linear).

* chore: edited default checkpoint.

* fix: minor bugs in the initializations.

* partial-fix: tf model errors for loading pretrained pt weights.

* partial-fix: call method updated

* partial-fix: cross loading of weights (4x3 variables to be matched)

* chore: removed unneeded comment.

* removed playground.py

* rebasing

* rebasing and removing playground.py.

* fix: renaming TFConvNextStage conv and layer norm layers

* chore: added initializers and other minor additions.

* chore: added initializers and other minor additions.

* add: tests for convnext.

* fix: integration tester class.

* fix: issues mentioned in pr feedback (round 1).

* fix: how output_hidden_states arg is propoagated inside the network.

* feat: handling of  arg for pure cnn models.

* chore: added a note on equal contribution in model docs.

* rebasing

* rebasing and removing playground.py.

* feat: encapsulation for the convnext trunk.

* Fix variable naming; Test-related corrections; Run make fixup

* chore: added Joao as a contributor to convnext.

* rebasing

* rebasing and removing playground.py.

* rebasing

* rebasing and removing playground.py.

* chore: corrected copyright year and added comment on NHWC.

* chore: fixed the black version and ran formatting.

* chore: ran make style.

* chore: removed from_pt argument from test, ran make style.

* rebasing

* rebasing and removing playground.py.

* rebasing

* rebasing and removing playground.py.

* fix: tests in the convnext subclass, ran make style.

* rebasing

* rebasing and removing playground.py.

* rebasing

* rebasing and removing playground.py.

* chore: moved convnext test to the correct location

* fix: locations for the test file of convnext.

* fix: convnext tests.

* chore: applied  sgugger's suggestion for dealing w/ output_attentions.

* chore: added comments.

* chore: applied updated quality enviornment style.

* chore: applied formatting with quality enviornment.

* chore: revert to the previous tests/test_modeling_common.py.

* chore: revert to the original test_modeling_common.py

* chore: revert to previous states for test_modeling_tf_common.py and modeling_tf_utils.py

* fix: tests for convnext.

* chore: removed output_attentions argument from convnext config.

* chore: revert to the earlier tf utils.

* fix: output shapes of the hidden states

* chore: removed unnecessary comment

* chore: reverting to the right test_modeling_tf_common.py.

* Styling nits

Co-authored-by: ariG23498 <aritra.born2fly@gmail.com>
Co-authored-by: Joao Gante <joao@huggingface.co>
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>
2022-02-25 18:19:16 +01:00
..
internal TF generate refactor - Greedy Search (#15562) 2022-02-15 17:54:43 +01:00
main_classes Adding ZeroShotImageClassificationPipeline (#12119) 2022-02-23 09:41:42 +01:00
model_doc Add TFConvNextModel (#15750) 2022-02-25 18:19:16 +01:00
tasks 🧼 NLP task guides (#15731) 2022-02-23 13:58:33 -06:00
_config.py Prevent style_doc from tempering the config file 2021-12-10 15:31:43 -05:00
_toctree.yml 🧼 NLP task guides (#15731) 2022-02-23 13:58:33 -06:00
accelerate.mdx Fix code format for Accelerate doc (#15335) 2022-01-27 13:49:04 -06:00
add_new_model.mdx added link to our writing-doc document (#15756) 2022-02-22 09:57:28 +01:00
add_new_pipeline.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
autoclass_tutorial.mdx Update tutorial docs (#15165) 2022-02-01 18:31:35 -06:00
benchmarks.mdx [doc] normalize HF Transformers string (#15023) 2022-01-10 08:44:33 -08:00
bertology.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
community.mdx add t5 ner finetuning (#15432) 2022-01-31 17:03:06 +01:00
contributing.md Update installation page and add contributing to the doc (#5084) 2020-06-17 14:01:10 -04:00
converting_tensorflow_models.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
create_a_model.mdx Create a custom model guide (#15489) 2022-02-07 12:34:56 -06:00
custom_datasets.mdx Added missing code in exemplary notebook - custom datasets fine-tuning (#15300) 2022-01-25 17:26:17 -05:00
custom_models.mdx [doc] custom_models: mention security features of the Hub (#15768) 2022-02-23 11:40:06 -05:00
debugging.mdx add a network debug script and document it (#15652) 2022-02-15 08:48:00 -08:00
examples.md per_device instead of per_gpu/error thrown when argument unknown (#4618) 2020-05-27 11:36:55 -04:00
fast_tokenizers.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
glossary.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
index.mdx Add TFConvNextModel (#15750) 2022-02-25 18:19:16 +01:00
installation.mdx Get started docs (#15098) 2022-01-28 19:01:37 -06:00
migration.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
model_sharing.mdx Update model share tutorial (#15288) 2022-01-28 18:49:26 -06:00
model_summary.mdx Add "open in hf spaces" gradio button issue #73 (#15106) 2022-01-14 10:12:30 -05:00
multilingual.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
notebooks.md Update notebooks (#3620) 2020-04-06 14:32:39 -04:00
parallelism.mdx [deepspeed docs] Megatron-Deepspeed info (#15488) 2022-02-04 11:15:13 -08:00
performance.mdx add model scaling section (#15119) 2022-02-09 15:27:30 +01:00
perplexity.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
philosophy.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
pipeline_tutorial.mdx Update tutorial docs (#15165) 2022-02-01 18:31:35 -06:00
pr_checks.mdx [docs] fix wrong file name in pr_check (#15380) 2022-01-28 07:52:01 -05:00
preprocessing.mdx Update tutorial docs (#15165) 2022-02-01 18:31:35 -06:00
quicktour.mdx Re-enable doctests for the quicktour (#15828) 2022-02-25 17:46:38 +01:00
sagemaker.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
serialization.mdx Add PLBart (#13269) 2022-02-18 14:17:09 +01:00
task_summary.mdx Re-enable doctests for the quicktour (#15828) 2022-02-25 17:46:38 +01:00
testing.mdx [doc] normalize HF Transformers string (#15023) 2022-01-10 08:44:33 -08:00
tokenizer_summary.mdx Fix grammar in tokenizer_summary (#15614) 2022-02-11 16:51:30 -05:00
training.mdx Update fine-tune docs (#15259) 2022-02-01 18:28:12 -06:00
troubleshooting.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00