transformers/docs/source
Andreas Madsen b4b613b102
Implement Roberta PreLayerNorm (#20305)
* Copy RoBERTa

* formatting

* implement RoBERTa with prelayer normalization

* update test expectations

* add documentation

* add convertion script for DinkyTrain weights

* update checkpoint repo

Unfortunately the original checkpoints assumes a hacked roberta model

* add to RoBERTa-PreLayerNorm docs to toc

* run utils/check_copies.py

* lint files

* remove unused import

* fix check_repo reporting wrongly a test is missing

* fix import error, caused by rebase

* run make fix-copies

* add RobertaPreLayerNormConfig to ROBERTA_EMBEDDING_ADJUSMENT_CONFIGS

* Fix documentation <Facebook> -> Facebook

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixup: Fix documentation <Facebook> -> Facebook

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Add missing Flax header

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* expected_slice -> EXPECTED_SLICE

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* update copies after rebase

* add missing copied from statements

* make fix-copies

* make prelayernorm explicit in code

* fix checkpoint path for the original implementation

* add flax integration tests

* improve docs

* update utils/documentation_tests.txt

* lint files

* Remove Copyright notice

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* make fix-copies

* Remove EXPECTED_SLICE calculation comments

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-12-19 09:30:17 +01:00
..
de Convert tokenizer outputs for Keras in doc example (#20732) 2022-12-12 16:14:04 +00:00
en Implement Roberta PreLayerNorm (#20305) 2022-12-19 09:30:17 +01:00
es Spanish translation of the file debugging.mdx (#20566) 2022-12-12 10:38:56 -05:00
it [docs] set overflowing image width to auto-scale (#20197) 2022-11-15 01:13:40 +01:00
ko fix: "BigSicence" typo in docs (#20331) 2022-11-21 09:44:54 -05:00
pt [docs] set overflowing image width to auto-scale (#20197) 2022-11-15 01:13:40 +01:00
zh translate zh quicktour(#20095) (#20181) 2022-11-21 08:44:18 -05:00
_config.py Use doc builder styler (#16412) 2022-03-28 07:45:18 -04:00