transformers/templates
Sanchit Gandhi cd9274d010
[FlaxBert] Add ForCausalLM (#16995)
* [FlaxBert] Add ForCausalLM

* make style

* fix output attentions

* Add RobertaForCausalLM

* remove comment

* fix fx-to-pt model loading

* remove comment

* add modeling tests

* add enc-dec model tests

* add big_bird

* add electra

* make style

* make repo-consitency

* add to docs

* remove roberta test

* quality

* amend cookiecutter

* fix attention_mask bug in flax bert model tester

* tighten pt-fx thresholds to 1e-5

* add 'copied from' statements

* amend 'copied from' statements

* amend 'copied from' statements

* quality
2022-05-03 11:26:19 +02:00
..
adding_a_missing_tokenization_test add a template to add missing tokenization test (#16553) 2022-04-05 10:50:22 +02:00
adding_a_new_example_script add torch.no_grad when in eval mode (#17020) 2022-05-02 07:49:19 -04:00
adding_a_new_model [FlaxBert] Add ForCausalLM (#16995) 2022-05-03 11:26:19 +02:00