transformers/utils
Yih-Dar 2e20c0f34a
Make Flax GPT2 working with cross attention (#13008)
* make flax gpt2 working with cross attention

* Remove encoder->decoder projection layer

* A draft (incomplete) for FlaxEncoderDecoderModel

* Add the method from_encoder_decoder_pretrained + the docstrings

* Fix the mistakes of using EncoderDecoderModel

* Fix style

* Add FlaxEncoderDecoderModel to the library

* Fix cyclic imports

* Add FlaxEncoderDecoderModel to modeling_flax_auto.py

* Remove question comments

* add tests for FlaxEncoderDecoderModel

* add flax_encoder_decoder to the lists of ignored entries in check_repo.py

* fix missing required positional arguments

* Remove **kwargs when creating FlaxEncoderDecoderModel in from_encoder_decoder_pretrained()

Also fix generation eos/pad tokens issue

* Fix: Use sequences from the generated_output

* Change a check from assert to raise ValueError

* Fix examples and token ids issues

* Fix missing all_cross_attentions when outputting tuple in modeling_gpt2

* Remove the changes in configuration docstrings.

* allow for bert 2 gpt2

* make fix-copies

* Apply suggestions from code review

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Change remaining examples to bert2gpt2

* Change the test to Bert2GPT2

* Fix examples

* Fix import

* Fix unpack bug

* Rename to FlaxEncoderDecoderModelTest and change the test to bert2gpt2

* Apply suggestions from code review

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Fix: NotImplentedError -> NotImplementedError

* Apply suggestions from code review

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* up

* finalize

Co-authored-by: ydshieh <ydshieh@user.noreply>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2021-08-23 17:57:29 +02:00
..
tf_ops Check TF ops for ONNX compliance (#10025) 2021-02-15 07:55:10 -05:00
check_copies.py Add possibility to maintain full copies of files (#12312) 2021-06-28 10:02:53 -04:00
check_dummies.py Have dummy processors have a from_pretrained method (#12145) 2021-06-15 08:39:05 -04:00
check_inits.py Import check_inits handling of duplicate definitions. (#12467) 2021-07-01 12:52:00 -04:00
check_repo.py Make Flax GPT2 working with cross attention (#13008) 2021-08-23 17:57:29 +02:00
check_table.py Fix frameworks table so it's alphabetical (#13118) 2021-08-16 15:45:19 +02:00
check_tf_ops.py Check TF ops for ONNX compliance (#10025) 2021-02-15 07:55:10 -05:00
custom_init_isort.py Sort init import (#10801) 2021-03-19 16:17:13 -04:00
documentation_tests.txt Doctests job (#13088) 2021-08-12 03:42:25 -04:00
download_glue_data.py Enforce string-formatting with f-strings (#10980) 2021-03-31 10:00:27 -04:00
get_modified_files.py a more reliable version of branching point discovery (#9449) 2021-01-07 04:47:50 -05:00
link_tester.py Enforce string-formatting with f-strings (#10980) 2021-03-31 10:00:27 -04:00
notification_service.py Add timeout to CI. (#12684) 2021-07-13 15:13:18 -04:00
release.py Fix pattern in conf.py (#11784) 2021-05-20 09:30:31 -04:00
style_doc.py Only test the files impacted by changes in the diff (#12644) 2021-07-14 10:56:55 -04:00
tests_fetcher.py Rely on huggingface_hub for common tools (#13100) 2021-08-12 14:59:02 +02:00