Patrick von Platen
|
996a315e76
|
Flax Generate (#11777)
* fix_torch_device_generate_test
* remove @
* add
* indexing
* correct a couple of tests
* fix tests
* add logits processor
* finish top_k, top_p, temp
* add docs
* correct flax prng key default
* improve generate
* add generation docs
* add docs
* make style
* revert model outputs change
* make style
* correct typo
* fix tests
* fix slow test
* add raise
* finish generation
Co-authored-by: Patrick von Platen <patrick@huggingface.co>
|
2021-05-27 00:18:17 +01:00 |
|
Suraj Patil
|
ca33278fdb
|
FlaxGPT2 (#11556)
* flax gpt2
* combine masks
* handle shared embeds
* add causal LM sample
* style
* add tests
* style
* fix imports, docs, quality
* don't use cache
* add cache
* add cache 1st version
* make use cache work
* start adding test for generation
* finish generation loop compilation
* rewrite test
* finish
* update
* update
* apply sylvains suggestions
* update
* refactor
* fix typo
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
|
2021-05-18 22:50:51 +01:00 |
|