transformers

riaz.somc/transformers

Fork 0

mirror of https://github.com/huggingface/transformers.git synced 2025-07-08 07:10:06 +06:00

Commit Graph

Author	SHA1	Message	Date
Guido Novati	ecd6efe7cb	Fix megatron_gpt2 attention block's causal mask (#12007 ) * Fix megatron_gpt2 attention block's causal mask. * compatibility with checkpoints created with recent versions of Megatron-LM * added integration test for the released Megatron-GPT2 model * code style changes * added option to megatron conversion script to read from config file Co-authored-by: Guido Novati <gnovati@nvidia.com>	2021-06-14 04:57:55 -04:00

Author

SHA1

Message

Date

Guido Novati

ecd6efe7cb

Fix megatron_gpt2 attention block's causal mask (#12007 )

* Fix megatron_gpt2 attention block's causal mask.

* compatibility with checkpoints created with recent versions of Megatron-LM

* added integration test for the released Megatron-GPT2 model

* code style changes

* added option to megatron conversion script to read from config file

Co-authored-by: Guido Novati <gnovati@nvidia.com>

2021-06-14 04:57:55 -04:00

1 Commits