transformers/tests/models/nemotron
Longjie Zheng 0d1692a49b
Fix attn mask ignore logic in training-time trace (#32613)
* fix attn mask logic for training-time trace

* add test

* fix

* fix

* fix

* fix

* fix

* format

* [run-slow] llama

* avoid accelearate

* [run-slow] llama
2024-10-04 19:00:45 +02:00
..
__init__.py Add Nemotron HF Support (#31699) 2024-08-06 15:42:05 +02:00
test_modeling_nemotron.py Fix attn mask ignore logic in training-time trace (#32613) 2024-10-04 19:00:45 +02:00