transformers/tests/models/deepseek_v3
2025-06-24 20:16:56 +02:00
..
__init__.py [WIP] add deepseek-v3 (#35926) 2025-03-28 15:56:59 +01:00
test_modeling_deepseek_v3.py Skip sdpa dispatch on flash test due to unsupported head dims (#39010) 2025-06-24 20:16:56 +02:00