cyyever
|
1e6b546ea6
|
Use Python 3.9 syntax in tests (#37343)
Signed-off-by: cyy <cyyever@outlook.com>
|
2025-04-08 14:12:08 +02:00 |
|
Mohamed Mekkouri
|
92429057d9
|
Skip FP8 linear tests For device capability < 9.0(#37008)
* skip fp8 linear
* add capability check
* format
|
2025-03-27 12:38:37 +01:00 |
|
湛露先生
|
ebd2029483
|
Change GPUS to GPUs (#36945)
Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
|
2025-03-25 17:25:39 +01:00 |
|
Mohamed Mekkouri
|
cb586a3999
|
Add require_read_token to fp8 tests (#36189)
fix
|
2025-02-14 12:27:35 +01:00 |
|
Mohamed Mekkouri
|
efe72fe21f
|
Adding FP8 Quantization to transformers (#36026)
* first commit
* adding kernels
* fix create_quantized_param
* fix quantization logic
* end2end
* fix style
* fix imports
* fix consistency
* update
* fix style
* update
* udpate after review
* make style
* update
* update
* fix
* update
* fix docstring
* update
* update after review
* update
* fix scheme
* update
* update
* fix
* update
* fix docstring
* add source
* fix test
---------
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
|
2025-02-13 13:01:19 +01:00 |
|