Yih-Dar
|
05de764e9c
|
Aurevoir PyTorch 1 (#35358)
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
|
2024-12-20 14:36:31 +01:00 |
|
Joao Gante
|
a7734238ff
|
Generation tests: update imagegpt input name, remove unused functions (#33663)
|
2024-09-24 16:40:48 +01:00 |
|
Younes Belkada
|
47b096412d
|
Fix: Fix FalconMamba training issues due to incompatible kernels (#33195)
* fix FM training kernels
* fix copies
* fix copies
* propagate to slow path
* make it BC
* add comment
* fix test
|
2024-09-05 11:55:08 +02:00 |
|
Younes Belkada
|
93e538ae2e
|
Mamba / FalconMamba: Fix mamba left padding (#32677)
* fix mamba left padding
* Apply suggestions from code review
Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>
* fix copies
* test with `inputs_embeds`
* Update src/transformers/models/falcon_mamba/modeling_falcon_mamba.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* copies
* clairfy
* fix last comments
* remove
---------
Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
|
2024-08-19 16:01:35 +02:00 |
|
Younes Belkada
|
7c11491208
|
Add new model (#32615)
* v1 - working version
* fix
* fix
* fix
* fix
* rename to correct name
* fix title
* fixup
* rename files
* fix
* add copied from on tests
* rename to `FalconMamba` everywhere and fix bugs
* fix quantization + accelerate
* fix copies
* add `torch.compile` support
* fix tests
* fix tests and add slow tests
* copies on config
* merge the latest changes
* fix tests
* add few lines about instruct
* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* fix
* fix tests
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
|
2024-08-12 08:22:47 +02:00 |
|