transformers/examples/research_projects/codeparrot/scripts
Loubna Ben Allal d91841315a
New features for CodeParrot training script (#16851)
* add tflops logging and fix grad accumulation

* add accelerate tracking and checkpointing

* scale loss of last batch correctly

* fix typo

* compress loss computation

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* add resume from checkpoint argument

* add load_state accelerate from checkpoint, register lr scheduler and add tflops function

* reformat code

* reformat code

* add condition on path for resume checkpoint

* combine if conditions

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* add source for tflops formula

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>
2022-04-21 18:43:46 +02:00
..
arguments.py New features for CodeParrot training script (#16851) 2022-04-21 18:43:46 +02:00
bpe_training.py fix: switch from slow to generic tokenizer class (#15122) 2022-01-12 09:12:43 -05:00
codeparrot_training.py New features for CodeParrot training script (#16851) 2022-04-21 18:43:46 +02:00
human_eval.py Jia multi gpu eval (#16428) 2022-04-11 11:24:32 +02:00
initialize_model.py Add CodeParrot 🦜 codebase (#14536) 2021-12-02 10:41:35 +01:00
preprocessing.py Add CodeParrot 🦜 codebase (#14536) 2021-12-02 10:41:35 +01:00
validation_loss.py Add CodeParrot 🦜 codebase (#14536) 2021-12-02 10:41:35 +01:00