transformers/examples/research_projects
Leandro von Werra 43f953cc2e
Add CodeParrot 🦜 codebase (#14536)
* add readme skeleton

* update readme

* add initialization script

* add deduplication script

* add codeparrot training script

* add code generation evaluation

* add validation loss script

* add requirements

* update readme

* tweak readme

* make style

* add highlights to readme

* add CLIs to scripts

* add tokenizer training script

* add docstring to constant length dataset

* fix defaults in arguments

* update readme with cli

* move image to hub

* tweaks of readme

* fix cli commands

* add author

* explain env variables

* fix formatting

* Update examples/research_projects/codeparrot/README.md

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* Apply suggestions from code review

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* replace generic with gpt2 tokenizer

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
2021-12-02 10:41:35 +01:00
..
adversarial Update namespaces inside torch.utils.data to the latest. (#13167) 2021-08-19 14:29:51 +02:00
bert-loses-patience remove extra white space from log format (#12360) 2021-06-25 13:20:14 -07:00
bertabs make style (#11442) 2021-04-26 13:50:34 +02:00
bertology [style] consistent nn. and nn.functional: part 4 examples (#12156) 2021-06-14 12:28:24 -07:00
codeparrot Add CodeParrot 🦜 codebase (#14536) 2021-12-02 10:41:35 +01:00
deebert remove extra white space from log format (#12360) 2021-06-25 13:20:14 -07:00
distillation Remove n_ctx from configs (#14165) 2021-10-29 11:50:25 +02:00
fsner Update FSNER code in examples->research_projects->fsner (#13864) 2021-10-05 22:47:11 -04:00
jax-projects Switch from using sum for flattening lists of lists in group_texts (#14472) 2021-11-22 16:17:26 -05:00
longform-qa [style] consistent nn. and nn.functional: part 4 examples (#12156) 2021-06-14 12:28:24 -07:00
lxmert upgrade sentencepiece version (#13564) 2021-09-15 15:25:03 +02:00
mlm_wwm fix research_projects/mlm_wwm readme.md examples (#13646) 2021-09-20 15:01:35 -04:00
mm-imdb remove extra white space from log format (#12360) 2021-06-25 13:20:14 -07:00
movement-pruning use functional interface for softmax in attention (#14198) 2021-11-30 11:47:33 -05:00
performer remove extra white space from log format (#12360) 2021-06-25 13:20:14 -07:00
pplm Fix execution PATH for PPLM Example (#14287) 2021-11-06 10:33:47 -04:00
quantization-qdqbert Add QDQBert model and quantization examples of SQUAD task (#14066) 2021-11-19 13:33:39 -05:00
rag minor fixes in original RAG training (#12395) 2021-06-29 13:39:48 +01:00
rag-end2end-retriever rm require_version_examples (#12088) 2021-06-09 11:02:52 -07:00
seq2seq-distillation Update Transformers to huggingface_hub >= 0.1.0 (#14251) 2021-11-02 18:58:42 -04:00
visual_bert upgrade sentencepiece version (#13564) 2021-09-15 15:25:03 +02:00
wav2vec2 fix --gradient_checkpointing (#13964) 2021-11-11 17:50:21 +01:00
zero-shot-distillation remove extra white space from log format (#12360) 2021-06-25 13:20:14 -07:00
README.md Reorganize examples (#9010) 2020-12-11 10:07:02 -05:00

Research projects

This folder contains various research projects using 🤗 Transformers. They are not maintained and require a specific version of 🤗 Transformers that is indicated in the requirements file of each folder. Updating them to the most recent version of the library will require some work.

To use any of them, just run the command

pip install -r requirements.txt

inside the folder of your choice.

If you need help with any of those, contact the author(s), indicated at the top of the README of each folder.