transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-30 17:52:35 +06:00

Author	SHA1	Message	Date
Sam Shleifer	297233fa92	[s2s] Switch README urls to cdn (#7670 )	2020-10-08 21:22:22 -04:00
Sam Shleifer	a1ecc90d6b	[pseudo] Switch URLS to CDN (#7661 )	2020-10-08 14:12:39 -04:00
Suraj Patil	06a973fd2a	[s2s] configure lr_scheduler from command line (#7641 )	2020-10-08 13:06:35 -04:00
Sam Shleifer	aba4e22944	[pseudolabels] cleanup markdown table (#7653 )	2020-10-07 23:04:18 -04:00
Sam Shleifer	e2bb9abb6a	[s2s] release pseudolabel links and instructions (#7639 )	2020-10-07 11:20:44 -04:00
Sylvain Gugger	08ba4b4902	Trainer callbacks (#7596 ) * Initial callback proposal * Finish various callbacks * Post-rebase conflicts * Fix tests * Don't use something that's not set * Documentation * Remove unwanted print. * Document all models can work * Add tests + small fixes * Update docs/source/internal/trainer_utils.rst Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address review comments * Fix TF tests * Real fix this time * This one should work * Fix typo * Really fix typo Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-10-07 10:50:21 -04:00
Sam Shleifer	500be01c5d	[s2s] save first batch to json for debugging purposes (#6810 )	2020-10-06 16:11:56 -04:00
Sam Shleifer	d5d2744aa7	Support T5 Distillation w/hidden state supervision (#7599 )	2020-10-05 21:31:48 -04:00
Suraj Patil	99cb924bfb	[s2s] add config params like Dropout in Seq2SeqTrainingArguments (#7532 )	2020-10-04 12:42:30 -04:00
Sam Shleifer	9bdce3a4f9	[s2s] fix lockfile and peg distillation constants (#7545 )	2020-10-02 15:58:14 -04:00
Sam Shleifer	de4d7b004a	[s2s] Adafactor support for builtin trainer (#7522 )	2020-10-01 17:27:45 -04:00
Sam Shleifer	d3a9601a11	[s2s] trainer scripts: Remove --run_name, thanks sylvain! (#7521 )	2020-10-01 17:18:47 -04:00
Sylvain Gugger	bdcc4b78a2	Fix seq2seq example test (#7518 ) * Fix seq2seq example test * Fix bad copy-paste * Also save the state	2020-10-01 14:13:29 -04:00
Sam Shleifer	2a358f45ef	[s2s] fix nltk pytest race condition with FileLock (#7515 )	2020-10-01 12:51:09 -04:00
Suraj Patil	72d363d979	[examples/s2s] clean up finetune_trainer (#7509 )	2020-10-01 12:19:29 -04:00
Sam Shleifer	48f23f92a8	[s2sTrainer] test + code cleanup (#7467 )	2020-10-01 00:33:01 -04:00
Julien Chaumond	0acd1ffa09	[doc] rm Azure buttons as not implemented yet	2020-09-30 17:31:08 -04:00
Sam Shleifer	03e46c1de3	[s2s] fix kwargs style (#7488 )	2020-09-30 17:00:06 -04:00
Sam Shleifer	6fe8a693eb	[s2s] Fix t5 warning for distributed eval (#7487 )	2020-09-30 16:58:03 -04:00
Amanpreet Singh	c031d01023	Seq2SeqDataset: avoid passing src_lang everywhere (#7470 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-09-30 13:27:48 -04:00
Suraj Patil	08939cfdf7	[s2strainer] fix eval dataset loading (#7477 )	2020-09-30 12:39:13 -04:00
Sam Shleifer	74d8d69bd4	[s2s] consistent output format across eval scripts (#7435 )	2020-09-28 23:20:03 -04:00
Ola Piktus	ae3e84f3ba	[RAG] Clean Rag readme in examples (#7413 ) * Improve README + consolidation script * Reformat README * Reformat README Co-authored-by: Your Name <you@example.com>	2020-09-28 10:06:39 +02:00
Sam Shleifer	748425d47d	[T5] allow config.decoder_layers to control decoder size (#7409 ) * Working assymmetrical T5 * rename decoder_layers -> num_decoder_layers * Fix docstring * Allow creation of asymmetric t5 students	2020-09-28 03:08:04 -04:00
Sam Shleifer	7296fea1d6	[s2s] rougeLSum expects \n between sentences (#7410 ) Co-authored-by: Swetha Mandava <smandava@nvidia.com>	2020-09-27 16:27:19 -04:00
Suraj Patil	eab5f59682	[s2s] add create student script (#7290 ) Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-09-27 15:10:46 -04:00
Ola Piktus	fe326bd5cf	Remove dependency on examples/seq2seq from rag (#7395 ) Co-authored-by: Your Name <you@example.com>	2020-09-25 18:20:49 +02:00
Quentin Lhoest	cf1c88e092	[RAG] Fix retrieval offset in RAG's HfIndex and better integration tests (#7372 ) * Fix retrieval offset in RAG's HfIndex * update slow tests * style * fix new test * style * add better tests Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-09-25 16:12:46 +02:00
Suraj Patil	415071b4c2	doc changes (#7385 )	2020-09-25 08:00:36 -04:00
Patrick von Platen	2dd652d757	[RAG] Add missing doc and attention_mask to rag (#7382 ) * add docs * add missing docs and attention_mask in fine-tune	2020-09-25 11:23:55 +02:00
Suraj Patil	9e68d075a4	Seq2SeqTrainer (#6769 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-09-24 18:46:58 -04:00
Sam Shleifer	d9d0f1140b	[s2s] distributed eval allows num_return_sequences > 1 (#7254 )	2020-09-24 17:30:09 -04:00
Patrick von Platen	0804d077c6	correct attention mask (#7373 )	2020-09-24 23:22:04 +02:00
Stas Bekman	eadd870b2f	[seq2seq] make it easier to run the scripts (#7274 )	2020-09-24 15:23:48 -04:00
Felipe Curti	d266613635	[Benchmarks] Change all args to from `no_...` to their positive form (#7075 ) * Changed name to all no_... arguments and all references to them, inverting the boolean condition * Change benchmark tests to use new Benchmark Args * Update src/transformers/benchmark/benchmark_args_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/benchmark/benchmark.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Fix Style. Add --no options in help * fix some part of tests * Update src/transformers/benchmark/benchmark_args_utils.py * Update src/transformers/benchmark/benchmark_args_utils.py * Update src/transformers/benchmark/benchmark_args_utils.py * fix all tests * make style * add backwards compability * make backwards compatible Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: fmcurti <fcurti@DESKTOP-RRQURBM.localdomain>	2020-09-23 13:25:24 -04:00
Sam Shleifer	78387cc63e	[s2s] only save metrics.json from rank zero (#7331 )	2020-09-22 18:27:28 -04:00
Sam Shleifer	e53138a1b9	[s2s] add src_lang kwarg for distributed eval (#7300 )	2020-09-22 18:26:37 -04:00
Sam Shleifer	25b0463d0b	[s2s] add supported architecures to MD (#7252 )	2020-09-22 13:09:35 -04:00
Ola Piktus	c754c41c61	RAG (#6813 ) * added rag WIP * path fix * Formatting / renaming prior to actual work * added rag WIP * path fix * Formatting / renaming prior to actual work * added rag WIP * path fix * Formatting / renaming prior to actual work * added rag WIP * Formatting / renaming prior to actual work * First commit * improve comments * Retrieval evaluation scripts * refactor to include modeling outputs + MPI retriever * Fix rag-token model + refactor * Various fixes + finetuning logic * use_bos fix * Retrieval refactor * Finetuning refactoring and cleanup * Add documentation and cleanup * Remove set_up_rag_env.sh file * Fix retrieval wit HF index * Fix import errors * Fix quality errors * Refactor as per suggestions in https://github.com/huggingface/transformers/pull/6813#issuecomment-687208867 * fix quality * Fix RAG Sequence generation * minor cleanup plus initial tests * fix test * fix tests 2 * Comments fix * post-merge fixes * Improve readme + post-rebase refactor * Extra dependencied for tests * Fix tests * Fix tests 2 * Refactor test requirements * Fix tests 3 * Post-rebase refactor * rename nlp->datasets * RAG integration tests * add tokenizer to slow integration test and allow retriever to run on cpu * add tests; fix position ids warning * change structure * change structure * add from encoder generator * save working solution * make all integration tests pass * add RagTokenizer.save/from_pretrained and RagRetriever.save/from_pretrained * don't save paths * delete unnecessary imports * pass config to AutoTokenizer.from_pretrained for Rag tokenizers * init wiki_dpr only once * hardcode legacy index and passages paths (todo: add the right urls) * finalize config * finalize retriver api and config api * LegacyIndex index download refactor * add dpr to autotokenizer * make from pretrained more flexible * fix ragfortokengeneration * small name changes in tokenizer * add labels to models * change default index name * add retrieval tests * finish token generate * align test with previous version and make all tests pass * add tests * finalize tests * implement thoms suggestions * add first version of test * make first tests work * make retriever platform agnostic * naming * style * add legacy index URL * docstrings + simple retrieval test for distributed * clean model api * add doc_ids to retriever's outputs * fix retrieval tests * finish model outputs * finalize model api * fix generate problem for rag * fix generate for other modles * fix some tests * save intermediate * set generate to default * big refactor generate * delete rag_api * correct pip faiss install * fix auto tokenization test * fix faiss install * fix test * move the distributed logic to examples * model page * docs * finish tests * fix dependencies * fix import in __init__ * Refactor eval_rag and finetune scripts * start docstring * add psutil to test * fix tf test * move require torch to top * fix retrieval test * align naming * finish automodel * fix repo consistency * test ragtokenizer save/load * add rag model output docs * fix ragtokenizer save/load from pretrained * fix tokenizer dir * remove torch in retrieval * fix docs * fixe finetune scripts * finish model docs * finish docs * remove auto model for now * add require torch * remove solved todos * integrate sylvains suggestions * sams comments * correct mistake on purpose * improve README * Add generation test cases * fix rag token * clean token generate * fix test * add note to test * fix attention mask * add t5 test for rag * Fix handling prefix in finetune.py * don't overwrite index_name Co-authored-by: Patrick Lewis <plewis@fb.com> Co-authored-by: Aleksandra Piktus <piktus@devfair0141.h2.fair> Co-authored-by: Aleksandra Piktus <piktus@learnfair5102.h2.fair> Co-authored-by: Aleksandra Piktus <piktus@learnfair5067.h2.fair> Co-authored-by: Your Name <you@example.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>	2020-09-22 18:29:58 +02:00
Julien Plu	585217c87f	Add generic text classification example in TF (#5716 ) * Add new example with nlp * Update README * replace nlp by datasets * Update examples/text-classification/README.md Add Lysandre's suggestion. Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-09-22 12:05:05 -04:00
Sam Shleifer	656c27c3a3	[s2s] save hostname with repo info (#7301 ) * save hostname	2020-09-21 17:26:24 -04:00
Stas Bekman	af4b98ed97	[s2s] adjust finetune + test to work with fsmt (#7263 )	2020-09-21 15:13:19 -04:00
Stas Bekman	8d562a2d1a	[s2s] s/alpha_loss_encoder/alpha_encoder_loss/ (#7298 ) fix to match `distillation.py: self.alpha_encoder_loss`	2020-09-21 14:14:26 -04:00
Stas Bekman	cbb2f75a16	[s2s tests] fix test_run_eval_search (#7297 )	2020-09-21 14:00:40 -04:00
Lysandre	aae4edb5f0	Addressing review comment	2020-09-21 11:37:00 +02:00
Suraj Patil	43b9d93875	[example/glue] fix compute_metrics_fn for bart like models (#7248 ) * fix compute_metrics_fn * p.predictions -> preds * apply suggestions	2020-09-21 05:34:20 -04:00
Stas Bekman	7cbf0f722d	examples/seq2seq/__init__.py mutates sys.path (#7194 )	2020-09-20 16:54:42 -04:00
Sam Shleifer	83dba10b8f	[s2s] distributed_eval.py saves better speed info (#7242 )	2020-09-18 15:46:01 -04:00
Stefan Schweter	ee9eae4e06	token-classification: update url of GermEval 2014 dataset (#6571 )	2020-09-18 06:18:06 -04:00
Sam Shleifer	67d9fc50d9	[s2s] remove double assert (#7223 )	2020-09-17 18:32:31 -04:00

1 2 3 4 5 ...

1233 Commits