transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-24 23:08:57 +06:00

Author	SHA1	Message	Date
Suraj Patil	e49393c361	[examples] Add trainer support for question-answering (#4829 ) * add SquadDataset * add DataCollatorForQuestionAnswering * update __init__ * add run_squad with trainer * add DataCollatorForQuestionAnswering in __init__ * pass data_collator to trainer * doc tweak * Update run_squad_trainer.py * Update __init__.py * Update __init__.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-07-07 08:57:08 -04:00
Shashank Gupta	3dcb748e31	Added data collator for permutation (XLNet) language modeling and related calls (#5522 ) * Added data collator for XLNet language modeling and related calls Added DataCollatorForXLNetLanguageModeling in data/data_collator.py to generate necessary inputs for language modeling training with XLNetLMHeadModel. Also added related arguments, logic and calls in examples/language-modeling/run_language_modeling.py. Resolves: #4739, #2008 (partially) * Changed name to `DataCollatorForPermutationLanguageModeling` Changed the name of `DataCollatorForXLNetLanguageModeling` to the more general `DataCollatorForPermutationLanguageModelling`. Removed the `--mlm` flag requirement for the new collator and defined a separate `--plm_probability` flag for its use. CTRL uses a CLM loss just like GPT and GPT-2, so should work out of the box with this script (provided `past` is taken care of similar to `mems` for XLNet). Changed calls and imports appropriately. * Added detailed comments, changed variable names Added more detailed comments to `DataCollatorForPermutationLanguageModeling` in `data/data_collator.py` to explain working. Also cleaned up variable names and made them more informative. * Added tests for new data collator Added tests in `tests/test_trainer.py` for DataCollatorForPermutationLanguageModeling based on those in DataCollatorForLanguageModeling. A specific test has been added to check for odd-length sequences. * Fixed styling issues	2020-07-07 10:17:37 +02:00
Lysandre Debut	9d9b872b66	The `add_space_before_punct_symbol` is only for TransfoXL (#5549 )	2020-07-06 12:17:05 -04:00
Sylvain Gugger	734a28a767	Clean up diffs in Trainer/TFTrainer (#5417 ) * Cleanup and unify Trainer/TFTrainer * Forgot to adapt TFTrainingArgs * In tf scripts n_gpu -> n_replicas * Update src/transformers/training_args.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address review comments * Formatting * Fix typo Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-07-01 11:00:20 -04:00
Sam Shleifer	13deb95a40	Move tests/utils.py -> transformers/testing_utils.py (#5350 )	2020-07-01 10:31:17 -04:00
Sylvain Gugger	4ade7491f4	Fix examples titles and optimization doc page (#5408 )	2020-07-01 08:11:25 -04:00
Hong Xu	501040fd30	In the run_ner.py example, give the optional label arg a default value (#5326 ) Otherwise, if label is not specified, the following error occurs: Traceback (most recent call last): File "run_ner.py", line 303, in <module> main() File "run_ner.py", line 101, in main model_args, data_args, training_args = parser.parse_json_file(json_file=os.path.abspath(sys.argv[1])) File "/home/user/anaconda3/envs/bert/lib/python3.7/site-packages/transformers/hf_argparser.py", line 159, in parse_json_file obj = dtype(**inputs) TypeError: __init__() missing 1 required positional argument: 'labels'	2020-06-30 19:45:35 -04:00
Sam Shleifer	27a7fe7a8d	examples/seq2seq: never override $WANDB_PROJECT (#5407 )	2020-06-30 15:29:13 -04:00
Kevin Canwen Xu	331d8d2936	Upload DistilBART artwork (#5394 )	2020-06-30 18:11:11 +08:00
MichaelJanz	9a473f1e43	Update Bertabs example to work again (#5355 ) * Fix the bug 'Attempted relative import with no known parent package' when using the bertabs example. Also change the used model from bertabs-finetuned-cnndm, since it seems not be accessible anymore * Update run_summarization.py Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>	2020-06-30 14:05:01 +08:00
Sam Shleifer	a316a6aaa8	[seq2seq docs] Move evaluation down, fix typo (#5365 )	2020-06-29 10:36:04 -04:00
Patrick von Platen	4bcc35cd69	[Docs] Benchmark docs (#5360 ) * first doc version * add benchmark docs * fix typos * improve README * Update docs/source/benchmarks.rst Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * fix naming and docs Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-06-29 16:08:57 +02:00
Sam Shleifer	45e26125de	save_pretrained: mkdir(exist_ok=True) (#5258 ) * all save_pretrained methods mkdir if not os.path.exists	2020-06-28 14:53:47 -04:00
Suraj Patil	12dfbd4f7a	[examples] fix example links (#5344 )	2020-06-28 12:54:54 -04:00
Sam Shleifer	393b8dc09a	examples/seq2seq/run_eval.py fixes and docs (#5322 )	2020-06-26 19:20:43 -04:00
Sam Shleifer	5543b30aa6	[pl_examples] default warmup steps=0 (#5316 )	2020-06-26 15:03:41 -04:00
Thomas Wolf	601d4d699c	[tokenizers] Updates data processors, docstring, examples and model cards to the new API (#5308 ) * remove references to old API in docstring - update data processors * style * fix tests - better type checking error messages * better type checking * include awesome fix by @LysandreJik for #5310 * updated doc and examples	2020-06-26 19:48:14 +02:00
Patrick von Platen	79a82cc06a	[Benchmarks] improve Example Plotter (#5245 ) * improve plotting * better labels * fix time plot	2020-06-26 15:00:14 +02:00
Lysandre Debut	7cc15bdd96	Closes #5218	2020-06-25 18:19:21 -04:00
Sam Shleifer	e008d520bb	[examples/seq2seq] more README improvements (#5274 )	2020-06-25 10:13:01 -04:00
Sam Shleifer	40457bcebb	examples/seq2seq supports translation (#5202 )	2020-06-24 23:58:11 -04:00
Victor SANH	4965aee064	[HANS] Fix label_list for RoBERTa/BART (class flipping) (#5196 ) * fix weirdness in roberta/bart for mnli trained checkpoints * black compliance * isort code check	2020-06-24 14:38:15 -04:00
Patrick von Platen	9fe09cec76	[Benchmark] Extend Benchmark to all model type extensions (#5241 ) * add benchmark for all kinds of models * improved import * delete bogus files * make style	2020-06-24 15:11:42 +02:00
Sylvain Gugger	7c41057d50	Add hugs (#5225 )	2020-06-24 07:56:14 -04:00
Sylvain Gugger	5e85b324ec	Use the script in utils (#5224 )	2020-06-24 07:55:58 -04:00
Kevin Canwen Xu	54e9ce785d	Fix PABEE division by zero error (#5233 ) * Fix PABEE division by zero error * patience=0 by default	2020-06-24 16:10:36 +08:00
Sam Shleifer	76e5af4cfd	[pl_examples] revert deletion of optimizer_step (#5227 )	2020-06-23 16:40:45 -04:00
Sam Shleifer	f5c2a122e3	Upgrade examples to pl=0.8.1(#5146 )	2020-06-22 20:40:10 -04:00
Patrick von Platen	fa0be6d761	Benchmarks (#4912 ) * finish benchmark * fix isort * fix setup cfg * retab * fix time measuring of tf graph mode * fix tf cuda * clean code * better error message	2020-06-22 12:06:56 +02:00
Ilya Boytsov	bc3a0c0607	[examples] fixes arguments for summarization finetune scripts (#5157 ) Authored-by: i.boytsov <i.boytsov@MAC867.local>	2020-06-21 11:51:21 -04:00
Kevin Canwen Xu	c0c577cf8f	Fix PABEE's result table (#5158 )	2020-06-20 22:56:39 +08:00
Kevin Canwen Xu	2fd28d4363	Add BERT Loses Patience (Patience-based Early Exit) (#5078 ) * Add BERT Loses Patience (Patience-based Early Exit) * update model archive * update format * sort import * flake8 * Add results * full results * align the table * refactor to inherit * default per gpu eval = 1 * Formatting * Formatting * isort * modify readme * Add check * Fix format * Fix format * Doc strings * ALBERT & BERT for sequence classification don't inherit from the original anymore * Remove incorrect comments * Remove incorrect comments * Remove incorrect comments * Sync up with new code * Sync up with new code * Add a test * Add a test * Add a test * Add a test * Add a test * Add a test * Finishing up!	2020-06-20 13:41:46 +08:00
Sam Shleifer	2db1e2f415	[cleanup] remove redundant code in SummarizationDataset (#5119 )	2020-06-18 20:34:48 -04:00
Lysandre	efeb75b805	Remove misleading comment closes #4958	2020-06-17 18:24:35 -04:00
Sam Shleifer	f1a3d03741	add pandas to setup.cfg (#5093 )	2020-06-17 16:39:17 -04:00
Pranav Dayanand Pawar	049e14f0e3	very minor spelling correction in script command (#5090 ) actual script name - counts_parameters.py	2020-06-17 16:08:43 -04:00
Sam Shleifer	043f9f51f9	[examples] SummarizationModule improvements (#4951 )	2020-06-17 13:51:34 -04:00
Sylvain Gugger	cd40f6564e	Add header and fix command (#5082 )	2020-06-17 11:45:05 -04:00
flozi00	af497b5672	Typo (#5069 )	2020-06-16 16:46:20 -04:00
Yacine Jernite	49c5202522	Eli5 examples (#4968 ) * add eli5 examples * add dense query script * query_di * merging * merging * add_utils * adds nearest neighbor wikipedia * batch queries * training_retriever * new notebooks * moved retriever traiing script * finished wiki40b * max_len_fix * train_s2s * retriever_batch_checkpointing * cleanup * merge * dim_fix * fix_indexer * fix_wiki40b_snippets * fix_embed_for_r * fp32 index * fix_sparse_q * joint_training * remove obsolete datasets * add_passage_nn_results * add_passage_nn_results * add_batch_nn * add_batch_nn * add_data_scripts * notebook * notebook * notebook * fix_multi_gpu * add_app * full_caching * full_caching * notebook * sparse_done * images * notebook * add_image_gif * with_Gif * add_contr_image * notebook * notebook * notebook * train_functions * notebook * min_retrieval_length * pandas_option * notebook * min_retrieval_length * notebook * notebook * eval_Retriever * notebook * images * notebook * add_example * add_example * notebook * fireworks * notebook * notebook * joe's notebook comments * app_update * notebook * notebook_link * captions * notebook * assing RetriBert model * add RetriBert to Auto * change AutoLMHead to AutoSeq2Seq * notebook downloads from hf models * style_black * style_black * app_update * app_update * fix_app_update * style * style * isort * Delete WikiELI5training.ipynb * Delete evaluate_eli5.py * Delete WikiELI5explore.ipynb * Delete ExploreWikiELI5Support.html * Delete explainlikeimfive.py * Delete wiki_snippets.py * children before parent * children before parent * style_black * style_black_only * isort * isort_new * Update src/transformers/modeling_retribert.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> * typo fixes * app_without_asset * cleanup * Delete ELI5animation.gif * Delete ELI5contrastive.svg * Delete ELI5wiki_index.svg * Delete choco_bis.svg * Delete fireworks.gif * Delete huggingface_logo.jpg * Delete huggingface_logo.svg * Delete Long_Form_Question_Answering_with_ELI5_and_Wikipedia.ipynb * Delete eli5_app.py * Delete eli5_utils.py * readme * Update README.md * unused imports * moved_info * default_beam * ftuned model * disclaimer * Update src/transformers/modeling_retribert.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * black * add_doc * names * isort_Examples * isort_Examples * Add doc to index Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-06-16 16:36:58 -04:00
Sam Shleifer	c3e607496c	[cleanup] examples test_run_squad uses tiny model (#5059 )	2020-06-16 14:06:45 -04:00
Sylvain Gugger	d5477baf7d	Convert hans to Trainer (#5025 ) * Convert hans to Trainer * Tick box	2020-06-16 08:06:31 -04:00
Anthony MOI	36434220fc	[HUGE] Refactoring tokenizers backend - padding - truncation - pre-tokenized pipeline - fast tokenizers - tests (#4510 ) * Use tokenizers pre-tokenized pipeline * failing pretrokenized test * Fix is_pretokenized in python * add pretokenized tests * style and quality * better tests for batched pretokenized inputs * tokenizers clean up - new padding_strategy - split the files * [HUGE] refactoring tokenizers - padding - truncation - tests * style and quality * bump up requied tokenizers version to 0.8.0-rc1 * switched padding/truncation API - simpler better backward compat * updating tests for custom tokenizers * style and quality - tests on pad * fix QA pipeline * fix backward compatibility for max_length only * style and quality * Various cleans up - add verbose * fix tests * update docstrings * Fix tests * Docs reformatted * __call__ method documented Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-06-15 17:12:51 -04:00
Sylvain Gugger	1affde2f10	Make DataCollator a callable (#5015 ) * Make DataCollator a callable * Update src/transformers/data/data_collator.py Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-06-15 11:58:33 -04:00
Stefan Schweter	d812e6d76e	NER: fix construction of input examples for RoBERTa (#4943 ) * utils_ner: do not add extra sep token for RoBERTa model * run_pl_ner: do not add extra sep token for RoBERTa model	2020-06-15 08:30:40 -04:00
Sylvain Gugger	403d309857	Hans data (#4854 ) * Update hans data to be able to use Trainer * Fixes * Deal with tokenizer that don't have token_ids * Clean up things * Simplify data use * Fix the input dict * Formatting + proper path in README	2020-06-13 09:35:13 -04:00
VictorSanh	473808da0d	update `mvmt-pruning/saving_prunebert` (updating torch to 1.5)	2020-06-11 19:42:45 +00:00
Sylvain Gugger	e8db8b845a	Remove unused arguments in Multiple Choice example (#4853 ) * Remove unused arguments * Formatting * Remove second todo comment	2020-06-09 20:05:09 -04:00
songyouwei	29c36e9f36	run_pplm.py bug fix (#4867 ) `is_leaf` may become `False` after `.to(device=device)` function call.	2020-06-09 19:14:27 -04:00
Sam Shleifer	f90bc44d9a	[examples] Cleanup summarization docs (#4876 )	2020-06-09 17:38:28 -04:00

1 2 3 4 5 ...

1052 Commits