transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-15 02:28:24 +06:00

Author	SHA1	Message	Date
Sylvain Gugger	7acfa95afb	Add missing new line	2021-01-20 14:13:16 -05:00
Darigov Research	5a307ece82	Adds flashcards to Glossary & makes small corrections (#8949 ) * fix: Makes small typo corrections & standardises glossary * feat: Adds introduction & links to transformer flashcards * feat: Adds attribution & adjustments requested in #8949 * feat: Adds flashcards to community.md * refactor: Removes flashcards from glossary	2021-01-20 13:28:40 -05:00
NielsRogge	88583d4958	Add notebook (#9696 )	2021-01-20 10:19:26 -05:00
NielsRogge	d1370d29b1	Add DeBERTa head models (#9691 ) * Add DebertaForMaskedLM, DebertaForTokenClassification, DebertaForQuestionAnswering * Add docs and fix quality * Fix Deberta not having pooler	2021-01-20 10:18:50 -05:00
acul3	8940c7662d	Add t5 convert to transformers-cli (#9654 ) * Update run_mlm.py * add t5 model to transformers-cli convert * update rum_mlm.py same as master * update converting model docs * update converting model docs * Update convert.py * Trigger notification * update import sorted * fix typo t5	2021-01-20 09:34:27 -05:00
Sylvain Gugger	76f36e183a	Add a community page to the docs (#9682 )	2021-01-20 04:54:36 -05:00
Stas Bekman	82498cbc37	[deepspeed doc] install issues + 1-gpu deployment (#9582 ) * [doc] install + 1-gpu deployment * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * improvements Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-01-14 11:05:04 -08:00
Lysandre	e43f3b6190	v4.2.1 in docs	2021-01-14 14:25:30 +01:00
Lysandre	33a8497db8	v4.2.0 documentation	2021-01-13 16:15:40 +01:00
Lysandre	7d9a9d0c72	Release: v4.2.0	2021-01-13 16:01:51 +01:00
Julien Chaumond	247a7b2029	Doc: Update pretrained_models wording (#9545 ) * Update pretrained_models.rst To clarify things cf. this tweet for instance https://twitter.com/RTomMcCoy/status/1349094111505211395 * format	2021-01-13 05:58:05 -05:00
Stas Bekman	2df34f4aba	[trainer] deepspeed integration (#9211 ) * deepspeed integration * style * add test * ds wants to do its own backward * fp16 assert * Update src/transformers/training_args.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * style * for clarity extract what args are being passed to deepspeed * introduce the concept of self.wrapped_model * s/self.wrapped_model/self.model_wrapped/ * complete transition to self.wrapped_model / self.model * fix * doc * give ds its own init * add custom overrides, handle bs correctly * fix test * clean up model_init logic, fix small bug * complete fix * collapse --deepspeed_config into --deepspeed * style * start adding doc notes * style * implement hf2ds optimizer and scheduler configuration remapping * oops * call get_num_training_steps absolutely when needed * workaround broken auto-formatter * deepspeed_config arg is no longer needed - fixed in deepspeed master * use hf's fp16 args in config * clean * start on the docs * rebase cleanup * finish up --fp16 * clarify the supported stages * big refactor thanks to discovering deepspeed.init_distributed * cleanup * revert fp16 part * add checkpoint-support * more init ds into integrations * extend docs * cleanup * unfix docs * clean up old code * imports * move docs * fix logic * make it clear which file it's referring to * document nodes/gpus * style * wrong format * style * deepspeed handles gradient clipping * easier to read * major doc rewrite * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * docs * switch to AdamW optimizer * style * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * clarify doc Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-01-12 19:05:18 -08:00
NielsRogge	e45eba3b1c	Improve LayoutLM (#9476 ) * Add LayoutLMForSequenceClassification and integration tests Improve docs Add LayoutLM notebook to list of community notebooks * Make style & quality * Address comments by @sgugger, @patrickvonplaten and @LysandreJik * Fix rebase with master * Reformat in one line * Improve code examples as requested by @patrickvonplaten Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-01-12 09:26:32 -05:00
Patrick von Platen	7f28613213	[TFBart] Split TF-Bart (#9497 ) * make templates ready * make add_new_model_command_ready * finish tf bart * prepare tf mbart * finish tf bart * add tf mbart * add marian * prep pegasus * add tf pegasus * push blenderbot tf * add blenderbot * add blenderbot small * clean-up * make fix copy * define blend bot tok * fix * up * make style * add to docs * add copy statements * overwrite changes * improve * fix docs * finish * fix last slow test * fix missing git conflict line * fix blenderbot * up * fix blenderbot small * load changes * finish copied from * upload fix	2021-01-12 02:06:32 +01:00
Sylvain Gugger	8d25df2c7a	Make doc styler detect lists on rst (#9488 )	2021-01-11 08:53:41 -05:00
Patrick von Platen	9e1ea846bc	[README] Add new models (#9465 ) * add new models * make fix-copies	2021-01-08 05:49:43 -05:00
Patrick von Platen	ae5a32bb0d	up (#9454 )	2021-01-07 11:51:02 +01:00
Simon Brandeis	c89f1bc92e	Add flags to return scores, hidden states and / or attention weights in GenerationMixin (#9150 ) * Define new output dataclasses for greedy generation * Add output_[...] flags in greedy generation methods Added output_attentions, output_hidden_states, output_scores flags in generate and greedy_search methods in GenerationMixin. * [WIP] Implement logic and tests for output flags in generation * Update GreedySearchOutput classes & docstring * Implement greedy search output accumulation logic Update greedy_search unittests Fix generate method return value docstring Properly init flags with the default config * Update configuration to add output_scores flag * Fix test_generation_utils Sort imports and fix isinstance tests for GreedySearchOutputs * Fix typo in generation_utils * Add return_dict_in_generate for backwards compatibility * Add return_dict_in_generate flag in config * Fix tyPo in configuration * Fix handling of attentions and hidden_states flags * Make style & quality * first attempt attentions * some corrections * improve tests * special models requires special test * disable xlm test for now * clean tests * fix for tf * isort * Add output dataclasses for other generation methods * Add logic to return dict in sample generation * Complete test for sample generation - Pass output_attentions and output_hidden_states flags to encoder in encoder-decoder models - Fix import satements order in test_generation_utils file * Add logic to return dict in sample generation - Refactor tests to avoid using self.assertTrue, which provides scarce information when the test fails - Add tests for the three beam_search methods: vanilla, sample and grouped * Style doc * Fix copy-paste error in generation tests * Rename logits to scores and refactor * Refactor group_beam_search for consistency * make style * add sequences_scores * fix all tests * add docs * fix beam search finalize test * correct docstring * clean some files * Made suggested changes to the documentation * Style doc ? * Style doc using the Python util * Update src/transformers/generation_utils.py * fix empty lines * fix all test Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-01-06 17:11:42 +01:00
Qbiwan	ecfcac223c	Improve documentation coverage for Phobert (#9427 ) * first commit * change phobert to phoBERT as per author in overview * v3 and v4 both runs on same code hence there is no need to differentiate them Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-01-06 10:04:32 -05:00
Qbiwan	be898998bb	Improve documentation coverage for Herbert (#9428 ) * first commit * changed XLMTokenizer to HerbertTokenizer in code example	2021-01-06 09:13:43 -05:00
Patrick von Platen	b972c1bfb0	finalize (#9431 )	2021-01-06 14:36:55 +01:00
Sylvain Gugger	bcb55d33ce	Upgrade styler to better handle lists (#9423 ) * Add missing lines before a new list. * Update doc styler and restyle some files. * Fix docstrings of LED and Longformer	2021-01-06 07:46:17 -05:00
NielsRogge	b7e548976f	Fix URLs to TAPAS notebooks (#9435 )	2021-01-06 07:20:41 -05:00
Stas Bekman	d64372fdfc	[docs] outline sharded ddp doc (#9208 ) * outline sharded dpp doc * fix link * add example * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * narrow the command and remove non-essentials Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-01-05 17:34:15 -08:00
Patrick von Platen	eef66035a2	[PyTorch Bart] Split Bart into different models (#9343 ) * first try * remove old template * finish bart * finish mbart * delete unnecessary line * init pegasus * save intermediate * correct pegasus * finish pegasus * remove cookie cutter leftover * add marian * finish blenderbot * replace in file * correctly split blenderbot * delete "old" folder * correct "add statement" * adapt config for tf comp * correct configs for tf * remove ipdb * fix more stuff * fix mbart * push pegasus fix * fix mbart * more fixes * fix research projects code * finish docs for bart, mbart, and marian * delete unnecessary file * correct attn typo * correct configs * remove pegasus for seq class * correct peg docs * correct peg docs * finish configs * further improve docs * add copied from statements to mbart * fix copied from in mbart * add copy statements to marian * add copied from to marian * add pegasus copied from * finish pegasus * finish copied from * Apply suggestions from code review * make style * backward comp blenderbot * apply lysandres and sylvains suggestions * apply suggestions * push last fixes * fix docs * fix tok tests * fix imports code style * fix doc	2021-01-05 22:00:05 +01:00
Patrick von Platen	189387e9b2	LED (#9278 ) * create model * add integration * save current state * make integration tests pass * add one more test * add explanation to tests * remove from bart * add padding * remove unnecessary test * make all tests pass * re-add cookie cutter tests * finish PyTorch * fix attention test * Update tests/test_modeling_common.py * revert change * remove unused file * add string to doc * save intermediate * make tf integration tests pass * finish tf * fix doc * fix docs again * add led to doctree * add to auto tokenizer * added tips for led * make style * apply jplus statements * correct tf longformer * apply lysandres suggestions * apply sylvains suggestions * Apply suggestions from code review	2021-01-05 13:14:30 +01:00
Sugeeth	314cca2842	Fix documentation links always pointing to master. (#9217 ) * Use extlinks to point hyperlink with the version of code * Point to version on release and master until then * Apply style * Correct links * Add missing backtick * Simple missing backtick after all. Co-authored-by: Raghavendra Sugeeth P S <raghav-5305@raghav-5305.csez.zohocorpin.com> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2021-01-05 06:18:48 -05:00
Qbiwan	086718ac6e	Improve documentation coverage for Bertweet (#9379 ) * bertweet docs coverage * style doc max len 119 * maxlen style rst * run main() from style_doc * changed according to comments	2021-01-04 13:12:59 -05:00
Patrick von Platen	75ff530551	correct docs (#9378 )	2021-01-04 17:27:29 +01:00
Patrick von Platen	52b3a05e83	[Bart doc] Fix outdated statement (#9299 ) * fix bart doc * fix docs	2020-12-24 14:47:53 +01:00
Suraj Patil	88ef8893cd	Add caching mechanism to BERT, RoBERTa (#9183 ) * add past_key_values * add use_cache option * make mask before cutting ids * adjust position_ids according to past_key_values * flatten past_key_values * fix positional embeds * fix _reorder_cache * set use_cache to false when not decoder, fix attention mask init * add test for caching * add past_key_values for Roberta * fix position embeds * add caching test for roberta * add doc * make style * doc, fix attention mask, test * small fixes * adress patrick's comments * input_ids shouldn't start with pad token * use_cache only when decoder * make consistent with bert * make copies consistent * add use_cache to encoder * add past_key_values to tapas attention * apply suggestions from code review * make coppies consistent * add attn mask in tests * remove copied from longformer * apply suggestions from code review * fix bart test * nit * simplify model outputs * fix doc * fix output ordering	2020-12-23 23:01:32 +05:30
Connor Brinton	bcc87c639f	Minor documentation revisions from copyediting (#9266 ) * typo: Revise "checkout" to "check out" * typo: Change "seemlessly" to "seamlessly" * typo: Close parentheses in "Using the tokenizer" * typo: Add closing parenthesis to supported models aside * docs: Treat ``position_ids`` as plural Alternatively, the word "argument" could be added to make the subject singular. * docs: Remove comma, making subordinate clause * docs: Remove comma separating verb and direct object * docs: Fix typo ("next" -> "text") * docs: Reverse phrase order to simplify sentence * docs: "quicktour" -> "quick tour" * docs: "to throw" -> "from throwing" * docs: Remove disruptive newline in padding/truncation section * docs: "show exemplary" -> "show examples of" * docs: "much harder as" -> "much harder than" * docs: Fix typo "seach" -> "search" * docs: Fix subject-verb disagreement in WordPiece description * docs: Fix style in preprocessing.rst	2020-12-23 10:15:49 -05:00
Sylvain Gugger	490b39e614	Seq2seq trainer (#9241 ) * Add label smoothing in Trainer * Add options for scheduler and Adafactor in Trainer * Put Seq2SeqTrainer in the main lib * Apply suggestions from code review Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Address review comments and adapt scripts * Documentation * Move test not using script to tests folder Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-12-22 11:33:44 -05:00
Sylvain Gugger	1fc7119181	Fix script that check objects are documented (#9259 )	2020-12-22 11:12:58 -05:00
Suraj Patil	f4432b7e01	add base model classes to bart subclassed models (#9230 ) * add base model classes to bart subclassed models * add doc	2020-12-21 19:56:46 +05:30
Stas Bekman	3ff5e8955a	[t5 doc] typos (#9199 ) * [t5 doc] typos a few run away backticks @sgugger * style	2020-12-18 16:03:26 -08:00
Sylvain Gugger	3e56e2ce04	Fix typo	2020-12-18 10:11:07 -05:00
sandip	467e9158b4	Added TF CTRL Sequence Classification (#9151 ) * Added TF CTRL Sequence Classification * code refactor	2020-12-17 18:10:57 -05:00
Lysandre	bd40345d3e	v4.1.1 docs	2020-12-17 11:28:38 -05:00
Lysandre	bfa4ccf77d	Release: v4.1.1	2020-12-17 11:25:49 -05:00
Lysandre	e0790cca78	Fix TAPAS doc	2020-12-17 11:25:05 -05:00
Sylvain Gugger	6d2e864db7	Put all models in the constants (#9170 ) * Put all models in the constants * Add Google AI mention in the main README	2020-12-17 11:23:21 -05:00
Lysandre	f83d9c8da7	v4.1.0 docs	2020-12-17 10:16:07 -05:00
Lysandre	f5438ab8a2	Release: v4.1.0	2020-12-17 10:04:55 -05:00
Lysandre	ac2c7e398f	Remove erroneous character	2020-12-17 09:47:19 -05:00
Lysandre Debut	1aca3d6afa	Add disclaimer to TAPAS rst file (#9167 ) Co-authored-by: sgugger <sylvain.gugger@gmail.com> Co-authored-by: sgugger <sylvain.gugger@gmail.com>	2020-12-17 09:34:06 -05:00
Lysandre Debut	1c1a2ffbff	TableQuestionAnsweringPipeline (#9145 ) * AutoModelForTableQuestionAnswering * TableQuestionAnsweringPipeline * Apply suggestions from Patrick's code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Sylvain and Patrick comments * Better PyTorch/TF error message * Add integration tests * Argument Handler naming Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com> * Fix docs to appease the documentation gods Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-12-16 12:31:50 -05:00
Lysandre Debut	07384baf7a	AutoModelForTableQuestionAnswering (#9154 ) * AutoModelForTableQuestionAnswering * Update src/transformers/models/auto/modeling_auto.py * Style	2020-12-16 12:14:33 -05:00
Hayden Housen	34334662df	Add message to documentation that longformer doesn't support token_type_ids (#9152 ) * Add message to documentation that longformer doesn't support token_type_ids * Format changes	2020-12-16 11:06:14 -05:00
Patrick von Platen	640e6fe190	[Flax] Align FlaxBertForMaskedLM with BertForMaskedLM, implement from_pretrained, init (#9054 ) * save intermediate * save intermediate * save intermediate * correct flax bert model file * new module / model naming * make style * almost finish BERT * finish roberta * make fix-copies * delete keys file * last refactor * fixes in run_mlm_flax.py * remove pooled from run_mlm_flax.py` * fix gelu \| gelu_new * remove Module from inits * splits * dirty print * preventing warmup_steps == 0 * smaller splits * make fix-copies * dirty print * dirty print * initial_evaluation argument * declaration order fix * proper model initialization/loading * proper initialization * run_mlm_flax improvements: improper model inputs bugfix + automatic dataset splitting + tokenizers parallelism warning + avoiding warmup_steps=0 bug * removed tokenizers warning hack, fixed model re-initialization * reverted training_args.py changes * fix flax from pretrained * improve test in flax * apply sylvains tips * update init * make 0.3.0 compatible * revert tevens changes * revert tevens changes 2 * finalize revert * fix bug * add docs * add pretrained to init * Update src/transformers/modeling_flax_utils.py * fix copies * final improvements Co-authored-by: TevenLeScao <teven.lescao@gmail.com>	2020-12-16 13:03:32 +01:00

1 2 3 4 5 ...

578 Commits