transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-19 20:48:22 +06:00

Author	SHA1	Message	Date
Stas Bekman	49d8076fa2	[doc] Summary of the models fixes (#6511 ) * [doc] Summary of the models fixes * correction	2020-08-17 16:04:53 +08:00
Cahya Wirawan	72911c893a	Create model cards for indonesian models (#6522 ) * added model cards for indonesian gpt2-small, bert-base and roberta-base models * removed bibtex entries	2020-08-17 15:42:25 +08:00
Masatoshi Suzuki	48c6c6139f	Support additional dictionaries for BERT Japanese tokenizers (#6515 ) * Update BERT Japanese tokenizers * Update CircleCI config to download unidic * Specify to use the latest dictionary packages	2020-08-17 12:00:23 +08:00
Stas Bekman	423eb5b1d7	[doc] fix invalid env vars (#6504 ) - remove invalid `ENV_` prefix. - add a few ':' while at it	2020-08-17 11:11:40 +08:00
Philip May	3c72f5584b	Add Model Card for electra-base-german-uncased (#6496 ) * Add Model Card for electra-base-german-uncased * Update README.md Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>	2020-08-17 11:02:32 +08:00
Stas Bekman	df15c7c226	typos (#6505 )	2020-08-17 10:57:36 +08:00
fabiocapsouza	6d38ab1cc3	Update bert-base-portuguese-cased and bert-large-portuguese-cased model cards (#6527 ) Co-authored-by: Fabio Souza <fabiosouza@neuralmind.ai>	2020-08-17 10:49:49 +08:00
Sam Shleifer	84c265ffcc	[lightning_base] fix s2s logging, only make train_loader once (#6404 )	2020-08-16 22:49:41 -04:00
Sam Shleifer	72add6c98f	[s2s] docs, document desired filenames nicely (#6525 )	2020-08-16 20:31:22 -04:00
Kyle Piira	2060181126	Fixes paths with spaces in seq2seq example (#6493 )	2020-08-16 13:36:38 -04:00
Kevin Canwen Xu	fe61c05b85	Add examples/bert-loses-patience who can help (#6499 )	2020-08-16 16:30:16 +08:00
Jin Young (Daniel) Sohn	24107c2c83	Fix TPU Convergence bug introduced by PR#6151 (#6488 ) Currently with the bug introduced we're taking two optimizer steps per batch: one global one, where `xm.optimizer_step` injects a CRS between all cores in training, and one without. This has been affecting training accuracy (for example, XLNet GLUE on MNLI is not converging, etc.).	2020-08-14 12:47:37 -04:00
Sylvain Gugger	895ed8f451	Generation doc (#6470 ) * Generation doc * MBartForConditionalGeneration (#6441) * add MBartForConditionalGeneration * style * rebase and fixes * add mbart test in TEST_FILES_WITH_NO_COMMON_TESTS * fix docs * don't ignore mbart * doc * fix mbart fairseq link * put mbart before bart * apply doc suggestions * Use hash to clean the test dirs (#6475) * Use hash to clean the test dirs * Use hash to clean the test dirs * Use hash to clean the test dirs * fix * [EncoderDecoder] Add Cross Attention for GPT2 (#6415) * add cross attention layers for gpt2 * make gpt2 cross attention work * finish bert2gpt2 * add explicit comments * remove attention mask since not yet supported * revert attn mask in pipeline * Update src/transformers/modeling_gpt2.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_encoder_decoder.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Sort unique_no_split_tokens to make it deterministic (#6461) * change unique_no_split_tokens's type to set * use sorted list instead of set * style * Import accuracy_score (#6480) * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address comments * Styling * Generation doc * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address comments * Styling Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Kevin Canwen Xu <canwenxu@126.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com> Co-authored-by: gijswijnholds <gijswijnholds@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-08-14 09:46:39 -04:00
gijswijnholds	b5ba758ba9	Import accuracy_score (#6480 )	2020-08-14 08:16:16 -04:00
Quentin Lhoest	9a8c168f56	Sort unique_no_split_tokens to make it deterministic (#6461 ) * change unique_no_split_tokens's type to set * use sorted list instead of set * style	2020-08-14 10:36:58 +02:00
Patrick von Platen	1d6e71e116	[EncoderDecoder] Add Cross Attention for GPT2 (#6415 ) * add cross attention layers for gpt2 * make gpt2 cross attention work * finish bert2gpt2 * add explicit comments * remove attention mask since not yet supported * revert attn mask in pipeline * Update src/transformers/modeling_gpt2.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_encoder_decoder.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-14 09:43:29 +02:00
Kevin Canwen Xu	eb613b566a	Use hash to clean the test dirs (#6475 ) * Use hash to clean the test dirs * Use hash to clean the test dirs * Use hash to clean the test dirs * fix	2020-08-14 15:34:39 +08:00
Suraj Patil	680f1337c3	MBartForConditionalGeneration (#6441 ) * add MBartForConditionalGeneration * style * rebase and fixes * add mbart test in TEST_FILES_WITH_NO_COMMON_TESTS * fix docs * don't ignore mbart * doc * fix mbart fairseq link * put mbart before bart * apply doc suggestions	2020-08-14 03:21:16 -04:00
Manuel Romero	05810cd80a	Fix typo (#6469 )	2020-08-13 15:01:08 -04:00
Kevin Canwen Xu	7bc00569df	Clean directory after script testing (#6453 ) * Clean Dir after testing * remove pabee ignore	2020-08-14 00:34:03 +08:00
Sam Shleifer	e92efcf728	Mult rouge by 100: standard units (#6359 )	2020-08-13 12:15:54 -04:00
vblagoje	eda07efaa5	Add POS tagging and Phrase chunking token classification examples (#6457 ) * Add more token classification examples * POS tagging example * Phrase chunking example * PR review fixes * Add conllu to third party list (used in token classification examples)	2020-08-13 12:09:51 -04:00
Suraj Patil	f51161e230	add BartTokenizerFast in AutoTokenizer (#6464 ) Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-08-13 12:08:11 -04:00
Suraj Patil	a442f87adc	add LongformerTokenizerFast in AutoTokenizer (#6463 )	2020-08-13 12:06:43 -04:00
Lysandre Debut	f7cbc13db7	Test model outputs equivalence (#6445 ) * Test model outputs equivalence * Fix failing tests * From dict to kwargs * DistilBERT * Addressing @sgugger and @patrickvonplaten's comments	2020-08-13 11:59:35 -04:00
Prajjwal Bhargava	54c687e97c	typo fix (#6462 )	2020-08-13 09:36:48 -04:00
Zhu Baohe	9d94aecd51	Fix docs and bad word tokens generation_utils.py (#6387 ) * fix * fix2 * fix3	2020-08-13 13:12:16 +02:00
cedspam	0ed7c00ba6	Update README.md (#6435 ) * Update README.md * Update README.md * Update README.md	2020-08-13 11:01:17 +02:00
Stas Bekman	e983da0e7d	cleanup tf unittests: part 2 (#6260 ) * cleanup torch unittests: part 2 * remove trailing comma added by isort, and which breaks flake * one more comma * revert odd balls * part 3: odd cases * more ["key"] -> .key refactoring * .numpy() is not needed * more unncessary .numpy() removed * more simplification	2020-08-13 04:29:06 -04:00
Joe Davison	bc820476a5	add targets arg to fill-mask pipeline (#6239 ) * add targets arg to fill-mask pipeline * add tests and more error handling * quality * update docstring	2020-08-12 12:48:29 -04:00
Patrick von Platen	0735def8e1	[EncoderDecoder] Add encoder-decoder for roberta/ vanilla longformer (#6411 ) * add encoder-decoder for roberta * fix headmask * apply Sylvains suggestions * fix typo * Apply suggestions from code review	2020-08-12 18:23:30 +02:00
zcain117	fd3de2000f	Get GKE logs via kubectl logs instead of gcloud logging read. (#6446 )	2020-08-12 11:46:24 -04:00
Sam Shleifer	f94a52cd79	[s2s] add BartTranslationDistiller for distilling mBART (#6363 )	2020-08-12 11:41:04 -04:00
Sylvain Gugger	d2370e1bd8	Adding PaddingDataCollator (#6442 ) * Data collator with padding * Add type annotation * Support tensors as well * Add comment * Fix for labels wrong shape * Data collator with padding * Add type annotation * Support tensors as well * Add comment * Fix for labels wrong shape * Remove changes rendered unnecessary	2020-08-12 11:32:27 -04:00
Sylvain Gugger	96c3329f19	Fix #6428 (#6437 )	2020-08-12 08:47:30 -04:00
Sylvain Gugger	a8db954cda	Activate check on the CI (#6427 ) * Activate check on the CI * Fix repo inconsistencies * Don't document too much	2020-08-12 08:42:14 -04:00
Sylvain Gugger	34fabe1697	Move prediction_loss_only to TrainingArguments (#6426 )	2020-08-12 08:03:45 -04:00
Sylvain Gugger	e9c3031463	Fixes to make life easier with the nlp library (#6423 ) * allow using tokenizer.pad as a collate_fn in pytorch * allow using tokenizer.pad as a collate_fn in pytorch * Add documentation and tests * Make attention mask the right shape * Better test Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-08-12 08:00:56 -04:00
Stas Bekman	87b359439f	[test] replace capsys with the more refined CaptureStderr/CaptureStdout (#6422 ) * replace capsys with the more refined CaptureStderr/CaptureStdout * Update examples/seq2seq/test_seq2seq_examples.py Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-08-12 07:54:28 -04:00
Jared T Nielsen	ac5bcf236e	Fix FFN dropout in TFAlbertLayer, and split dropout in TFAlbertAttent… (#4323 ) * Fix FFN dropout in TFAlbertLayer, and split dropout in TFAlbertAttention into two separate dropout layers. * Same dropout fixes for PyTorch.	2020-08-12 07:52:42 -04:00
Lysandre Debut	4ffea5ce2f	Disabled pabee test (#6431 )	2020-08-12 02:52:50 -04:00
Rohan Rajpal	155288f04b	[model_card] rohanrajpal/bert-base-codemixed-uncased-sentiment (#6324 ) * Create README.md * Update model_cards/rohanrajpal/bert-base-codemixed-uncased-sentiment/README.md * Update model_cards/rohanrajpal/bert-base-codemixed-uncased-sentiment/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-08-11 18:38:18 -04:00
Manuel Romero	4e6245fc7e	Create model card T5-base fine-tuned on event2Mind for Intent Prediction (#6412 )	2020-08-11 18:35:27 -04:00
Manuel Romero	46e3a0a6ec	Create README.md (#6381 )	2020-08-11 18:34:11 -04:00
Manuel Romero	31dfde7429	Create README.md (#6378 )	2020-08-11 18:32:37 -04:00
Manuel Romero	25e29150a2	Add metadata to be indexed properly (#6380 )	2020-08-11 18:32:29 -04:00
Manuel Romero	471be5f279	Change metadata to be indexed correctly (#6379 )	2020-08-11 18:32:18 -04:00
Rohan Rajpal	42ee0bc63d	Create README.md (#6346 ) * Create README.md * add results on SAIL dataset * Update model_cards/rohanrajpal/bert-base-multilingual-codemixed-cased-sentiment/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-08-11 18:31:34 -04:00
Sam Shleifer	3f071c4b6e	[examples] add pytest dependency (#6425 )	2020-08-11 17:58:09 -04:00
Stas Bekman	ece0903e11	lr_schedulers: add get_polynomial_decay_schedule_with_warmup (#6361 ) * [wip] add get_polynomial_decay_schedule_with_warmup * style * add assert * change lr_end to a much smaller default number * check for exact equality * [model_cards] electra-base-turkish-cased-ner (#6350) * for electra-base-turkish-cased-ner * Add metadata Co-authored-by: Julien Chaumond <chaumond@gmail.com> * Temporarily de-activate TPU CI * Update modeling_tf_utils.py (#6372) fix typo: ckeckpoint->checkpoint * the test now works again (#6371) * correct pl link in readme (#6364) * refactor almost identical tests (#6339) * refactor almost identical tests * important to add a clear assert error message * make the assert error even more descriptive than the original bt * Small docfile fixes (#6328) * Patch models (#6326) * TFAlbertFor{TokenClassification, MultipleChoice} * Patch models * BERT and TF BERT info s * Update check_repo * Ci GitHub caching (#6382) * Cache Github Actions CI * Remove useless file * Colab button (#6389) * Add colab button * Add colab link for tutorials * Fix links for open in colab (#6391) * Update src/transformers/optimization.py consistently use lr_end=1e-7 default Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * [wip] add get_polynomial_decay_schedule_with_warmup * style * add assert * change lr_end to a much smaller default number * check for exact equality * Update src/transformers/optimization.py consistently use lr_end=1e-7 default Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * remove dup (leftover from merge) * convert the test into the new refactored format * stick to using the current_step as is, without ++ Co-authored-by: M. Yusuf Sarıgöz <yusufsarigoz@gmail.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by: Alexander Measure <ameasure@gmail.com> Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-08-11 17:56:41 -04:00

... 17 18 19 20 21 ...

5759 Commits