transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-30 17:52:35 +06:00

Author	SHA1	Message	Date
Sam Shleifer	ab42d74850	Fix bart base test (#6587 )	2020-08-18 21:28:10 -04:00
Sam Shleifer	1529bf9680	add BartConfig.force_bos_token_to_be_generated (#6526 ) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-18 19:15:50 -04:00
Patrick von Platen	974bb4af26	[Model card] Bert2GPT2 EncoderDecoder model (#6569 ) * Bert2GPT2 EncoderDecoder model * Update README.md	2020-08-18 19:28:17 +02:00
Suraj Patil	6f972e1423	update xnli-mt url (#6580 )	2020-08-18 13:10:47 -04:00
Suraj Patil	fb6844aff5	[Pegasus Doc] minor typo (#6579 ) Minor typo correction @sshleifer	2020-08-18 12:47:47 -04:00
Manuel Romero	aaab9ab187	Create README.md (#6556 )	2020-08-18 12:43:20 -04:00
Manuel Romero	1dfce0f08a	Create README.md (#6557 )	2020-08-18 12:42:14 -04:00
Romain Rigaux	7516bcf273	[docs] Fix number of 'ug' occurrences in tokenizer_summary (#6574 )	2020-08-18 10:23:25 -04:00
Romain Rigaux	5a5af22ed5	[docs] Fix wrong newline in the middle of a paragraph (#6573 )	2020-08-18 10:22:43 -04:00
Stas Bekman	7659a8eb37	fix incorrect codecov reports (#6553 ) As discussed at https://github.com/huggingface/transformers/issues/6317 codecov currently sends an invalid report when it fails to find a code coverage report for the base it checks against, so this gets fixed by: - require_base: yes # don't report if there is no base coverage report let's add this for clarity, this supposedly is already the default. - require_head: yes # don't report if there is no head coverage report and perhaps no point reporting on doc changes as they don't make any difference and it just generates noise: - require_changes: true # only comment if there was change in coverage	2020-08-18 10:21:13 -04:00
Stefan Schweter	cfa26d2b41	github: add @stefan-it to bug-report template for all token-classification related bugs (#6489 )	2020-08-18 08:38:54 -04:00
Philip May	1fdf372f8c	Small typo fixes for model card: electra-base-german-uncased (#6555 ) * Update README.md * Update model_cards/german-nlp-group/electra-base-german-uncased/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-08-18 08:21:52 -04:00
Ali Modarressi	5a81195ea9	Fixed label datatype for STS-B (#6492 ) * fixed label datatype for sts-b * naming update * make style * make style	2020-08-18 08:09:39 -04:00
Sam Shleifer	12d7624199	[marian] converter supports models from new Tatoeba project (#6342 )	2020-08-17 23:55:42 -04:00
Jim Regan	fb7330b30e	update with #s of sentences/tokens (#6546 )	2020-08-17 16:48:05 -04:00
onepointconsulting	63144701ed	Added first model card (#6530 ) * Added first model card * Add metadata Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-08-17 16:24:10 -04:00
Ikram Ali	98ee802023	[model_cards] Add model cards for Urduhack model (roberta-urdu-small) (#6536 ) * [model_cards] roberta-urdu-small added. * [model_cards] typo fixed. * Tweak license format (yaml expects a simple string) Co-authored-by: Ikram Ali <mrikram1989> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-08-17 16:04:29 -04:00
Jim Regan	3a302904cb	[model_cards] Add a new model for Irish (#6544 )	2020-08-17 15:56:56 -04:00
Julien Chaumond	07971d8b18	[model_cards] Fix yaml for cedpsam/chatbot_fr	2020-08-17 21:33:32 +02:00
Suraj Patil	407da12ef1	[T5Tokenizer] add prepare_seq2seq_batch method (#6122 ) * tests	2020-08-17 13:57:19 -04:00
Suraj Patil	c9564f5343	[Doc] add more MBart and other doc (#6490 ) * add mbart example * add Pegasus and MBart in readme * typo * add MBart in Pretrained models * add pre-proc doc * add DPR in readme * fix indent * doc fix	2020-08-17 12:30:26 -04:00
Stas Bekman	f68c873100	replace _ with __ rst links (#6541 )	2020-08-17 12:27:02 -04:00
sgugger	7ca6ab67fc	Fix CI	2020-08-17 12:20:40 -04:00
Stas Bekman	b732e7e111	[doc] multiple corrections to "Summary of the tasks" (#6509 ) * [doc] multiple corrections to "Summary of the tasks" * fix indentation * correction * fix links, add links to examples/seq2seq/README.md instead of non-existing script	2020-08-17 11:49:16 -04:00
Suraj Patil	2a77813d53	[BartTokenizer] add prepare s2s batch (#6212 ) Co-authored-by: sgugger <sylvain.gugger@gmail.com>	2020-08-17 11:44:46 -04:00
Stas Bekman	84d33317ae	[doc] make the text more readable, fix some typos, add some disambiguation (#6508 ) * [doc] make the text more readable, fix some typos, add some disambiguation * Update docs/source/glossary.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-17 11:07:58 -04:00
Joe Davison	d0c2389f48	add custom datasets tutorial (#6466 ) * add custom datasets tutorial * python -> bash code blocks * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * minor review feedback changes * add working native QA snippet Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-17 09:15:34 -04:00
Sam Shleifer	d2da2cb232	allow spaces in bash args with "$@" (#6521 )	2020-08-17 09:06:35 -04:00
Funtowicz Morgan	b41cc0b86a	Fix flaky ONNX tests (#6531 )	2020-08-17 09:04:35 -04:00
Stas Bekman	39c3b1d9de	[sched] polynomial_decay_schedule use default power=1.0 (#6473 )	2020-08-17 08:33:12 -04:00
Stas Bekman	9dbe4094f2	[testing] a new TestCasePlus subclass + get_auto_remove_tmp_dir() (#6494 ) * [testing] switch to a new TestCasePlus + get_auto_remove_tmp_dir() for auto-removal of tmp dirs * respect after=True for tempfile, simplify code * comments * comment fix * put `before` last in args, so can make debug even faster	2020-08-17 08:12:19 -04:00
Patrick von Platen	36010cb1e2	fix pegasus doc (#6533 )	2020-08-17 12:24:43 +02:00
Kevin Canwen Xu	37709b5909	Remove deprecated assertEquals (#6532 ) `assertEquals` is deprecated: https://stackoverflow.com/questions/930995/assertequals-vs-assertequal-in-python/931011 This PR replaces these deprecated methods.	2020-08-17 17:13:58 +08:00
Stas Bekman	49d8076fa2	[doc] Summary of the models fixes (#6511 ) * [doc] Summary of the models fixes * correction	2020-08-17 16:04:53 +08:00
Cahya Wirawan	72911c893a	Create model cards for indonesian models (#6522 ) * added model cards for indonesian gpt2-small, bert-base and roberta-base models * removed bibtex entries	2020-08-17 15:42:25 +08:00
Masatoshi Suzuki	48c6c6139f	Support additional dictionaries for BERT Japanese tokenizers (#6515 ) * Update BERT Japanese tokenizers * Update CircleCI config to download unidic * Specify to use the latest dictionary packages	2020-08-17 12:00:23 +08:00
Stas Bekman	423eb5b1d7	[doc] fix invalid env vars (#6504 ) - remove invalid `ENV_` prefix. - add a few ':' while at it	2020-08-17 11:11:40 +08:00
Philip May	3c72f5584b	Add Model Card for electra-base-german-uncased (#6496 ) * Add Model Card for electra-base-german-uncased * Update README.md Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>	2020-08-17 11:02:32 +08:00
Stas Bekman	df15c7c226	typos (#6505 )	2020-08-17 10:57:36 +08:00
fabiocapsouza	6d38ab1cc3	Update bert-base-portuguese-cased and bert-large-portuguese-cased model cards (#6527 ) Co-authored-by: Fabio Souza <fabiosouza@neuralmind.ai>	2020-08-17 10:49:49 +08:00
Sam Shleifer	84c265ffcc	[lightning_base] fix s2s logging, only make train_loader once (#6404 )	2020-08-16 22:49:41 -04:00
Sam Shleifer	72add6c98f	[s2s] docs, document desired filenames nicely (#6525 )	2020-08-16 20:31:22 -04:00
Kyle Piira	2060181126	Fixes paths with spaces in seq2seq example (#6493 )	2020-08-16 13:36:38 -04:00
Kevin Canwen Xu	fe61c05b85	Add examples/bert-loses-patience who can help (#6499 )	2020-08-16 16:30:16 +08:00
Jin Young (Daniel) Sohn	24107c2c83	Fix TPU Convergence bug introduced by PR#6151 (#6488 ) Currently with the bug introduced we're taking two optimizer steps per batch: one global one, where `xm.optimizer_step` injects a CRS between all cores in training, and one without. This has been affecting training accuracy (for example, XLNet GLUE on MNLI is not converging, etc.).	2020-08-14 12:47:37 -04:00
Sylvain Gugger	895ed8f451	Generation doc (#6470 ) * Generation doc * MBartForConditionalGeneration (#6441) * add MBartForConditionalGeneration * style * rebase and fixes * add mbart test in TEST_FILES_WITH_NO_COMMON_TESTS * fix docs * don't ignore mbart * doc * fix mbart fairseq link * put mbart before bart * apply doc suggestions * Use hash to clean the test dirs (#6475) * Use hash to clean the test dirs * Use hash to clean the test dirs * Use hash to clean the test dirs * fix * [EncoderDecoder] Add Cross Attention for GPT2 (#6415) * add cross attention layers for gpt2 * make gpt2 cross attention work * finish bert2gpt2 * add explicit comments * remove attention mask since not yet supported * revert attn mask in pipeline * Update src/transformers/modeling_gpt2.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_encoder_decoder.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Sort unique_no_split_tokens to make it deterministic (#6461) * change unique_no_split_tokens's type to set * use sorted list instead of set * style * Import accuracy_score (#6480) * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address comments * Styling * Generation doc * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address comments * Styling Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Kevin Canwen Xu <canwenxu@126.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com> Co-authored-by: gijswijnholds <gijswijnholds@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-08-14 09:46:39 -04:00
gijswijnholds	b5ba758ba9	Import accuracy_score (#6480 )	2020-08-14 08:16:16 -04:00
Quentin Lhoest	9a8c168f56	Sort unique_no_split_tokens to make it deterministic (#6461 ) * change unique_no_split_tokens's type to set * use sorted list instead of set * style	2020-08-14 10:36:58 +02:00
Patrick von Platen	1d6e71e116	[EncoderDecoder] Add Cross Attention for GPT2 (#6415 ) * add cross attention layers for gpt2 * make gpt2 cross attention work * finish bert2gpt2 * add explicit comments * remove attention mask since not yet supported * revert attn mask in pipeline * Update src/transformers/modeling_gpt2.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_encoder_decoder.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-14 09:43:29 +02:00
Kevin Canwen Xu	eb613b566a	Use hash to clean the test dirs (#6475 ) * Use hash to clean the test dirs * Use hash to clean the test dirs * Use hash to clean the test dirs * fix	2020-08-14 15:34:39 +08:00

... 8 9 10 11 12 ...

5342 Commits