transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-19 20:48:22 +06:00

Author	SHA1	Message	Date
Patrick von Platen	505f2d749e	[Tests] fix attention masks in Tests (#6621 ) * fix distilbert * fix typo	2020-08-20 13:23:47 -04:00
Denisa Roberts	c9454507cf	Add tests for Reformer tokenizer (#6485 )	2020-08-20 18:58:44 +02:00
Joe Davison	f9d280a959	TFTrainer dataset doc & fix evaluation bug (#6618 ) * TFTrainer dataset doc & fix evaluation bug discussed in #6551 * add docstring to test/eval datasets	2020-08-20 12:11:36 -04:00
Sylvain Gugger	573bdb0a5d	Add tests to Trainer (#6605 ) * Add tests to Trainer * Test if removing long breaks everything * Remove ugly hack * Fix distributed test * Use float for number of epochs	2020-08-20 11:13:50 -04:00
Joe Davison	039d8d65fc	add intro to nlp lib & dataset links to custom datasets tutorial (#6583 ) * add intro to nlp lib + links * unique links...	2020-08-20 10:32:51 -04:00
sgugger	b3e54698dd	Fix CI	2020-08-20 08:34:02 -04:00
Prajjwal Bhargava	33bf426498	removed redundant arg in prepare_inputs (#6614 ) * removed redundant arg in prepare_inputs * made same change in prediction_loop	2020-08-20 08:23:35 -04:00
Romain Rigaux	cabfdfafc0	Docs copy button misses ... prefixed code (#6518 ) Tested in a local build of the docs. e.g. Just above https://huggingface.co/transformers/task_summary.html#causal-language-modeling Copy will copy the full code, e.g. for token in top_5_tokens: print(sequence.replace(tokenizer.mask_token, tokenizer.decode([token]))) Instead of currently only: for token in top_5_tokens: >>> for token in top_5_tokens: ... print(sequence.replace(tokenizer.mask_token, tokenizer.decode([token]))) Distilled models are smaller than the models they mimic. Using them instead of the large versions would help reduce our carbon footprint. Distilled models are smaller than the models they mimic. Using them instead of the large versions would help increase our carbon footprint. Distilled models are smaller than the models they mimic. Using them instead of the large versions would help decrease our carbon footprint. Distilled models are smaller than the models they mimic. Using them instead of the large versions would help offset our carbon footprint. Distilled models are smaller than the models they mimic. Using them instead of the large versions would help improve our carbon footprint. Docs for the option fix: https://sphinx-copybutton.readthedocs.io/en/latest/	2020-08-20 17:35:06 +08:00
Stas Bekman	61b5ee11e3	lighter 'make test' (#6512 )	2020-08-20 17:24:25 +08:00
Siddharth Jain	3c3c46f563	Typo fix in 04-onnx-export (#6595 )	2020-08-20 16:17:16 +08:00
Oren Amsalem	93c5c9a528	[cleanup] remove confusing newline (#6603 )	2020-08-20 00:33:36 -04:00
Sylvain Gugger	18ca0e9140	Fix #6575 (#6596 )	2020-08-19 13:04:33 -04:00
Suraj Patil	7581884dee	[BartTokenizerFast] add prepare_seq2seq_batch (#6543 )	2020-08-19 10:37:48 -04:00
Patrick von Platen	8bcceaceff	fix model outputs test (#6593 )	2020-08-19 16:18:51 +02:00
Sam Shleifer	9a86321b11	tf generation utils: remove unused kwargs (#6591 )	2020-08-19 09:37:45 -04:00
Pradhy729	2a7402cbd3	Feed forward chunking others (#6365 ) * Feed forward chunking for Distilbert & Albert * Added ff chunking for many other models * Change model signature * Added chunking for XLM * Cleaned up by removing some variables. * remove test_chunking flag Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>	2020-08-19 14:31:10 +02:00
Patrick von Platen	fe0b85e77a	[EncoderDecoder] Add functionality to tie encoder decoder weights (#6538 ) * start adding tie encoder to decoder functionality * finish model tying * make style * Apply suggestions from code review * fix t5 list including cross attention * apply sams suggestions * Update src/transformers/modeling_encoder_decoder.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * add max depth break point Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-19 14:23:45 +02:00
Sam Shleifer	ab42d74850	Fix bart base test (#6587 )	2020-08-18 21:28:10 -04:00
Sam Shleifer	1529bf9680	add BartConfig.force_bos_token_to_be_generated (#6526 ) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-18 19:15:50 -04:00
Patrick von Platen	974bb4af26	[Model card] Bert2GPT2 EncoderDecoder model (#6569 ) * Bert2GPT2 EncoderDecoder model * Update README.md	2020-08-18 19:28:17 +02:00
Suraj Patil	6f972e1423	update xnli-mt url (#6580 )	2020-08-18 13:10:47 -04:00
Suraj Patil	fb6844aff5	[Pegasus Doc] minor typo (#6579 ) Minor typo correction @sshleifer	2020-08-18 12:47:47 -04:00
Manuel Romero	aaab9ab187	Create README.md (#6556 )	2020-08-18 12:43:20 -04:00
Manuel Romero	1dfce0f08a	Create README.md (#6557 )	2020-08-18 12:42:14 -04:00
Romain Rigaux	7516bcf273	[docs] Fix number of 'ug' occurrences in tokenizer_summary (#6574 )	2020-08-18 10:23:25 -04:00
Romain Rigaux	5a5af22ed5	[docs] Fix wrong newline in the middle of a paragraph (#6573 )	2020-08-18 10:22:43 -04:00
Stas Bekman	7659a8eb37	fix incorrect codecov reports (#6553 ) As discussed at https://github.com/huggingface/transformers/issues/6317 codecov currently sends an invalid report when it fails to find a code coverage report for the base it checks against, so this gets fixed by: - require_base: yes # don't report if there is no base coverage report let's add this for clarity, this supposedly is already the default. - require_head: yes # don't report if there is no head coverage report and perhaps no point reporting on doc changes as they don't make any difference and it just generates noise: - require_changes: true # only comment if there was change in coverage	2020-08-18 10:21:13 -04:00
Stefan Schweter	cfa26d2b41	github: add @stefan-it to bug-report template for all token-classification related bugs (#6489 )	2020-08-18 08:38:54 -04:00
Philip May	1fdf372f8c	Small typo fixes for model card: electra-base-german-uncased (#6555 ) * Update README.md * Update model_cards/german-nlp-group/electra-base-german-uncased/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-08-18 08:21:52 -04:00
Ali Modarressi	5a81195ea9	Fixed label datatype for STS-B (#6492 ) * fixed label datatype for sts-b * naming update * make style * make style	2020-08-18 08:09:39 -04:00
Sam Shleifer	12d7624199	[marian] converter supports models from new Tatoeba project (#6342 )	2020-08-17 23:55:42 -04:00
Jim Regan	fb7330b30e	update with #s of sentences/tokens (#6546 )	2020-08-17 16:48:05 -04:00
onepointconsulting	63144701ed	Added first model card (#6530 ) * Added first model card * Add metadata Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-08-17 16:24:10 -04:00
Ikram Ali	98ee802023	[model_cards] Add model cards for Urduhack model (roberta-urdu-small) (#6536 ) * [model_cards] roberta-urdu-small added. * [model_cards] typo fixed. * Tweak license format (yaml expects a simple string) Co-authored-by: Ikram Ali <mrikram1989> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-08-17 16:04:29 -04:00
Jim Regan	3a302904cb	[model_cards] Add a new model for Irish (#6544 )	2020-08-17 15:56:56 -04:00
Julien Chaumond	07971d8b18	[model_cards] Fix yaml for cedpsam/chatbot_fr	2020-08-17 21:33:32 +02:00
Suraj Patil	407da12ef1	[T5Tokenizer] add prepare_seq2seq_batch method (#6122 ) * tests	2020-08-17 13:57:19 -04:00
Suraj Patil	c9564f5343	[Doc] add more MBart and other doc (#6490 ) * add mbart example * add Pegasus and MBart in readme * typo * add MBart in Pretrained models * add pre-proc doc * add DPR in readme * fix indent * doc fix	2020-08-17 12:30:26 -04:00
Stas Bekman	f68c873100	replace _ with __ rst links (#6541 )	2020-08-17 12:27:02 -04:00
sgugger	7ca6ab67fc	Fix CI	2020-08-17 12:20:40 -04:00
Stas Bekman	b732e7e111	[doc] multiple corrections to "Summary of the tasks" (#6509 ) * [doc] multiple corrections to "Summary of the tasks" * fix indentation * correction * fix links, add links to examples/seq2seq/README.md instead of non-existing script	2020-08-17 11:49:16 -04:00
Suraj Patil	2a77813d53	[BartTokenizer] add prepare s2s batch (#6212 ) Co-authored-by: sgugger <sylvain.gugger@gmail.com>	2020-08-17 11:44:46 -04:00
Stas Bekman	84d33317ae	[doc] make the text more readable, fix some typos, add some disambiguation (#6508 ) * [doc] make the text more readable, fix some typos, add some disambiguation * Update docs/source/glossary.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-17 11:07:58 -04:00
Joe Davison	d0c2389f48	add custom datasets tutorial (#6466 ) * add custom datasets tutorial * python -> bash code blocks * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * minor review feedback changes * add working native QA snippet Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-17 09:15:34 -04:00
Sam Shleifer	d2da2cb232	allow spaces in bash args with "$@" (#6521 )	2020-08-17 09:06:35 -04:00
Funtowicz Morgan	b41cc0b86a	Fix flaky ONNX tests (#6531 )	2020-08-17 09:04:35 -04:00
Stas Bekman	39c3b1d9de	[sched] polynomial_decay_schedule use default power=1.0 (#6473 )	2020-08-17 08:33:12 -04:00
Stas Bekman	9dbe4094f2	[testing] a new TestCasePlus subclass + get_auto_remove_tmp_dir() (#6494 ) * [testing] switch to a new TestCasePlus + get_auto_remove_tmp_dir() for auto-removal of tmp dirs * respect after=True for tempfile, simplify code * comments * comment fix * put `before` last in args, so can make debug even faster	2020-08-17 08:12:19 -04:00
Patrick von Platen	36010cb1e2	fix pegasus doc (#6533 )	2020-08-17 12:24:43 +02:00
Kevin Canwen Xu	37709b5909	Remove deprecated assertEquals (#6532 ) `assertEquals` is deprecated: https://stackoverflow.com/questions/930995/assertequals-vs-assertequal-in-python/931011 This PR replaces these deprecated methods.	2020-08-17 17:13:58 +08:00

... 16 17 18 19 20 ...

5759 Commits