transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-20 21:18:21 +06:00

Author	SHA1	Message	Date
Gunnlaugur Thor Briem	cd30f98fd2	doc: fix apparent copy-paste error in docstring (#5626 )	2020-07-14 09:47:41 +02:00
as-stevens	f867000f56	[Reformer classification head] Implement the reformer model classification head for text classification (#5198 ) * Reformer model head classification implementation for text classification * Reformat the reformer model classification code * PR review comments, and test case implementation for reformer for classification head changes * CI/CD reformer for classification head test import error fix * CI/CD test case implementation added ReformerForSequenceClassification to all_model_classes * Code formatting- fixed * Normal test cases added for reformer classification head * Fix test cases implementation for the reformer classification head * removed token_type_id parameter from the reformer classification head * fixed the test case for reformer classification head * merge conflict with master fixed * merge conflict, changed reformer classification to accept the choice_label parameter added in latest code * refactored the the reformer classification head test code * reformer classification head, common transform test cases fixed * final set of the review comment, rearranging the reformer classes and docstring add to classification forward method * fixed the compilation error and text case fix for reformer classification head * Apply suggestions from code review Remove unnecessary dup Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-07-14 09:16:22 +02:00
Gaurav Mishra	f0bda06f43	Update tokenization_t5.py (#5717 ) Minor doc fix.	2020-07-14 00:02:03 -04:00
Sam Shleifer	c3c61ea017	[Fix] github actions CI by reverting #5138 (#5686 )	2020-07-13 17:12:18 -04:00
Stas Bekman	45addfe96d	FlaubertForTokenClassification (#5644 ) * implement FlaubertForTokenClassification as a subclass of XLMForTokenClassification * fix mapping order * add the doc * add common tests	2020-07-13 14:59:53 -04:00
Patrick von Platen	7096e47513	[Longformer] fix longformer global attention output (#5659 ) * fix longformer global attention output * fix multi gpu problem * replace -10000 with 0 * better comment * make attention output equal local and global * Update src/transformers/modeling_longformer.py	2020-07-13 17:23:22 +02:00
Sylvain Gugger	ce374ba877	Fix Trainer in DataParallel setting (#5685 ) * Fix Trainer in DataParallel setting * Fix typo Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-07-13 08:37:38 -04:00
Stas Bekman	0a19a49dfe	doc improvements (#5688 )	2020-07-13 18:10:17 +08:00
Stas Bekman	443b0cad96	rename the function to match the rest of the test convention (#5692 )	2020-07-13 18:09:49 +08:00
onepointconsulting	74843695eb	Added first description of the model (#5672 ) Added general description, information about the tags and also some example usage code.	2020-07-13 02:53:48 -04:00
Kevin Canwen Xu	0befb51327	Pipeline model type check (#5679 ) * Add model type check for pipelines * Add model type check for pipelines * rename func * Fix the init parameters * Fix format * rollback unnecessary refactor	2020-07-12 12:34:21 +08:00
Kevin Canwen Xu	dc31a72f50	Add Microsoft's CodeBERT (#5683 ) * Add Microsoft's CodeBERT * link style * single modal * unused import	2020-07-11 21:37:30 +08:00
Sylvain Gugger	7fad617dc1	Document model outputs (#5673 ) * Document model outputs * Update docs/source/main_classes/output.rst Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-07-10 17:31:02 -04:00
Sylvain Gugger	df983b7483	Deprecate old past arguments (#5671 )	2020-07-10 17:25:52 -04:00
Tomo Lazovich	cdf4cd7068	[squad] add version tag to squad cache (#5669 )	2020-07-10 16:34:21 -04:00
Patrick von Platen	223084e42b	Add Reformer to notebooks	2020-07-10 18:34:25 +02:00
Julien Chaumond	201d23f285	Update The Big Table of Tasks Co-Authored-By: Suraj Patil <surajp815@gmail.com> Co-Authored-By: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-07-10 18:07:29 +02:00
Bashar Talafha	82f7bbbd93	Update README.md (#5617 ) * Update README.md * Update README.md	2020-07-10 11:43:27 -04:00
Manuel Romero	bf497376ee	Create README.md (#5572 )	2020-07-10 11:42:49 -04:00
kolk	3653d01f2a	Create README.md for electra-base-squad2 (#5574 )	2020-07-10 11:39:44 -04:00
Txus	aa69c81f29	Add freshly trained `base` version (#5621 )	2020-07-10 11:39:04 -04:00
Teven	227e0a406d	Fixed use of memories in XLNet (caching for language generation + warning when loading improper memoryless model) (#5632 ) * Pytorch gpu => cpu proper device * Memoryless XLNet warning + fixed memories during generation * Revert "Pytorch gpu => cpu proper device" This reverts commit `93489b36` * made black happy * TF generation with memories * dim => axis * added padding_text to TF XL models * Added comment, added TF	2020-07-10 17:38:36 +02:00
Manuel Romero	3b7b646563	Create README.md (#5638 )	2020-07-10 11:38:23 -04:00
Manuel Romero	0039b965db	Create model card (#5655 ) Create model card for T5-small fine-tuned on SQUAD v2	2020-07-10 11:38:11 -04:00
Nils Reimers	46982d612f	Create README.md - Model card (#5657 ) Model card for sentence-transformers/bert-base-nli-cls-token	2020-07-10 11:38:03 -04:00
Nils Reimers	c483803d1b	Create README.md - Model card (#5658 ) Model card for sentence-transformers/bert-base-nli-max-tokens	2020-07-10 11:37:56 -04:00
Sylvain Gugger	edfd82f5ff	Change model outputs types to self-document outputs (#5438 ) * [WIP] Proposal for model outputs * All Bert models * Make CI green maybe? * Fix ONNX test * Isolate ModelOutput from pt and tf * Formatting * Add Electra models * Auto-generate docstrings from outputs * Add TF outputs * Add some BERT models * Revert TF side * Remove last traces of TF changes * Fail with a clear error message * Add Albert and work through Bart * Add CTRL and DistilBert * Formatting * Progress on Bart * Renames and finish Bart * Formatting * Fix last test * Add DPR * Finish Electra and add FlauBERT * Add GPT2 * Add Longformer * Add MMBT * Add MobileBert * Add GPT * Formatting * Add Reformer * Add Roberta * Add T5 * Add Transformer XL * Fix test * Add XLM + fix XLMForTokenClassification * Style + XLMRoberta * Add XLNet * Formatting * Add doc of return_tuple arg	2020-07-10 11:36:53 -04:00
Suraj Parmar	fa265230a2	Create Model card for RoBERTa-hindi-guj-san (#5661 )	2020-07-10 11:34:23 -04:00
Sylvain Gugger	b2747af543	Improvements to PretrainedConfig documentation (#5642 ) * Update PretrainedConfig doc * Formatting * Small fixes * Forgotten args and more cleanup	2020-07-10 10:31:47 -04:00
Julien Chaumond	bfacb2e34f	[model_card] BART for ELI5 cc @yjernite	2020-07-10 08:10:24 -04:00
Nils Reimers	2e6bb0e9c3	Create README.md (#5652 )	2020-07-10 05:41:10 -04:00
Julien Chaumond	552e4591f5	[model_card] Add meta + fix link to image (hotlinking to image works on GitHub but not on external sites) cc @bashartalafha	2020-07-10 05:07:33 -04:00
Teven	02a0b43014	Fixed TextGenerationPipeline on torch + GPU (#5629 ) * Pytorch gpu => cpu proper device * Memoryless XLNet warning + fixed memories during generation * Revert "Memoryless XLNet warning + fixed memories during generation" This reverts commit `3d3251ff` * Took the operations on the generated_sequence out of the ensure_device scope	2020-07-09 16:29:32 -04:00
Sylvain Gugger	760f726e51	Add forum link in the docs (#5637 )	2020-07-09 15:13:22 -04:00
Stas Bekman	bfeaae2235	fix 404 (#5616 )	2020-07-09 15:12:29 -04:00
Lysandre Debut	b25f7802de	Should check that torch TPU is available (#5636 )	2020-07-09 13:54:32 -04:00
Lysandre Debut	3cc23eee06	More explicit error when failing to tensorize overflowing tokens (#5633 )	2020-07-09 13:35:21 -04:00
Lysandre	b9d8af07e6	Update stable doc	2020-07-09 11:06:23 -04:00
Lysandre Debut	1158e56551	Correct extension (#5631 )	2020-07-09 11:03:07 -04:00
Lysandre	5c82bf6831	Update stable doc	2020-07-09 10:16:13 -04:00
Lysandre Debut	0533cf4706	Test XLA examples (#5583 ) * Test XLA examples * Style * Using `require_torch_tpu` * Style * No need for pytest	2020-07-09 09:19:19 -04:00
Funtowicz Morgan	3bd55199cd	QA pipeline BART compatible (#5496 ) * Ensure padding and question cannot have higher probs than context. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Add bart the the list of tokenizers adding two <sep> tokens for squad_convert_example_to_feature Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Format. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Addressing @patrickvonplaten comments. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Addressing @patrickvonplaten comments about masking non-context element when generating the answer. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Addressing @sshleifer comments. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Make sure we mask CLS after handling impossible answers Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Mask in the correct vectors ... Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>	2020-07-09 15:11:40 +02:00
Stas Bekman	fa5423b169	doc fixes (#5613 )	2020-07-08 19:52:44 -04:00
Txus	7d0ef00420	Add newly trained `calbert-tiny-uncased` (#5599 ) * Create README.md Add newly trained `calbert-tiny-uncased` (complete rewrite with SentencePiece) * Add Exbert link * Apply suggestions from code review Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-07-08 17:54:51 -04:00
Lorenzo Ampil	0cc4eae0e6	Fix Inconsistent NER Grouping (Pipeline) (#4987 ) * Add B I handling to grouping * Add fix to include separate entity as last token * move last_idx definition outside loop * Use first entity in entity group as reference for entity type * Add test cases * Take out extra class accidentally added * Return tf ner grouped test to original * Take out redundant last entity * Get last_idx safely Co-authored-by: ColleterVi <36503688+ColleterVi@users.noreply.github.com> * Fix first entity comment * Create separate functions for group_sub_entities and group_entities (splitting call method to testable functions) * Take out unnecessary last_idx * Remove additional forward pass test * Move token classification basic tests to separate class * Move token classification basic tests back to monocolumninputtestcase * Move base ner tests to nerpipelinetests * Take out unused kwargs * Add back mandatory_keys argument * Add unitary tests for group_entities in _test_ner_pipeline * Fix last entity handling * Fix grouping fucntion used * Add typing to group_sub_entities and group_entities Co-authored-by: ColleterVi <36503688+ColleterVi@users.noreply.github.com>	2020-07-08 16:18:17 -04:00
Suraj Patil	82ce8488bb	create model cards for qg models (#5610 )	2020-07-08 16:08:56 -04:00
Bashar Talafha	d6b6ab11f0	Create README.md (#5601 )	2020-07-08 16:07:48 -04:00
Patrick von Platen	40d98ebf50	Update benchmark notebook (#5603 ) * Créé avec Colaboratory * delete old file	2020-07-08 16:03:59 +02:00
Sylvain Gugger	281e394889	Update question template (#5585 )	2020-07-08 08:46:35 -04:00
Patrick von Platen	f82a2a5e8e	[Benchmark] Add benchmarks for TF Training (#5594 ) * tf_train * adapt timing for tpu * fix timing * fix timing * fix timing * fix timing * update notebook * add tests	2020-07-08 12:11:09 +02:00

... 24 25 26 27 28 ...

5759 Commits