Julien Chaumond
5cb463a714
Update test_tokenization_auto.py
2020-01-13 22:38:29 -05:00
Julien Chaumond
afc24ea5d4
In a parallel setup this could fail
2020-01-13 23:44:08 +00:00
Julien Chaumond
894812c652
Fixup mapping
2020-01-13 23:34:19 +00:00
Julien Chaumond
b20f11d4ca
🔫 Python35
2020-01-13 23:20:44 +00:00
Julien Chaumond
0304628590
Map configs to models and tokenizers
2020-01-13 23:11:44 +00:00
Julien Chaumond
1fc855e456
[tests] Safety checks on CONFIG_MAPPING
2020-01-13 21:52:55 +00:00
Julien Chaumond
3c86b6f3c5
Py35 doesn't like inline variable types
2020-01-13 20:44:33 +00:00
Julien Chaumond
b803b067bf
Config to Model mapping
2020-01-13 20:05:20 +00:00
Thomas Wolf
896a0eb1fd
Merge pull request #2459 from Perseus14/patch-4
...
Update pipelines.py
2020-01-13 16:02:54 +01:00
Morgan Funtowicz
0d6c17fc1b
black formatting
2020-01-13 11:18:27 +01:00
IWillPull
a3085020ed
Added repetition penalty to PPLM example ( #2436 )
...
* Added repetition penalty
* Default PPLM repetition_penalty to neutral
* Minor modifications to comply with reviewer's suggestions. (j -> token_idx)
* Formatted code with `make style`
2020-01-10 23:00:07 -05:00
Julien Chaumond
cf8a70bf68
More AutoConfig tests
2020-01-11 03:43:57 +00:00
Julien Chaumond
6bb3edc300
Serialize model_type if exists
2020-01-11 03:18:56 +00:00
Julien Chaumond
c6f682c1eb
flake
2020-01-11 03:18:31 +00:00
Julien Chaumond
4d1c98c012
AutoConfig + other Auto classes honor model_type
2020-01-11 02:46:17 +00:00
Julien Chaumond
2f32dfd33b
Convention: name mixins mixins
2020-01-11 01:24:29 +00:00
VictorSanh
e83d9f1c1d
cleaning - change ' to " (black requirements)
2020-01-10 19:34:25 -05:00
VictorSanh
ebba9e929d
minor spring cleaning - missing configs + processing
2020-01-10 19:14:58 -05:00
Julien Chaumond
055e80cfad
rm old ConfigTester
2020-01-10 21:36:18 +00:00
Thomas Wolf
b1e1a9f9b2
Merge pull request #2495 from mschrimpf/patch-1
...
T5: move rp_bucket to relative_attention_bias' device
2020-01-10 22:18:54 +01:00
Julien Chaumond
fd8423321f
keep list sorted
2020-01-10 20:36:46 +00:00
Julien Chaumond
0cd81fb99f
[isort] declare more third-parties in case no tf install
2020-01-10 20:35:45 +00:00
Martin Schrimpf
90d3b787f6
move rp_bucket to relative_attention_bias' device
...
otherwise, `rp_bucket` will always be on cpu and fail if `self.relative_attention_bias` is on cuda
2020-01-10 15:09:10 -05:00
Julien Chaumond
84c0aa1868
num_parameters helper
2020-01-10 17:40:02 +00:00
Victor SANH
331065e62d
missing import
2020-01-10 11:42:53 +01:00
Victor SANH
414e9e7122
indents test
2020-01-10 11:42:53 +01:00
Victor SANH
3cdb38a7c0
indents
2020-01-10 11:42:53 +01:00
Victor SANH
ebd45980a0
Align with run_squad
+ fix some errors
2020-01-10 11:42:53 +01:00
Victor SANH
45634f87f8
fix Sampler in distributed training - evaluation
2020-01-10 11:42:53 +01:00
Victor SANH
af1ee9e648
Move torch.nn.utils.clip_grad_norm_
2020-01-10 11:42:53 +01:00
Lysandre
164c794eb3
New SQuAD API for distillation script
2020-01-10 11:42:53 +01:00
Lysandre
801f2ac8c7
Add PRETRAINED_INIT_CONFIGURATION to DistilBERT tokenizer
2020-01-10 11:42:21 +01:00
Yohei Tamura
bfec203d4e
modified: src/transformers/tokenization_utils.py
2020-01-09 12:54:28 +01:00
Julien Chaumond
f599623a99
PreTrainedTokenizerFast: hotfix _convert_encoding
...
cc @n1t0
2020-01-08 15:46:37 -05:00
Rishabh Manoj
f26a353057
Update pipelines.py
...
Modified QA pipeline to consider all features for each example before generating topk answers.
Current pipeline only takes one SquadExample, one SquadFeature, one start logit list, one end logit list to retrieve the answer, this is not correct as one SquadExample can produce multiple SquadFeatures.
2020-01-08 21:12:34 +05:30
Lysandre
16ce15ed4b
DistilBERT token type ids removed from inputs in run_squad
2020-01-08 13:18:30 +01:00
Lysandre Debut
f24232cd1b
Fix error with global step in run_squad.py
2020-01-08 11:39:00 +01:00
thomwolf
1b59b57b57
ignore_index equal -100 in T5 model
2020-01-08 09:52:10 +01:00
Romain Keramitas
569da80ced
Make doc regarding masked indices more clear.
...
Signed-off-by: Romain Keramitas <r.keramitas@gmail.com>
2020-01-07 17:37:27 +01:00
Oren Amsalem
43114b89ba
spelling correction ( #2434 )
2020-01-07 17:25:25 +01:00
Genta Indra Winata
d6a677b14b
Fix typograpical errors ( #2438 )
2020-01-07 17:21:23 +01:00
Lysandre Debut
27c1b656cc
Fix error with global step in run_lm_finetuning.py
2020-01-07 16:16:12 +01:00
Lysandre
24df44d9c7
Black version python 3.5
2020-01-07 15:53:42 +01:00
Lysandre Debut
73be60c47b
Quotes
2020-01-07 15:34:23 +01:00
Lysandre
6806f8204e
fix #2410
2020-01-07 15:20:45 +01:00
Simone Primarosa
176d3b3079
Add support for Albert and XLMRoberta for the Glue example ( #2403 )
...
* Add support for Albert and XLMRoberta for the Glue example
2020-01-07 14:55:55 +01:00
Morgan Funtowicz
9261c7f771
Remove f-string device creation on PyTorch GPU pipelines.
...
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>
2020-01-07 11:46:44 +01:00
Morgan Funtowicz
91d33c798b
Fix issue on pipelines where pytorch's tensors are not copied on the user-specified GPU device.
...
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>
2020-01-07 11:12:31 +01:00
Dima Galat
2926852f14
fixed formatting
2020-01-07 11:56:03 +11:00
Dima Galat
e2810edc8f
removing redundant .flush
2020-01-07 11:47:25 +11:00