Juha Kiili
66fc8d25a5
Change ref to original GLUE downloader script
2019-12-03 10:49:50 +02:00
Juha Kiili
2421e54f8c
Add link to original source and license to download_glue.data.py
2019-11-29 15:39:28 +02:00
Juha Kiili
41aa0e8003
Refactor logs and fix loss bug
2019-11-29 15:33:25 +02:00
Juha Kiili
05d4232f63
Add valohai.yaml
2019-11-21 12:38:17 +02:00
Aarni Koskela
aac3551407
Add download_glue_data.py from kamalkraj/ALBERT-TF2.0
...
Original source: fa90194e5f/download_glue_data.py
Original license: fa90194e5f/LICENSE
(Apache-2.0)
2019-11-21 12:37:41 +02:00
Juha Kiili
2cf3447e0a
Glue: log in Valohai-compatible JSON format too
2019-11-21 12:35:25 +02:00
Thomas Wolf
0cdfcca24b
Merge pull request #1860 from stefan-it/camembert-for-token-classification
...
[WIP] Add support for CamembertForTokenClassification
2019-11-21 10:56:07 +01:00
Jin Young Sohn
e70cdf083d
Cleanup TPU bits from run_glue.py
...
TPU runner is currently implemented in:
https://github.com/pytorch-tpu/transformers/blob/tpu/examples/run_glue_tpu.py .
We plan to upstream this directly into `huggingface/transformers`
(either `master` or `tpu`) branch once it's been more thoroughly tested.
2019-11-20 17:54:34 -05:00
Lysandre
454455c695
fix #1879
2019-11-20 09:42:48 -05:00
Kazutoshi Shinoda
f3386d9383
typo "deay" -> "decay"
2019-11-18 11:50:06 -05:00
Stefan Schweter
56c84863a1
camembert: add support for CamemBERT in run_ner example
2019-11-18 17:06:57 +01:00
Stefan Schweter
0b3d45eb64
camembert: add implementation for save_vocabulary method
2019-11-18 15:49:44 +01:00
Julien Chaumond
3916b334a8
[camembert] Acknowledge the full author list
2019-11-18 09:29:11 -05:00
Sebastian Stabinger
44455eb5b6
Adds CamemBERT to Model architectures list
2019-11-18 09:23:14 -05:00
Stefan Schweter
33753d9139
module: import CamembertForTokenClassification
2019-11-18 14:14:54 +01:00
Stefan Schweter
d32ce2c8df
camembert: add wrapper for CamembertForTokenClassification
2019-11-18 14:14:19 +01:00
Julien Chaumond
0477b307c7
[camembert] tokenizer: use additional_special_tokens
2019-11-16 00:11:07 -05:00
Julien Chaumond
f9abf73e31
[camembert] realign w/ recent changes
2019-11-16 00:11:07 -05:00
Julien Chaumond
26858f27cb
[camembert] Upload to s3 + rename script
2019-11-16 00:11:07 -05:00
Louis MARTIN
035fea5315
Add CamemBERT to auto files and docs
2019-11-16 00:11:07 -05:00
Louis MARTIN
694d4fcbb6
Add CamemBERT classes to __init__.py
2019-11-16 00:11:07 -05:00
Louis MARTIN
3e20c2e871
Update demo_camembert.py with new classes
2019-11-16 00:11:07 -05:00
Louis MARTIN
f12e4d8da7
Move demo_camembert.py to examples/contrib
2019-11-16 00:11:07 -05:00
Louis MARTIN
fb6c70a91d
Update tokenization_camembert.py with urls
2019-11-16 00:11:07 -05:00
Louis MARTIN
e44b939e71
Add configuration_camembert.py and modeling_camembert.py
2019-11-16 00:11:07 -05:00
Louis MARTIN
6e72fd094c
Add demo_camembert.py
2019-11-16 00:11:07 -05:00
Louis MARTIN
14b3aa3b3c
Add tokenization_camembert.py
2019-11-16 00:11:07 -05:00
Thomas Wolf
74ce8de7d8
Merge pull request #1792 from stefan-it/distilbert-for-token-classification
...
DistilBERT for token classification
2019-11-14 22:47:53 +01:00
Thomas Wolf
05db5bc1af
added small comparison between BERT, RoBERTa and DistilBERT
2019-11-14 22:40:22 +01:00
Thomas Wolf
9629e2c676
Merge pull request #1804 from ronakice/master
...
fix multi-gpu eval in torch examples
2019-11-14 22:24:05 +01:00
Thomas Wolf
5b322a36db
Merge pull request #1811 from huggingface/special-tokens
...
Fix special tokens addition in decoder #1807
2019-11-14 22:17:24 +01:00
Thomas Wolf
1a237d7f42
Merge pull request #1831 from iedmrc/gpt2-tokenization-sum-func-replacement
...
sum() is replaced by itertools.chain.from_iterable()
2019-11-14 22:11:54 +01:00
Thomas Wolf
df99f8c5a1
Merge pull request #1832 from huggingface/memory-leak-schedulers
...
replace LambdaLR scheduler wrappers by function
2019-11-14 22:10:31 +01:00
Thomas Wolf
0be9ae7b3e
Merge pull request #1833 from huggingface/max-length-warning
...
Token indices sequence length is longer than the specified maximum sequence length for this model
2019-11-14 22:04:49 +01:00
Lysandre
be7f2aacce
[CI][DOC] Don't rebuild if folder exists - Correct directory.
2019-11-14 14:54:44 -05:00
Lysandre
8f8d69716a
[CI][DOC] Don't rebuild if folder exists.
2019-11-14 14:48:21 -05:00
Rémi Louf
2276bf69b7
update the examples, docs and template
2019-11-14 20:38:02 +01:00
Lysandre
d7929899da
Specify checkpoint in saved file for run_lm_finetuning.py
2019-11-14 10:49:00 -05:00
Lysandre
a67e747889
Reorganized max_len warning
2019-11-14 10:30:22 -05:00
Lysandre
e18f786cd5
Quickstart example showcasing past
2019-11-14 10:06:00 -05:00
Rémi Louf
022525b003
replace LambdaLR scheduler wrappers by function
...
Custom schedulers are currently initiated by wrapping Pytorch's LambdaLR
class and passing a method of the wrapping class to the __init__
function of LambdaLR. This approach is not appropriate for several
reasons:
1. one does not need to define a class when it only defines a
__init__() method;
2. instantiating the parent class by passing a method of the child class
creates a cyclical reference which leads to memory leaks. See issues #1742 and #1134 .
In this commit we replace the wrapper classes with functions that
instantiate `LambdaLR` with a custom learning rate function. We use a
closure to specify the parameter of the latter. We also do a bit of
renaming within the function to explicit the behaviour and removed
docstrings that were subsequently not necessary.
2019-11-14 15:39:08 +01:00
İbrahim Ethem Demirci
7627dde1f8
sum() is the leanest method to flatten a string list, so it's been replaced by itertools.chain.from_iterable()
2019-11-14 17:06:15 +03:00
Lysandre
74d0bcb6ff
Fix special tokens addition in decoder
2019-11-12 15:27:57 -05:00
Julien Chaumond
155c782a2c
[inputs_embeds] All TF models + tests
2019-11-12 11:29:21 -05:00
Julien Chaumond
2aef2f0bbc
[common attributes] Fix previous commit for transfo-xl
2019-11-12 11:29:21 -05:00
Julien Chaumond
2f17464266
[common attributes] Slightly sharper test coverage
2019-11-12 11:29:21 -05:00
Julien Chaumond
9d2398fd99
Ooopsie
2019-11-12 11:29:21 -05:00
Julien Chaumond
70d97ddd60
[TF models] Common attributes as per #1721
2019-11-12 11:29:21 -05:00
Julien Chaumond
872403be1c
This is not a @property after all
2019-11-12 11:29:21 -05:00
Julien Chaumond
dd6b2e05e1
whitespace
2019-11-12 11:29:21 -05:00