Stefan Schweter
3e89fca543
readme: add XLM-RoBERTa to model architecture list
2019-12-18 19:44:23 +01:00
Gunnlaugur Thor Briem
d303f84e7b
fix: wrong architecture count in README
...
Just say “the following” so that this intro doesn't so easily fall out of date :) )
2019-12-17 16:18:00 +00:00
Julien Chaumond
3f5ccb183e
[doc] Clarify uploads
...
cf 855ff0e91d (commitcomment-36452545)
2019-12-16 18:20:29 -05:00
Julien Chaumond
855ff0e91d
[doc] Model upload and sharing
...
ping @lysandrejik @thomwolf
Is this clear enough? Anything we should add?
2019-12-16 12:42:22 -05:00
Thomas Wolf
e92bcb7eb6
Merge pull request #1739 from huggingface/t5
...
[WIP] Adding Google T5 model
2019-12-14 09:40:43 +01:00
Lysandre
7bd11dda6f
Release: v2.2.2
2019-12-13 16:45:30 -05:00
thomwolf
0558c9cb9b
Merge branch 'master' into t5
2019-12-10 12:58:48 +01:00
Suvrat Bhooshan
df3961121f
Add MMBT Model to Transformers Repo
2019-12-09 18:36:48 -08:00
Pierric Cistac
5c877fe94a
fix albert links
2019-12-09 18:53:00 -05:00
Aymeric Augustin
35401fe50f
Remove dependency on pytest for running tests ( #2055 )
...
* Switch to plain unittest for skipping slow tests.
Add a RUN_SLOW environment variable for running them.
* Switch to plain unittest for PyTorch dependency.
* Switch to plain unittest for TensorFlow dependency.
* Avoid leaking open files in the test suite.
This prevents spurious warnings when running tests.
* Fix unicode warning on Python 2 when running tests.
The warning was:
UnicodeWarning: Unicode equal comparison failed to convert both arguments to Unicode - interpreting them as being unequal
* Support running PyTorch tests on a GPU.
Reverts 27e015bd
.
* Tests no longer require pytest.
* Make tests pass on cuda
2019-12-06 13:57:38 -05:00
VictorSanh
552c44a9b1
release distilm-bert
2019-12-05 10:14:58 -05:00
LysandreJik
8101924a68
Patch: v2.2.1
2019-12-03 11:20:26 -05:00
Julien Chaumond
b5d884d25c
Uniformize #1952
2019-11-27 11:05:55 -05:00
Lysandre
cf26a0c85e
Fix pretrained models table
2019-11-26 15:40:03 -05:00
Lysandre Debut
b632145273
Update master documentation link in README
2019-11-26 14:27:15 -05:00
Lysandre
ae98d45991
Release: v2.2.0
2019-11-26 14:12:44 -05:00
Julien Chaumond
176cd1ce1b
[doc] homogenize instructions slightly
2019-11-23 11:18:54 -05:00
Rémi Louf
6f70bb8c69
add instructions to run the examples
2019-11-21 14:41:19 -05:00
Julien Chaumond
3916b334a8
[camembert] Acknowledge the full author list
2019-11-18 09:29:11 -05:00
Sebastian Stabinger
44455eb5b6
Adds CamemBERT to Model architectures list
2019-11-18 09:23:14 -05:00
Thomas Wolf
df99f8c5a1
Merge pull request #1832 from huggingface/memory-leak-schedulers
...
replace LambdaLR scheduler wrappers by function
2019-11-14 22:10:31 +01:00
Rémi Louf
2276bf69b7
update the examples, docs and template
2019-11-14 20:38:02 +01:00
thomwolf
8aba81a0b6
fix #1789
2019-11-12 08:52:43 +01:00
thomwolf
f03c0c1423
adding models in readme and auto classes
2019-11-08 11:49:46 +01:00
Lysandre
68f7064a3e
Add model.train()
line to ReadMe training example
...
Co-Authored-By: Santosh-Gupta <San.Gupta.ML@gmail.com>
2019-11-04 11:52:35 -05:00
Thomas Wolf
7f84fc571a
Merge pull request #1670 from huggingface/templates
...
Templates and explanation for adding a new model and example script
2019-10-30 17:05:58 +01:00
Thomas Wolf
5c6a19a94a
Merge pull request #1604 from huggingface/deploy_doc
...
Versioning in documentation
2019-10-30 17:03:14 +01:00
thomwolf
328a86d2af
adding links to the templates in readme and contributing
2019-10-30 11:37:55 +01:00
Lysandre
b82bfbd0c3
Updated README to show all available documentation
2019-10-24 15:55:31 +00:00
Julien Chaumond
ef1b8b2ae5
[CTRL] warn if generation prompt does not start with a control code
...
see also https://github.com/salesforce/ctrl/pull/50
2019-10-22 21:30:32 +00:00
Julián Peller (dataista)
e16d46843a
Fix architectures count
2019-10-22 15:13:47 -04:00
thomwolf
4d456542e9
Fix citation
2019-10-21 16:34:14 +02:00
Lysandre Debut
c544194611
Remove special_tokens_mask
from inputs in README
...
Co-authored-by: Thomas Wolf @thomwolf
2019-10-16 11:05:13 -04:00
Emrah Budur
5a8c6e771a
Fixed the sample code in the title 'Quick tour'.
2019-10-12 14:17:17 +03:00
thomwolf
4b8f3e8f32
adding citation
2019-10-11 16:18:16 +02:00
thomwolf
d9e60f4f0d
Merge branch 'master' into pr/1383
2019-10-09 17:25:08 +02:00
Julien Chaumond
d688af19e5
Update link to swift-coreml-transformers
...
cc @lysandrejik
2019-10-08 16:37:52 -04:00
seanBE
6dc6c716c5
fix pytorch-transformers migration description in README
2019-10-07 09:59:54 +01:00
Christopher Goh
904158ac4d
Rephrase forward method to reduce ambiguity
2019-10-06 23:40:52 -04:00
Christopher Goh
0f65d8cbbe
Fix some typos in README
2019-10-06 23:40:52 -04:00
keskarnitish
dbed1c5d94
Adding CTRL (squashed commit)
...
adding conversion script
adding first draft of modeling & tokenization
adding placeholder for test files
bunch of changes
registering the tokenizer/model/etc
tests
change link; something is very VERY wrong here
weird end-of-word thingy going on
i think the tokenization works now ; wrote the unit tests
overall structure works;load w next
the monster is alive!
works after some cleanup as well
adding emacs autosave to gitignore
currently only supporting the 48 layer one; seems to infer fine on my macbook
cleanup
fixing some documentation
fixing some documentation
tests passing?
now works on CUDA also
adding greedy?
adding greedy sampling
works well
2019-10-03 22:29:03 -07:00
VictorSanh
35071007cb
incoming release 🔥 update links to arxiv preprint
2019-10-03 10:27:11 -04:00
DenysNahurnyi
6971556ab8
Fix syntax typo in README.md
2019-10-01 14:59:31 -04:00
Santosh Gupta
5c3b32d44d
Update README.md
...
Lines 183 - 200, fixed indentation. Line 198, replaced `tokenizer_class` with `BertTokenizer`, since `tokenizer_class` is not defined in the loop it belongs to.
2019-09-30 18:48:01 +00:00
wangfei
60f791631b
Fix link in readme
2019-09-28 16:20:17 +08:00
BramVanroy
15749bfc10
Add small note about the output of hidden states
2019-09-27 10:01:36 +02:00
thomwolf
6c3b131516
typo in readme/doc
2019-09-26 16:23:28 +02:00
thomwolf
4e63c90720
update installation instructions in readme
2019-09-26 16:14:21 +02:00
Lysandre Debut
0f92f76ca3
CircleCI reference in README
2019-09-26 08:59:52 -04:00
thomwolf
9676d1a2a8
update readme and setup.py
2019-09-26 13:47:58 +02:00
thomwolf
4dde31cb76
update readme
2019-09-26 12:18:26 +02:00
thomwolf
4ddc31ff40
update readme with migration change
2019-09-26 12:00:38 +02:00
thomwolf
f47f7f4611
add logo
2019-09-26 11:28:44 +02:00
thomwolf
9fabc0b6a9
wip readme
2019-09-26 11:21:34 +02:00
thomwolf
31c23bd5ee
[BIG] pytorch-transformers => transformers
2019-09-26 10:15:53 +02:00
Julien Chaumond
62760baf46
tiny fixes
2019-09-17 18:29:15 -04:00
Julien Chaumond
f9453d15e5
Fix broken link
2019-09-05 12:35:22 -04:00
Julien Chaumond
f7ee2e5d20
[README] link to Write With Transformer
2019-09-05 12:33:46 -04:00
Thomas Wolf
50e615f43d
Merge branch 'master' into improved_testing
2019-08-30 13:40:35 +02:00
thomwolf
306af132d7
update readme to mention add_special_tokens more clearly in example
2019-08-30 11:30:51 +02:00
LysandreJik
75bc2a03cc
Updated article link
2019-08-28 10:05:15 -04:00
thomwolf
912a377e90
dilbert -> distilbert
2019-08-28 13:59:42 +02:00
thomwolf
4ce5f36f78
update readmes
2019-08-28 12:14:31 +02:00
VictorSanh
497f73c964
add DilBERT to master REAME
2019-08-28 07:16:30 +00:00
thomwolf
e00b4ff1de
fix #1017
2019-08-21 22:22:17 +02:00
Nikolay Korolev
ad6e62cd82
Fix typo. configuratoin -> configuration
2019-08-20 15:43:06 +03:00
Christophe Bourguignat
189ff9b664
Update README after RoBERTa addition
2019-08-17 13:18:37 -04:00
LysandreJik
9d0029e215
Added RoBERTa example to README
2019-08-15 17:17:35 -04:00
Lysandre Debut
88efc65bac
Merge pull request #964 from huggingface/RoBERTa
...
RoBERTa: model conversion, inference, tests 🔥
2019-08-15 11:11:10 -04:00
Julien Chaumond
c4ef103447
[RoBERTa] First 4 authors
...
cf. https://github.com/huggingface/pytorch-transformers/pull/964#discussion_r313574354
Co-Authored-By: Myle Ott <myleott@fb.com>
2019-08-14 12:31:09 -04:00
carefree0910
a7b4cfe919
Update README.md
...
I assume that it should test the `re-load` functionality after testing the `save` functionality, however I'm also surprised that nobody points this out after such a long time, so maybe I've misunderstood the purpose. This PR is just in case :)
2019-08-12 09:53:05 -04:00
LysandreJik
d2cc6b101e
Merge branch 'master' into RoBERTa
2019-08-08 09:42:05 -04:00
Christopher Goh
a6f412da01
Fixed typo in migration guide
2019-08-07 02:19:14 +08:00
Thomas Wolf
d43dc48b34
Merge branch 'master' into auto_models
2019-08-05 19:17:35 +02:00
thomwolf
7223886dc9
fix #944
2019-08-05 17:16:56 +02:00
thomwolf
58830807d1
inidicate we only support pytorch 1.0.0+ now
2019-08-05 14:38:59 +02:00
thomwolf
328afb7097
cleaning up tokenizer tests structure (at last) - last remaining ppb refs
2019-08-05 14:08:56 +02:00
Julien Chaumond
05c083520a
[RoBERTa] model conversion, inference, tests 🔥
2019-08-04 21:39:21 -04:00
thomwolf
009273dbdd
big doc update [WIP]
2019-08-04 12:14:57 +02:00
Julien Chaumond
44dd941efb
link to swift-coreml-transformers
2019-08-01 09:50:30 -04:00
Anthony MOI
f2a3eb987e
Fix small typos
2019-07-31 11:05:06 -04:00
Pierric Cistac
97091acb8c
Small spelling fix
2019-07-31 10:37:56 -04:00
Grégory Châtel
769bb643ce
Fixing a broken link.
2019-07-31 10:22:41 -04:00
Thomas Wolf
fec76a481d
Update readme
2019-07-23 16:05:29 +02:00
thomwolf
ba52fe69d5
update breaking change section regarding from_pretrained keyword arguments
2019-07-23 15:10:02 +02:00
rish-16
2f869dc665
Fixed typo
2019-07-21 11:05:36 -04:00
Thomas Wolf
dbecfcf321
Merge pull request #815 from praateekmahajan/update-readme-link
...
Update Readme link for Fine Tune/Usage section
2019-07-18 18:30:32 +02:00
Peiqin Lin
acc48a0cc9
typos
2019-07-18 09:54:04 -04:00
Praateek Mahajan
0d46b17553
Update Readme
...
Incorrect link for `Quick tour: Fine-tuning/usage scripts`
2019-07-17 22:50:10 -07:00
thomwolf
c5b3d86a91
Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT
2019-07-16 21:21:05 +02:00
thomwolf
6b70760204
typos
2019-07-16 21:21:03 +02:00
Thomas Wolf
b33a385091
update readme
2019-07-16 16:18:37 +02:00
thomwolf
6a72d9aa52
updated examples in readme
2019-07-16 16:09:29 +02:00
thomwolf
b59043bf8f
update readme
2019-07-16 16:03:48 +02:00
thomwolf
edc79acb3b
simpler quick tour
2019-07-16 16:02:32 +02:00
thomwolf
5c82d3488f
indicate default evaluation in breaking changes
2019-07-16 15:45:58 +02:00
thomwolf
4acaa65068
model in evaluation mode by default after from_pretrained
2019-07-16 15:41:57 +02:00
thomwolf
1849aa7d39
update readme and pretrained model weight files
2019-07-16 15:11:29 +02:00
thomwolf
43e0e8fa04
updates to readme and doc
2019-07-16 13:56:47 +02:00
thomwolf
352e3ff998
added migration guide to readme
2019-07-16 09:03:49 +02:00
thomwolf
8ad7e5b4f2
indeed
2019-07-16 00:29:15 +02:00
thomwolf
064d0a0b76
update readme
2019-07-16 00:21:33 +02:00
thomwolf
3b8b0e01bb
update readme
2019-07-16 00:12:55 +02:00
thomwolf
2397f958f9
updating examples and doc
2019-07-14 23:20:10 +02:00
thomwolf
6135de2fa3
readme update
2019-07-11 15:39:49 +02:00
thomwolf
e468192e2f
Merge branch 'pytorch-transformers' into xlnet
2019-07-09 17:05:37 +02:00
LysandreJik
ab30651802
Hugging Face theme.
2019-07-08 16:05:26 -04:00
thomwolf
eb91f6437e
update readme and setup
2019-07-05 12:30:15 +02:00
thomwolf
0231ba291e
circle-ci
2019-07-05 11:59:04 +02:00
thomwolf
0bab55d5d5
[BIG] name change
2019-07-05 11:55:36 +02:00
thomwolf
93e9971c54
fix tests
2019-06-26 10:02:45 +02:00
thomwolf
e55d4c4ede
various updates to conversion, models and examples
2019-06-26 00:57:53 +02:00
thomwolf
603c513b35
update main conversion script and readme
2019-06-25 10:45:07 +02:00
thomwolf
62d78aa37e
updating GLUE utils for compatibility with XLNet
2019-06-24 14:36:11 +02:00
thomwolf
c304593d8f
BERTology details in readme
2019-06-20 10:05:06 +02:00
thomwolf
34d706a0e1
pruning in bertology
2019-06-19 15:25:49 +02:00
thomwolf
dc8e0019b7
updating examples
2019-06-19 13:23:20 +02:00
thomwolf
68ab9599ce
small fix and updates to readme
2019-06-19 09:38:38 +02:00
thomwolf
4d8c4337ae
test barrier in distrib training
2019-06-18 22:41:28 +02:00
thomwolf
15ebd67d4e
cache in run_classifier + various fixes to the examples
2019-06-18 15:58:22 +02:00
thomwolf
d82e5deeb1
set find_unused_parameters=True in DDP
2019-06-18 12:13:14 +02:00
thomwolf
f964753090
explanation on the current location of the caching folder
2019-06-18 11:36:28 +02:00
thomwolf
382e2d1e50
spliting config and weight files for bert also
2019-06-18 10:37:16 +02:00
thomwolf
4447f270b2
updating hub
2019-06-17 16:21:28 +02:00
thomwolf
33d3db5c43
updating head masking, readme and docstrings
2019-06-17 15:51:28 +02:00
thomwolf
34858ae1d9
adding bert whole words, bertgerman and gpt-2 medium models, head masking
2019-06-17 11:02:39 +02:00
timoeller
16af9ff7b0
Add German Bert model to code, update readme
2019-06-14 17:42:46 +02:00
Colanim
1eba8b9d96
Fix link in README
2019-05-30 14:01:46 +09:00
lukovnikov
331a46ff04
- replaced OpenAIGPTAdam with OpenAIAdam in docs
2019-04-25 16:04:37 +02:00
lukovnikov
704037ad51
- updated docs for new LR API
...
- added some images for illustration
- updated comments in optimization
2019-04-25 15:59:39 +02:00
thomwolf
18a8a15f78
improving GPT2 tokenization and adding tests
2019-04-16 17:00:55 +02:00
thomwolf
1135f2384a
clean up logger in examples for distributed case
2019-04-15 15:22:40 +02:00
thomwolf
cc43307023
update readme
2019-04-15 15:06:10 +02:00
thomwolf
60ea6c59d2
added best practices for serialization in README and examples
2019-04-15 15:00:33 +02:00
thomwolf
20577d8a7c
add configuration serialization to readme
2019-04-15 14:21:41 +02:00
thomwolf
b17963d82f
update readme
2019-04-15 13:44:30 +02:00
Weixin Wang
f26ce6992e
Fix links in README
2019-04-02 17:20:32 +08:00
Sepehr Sameni
b588ff362a
fix lm_finetuning's link
2019-03-29 12:39:24 +04:30
Thomas Wolf
694e2117f3
Merge pull request #388 from ananyahjha93/master
...
Added remaining GLUE tasks to 'run_classifier.py'
2019-03-28 09:06:53 +01:00
Thomas Wolf
bbff03fbfc
Merge pull request #394 from desireevl/master
...
Minor change in README
2019-03-27 12:03:00 +01:00
thomwolf
34561e61a5
update main readme also
2019-03-27 12:00:04 +01:00
Ananya Harsh Jha
f471979167
added GLUE dev set results and details on how to run GLUE tasks
2019-03-21 15:38:30 -04:00
Desiree Vogt-Lee
d52f914e24
weigths to weights
2019-03-21 15:02:59 +10:00
Junjie Qian
d648a02203
Correct line number in README for classes
2019-03-08 16:28:03 -08:00
thomwolf
7cc35c3104
fix openai gpt example and updating readme
2019-03-06 11:43:21 +01:00
thomwolf
906b638efa
updating readme
2019-03-06 10:24:19 +01:00
John Hewitt
e14c6b52e3
add BertTokenizer flag to skip basic tokenization
2019-02-26 20:11:24 -08:00
Joel Grus
8722e9eb3b
finish updating docstrings
2019-02-23 06:31:59 -08:00
Stanislas Polu
ff22b3acc0
Few small nits in GPT-2's code examples
2019-02-21 09:15:27 +00:00
Tong Guo
09efcece75
Update README.md
2019-02-21 11:25:33 +08:00