thomwolf
|
411981a080
|
remove slow circle-ci
|
2019-06-20 08:54:18 +02:00 |
|
chrislarson1
|
716cc1c4d9
|
added main() for programmatic call to convert pytorch->tf
|
2019-06-19 23:18:57 -04:00 |
|
chrislarson1
|
a8e071c690
|
added notebook to check correctness of the pytorch->tensorflow conversion
|
2019-06-19 23:08:08 -04:00 |
|
chrislarson1
|
0a4fb0da57
|
Merge remote-tracking branch 'upstream/master' into convert-back-to-tf
merging in latest changes from upstream
|
2019-06-19 22:56:20 -04:00 |
|
thomwolf
|
edfe91c36e
|
first version bertology ok
|
2019-06-19 23:43:04 +02:00 |
|
thomwolf
|
7766ce66dd
|
update bertology
|
2019-06-19 22:29:51 +02:00 |
|
thomwolf
|
7f00a36e27
|
pruning should keep on device
|
2019-06-19 22:23:12 +02:00 |
|
thomwolf
|
e4b46d86ce
|
update head pruning
|
2019-06-19 22:16:30 +02:00 |
|
timoeller
|
939cf29157
|
Adjust s3 german Bert file storage
|
2019-06-19 18:38:42 +02:00 |
|
thomwolf
|
0f40e8d6a6
|
debugger
|
2019-06-19 15:38:46 +02:00 |
|
thomwolf
|
0e1e8128bf
|
more logging
|
2019-06-19 15:35:49 +02:00 |
|
thomwolf
|
909d4f1af2
|
cuda again
|
2019-06-19 15:32:10 +02:00 |
|
thomwolf
|
14f0e8e557
|
fix cuda
|
2019-06-19 15:29:28 +02:00 |
|
thomwolf
|
34d706a0e1
|
pruning in bertology
|
2019-06-19 15:25:49 +02:00 |
|
thomwolf
|
dc8e0019b7
|
updating examples
|
2019-06-19 13:23:20 +02:00 |
|
thomwolf
|
68ab9599ce
|
small fix and updates to readme
|
2019-06-19 09:38:38 +02:00 |
|
thomwolf
|
f7e2ac01ea
|
update barrier
|
2019-06-18 22:43:35 +02:00 |
|
thomwolf
|
4d8c4337ae
|
test barrier in distrib training
|
2019-06-18 22:41:28 +02:00 |
|
thomwolf
|
3359955622
|
updating run_classif
|
2019-06-18 22:23:10 +02:00 |
|
thomwolf
|
29b7b30eaa
|
updating evaluation on a single gpu
|
2019-06-18 22:20:21 +02:00 |
|
thomwolf
|
7d2001aa44
|
overwrite_output_dir
|
2019-06-18 22:13:30 +02:00 |
|
thomwolf
|
16a1f338c4
|
fixing
|
2019-06-18 17:06:31 +02:00 |
|
thomwolf
|
92e0ad5aba
|
no numpy
|
2019-06-18 17:00:52 +02:00 |
|
thomwolf
|
4e6edc3274
|
hop
|
2019-06-18 16:57:15 +02:00 |
|
thomwolf
|
f55b60b9ee
|
fixing again
|
2019-06-18 16:56:52 +02:00 |
|
thomwolf
|
8bd9118294
|
quick fix
|
2019-06-18 16:54:41 +02:00 |
|
thomwolf
|
3e847449ad
|
fix out_label_ids
|
2019-06-18 16:53:31 +02:00 |
|
thomwolf
|
aad3a54e9c
|
fix paths
|
2019-06-18 16:48:04 +02:00 |
|
thomwolf
|
40dbda6871
|
updating classification example
|
2019-06-18 16:45:52 +02:00 |
|
thomwolf
|
7388c83b60
|
update run_classifier for distributed eval
|
2019-06-18 16:32:49 +02:00 |
|
thomwolf
|
9727723243
|
fix pickle
|
2019-06-18 16:02:42 +02:00 |
|
thomwolf
|
9710b68dbc
|
fix pickles
|
2019-06-18 16:01:15 +02:00 |
|
thomwolf
|
15ebd67d4e
|
cache in run_classifier + various fixes to the examples
|
2019-06-18 15:58:22 +02:00 |
|
thomwolf
|
e6e5f19257
|
fix
|
2019-06-18 14:45:14 +02:00 |
|
thomwolf
|
a432b3d466
|
distributed traing t_total
|
2019-06-18 14:39:09 +02:00 |
|
thomwolf
|
c5407f343f
|
split squad example in two
|
2019-06-18 14:29:03 +02:00 |
|
thomwolf
|
335f57baf8
|
only on main process
|
2019-06-18 14:03:46 +02:00 |
|
thomwolf
|
326944d627
|
add tensorboard to run_squad
|
2019-06-18 14:02:42 +02:00 |
|
thomwolf
|
d82e5deeb1
|
set find_unused_parameters=True in DDP
|
2019-06-18 12:13:14 +02:00 |
|
thomwolf
|
a59abedfb5
|
DDP update
|
2019-06-18 12:06:26 +02:00 |
|
thomwolf
|
2ef5e0de87
|
switch to pytorch DistributedDataParallel
|
2019-06-18 12:03:13 +02:00 |
|
thomwolf
|
9ce37af99b
|
oups
|
2019-06-18 11:47:54 +02:00 |
|
thomwolf
|
a40955f071
|
no need to duplicate models anymore
|
2019-06-18 11:46:14 +02:00 |
|
Thomas Wolf
|
3763f8944d
|
Merge pull request #696 from huggingface/split_config_weights
Split config weights
|
2019-06-18 11:42:57 +02:00 |
|
thomwolf
|
f964753090
|
explanation on the current location of the caching folder
|
2019-06-18 11:36:28 +02:00 |
|
thomwolf
|
868de8d1d7
|
updating weights loading
|
2019-06-18 10:58:20 +02:00 |
|
thomwolf
|
64e0adda81
|
better error message
|
2019-06-18 10:51:31 +02:00 |
|
thomwolf
|
382e2d1e50
|
spliting config and weight files for bert also
|
2019-06-18 10:37:16 +02:00 |
|
Thomas Wolf
|
a6f2511811
|
Merge pull request #694 from huggingface/release_0.6.3
Release 0.6.3
|
2019-06-17 16:27:25 +02:00 |
|
thomwolf
|
4447f270b2
|
updating hub
|
2019-06-17 16:21:28 +02:00 |
|