transformers/docs/source
Forrest Iandola 02ef825be2
SqueezeBERT architecture (#7083)
* configuration_squeezebert.py

thin wrapper around bert tokenizer

fix typos

wip sb model code

wip modeling_squeezebert.py. Next step is to get the multi-layer-output interface working

set up squeezebert to use BertModelOutput when returning results.

squeezebert documentation

formatting

allow head mask that is an array of [None, ..., None]

docs

docs cont'd

path to vocab

docs and pointers to cloud files (WIP)

line length and indentation

squeezebert model cards

formatting of model cards

untrack modeling_squeezebert_scratchpad.py

update aws paths to vocab and config files

get rid of stub of NSP code, and advise users to pretrain with mlm only

fix rebase issues

redo rebase of modeling_auto.py

fix issues with code formatting

more code format auto-fixes

move squeezebert before bert in tokenization_auto.py and modeling_auto.py because squeezebert inherits from bert

tests for squeezebert modeling and tokenization

fix typo

move squeezebert before bert in modeling_auto.py to fix inheritance problem

disable test_head_masking, since squeezebert doesn't yet implement head masking

fix issues exposed by the test_modeling_squeezebert.py

fix an issue exposed by test_tokenization_squeezebert.py

fix issue exposed by test_modeling_squeezebert.py

auto generated code style improvement

issue that we inherited from modeling_xxx.py: SqueezeBertForMaskedLM.forward() calls self.cls(), but there is no self.cls, and I think the goal was actually to call self.lm_head()

update copyright

resolve failing 'test_hidden_states_output' and remove unused encoder_hidden_states and encoder_attention_mask

docs

add integration test. rename squeezebert-mnli --> squeezebert/squeezebert-mnli

autogenerated formatting tweaks

integrate feedback from patrickvonplaten and sgugger to programming style and documentation strings

* tiny change to order of imports
2020-10-05 04:25:43 -04:00
..
_static Add documentation for v3.3.1 2020-09-29 14:31:43 -04:00
imgs Guide to fixed-length model perplexity evaluation (#5449) 2020-07-07 16:04:15 -06:00
internal Models doc (#7345) 2020-09-23 13:20:45 -04:00
main_classes Formatter (#7368) 2020-09-24 10:59:21 -04:00
model_doc SqueezeBERT architecture (#7083) 2020-10-05 04:25:43 -04:00
benchmarks.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
bertology.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
conf.py Release: v3.3.1 2020-09-29 14:17:34 -04:00
contributing.md Update installation page and add contributing to the doc (#5084) 2020-06-17 14:01:10 -04:00
converting_tensorflow_models.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
custom_datasets.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
examples.md per_device instead of per_gpu/error thrown when argument unknown (#4618) 2020-05-27 11:36:55 -04:00
favicon.ico Adding usage examples for common tasks (#2850) 2020-02-25 13:48:24 -05:00
glossary.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
index.rst SqueezeBERT architecture (#7083) 2020-10-05 04:25:43 -04:00
installation.md Make transformers install check positive (#7473) 2020-09-30 07:44:40 -04:00
migration.md Add hugs (#5225) 2020-06-24 07:56:14 -04:00
model_sharing.rst docs: fix model sharing file names (#5855) 2020-09-28 08:17:30 -04:00
model_summary.rst Document RAG again (#7377) 2020-09-28 08:31:46 -04:00
multilingual.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
notebooks.md Update notebooks (#3620) 2020-04-06 14:32:39 -04:00
perplexity.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
philosophy.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
preprocessing.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
pretrained_models.rst SqueezeBERT architecture (#7083) 2020-10-05 04:25:43 -04:00
quicktour.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
serialization.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
task_summary.rst Add forgotten return_dict argument in the docs (#7483) 2020-10-01 04:41:29 -04:00
testing.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
tokenizer_summary.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
training.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00