transformers/docs/source
Li-Huai (Allan) Lin 22454ae492
Add REALM (#13292)
* REALM initial commit

* Retriever OK (Update new_gelu).

* Encoder prediction score OK

* Encoder pretrained model OK

* Update retriever comments

* Update docs, tests, and imports

* Prune unused models

* Make embedder as a module `RealmEmbedder`

* Add RealmRetrieverOutput

* Update tokenization

* Pass all tests in test_modeling_realm.py

* Prune RealmModel

* Update docs

* Add training test.

* Remove completed TODO

* Style & Quality

* Prune `RealmModel`

* Fixup

* Changes:
1. Remove RealmTokenizerFast
2. Update docstrings
3. Add a method to RealmTokenizer to handle candidates tokenization.

* Fix up

* Style

* Add tokenization tests

* Update `from_pretrained` tests

* Apply suggestions

* Style & Quality

* Copy BERT model

* Fix comment to avoid docstring copying

* Make RealmBertModel private

* Fix bug

* Style

* Basic QA

* Save

* Complete reader logits

* Add searcher

* Complete searcher & reader

* Move block records init to constructor

* Fix training bug

* Add some outputs to RealmReader

* Add finetuned checkpoint variable names parsing

* Fix bug

* Update REALM config

* Add RealmForOpenQA

* Update convert_tfrecord logits

* Fix bugs

* Complete imports

* Update docs

* Update naming

* Add brute-force searcher

* Pass realm model tests

* Style

* Exclude RealmReader from common tests

* Fix

* Fix

* convert docs

* up

* up

* more make style

* up

* upload

* up

* Fix

* Update src/transformers/__init__.py

* adapt testing

* change modeling code

* fix test

* up

* up

* up

* correct more

* make retriever work

* update

* make style

* finish main structure

* Resolve merge conflict

* Make everything work

* Style

* Fixup

* Fixup

* Update training test

* fix retriever

* remove hardcoded path

* Fix

* Fix modeling test

* Update model links

* Initial retrieval test

* Fix modeling test

* Complete retrieval tests

* Fix

* style

* Fix tests

* Fix docstring example

* Minor fix of retrieval test

* Update license headers and docs

* Apply suggestions from code review

* Style

* Apply suggestions from code review

* Add an example to RealmEmbedder

* Fix

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-01-18 07:24:13 -05:00
..
internal Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
main_classes Add ONNX configuration classes to docs (#15121) 2022-01-12 16:33:32 +01:00
model_doc Add REALM (#13292) 2022-01-18 07:24:13 -05:00
_config.py Prevent style_doc from tempering the config file 2021-12-10 15:31:43 -05:00
_toctree.yml Add REALM (#13292) 2022-01-18 07:24:13 -05:00
add_new_model.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
add_new_pipeline.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
benchmarks.mdx [doc] normalize HF Transformers string (#15023) 2022-01-10 08:44:33 -08:00
bertology.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
community.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
contributing.md Update installation page and add contributing to the doc (#5084) 2020-06-17 14:01:10 -04:00
converting_tensorflow_models.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
custom_datasets.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
debugging.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
examples.md per_device instead of per_gpu/error thrown when argument unknown (#4618) 2020-05-27 11:36:55 -04:00
fast_tokenizers.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
glossary.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
index.mdx Add REALM (#13292) 2022-01-18 07:24:13 -05:00
installation.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
migration.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
model_sharing.mdx Update model_sharing.mdx (#15142) 2022-01-13 12:26:02 -05:00
model_summary.mdx Add "open in hf spaces" gradio button issue #73 (#15106) 2022-01-14 10:12:30 -05:00
multilingual.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
notebooks.md Update notebooks (#3620) 2020-04-06 14:32:39 -04:00
parallelism.mdx [doc] Update parallelism.mdx (#15018) 2022-01-04 09:58:27 -08:00
performance.mdx [doc] new MoE paper (#15184) 2022-01-17 09:10:51 -08:00
perplexity.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
philosophy.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
pr_checks.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
preprocessing.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
quicktour.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
sagemaker.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
serialization.mdx Add ONNX configuration classes to docs (#15121) 2022-01-12 16:33:32 +01:00
task_summary.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
testing.mdx [doc] normalize HF Transformers string (#15023) 2022-01-10 08:44:33 -08:00
tokenizer_summary.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
training.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
troubleshooting.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00