transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-20 13:08:21 +06:00

History

amyeroberts 8581a798c0 Add TF DeiT implementation (#17806 ) * Initial TF DeiT implementation * Fix copies naming issues * Fix up + docs * Properly same main layer * Name layers properly * Initial TF DeiT implementation * Fix copies naming issues * Fix up + docs * Properly same main layer * Name layers properly * Fixup * Fix import * Fix import * Fix import * Fix weight loading for tests whilst not on hub * Add doc tests and remove to_2tuple * Add back to_2tuple Removing to_2tuple results in many downstream changes needed because of the copies checks * Incorporate updates in Improve vision models #17731 PR * Don't hard code num_channels * Copy PyTorch DeiT embeddings and remove pytorch operations with mask * Fix patch embeddings & tidy up * Update PixelShuffle to move logic into class layer * Update doc strings - remove PT references * Use NHWC format in internal layers * Fix up * Use linear activation layer * Remove unused import * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Move dataclass to top of file * Remove from_pt now weights on hub * Fixup Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Amy Roberts <amyeroberts@users.noreply.github.com>		2022-07-13 18:04:08 +01:00
..
internal	Allow from transformers import TypicalLogitsWarper (#17477 )	2022-06-03 11:08:35 +02:00
main_classes	Add Visual Question Answering (VQA) pipeline (#17286 )	2022-06-13 07:49:44 -04:00
model_doc	Add TF DeiT implementation (#17806 )	2022-07-13 18:04:08 +01:00
tasks	Doc to dataset (#18037 )	2022-07-06 12:10:06 -04:00
_config.py	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
_toctree.yml	Sort doc toc (#18034 )	2022-07-07 08:17:58 -04:00
accelerate.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
add_new_model.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
add_new_pipeline.mdx	feat: add pipeline registry abstraction (#17905 )	2022-06-30 12:11:08 -04:00
autoclass_tutorial.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
benchmarks.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
bertology.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
big_models.mdx	Add link to existing documentation (#17931 )	2022-07-04 04:13:05 -04:00
community.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
contributing.md	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
converting_tensorflow_models.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
create_a_model.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
custom_models.mdx	Fix some typos. (#17560 )	2022-07-11 05:00:13 -04:00
debugging.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
fast_tokenizers.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
glossary.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
index.mdx	Add TF DeiT implementation (#17806 )	2022-07-13 18:04:08 +01:00
installation.mdx	Added Command for windows VENV activation in installation docs (#18008 )	2022-07-07 08:18:44 -04:00
migration.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
model_sharing.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
model_summary.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
multilingual.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
notebooks.md	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
pad_truncation.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
perf_hardware.mdx	[WIP] [doc] performance/scalability revamp (#15723 )	2022-05-16 13:36:41 +02:00
perf_infer_cpu.mdx	Extend Transformers Trainer Class to Enable PyTorch Torchscript for Inference (#17153 )	2022-06-14 07:56:47 -04:00
perf_infer_gpu_many.mdx	Improve performance docs (#17750 )	2022-06-23 14:51:54 +02:00
perf_infer_gpu_one.mdx	Improve performance docs (#17750 )	2022-06-23 14:51:54 +02:00
perf_infer_special.mdx	Improve performance docs (#17750 )	2022-06-23 14:51:54 +02:00
perf_train_cpu.mdx	Extend Transformers Trainer Class to Enable CPU AMP and Integrate Intel Extension for PyTorch (#17138 )	2022-06-08 09:41:57 -04:00
perf_train_gpu_many.mdx	Improve performance docs (#17750 )	2022-06-23 14:51:54 +02:00
perf_train_gpu_one.mdx	Enable torchdynamo with torch_tensorrt(fx path) (#17765 )	2022-07-13 12:43:28 -04:00
perf_train_special.mdx	Improve performance docs (#17750 )	2022-06-23 14:51:54 +02:00
perf_train_tpu.mdx	Improve performance docs (#17750 )	2022-06-23 14:51:54 +02:00
performance.mdx	Improve performance docs (#17750 )	2022-06-23 14:51:54 +02:00
perplexity.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
philosophy.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
pipeline_tutorial.mdx	docs(transformers): fix typo (#17263 )	2022-05-16 17:04:30 -04:00
pr_checks.mdx	Add a check on config classes docstring checkpoints (#17012 )	2022-04-30 10:40:46 +02:00
preprocessing.mdx	Doc to dataset (#18037 )	2022-07-06 12:10:06 -04:00
quicktour.mdx	Fix doc test quicktour dataset (#16929 )	2022-04-25 16:26:59 +02:00
run_scripts.mdx	Fix all docs for accelerate install directions (#17145 )	2022-05-09 15:45:18 -04:00
sagemaker.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
serialization.mdx	Squash commits (#17981 )	2022-07-06 08:11:48 -04:00
task_summary.mdx	[Doctests] Correct task summary (#16644 )	2022-04-11 14:59:35 +02:00
testing.mdx	Fix some typos. (#17560 )	2022-07-11 05:00:13 -04:00
tokenizer_summary.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
training.mdx	Doc to dataset (#18037 )	2022-07-06 12:10:06 -04:00
troubleshooting.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00