transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-15 02:28:24 +06:00

History

Andreas Madsen b4b613b102 Implement Roberta PreLayerNorm (#20305 ) * Copy RoBERTa * formatting * implement RoBERTa with prelayer normalization * update test expectations * add documentation * add convertion script for DinkyTrain weights * update checkpoint repo Unfortunately the original checkpoints assumes a hacked roberta model * add to RoBERTa-PreLayerNorm docs to toc * run utils/check_copies.py * lint files * remove unused import * fix check_repo reporting wrongly a test is missing * fix import error, caused by rebase * run make fix-copies * add RobertaPreLayerNormConfig to ROBERTA_EMBEDDING_ADJUSMENT_CONFIGS * Fix documentation <Facebook> -> Facebook Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fixup: Fix documentation <Facebook> -> Facebook Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Add missing Flax header Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * expected_slice -> EXPECTED_SLICE Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * update copies after rebase * add missing copied from statements * make fix-copies * make prelayernorm explicit in code * fix checkpoint path for the original implementation * add flax integration tests * improve docs * update utils/documentation_tests.txt * lint files * Remove Copyright notice Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * make fix-copies * Remove EXPECTED_SLICE calculation comments Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>		2022-12-19 09:30:17 +01:00
..
internal	Add padding image transformation (#19838 )	2022-11-18 11:27:21 +00:00
main_classes	Generate: use `GenerationConfig` as the basis for `.generate()` parametrization (#20388 )	2022-12-15 18:27:20 +00:00
model_doc	Implement Roberta PreLayerNorm (#20305 )	2022-12-19 09:30:17 +01:00
tasks	Spanish translation of asr.mdx and add_new_pipeline.mdx (#20569 )	2022-12-12 09:23:23 -05:00
_config.py	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
_toctree.yml	Implement Roberta PreLayerNorm (#20305 )	2022-12-19 09:30:17 +01:00
accelerate.mdx	✨ update to use interlibrary links instead of Markdown (#18500 )	2022-08-08 10:53:52 -05:00
add_new_model.mdx	add small updates only (#19847 )	2022-10-24 10:18:20 -07:00
add_new_pipeline.mdx	Spanish translation of asr.mdx and add_new_pipeline.mdx (#20569 )	2022-12-12 09:23:23 -05:00
add_tensorflow_model.mdx	docs: Resolve many typos in the English docs (#20088 )	2022-11-07 09:19:04 -05:00
autoclass_tutorial.mdx	Update doc examples feature extractor -> image processor (#20501 )	2022-11-30 14:50:55 +00:00
benchmarks.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
bertology.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
big_models.mdx	docs: Resolve many typos in the English docs (#20088 )	2022-11-07 09:19:04 -05:00
community.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
contributing.md	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
converting_tensorflow_models.mdx	Docs - Guide to add a new TensorFlow model (#19256 )	2022-09-30 20:30:38 +01:00
create_a_model.mdx	Update doc examples feature extractor -> image processor (#20501 )	2022-11-30 14:50:55 +00:00
custom_models.mdx	Replace awkward timm link with the expected one (#20109 )	2022-11-07 13:57:39 -05:00
debugging.mdx	Spanish translation of the file debugging.mdx (#20566 )	2022-12-12 10:38:56 -05:00
fast_tokenizers.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
glossary.mdx	Update doc examples feature extractor -> image processor (#20501 )	2022-11-30 14:50:55 +00:00
hpo_train.mdx	update doc for perf_train_cpu_many (#19506 )	2022-10-11 22:54:19 -04:00
index.mdx	Implement Roberta PreLayerNorm (#20305 )	2022-12-19 09:30:17 +01:00
installation.mdx	Move cache folder to huggingface/hub for consistency with hf_hub (#18492 )	2022-08-05 13:14:00 -04:00
migration.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
model_sharing.mdx	Just re-reading the whole doc every couple of months 😬 (#18489 )	2022-08-06 09:38:55 +02:00
model_summary.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
multilingual.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
notebooks.md	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
pad_truncation.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
perf_hardware.mdx	[WIP] [doc] performance/scalability revamp (#15723 )	2022-05-16 13:36:41 +02:00
perf_infer_cpu.mdx	add doc for (#20525 )	2022-12-01 16:52:13 +01:00
perf_infer_gpu_many.mdx	add doc for (#20525 )	2022-12-01 16:52:13 +01:00
perf_infer_gpu_one.mdx	add doc for (#20525 )	2022-12-01 16:52:13 +01:00
perf_infer_special.mdx	Improve performance docs (#17750 )	2022-06-23 14:51:54 +02:00
perf_train_cpu_many.mdx	update cpu related doc (#20444 )	2022-11-28 08:54:35 -05:00
perf_train_cpu.mdx	update cpu related doc (#20444 )	2022-11-28 08:54:35 -05:00
perf_train_gpu_many.mdx	Fix Typo in Docs for GPU (#20509 )	2022-11-30 10:41:18 -05:00
perf_train_gpu_one.mdx	Migrate torchdynamo to torch.compile (#20634 )	2022-12-08 11:18:52 -05:00
perf_train_special.mdx	Fix Typo in Docs for GPU (#20509 )	2022-11-30 10:41:18 -05:00
perf_train_tpu.mdx	Fix Typo in Docs for GPU (#20509 )	2022-11-30 10:41:18 -05:00
performance.mdx	Fix Typo in Docs for GPU (#20509 )	2022-11-30 10:41:18 -05:00
perplexity.mdx	Fix incorrect size of input for 1st strided window length in `Perplexity of fixed-length models` (#18906 )	2022-09-06 15:20:12 -04:00
philosophy.mdx	Update doc examples feature extractor -> image processor (#20501 )	2022-11-30 14:50:55 +00:00
pipeline_tutorial.mdx	Fixing the pipeline tutorial test (#20746 )	2022-12-13 19:08:30 +01:00
pipeline_webserver.mdx	Rework the pipeline tutorial (#20437 )	2022-12-06 10:47:31 +01:00
pr_checks.mdx	📝 update documentation build section (#18548 )	2022-08-09 18:22:55 -05:00
preprocessing.mdx	Fix code sample in preprocess (#20561 )	2022-12-05 11:49:43 -08:00
quicktour.mdx	Fix rendering issue in quicktour (#20708 )	2022-12-09 13:51:35 -05:00
run_scripts.mdx	Just re-reading the whole doc every couple of months 😬 (#18489 )	2022-08-06 09:38:55 +02:00
sagemaker.mdx	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
serialization.mdx	Implement Roberta PreLayerNorm (#20305 )	2022-12-19 09:30:17 +01:00
task_summary.mdx	Update doc examples feature extractor -> image processor (#20501 )	2022-11-30 14:50:55 +00:00
testing.mdx	fixed spelling error in testing.mdx (#20220 )	2022-11-15 09:40:06 -05:00
tokenizer_summary.mdx	Update tokenizer_summary.mdx (#20135 )	2022-11-15 01:18:13 +01:00
torchscript.mdx	Breakup export guide (#19271 )	2022-10-03 13:18:29 -07:00
training.mdx	Convert tokenizer outputs for Keras in doc example (#20732 )	2022-12-12 16:14:04 +00:00
troubleshooting.mdx	Update doc examples feature extractor -> image processor (#20501 )	2022-11-30 14:50:55 +00:00