transformers/docs/source/en/main_classes
Ankur Goyal 2ef7742117
Add DocumentQuestionAnswering pipeline (#18414)
* [WIP] Skeleton of VisualQuestionAnweringPipeline extended to support LayoutLM-like models

* Fixup

* Use the full encoding

* Basic refactoring to DocumentQuestionAnsweringPipeline

* Cleanup

* Improve args, docs, and implement preprocessing

* Integrate OCR

* Refactor question_answering pipeline

* Use refactored QA code in the document qa pipeline

* Fix tests

* Some small cleanups

* Use a string type annotation for Image.Image

* Update encoding with image features

* Wire through the basic docs

* Handle invalid response

* Handle empty word_boxes properly

* Docstring fix

* Integrate Donut model

* Fixup

* Incorporate comments

* Address comments

* Initial incorporation of tests

* Address Comments

* Change assert to ValueError

* Comments

* Wrap `score` in float to make it JSON serializable

* Incorporate AutoModeLForDocumentQuestionAnswering changes

* Fixup

* Rename postprocess function

* Fix auto import

* Applying comments

* Improve docs

* Remove extra assets and add copyright

* Address comments

Co-authored-by: Ankur Goyal <ankur@impira.com>
2022-09-07 13:38:49 -04:00
..
callback.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
configuration.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
data_collator.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
deepspeed.mdx Fix broken link DeepSpeed documentation link (#18783) 2022-08-28 19:32:19 -07:00
feature_extractor.mdx Update feature extractor docs (#18324) 2022-07-27 15:32:57 -05:00
keras_callbacks.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
logging.mdx logging documentation update (#17174) 2022-05-16 16:47:28 -04:00
model.mdx [bnb] Minor modifications (#18631) 2022-08-17 00:48:10 +02:00
onnx.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
optimizer_schedules.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
output.mdx Minor typo in prose of model outputs documentation. (#18848) 2022-09-01 12:05:40 +02:00
pipelines.mdx Add DocumentQuestionAnswering pipeline (#18414) 2022-09-07 13:38:49 -04:00
processors.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
text_generation.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
tokenizer.mdx documentation: some minor clean up (#16850) 2022-04-26 16:56:08 -04:00
trainer.mdx mac m1 mps integration (#18598) 2022-08-16 16:34:51 +05:30