transformers/docs/source
Ankur Goyal 2ef7742117
Add DocumentQuestionAnswering pipeline (#18414)
* [WIP] Skeleton of VisualQuestionAnweringPipeline extended to support LayoutLM-like models

* Fixup

* Use the full encoding

* Basic refactoring to DocumentQuestionAnsweringPipeline

* Cleanup

* Improve args, docs, and implement preprocessing

* Integrate OCR

* Refactor question_answering pipeline

* Use refactored QA code in the document qa pipeline

* Fix tests

* Some small cleanups

* Use a string type annotation for Image.Image

* Update encoding with image features

* Wire through the basic docs

* Handle invalid response

* Handle empty word_boxes properly

* Docstring fix

* Integrate Donut model

* Fixup

* Incorporate comments

* Address comments

* Initial incorporation of tests

* Address Comments

* Change assert to ValueError

* Comments

* Wrap `score` in float to make it JSON serializable

* Incorporate AutoModeLForDocumentQuestionAnswering changes

* Fixup

* Rename postprocess function

* Fix auto import

* Applying comments

* Improve docs

* Remove extra assets and add copyright

* Address comments

Co-authored-by: Ankur Goyal <ankur@impira.com>
2022-09-07 13:38:49 -04:00
..
de Create pipeline_tutorial.mdx german docs (#18625) 2022-09-01 09:57:59 +02:00
en Add DocumentQuestionAnswering pipeline (#18414) 2022-09-07 13:38:49 -04:00
es Spanish translation of summarization.mdx (#15947) (#18477) 2022-08-08 15:54:11 -04:00
it [LayoutLMv3] Add TensorFlow implementation (#18678) 2022-08-30 11:48:11 +01:00
pt Just re-reading the whole doc every couple of months 😬 (#18489) 2022-08-06 09:38:55 +02:00
_config.py Use doc builder styler (#16412) 2022-03-28 07:45:18 -04:00