MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding

на сайте с June 14, 2023 21:39
Multimodal pre-training with text, layout, and image has made significant progress for Visually Rich Document Understanding (VRDU), especially the fixed-layout documents such as scanned document...