DI-2021 @ KDD 2021: Yunyao Li Talk Recording

Recording of invited talk 3/6 in Document Intelligence Workshop @ KDD2021 given by Yunyao Li, Distinguished Research Staff Member and Senior Research Manager at IBM Research.
Title: Towards Deep Table Understanding https://youtu.be/UT2wzBEJAWk
Abstract: Harvesting information from complex documents such as in financial reports and scientific publications is critical to building AI applications for business and research. Such documents are often in PDF format with critical facts and data conveyed in table and graphs. Extracting such information is essential to extract insights from these documents. In IBM Research, we have a rich agenda in this area that we call Deep Document Understanding. In this talk, I will focus on our research on Deep Table Understanding — extracting and understanding tables from PDF documents. I will introduce key challenges in table extraction and understanding and how we address such challenges, from how to acquire data at scale to enable deep neural network models to how to build, customize and evaluate such models. I will also describe how our work enables real-world use cases in domains such as finance and life science. Finally, I will briefly present TableQA, an important downstream task enabled by Deep Table Understanding.
Program committee (alphabetical): Doug Burdick, Dave Lewis, Yijuan Lu, Hamid Motahari, Sandeep Tata Chair: Benjamin Han
Originally posted on LinkedIn.