MITB Banner

Google releases FormNet; works on improving how text is read in forms

FormNet outperforms existing methods using less pre-training data and achieves SOTA performance on the CORD, FUNSD, and Payment benchmarks.

Share

Research scientists from the Cloud AI team of Google Research published a blog saying that the recent sequence modelling that directly models relationships between all words in a text selection has demonstrated SOTA performance on natural language tasks.

A natural approach to handling form document understanding tasks is first to serialise the form documents and then apply SOTA sequence models. However, form documents often have complex layouts that contain structured objects like – tables, columns, and text blocks. Their variety of layout patterns makes serialisation difficult and limits the performance of strict serialisation approaches. These unique challenges in form document structural modelling have been underexplored.

An illustration of the form document information extraction task using an example from the FUNSD dataset.

Image – Google blog 

The paper by research scientists Chen-Yu Lee, Chun-Liang Li and co-authors, “FormNet: Structural Encoding Beyond Sequential Modeling in Form Document Information Extraction”, proposed a structure-aware sequence model, called FormNet, to mitigate the sub-optimal serialisation of forms for document information extraction. 

They explained their process like this: To begin with, they designed a Rich Attention (RichAtt) mechanism that leverages the 2D spatial relationship between word tokens for attention weight calculation. Then, they constructed Super-Tokens for each word by embedding representations from their neighbouring tokens through a graph convolutional network. In the end, they demonstrated that FormNet outperforms existing methods using less pre-training data and achieves SOTA performance on the CORD, FUNSD, and Payment benchmarks.

Share
Picture of Poornima Nataraj

Poornima Nataraj

Poornima Nataraj has worked in the mainstream media as a journalist for 12 years, she is always eager to learn anything new and evolving. Witnessing a revolution in the world of Analytics, she thinks she is in the right place at the right time.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Subscribe to Our Newsletter

The Belamy, our weekly Newsletter is a rage. Just enter your email below.