Google releases FormNet; works on improving how text is read in forms

FormNet outperforms existing methods using less pre-training data and achieves SOTA performance on the CORD, FUNSD, and Payment benchmarks.

Research scientists from the Cloud AI team of Google Research published a blog saying that the recent sequence modelling that directly models relationships between all words in a text selection has demonstrated SOTA performance on natural language tasks.

A natural approach to handling form document understanding tasks is first to serialise the form documents and then apply SOTA sequence models. However, form documents often have complex layouts that contain structured objects like – tables, columns, and text blocks. Their variety of layout patterns makes serialisation difficult and limits the performance of strict serialisation approaches. These unique challenges in form document structural modelling have been underexplored.

An illustration of the form document information extraction task using an example from the FUNSD dataset.

Image – Google blog 

The paper by research scientists Chen-Yu Lee, Chun-Liang Li and co-authors, “FormNet: Structural Encoding Beyond Sequential Modeling in Form Document Information Extraction”, proposed a structure-aware sequence model, called FormNet, to mitigate the sub-optimal serialisation of forms for document information extraction. 

They explained their process like this: To begin with, they designed a Rich Attention (RichAtt) mechanism that leverages the 2D spatial relationship between word tokens for attention weight calculation. Then, they constructed Super-Tokens for each word by embedding representations from their neighbouring tokens through a graph convolutional network. In the end, they demonstrated that FormNet outperforms existing methods using less pre-training data and achieves SOTA performance on the CORD, FUNSD, and Payment benchmarks.

Download our Mobile App

Subscribe to our newsletter

Join our editors every weekday evening as they steer you through the most significant news of the day.
Your newsletter subscriptions are subject to AIM Privacy Policy and Terms and Conditions.

Our Recent Stories

Our Upcoming Events

3 Ways to Join our Community

Telegram group

Discover special offers, top stories, upcoming events, and more.

Discord Server

Stay Connected with a larger ecosystem of data science and ML Professionals

Subscribe to our Daily newsletter

Get our daily awesome stories & videos in your inbox
MOST POPULAR