Researchers Release An AI-Based Tool To Summarise Research Papers

Researchers Release An AI-Based Research Paper Summariser

In an attempt to simplify the process of summarising complex scientific research papers, researchers at the Allen Institute for Artificial Intelligence have released a new AI-based tool that summarises the text from scientific papers and present it in a few sentences.

Considering scientific research papers are complex to understand because of the language it is presented in, it becomes a challenge for many who are willing to work on the same or trying to be updated with scientific literature. And, that is why the researchers from Allen Institute for Artificial Intelligence came out with this new AI-based model — Semantic Scholar — that automatically generates a single-sentence summary using GPT-3 style techniques. This helps in locating the right paper and deciding whether to dedicate time to read that complex paper or not, stated by the official website.

Credit: Semantic Scholar // The NextWeb


Sign up for your weekly dose of what's up in emerging technology.

Researchers released this new feature is available in beta for over 10 million papers in the computer science domain. According to the news media, currently, visitors will be able to see these TLDR (Too Long; Didn’t Read) summaries of the papers on Allen Institute’s Semantic Scholar search engine.

These AI-generated summaries will help the readers make quick, informed decisions about the relevancy of the scientific and where to invest the time in further reading. These summaries explain the work in various contexts, such as sharing a paper on social media.

Download our Mobile App

According to the news media, the AI collates the most important parts of the paper from the key sections of abstract, introduction, and conclusion to create the summary.

According to the research paper by the institute — to build this model, the researchers first pre-trained the model on the English language and then created a SciTLDR data set of 3200 documents, including over 5,400 summaries of computer science papers. Further, the model has been trained on more than 20,000 titles of research papers to reduce dependency on domain knowledge while writing a synopsis. Once the training was done, the model was able to summarise documents over 5,000 average words in the document with just 21 average words in summary, which created a compression ratio of 238.1. 

With such advancements, the researchers are now looking to expand this AI-based solution to other fields beyond computer science.
Read more about the research here.

More Great AIM Stories

Sejuti Das
Sejuti currently works as Associate Editor at Analytics India Magazine (AIM). Reach out at

AIM Upcoming Events

Conference, in-person (Bangalore)
Rising 2023 | Women in Tech Conference
16-17th Mar, 2023

Early Bird Passes expire on 10th Feb

Conference, in-person (Bangalore)
Data Engineering Summit (DES) 2023
27-28th Apr, 2023

3 Ways to Join our Community

Telegram group

Discover special offers, top stories, upcoming events, and more.

Discord Server

Stay Connected with a larger ecosystem of data science and ML Professionals

Subscribe to our Daily newsletter

Get our daily awesome stories & videos in your inbox