Published on July 23, 2021
In Global Tech

NVIDIA Releases Eighth Generation Of Its Popular Conversational AI Software TensorRT

Name: NVIDIA Releases Eighth Generation Of Its Popular AI Software TensorRT
Uploaded: 2021-07-23T15:00:00+05:30
Channel: Amit Naik
Description: NVIDIA recently released the eighth generation of its popular AI software TensorRT which cuts inference time in half for language queries.

The latest version of TensorRT brings BERT-Large inference latency down to 1.2 milliseconds.

By Amit Naik

NVIDIA recently released the eighth generation of its popular AI software TensorRT which cuts inference time in half for language queries — enabling developers to build the best-performing search engines, ad recommendations and chatbots and deliver them from the cloud to the edge. TensorRT 8 is now generally available and free of charge to members of the NVIDIA developer programme. The latest versions of plug-ins, samples, and parsers are available on the TesorRT GitHub repository. https://youtu.be/0e5TRStkkLM What's new? The latest version of TensorRT brings BERT-Large inference latency down to 1.2 milliseconds with new optimisations. BERT-Large is one of the world's most widely used transformer-based models. Further, it delivers 2x accuracy for

Subscribe or log in to Continue Reading

Uncompromising innovation. Timeless influence. Your support powers the future of independent tech journalism.

Already have an account? Sign In.

📣 Want to advertise in AIM? Book here

Amit Naik

Amit Raja Naik, known as the 'AI Human,' serves as the editor at AIM Media House, where he leads a team of talented tech journalists who are driving and shaping technology conversations across India and around the world.

Shopify Open-Sources ML Platform That Has Already Saved Them Over a Year of Compute

How Informatica Reinvented Itself for the Cloud and AI Era

Is It Time for the Vibe Researcher to Rise and Shine?

Intel Reshuffles Leadership in Lead-Up to July Layoffs

Inside GeM’s Drive for Transparent, AI-First Public Procurement

MLDS 2025: Key Highlights from Day 1

Don’t Miss the Next Big Shift in AI.

Get one year subscription for ₹5999

Will 2026 be the year of AI IPOs?

With CoreWeave’s listing in the US and Fractal Analytics poised to become the first AI pure-play Indian company to go

Fighting Deepfakes May Not Be a Technology Problem

Defenders must be active at all times, while attackers need only one opportunity.

India’s Data Centre Expansion Is Decentralising

Without compute buildup beyond metros, the next wave of digital adoption will be constrained

How Mumbai Keeps Winning India’s Data-Centre Race

Land prices are among the highest in the country, but total build economics remain competitive by global standards.

From Shortages to Scale, io.net’s Approach to Rewriting AI Compute Access

A decentralised GPU marketplace may scale AI compute faster than traditional clouds, as GPU demand towers over supply

Why Deloitte Built a Tax AI That Knows When to Say ‘I Don’t Know’

The company has launched an agentic AI platform for tax research that’s targeting something radical in a conservative profession.

Can India’s AI Copyright Plan Survive Legal and Technical Scrutiny?

India’s ambitious proposal for a single mandatory AI training licence faces feasibility, legal and innovation concerns.

2026 Could be India’s Year in AI, But Only the Resilient Will Survive

“This isn’t a freeze. It’s a filter… By 2026, the real signal will be resilience, not rhetoric.”

Download the easiest way to
stay informed

Flagship Events

NVIDIA Releases Eighth Generation Of Its Popular Conversational AI Software TensorRT

Happy Llama 2026 The Must-Attend Summit for AI Startups Now in Bangalore and San Francisco