Last updated April 15, 2024
In AI News & Update

DeepLearning Comes Up with New Course on Unstructured Data Handling for LLMs

Taught by Matt Robinson, head of product at Unstructured, the course is free for a limited time and takes about an hour to complete.

Share

Published on April 11, 2024

by Shritama Saha

Listen to this story

Andrew Ng has rolled out a new course called “Preprocessing Unstructured Data for LLM Applications,” this time in collaboration with San Francisco-based startup Unstructured. Unstructured essentially captures unstructured data wherever it is stored and transforms it into AI-friendly JSON files for companies eager to incorporate AI into their business.

Taught by Matt Robinson, head of product at Unstructured, it’s free for a limited time and takes about an hour to complete.

You’ll learn to extract and standardise content from various document types, such as PDFs, PowerPoints, Word, and HTML files, as well as tables and images into a common JSON format. This will broaden the range of information available for your LLM applications. Enriching your content with metadata will improve retrieval augmented generation (RAG) results and enable more nuanced search capabilities.

The course covers techniques for document image analysis, including layout detection and vision and table transformers. You’ll discover how to apply these methods to preprocess PDFs, images, and tables. It is suitable for anyone interested in effectively processing diverse data types and formats to build high-performing LLM RAG systems.

Access all our open Survey & Awards Nomination forms in one place

Share

Shritama Saha

Shritama (she/her) is a technology journalist at AIM who is passionate to explore the influence of AI on different domains including fashion, healthcare and banks.

Related Posts

Gyan AI Unveils Smaller-Scale Maths LLM, Paramanu-Ganita, Outperforming LLama, Falcon

Shritama Saha 02/05/2024

PyTorch Releases Version 2.3 with Focus on Large Language Models and Sparse Inference

K L Krithika 25/04/2024

apple

Apple Releases Four Open Source LLMs with OpenELM Series of Models

Mohit Pandey 24/04/2024

Guardians of the Syntax: Securing Enterprise LLM Systems against Emerging Threats

Mohit Pandey 24/04/2024

OpenAI Introduces Instruction Hierarchy to Protect LLMs from Jailbreaks and Prompt Injections

OpenAI Introduces Instruction Hierarchy to Protect LLMs from Jailbreaks and Prompt Injections

Mohit Pandey 23/04/2024

FlowMind

JPMorgan Unveils FlowMind for Automatic Workflow Generation with LLMs

Mohit Pandey 23/04/2024

Everyone is Now Officially a Developer, Thanks to Microsoft

Microsoft Introduces Phi-3, LLM That Runs on the Phone

Mohit Pandey 23/04/2024

KissanAI Releases Dhenu Llama 3, an Indic LLM for Farmers

Siddharth Jindal 19/04/2024

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Soket AI Labs Becomes the First Indian Startup to Build Solutions Towards Ethical AGI

Soket AI Labs Becomes the First Indian Startup to Build Solutions Towards Ethical AGI

Mohit Pandey

The company is part of NVIDIA’s Inception Programme and AWS Activate for training compute access.

Tata Technologies Builds First-of-its-Kind Design Studio Using Llama 2 and Stable Diffusion 3

Siddharth Jindal

Bihar Emerges as the Next Big Hub for Tech Talent

Bihar Emerges as the Next Big Hub for Tech Talent

Vidyashree Srinivas

Top Editorial Picks

“I don’t care if we burn $50 billion a year, we’re building AGI,” says Sam Altman

Mohit Pandey

Google’s New Feature Lets Users Chat with Gemini Directly in Chrome’s Search Bar

Sukriti Gupta

SML Unveils Hanooman, Sets Ola Krutrim On Fire

Mohit Pandey

Broadcom Tries to Simplify VMware Portfolio and Licensing

Shyam Nandan Upadhyay

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Also in News

OpenAI Apple

iPhone 16 Likely to Run on OpenAI GPTs

GPT-4 Beats Clinical Psychologists in Understanding Complex Human Emotions

GPT-4 Beats Human Psychologists in Understanding Complex Emotions

US Govt ‘Snubs’ Musk and Zuckerberg, Keeps ’em Out of AI Safety Board

Is Intel Living in Denial?

Is Intel Living in Denial?

Rakuten India Announces the 4th Edition of RPC '24 in Collaboration with AIM

Rakuten India Announces the 4th Edition of RPC ’24 in Collaboration with AIM

Open-Source MS-DOS 4.0 Inspires Aspiring Developers to Embrace Retro Revolution

10 AI Tools to Accelerate Your Workflow

10 AI Tools to Accelerate Your Workflow

Baidu China Tesla

Baidu, The Darling of Automotive Giants

AI Courses & Careers

View All

10 Free Online AI Courses to Learn from the Best

Donna Eva 29/04/2024

India is a Goldmine for AI Talent

Donna Eva 15/04/2024

Top 10 LMS Platforms for Enterprise AI Training and Development

Analytics India Magazine 14/04/2024

Become a Certified Generative AI Engineer

Industry
Insights

View All

CynLr CyRo robotics

Bengaluru-based Robotics Company CynLr Unveils Semi-Humanoid ‘CyRo’

Vandana Nair 02/05/2024

ML in healthcare

Google’s Med-Gemini Model Achieves 91.1% Accuracy in Medical Diagnostics

K L Krithika 30/04/2024

Genpact Collaborates with Microsoft for AI-Driven Finance Transformation Across Enterprises

Genpact Collaborates with Microsoft for AI-Driven Finance Transformation Across Enterprises

Mohit Pandey 30/04/2024

Check our Industry Research Reports

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.

AIM Videos

GenAI Podcast with Gopi Duddi from Couchbase

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

GenAI
Corner

View All

7 AI Startups that Featured on Shark Tank India Season 3

Siddharth Jindal 15/04/2024

Top 9 Semiconductor GCCs in India

Shyam Nandan Upadhyay 15/04/2024

Top 6 Devin Alternatives to Automate Your Coding Tasks

Siddharth Jindal 08/04/2024

10 Free AI Courses by NVIDIA

Shritama Saha 02/04/2024

Top 6 AI/ML Hackathons to Participate in 2024

Siddharth Jindal 22/03/2024

What’s Devin Up to?

K L Krithika 17/03/2024

10 Underrated Women in AI to Watchout For

K L Krithika 11/03/2024

10 AI Startups Run by Incredible Women Entrepreneurs

K L Krithika 08/03/2024

Data
Dialogues

View All

Zerodha CTO Kailash Nadh

Zerodha CTO Warns Companies to Not Look at AI as a Solution Chasing a Problem

Vandana Nair 27/04/2024

Healthify Uses OpenAI’s GPTs to Help Indians Make Better Health Choices

Shritama Saha 25/04/2024

CTO Kailash Nadh Zerodha

Zerodha CTO Says He Stopped Googling Technical Stuff Over the Past Year

Vandana Nair 25/04/2024

Fibe Leverages Amazon Bedrock to Increase Customer Support Efficiency by 30%

Shritama Saha 24/04/2024

This 18-Year-Old Programmer is Creating an Open Source Alternative to Redis

This 18-Year-Old Programmer is Creating an Open Source Alternative to Redis

Mohit Pandey 24/04/2024

Automation Anywhere Wants to Augment Humans with AI, Not Replace Them

Shritama Saha 18/04/2024

Father of Computational Theory Wins 2023 Turing Award

Shritama Saha 13/04/2024

Falcon- TII- UAE

Building Open Source LLMs is Not for Everyone

Vandana Nair 12/04/2024

Future
Talks

View All

github copilot workspace

As GitHub Begins Technical Preview of Copilot Workspace, an Engineer Answers How it Differs from Devin

Pritam Bordoloi 29/04/2024

ai jobs india

T-Hub Supported MATH is Launching AI Career Finder to Create AI Jobs

Pritam Bordoloi 23/04/2024

Quora’s Poe Eats Google’s Lunch

Gopika Raj 17/04/2024

Zoho teams up with Intel for optimizing video AI workloads

Zoho Collaborates with Intel to Optimise & Accelerate Video AI Workloads

Gopika Raj 08/04/2024

Rakuten Certified as Best Firm for Data Scientists for the 2nd Time

Analytics India Magazine 08/04/2024

bulls.ai

This Indian Logistics Company Developed an LLM to Enhance Last-Mile Delivery

Pritam Bordoloi 02/04/2024

Perplexity AI

Perplexity AI Reviews with Pro Access

Vandana Nair 02/04/2024

Apple WWDC 2024

What to Expect at the ‘Absolutely Incredible’ Apple WWDC 2024

Vandana Nair 31/03/2024

Developer’s Corner

Japan is the Next Big Hub for Indian Tech Talent

Siddharth Jindal 22/04/2024

Will TypeScript Wipe Out JavaScript?

K L Krithika 21/04/2024

Meta Llama 3

Meta Forces Developers Cite ‘Llama 3’ in their AI Development

Sukriti Gupta 19/04/2024

Why Developers Hate Jira

Why Developers Hate Jira

Mohit Pandey 01/04/2024

In Case You Missed It

Which is the Most Frustrating Programming Language?

Which is the Most Frustrating Programming Language?

Mohit Pandey 18/03/2024

AI4Bharat Rolls Out IndicLLMSuite for Building LLMs in Indian Languages

Shritama Saha 15/03/2024

Google Introduces Synth^2 to Enhance the Training of Visual Language Models

K L Krithika 14/03/2024

Infosys Funds Llama 2 Project with 22 Indian Languages

Infosys Founder Funds Meta’s Llama 2 Project with 22 Indian Languages

Mohit Pandey 13/03/2024

Webstories

React Native Component Libraries

10 Best React Project Ideas For Beginners

android

8 Best AI Image Generator Apps Free for Android Users in 2024

Excel tools

9 Best AI Tools for Excel and Google Spread Sheet Automation

Generative AI Certification Courses

8 Best Generative AI Courses for Executives and Managers

Add ChatGPT Chrome Extension Right Away

Top 8 AI Browser Extensions for Chrome Users in 2024

Dead Programming Languages

Top 5 Devin AI Alternatives for Coders and Developers

Also in Trends

MongoDB Partners with Cohere to Elevate Enterprise GenAI Through MAAP

Confluent Unveils New Capabilities to Simplify AI and Stream Processing

Anthropic Unveils Claude 3 Team Plan for Enterprise Collaboration

OpenAI to Launch Google Search Alternative Soon

PyTorch Releases ExecuTorch Alpha for Deploying LLMs for Edge Devices

Microsoft’s Satya Nadella Says He is Thrilled to be in Thailand, Opens First Datacenter in the Region

Microsoft Announces $1.7 Bn Investment to Advance Indonesia’s Cloud and AI Ambitions

Mysterious gpt2-Chatbot Takes Everyone by Surprise

World's Biggest Media & Analyst firm specializing in AI

Advertise with us

AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.

Branded Content

AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.

Corporate Upskilling

ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent

Hackathons

With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.

Talent Assessment

Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring

Research & Advisory

AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.

Conferences & Events

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives.

AIM Launches the 3rd Edition of Data Engineering Summit. May 30-31, Bengaluru

Join the forefront of data innovation at the Data Engineering Summit 2024, where industry leaders redefine technology’s future.

© Analytics India Magazine Pvt Ltd & AIM Media House LLC 2024