Forest Cover Classification: Weekend Hackathon #12

Another weekend and another exciting hackathon, and this time with an open dataset. Yes, you heard it right !

In this weekend hackathon, we are using an open dataset and we have added some noise in the target variable to keep the spirit of competition right. The participants are provided with 55 distinguishing features to build a classification model that can predict the forest cover type in future.

The goal of this competition is to predict the forest cover types (the predominant kind of tree cover) from strictly cartographic variables (as opposed to remotely sensed data)

The challenge will start on July 17th Friday at 6 pm IST.

Problem Statement & Description

The dataset has been taken from UCI, but to keep the spirit of competition right, we have added some noise in the labels. In this hackathon, we challenge all Machinehackers to predict the forest cover types (the predominant kind of tree cover) from strictly cartographic variables (as opposed to remotely sensed data).

The actual forest cover type for a given 30 x 30-meter cell was determined from US Forest Service (USFS) Region to Resource Information System data. Independent variables were then derived from the data obtained from the US Geological Survey and USFS.

The data is in raw form (not scaled) and contains binary columns of data for qualitative independent variables such as wilderness areas and soil type (one-hot-encoded).

Given are 55 distinguishing factors that can predict the forest cover types. Your objective as a data scientist is to build a machine learning model that can accurately classify the forest cover types (the predominant kind of tree cover) from strictly cartographic variables.

Data Description:-

The unzipped folder will have the following files.

  • Train.csv –  29050 rows x 55 columns
  • Test.csv –  551962 rows x 54 columns
  • Sample Submission – Sample format for the submission.

Target Variable: Cover_Type

The datasets will be made available for download on July 17th, Friday at 6 pm IST

This hackathon and the bounty will expire on July 20th, Monday at 7 am IST

Below are the file formats for the provided data

Train.csv

Glimpse of training set, all features are not included.

Test.csv

Glimpse of test set, all features are not included.

Sample_Submission.xlsx

Glimpse of sample submission.

Bounties

The top 3 competitors will receive a free pass to the Computer Vision DevCon 2020

Know more about the Computer Vision DevCon 2020.

Click here to participate

Rules

  1. One account per participant. Submissions from multiple accounts will lead to disqualification
  2. The submission limit for the hackathon is 10 per day after which the submission will not be evaluated
  3. All registered participants are eligible to compete in the hackathon
  4. This competition counts towards your overall ranking points
  5. You will not be able to submit once you click the “Complete Hackathon” button. You may ignore this feature
  6. We ask that you respect the spirit of the competition and do not cheat
  7. This hackathon will expire on 20th July, Monday at 7 am IST
  8. Use of any external dataset is prohibited and doing so will lead to disqualification

Evaluation

The leaderboard is evaluated using multi-class log loss for the participant’s submission.

Download our Mobile App

Anurag Upadhyaya
Experienced Data Scientist with a demonstrated history of working in Industrial IOT (IIOT), Industry 4.0, Power Systems and Manufacturing domain. I have experience in designing robust solutions for various clients using Machine Learning, Artificial Intelligence, and Deep Learning. I have been instrumental in developing end to end solutions from scratch and deploying them independently at scale.

Subscribe to our newsletter

Join our editors every weekday evening as they steer you through the most significant news of the day.
Your newsletter subscriptions are subject to AIM Privacy Policy and Terms and Conditions.

Our Recent Stories

Our Upcoming Events

3 Ways to Join our Community

Telegram group

Discover special offers, top stories, upcoming events, and more.

Discord Server

Stay Connected with a larger ecosystem of data science and ML Professionals

Subscribe to our Daily newsletter

Get our daily awesome stories & videos in your inbox
MOST POPULAR
How Generative AI is Revolutionising Data Science Tools

How Generative AI is Revolutionising Data Science Tools

Einblick Prompt enables users to create complete data workflows using natural language, accelerating various stages of data science and analytics. Einblick has effectively combined the capabilities of a Jupyter notebook with the user-friendliness of ChatGPT.

Intel Goes All in on AI

Pat Gelsinger said, there are three types of chip manufacturers, “you’re big, you’re niche or you’re dead”