The application of data mining and analysis techniques is not new to football. However, it has not been exploited to its potential by football clubs in India.
A team of Industrial Engineering students from College of Engineering Trivandrum, completed a project on Football Analytics. The football analytics system developed includes methods for data collection from matches, algorithms to extract meaningful results and patterns of play from the data and methods of visualization of the findings. The patterns in passing and movement of players can be used as a tool for building the right strategy against the opposition. This vital information about the team and players makes the difference between winning and losing.
Attached below is the report of the match between Atletico Paranaense and Shamrock Rovers which was analysed using this system.
MATCH REPORT (FIRST HALF)
ATLETICO PARANAENSE vs SHAMROCK ROVERS
Match : 1st Semi-final
Time : 19:00 hrs
Date : 18 February, 2016
Venue : EMS Corporation Stadium, Kozhikode, Kerala
Tournament : Sait Nagjee International Club Football 2016
The algorithms when implemented on the datasets collected gave the following results:
- GENERAL STATISTICS
Total Passes in first half : 335
- Total Passes : 338 (63.2%)
- Successful passes : 273 (80.77%)
- Failed passes : 65 (19.23%)
- Very short passes : 44 (13%)
- Short passes : 164 (49%)
- Long passes : 102 (30%)
- Long overhead pass : 28 (8%)
- Total Passes : 197 (36.8%)
- Successful passes : 134 (68.1%)
- Failed passes : 63 (31.9%)
- Very short passes : 20 (10%)
- Short passes : 90 (46%)
- Long passes : 55 (28%)
- Long overhead pass : 32 (16%)
PASSES VISUALISATION OF ATLETICO PARANAENSE
All the passes made in a match can be visualised, helping teams and players know their strengths and weaknesses in passing. Different passes visualisation have been done by classifying passes on the basis of outcome of the pass, the player that makes the pass, the type of pass based on distance of the pass namely very short pass, short pass, long pass and long overhead pass.
- Center defenders : Players 3 and 13
- Wing defenders : Players 2 and 6
- Midfielders : Players 5 and 17
- Forwards : Players 8, 10 and 16
- Striker : Player 11
- Successful Passes : Blue colour
- Failed Passes : Red colour
Base on length of passes:
- Very short passes : less than 7 meters
- Short passes : 7 to 21 meters
- Long passes : greater than 21 meters
Passing combination identifies the number of passes made between different players of the same team and can reveal the passing patterns of a player, who he passes to and from whom he receives passes frequently. The rows represent the player at the ‘from’ end of the pass and columns represent the player at the ‘to’ end of the pass.
Table 1: Passing combination of Atletico Paranaense
DISTANCE AND SPEED OF PLAYERS
The distance run by and speed of the players were calculated using algorithms from the tracking data of a few minutes.
Table 2: Distance run and average speed of players of Atletico Paranaense
|Player Jersey number||Distance in meters||Speed in meters/second|
Speed profile of a player shows the sprints made by the player during the game. Gradual dip in the average speed values towards the end of the game for ageing players can be used to substitute them.
AVERAGE TEAM FORMATION
Team formation is the arrangement of players of a team on the field. The formation varies depending on the strategy of the team. The team formation of Atletico Paranaense for this match was 4-2-3-1.
Even though the players have an assigned region to play, it was found they move significantly from their positions. The average team formation can plot their mean position of play.
The statistics generated reveals areas of strength and weaknesses of the team. The visualisation of passes show patterns of play of the team and the players. The distance run by players and their speed profile can be used to substitute the players that show a below average performance. The passing combination reveals the sync between different players around whom the game revolves.
Sushant T. Eapen (email@example.com)
I did this project on Football Analytics out of my passion for football, analytics, coding and sports photography. Quite a crazy combination!
The other members of my team that did this project are:
- Cyriac Pius
- Harikrishnan Menoth
- Navaneeth K S
Register for our upcoming events:
- Meetup: NVIDIA RAPIDS GPU-Accelerated Data Analytics & Machine Learning Workshop, 18th Oct, Bangalore
- Join the Grand Finale of Intel Python HackFury2: 21st Oct, Bangalore
- Machine Learning Developers Summit 2020: 22-23rd Jan, Bangalore | 30-31st Jan, Hyderabad