The goal of this project is to scrape this website to collect the data to analysis 2022 T-20 Cricket World Cup. Then, I used the scraped data to analyze the following question based on the analysis of the Tableau Dashboard:
- Top 10 batsmans of the tournament based on the run scores.
- Top 10 bowlers of the tournament based on the taken wickets.
- Top 5 All rounders of the tournament based on the run scores and taken wickets.
- To find out the Player of The Tournament based on the batting and bowling performance including strike rate and economy rate.
- To find out the best 11 players squad of the tournament based on their performance.
- To find out the Top 10 Sixers.
- To find out the Top 10 Four takers.
- To find out the Top 10 Hard-hitter Batsman.
- To find out the Best Maiden taker bowler.
- To find out the Top 10 Death Over Bowlers.
- To find out the Top 10 extra runs given bowlers.
Here, I scraped 699 and 500 data for Batsmans and Bowlers respectively.
You can look the dashboard at a glance here.
My interesting findings from the Dashboard
Top 10 Batsman of the tournament as per my analysis:
- Virat Kohli
- Max O'Dowd
- Suryakumar Yadav
- Jos Buttler
- Kusal Mendis
- Sikandar Raza
- Pathum Nissanka
- Alex Hales
- Lorcan Tucker
- Glenn Philips
Top 10 Bowlers of the tournament as per my analysis:
- Wanindu Hasaranga de Silva
- Sam Curran
- Bas de Leede
- Blessing Muzarabani
- Shaheen Shah Afridi
- Shadab Khan
- Paul Van Meekeren
- Josh Little
- Anrich Nortje
- Sikandar Raza
Top 5 All-rounders of the tournament as per my analysis:
- Sikandar Raza
- Hardik Pandya
- Dhananjaya de Silva
- Ben Stokes
- Bas de Leede
Player of the tournament as per my analysis:
Sikandar Raza
Taken Runs - 219
Strike Rate - 110.5
Taken Wickets - 10
Economy - 6.89
Best 11 players squad based on the performance of the tournament as per my analysis (I consider strike rate and economy rate to choose players as well):
- Virat Kohli
- Suryakumar Yadav
- Kusal Mendis
- Jos Buttler (c & wk)
- Glenn Philips
- Sikandar Raza
- Hardik Pandya
- Wanindu Hasaranga de Silva
- Sam Curran
- Shaheen Shah Afridi
- Shadab Khan
Top 10 Sixers:
- Sikandar Raza (11)
- Kusal Mendis (10)
- Alex Hales (10)
- Suryakumar Yadav (9)
- Rilee Rossouw (9)
- Marcus Stoinis (9)
- Andy Balbirnie (9)
- Virat Kohli (8)
- Pathum Nissanka (8)
- Max O'Dowd (8)
Top 10 Four takers:
- Suryakumar Yadav (26)
- Virat Kohli (25)
- Jos Buttler (24)
- Max O'Dowd (22)
- Najmul Hossain Shanto (20)
- Lorcan Tucker (19)
- Glenn Phillips (19)
- Alex Hales (19)
- Kusal Mendis (17)
- Sikandar Raza (16)
Top 10 Hard-hitter Batsman (I considered highest Avg. Strike Rates and those batsman who faced at least 30 balls in the tournament):
- Suryakumar Yadav (182.6)
- Glenn Maxwell (161.2)
- Marcus Stoinis (159.2)
- Mohammad Haris (152.2)
- Dawid Malan (151.5)
- Finn Allen (150.5)
- Glenn Phillips (143.7)
- Rahmanullah Gurbaz (134.7)
- Litton Das (131.1)
- Gareth Delany (131.0)
Best Maiden taker bowler:
Bhuvneshwar Kumar
3 Overs maiden
Top 10 Death Over Bowlers (I considered lowest Avg. Economy bowlers and those who did at least 10 overs in the tournament):
- Jason Holder (4.67)
- Zahoor Khan (4.75)
- Bernard Scholtz (5.083)
- Anrich Nortje (5.27)
- Mustafizur Rahman (5.60)
- Mark Watt (5.83)
- Sean Williams (6.093)
- Shaheen Shah Afridi (6.094)
- Adil Rashid (6.125)
- Naseem Shah (6.19)
Top 10 6s consumed Bowlers:
- Wanindu Hasaranga (8)
- Ravichandran Ashwin (8)
- Maheesh Theekshana (8)
- Richard Ngarava (7)
- Logan van Beek (7)
- Barry McCarthy (7)
- Kagiso Rabada (6)
- Josh Hazlewood (6)
- Gareth Delany (6)
- Chris Woakes (6)
Top 10 extra runs given bowlers:
- Mark Adair (17 runs)
- Lahiru Kumara (12 runs)
- Richard Ngarava (10 runs)
- Fred Klaassen (10 runs)
- Maheesh Theekshana (9 runs)
- Chris Woakes (9 runs)
- Bas de Leede (9 runs)
- Barry McCarthy (9 runs)
- Mohammad Wasim ( 8 runs)
- Josh Little (8 runs)
- Clone the repo
git clone https://github.com/armanulalam/2022-T20-WorldCup-Cricket-Tournament-Analysis.git
- Initialize and activate virtual environment
virtualenv --no-site-packages venv source venv/bin/activate
- Install dependencies
pip install -r requirements.txt
- Download the Chrome Webdriver.
- Run the
scrpers/batsmans_scraper.py
andscrpers/bowlers_scraper.py
to scrape the necessary data from the website or Download the two files fromdata/batsman/batsman_summary_scraped.csv
anddata/bowlers/bowler_summary_scraped.csv
.python scrapers/batsmans_scraper.py
python scrapers/bowlers_scraper.py
- Run data_preprocessing.ipynb file in jupyter notebook to preprocess,manipulate and transform the dataset which you found in
data/batsman/batsman_summary_preprocessed.csv
anddata/bowlers/bowler_summary_preprocessed.csv
- Analysis the data through Tableau Public Dashboard based on the
batsman_summary_preprocessed.csv
andbowler_summary_preprocessed.csv
.
If you have any further queries then you can send me an email to [email protected]