About Me
I'm a self-motivated, hardworking and positive Data Scientist with a drive for continuous improvement and an ambition to fulfil a career that enables development of technical and interpersonal skills. Having taken on a number of varied roles within large-scale aerospace programmes, I have experience of data analysis, data service delivery and defining technology strategy.
I love developing and applying my technical skills to tackle interesting problems and draw meaningful insights from data. In particular, I'm passionate about the role of data analytics in football. I've set this page up to give an overview of some of the work I've undertaken in my spare time!
Projects
Listed below are a number of projects that I have undertaken, either for personal interest or to complement learning (see Courses & Learning section). Click on a project image to open its source repository or site, which contains source code and additional information:
Having decided on a move to London, I found myself trawling through endless pages of rental property listings on Rightmove. On a mission to find a value-for-money home, I realised that I had subconsicously formulated biases and hypothesese about the market. I therefore produced a Tableau dashboard to test these hypothesese and analyse the London rental market, clearly presenting market trends whilst providing the information I required to find a suitable home. This work also involves the development of a Rightmove data extraction tool to acquire property listing data.
  Web scraping: BS4  
  HTML basics  
  Data processing: pandas  
  Data engineering - Joins  
  Data vis.: Tableau  
A collection of Python projects focusing on the analysis and visualisation of football data, including scraping data from informative football websites and processing large public-domain match event datasets. This work includes production of appealing visuals and assessments of player performance, as well as the development of expected goals models using supervised machine learning techniques.
  Data processing: pandas  
  Data vis.: Matplotlib  
  Web scraping: BS4 & Selenium  
  Package & Module development  
  Supervised ML - Linear Regression  
  Supervised ML - Logistic Regression  
  Supervised ML - Neural Networks  
Implementation of a variety of Machine Learning algorithms from scratch in Python, drawing inspiration from books and tutorials. This works inolves the application of Object-Oriented Programming to produce supervised and unsupervised ML models, with the neccessary testing completed in order to demonstrate performance. Each algorithm is represented as an object with fit, predict and visualise methods that not only provide functionality, but give the user a visual representation of the statistical methods applied.
  Package & Module development  
  Module documentation  
  Object-oriented Programming  
  Supervised ML - Linear Regression  
  Supervised ML - kNN  
  Supervised ML - SVMs  
  Unsupervised ML - k Means  
  Unsupervised ML - Mean Shift  
Courses & Learning
I have completed a number of courses and learning activities to improve my competence and
proficiency as a Data Scientist. These are listed below, sorted by date of completion
(most recent top), along with key learning and certificates as appropriate. When viewing
on mobile, a landscape orientation is recommended.
Course | Summary of Learning | Evidence | |
---|---|---|---|
Stanford Online - Machine Learning Specialisation | Supervised Learning, Linear regression, Logistic regression, Regularisation, Gradient Descent, Tensorflow, Neural Networks, Decision trees, Tree ensembles, Anomaly detection, Unsupervised learning, Recommender Systems, Reinforcement Learning | Certificate | |
Udemy - Tableau for Data Science | Time series, Aggregation, Filters, Joining, Blending, Relationships, Visualising data, Geographic data, Table calculations, LOD calculations, Clustering, Dashboards & stories. | Certificate | |
Udemy - Python for Data Science & Machine Learning | Numpy, Pandas, Matplotlib, Seaborn, Sci-kit learn, Supervised ML, Unsupervised ML, Tensorflow, Keras, Deep learning, Neural networks, PCA, Recommender systems, NLP, Big data. | Certificate | |
Udemy - Complete Python Bootcamp | Jupyter notebooks, Data types, Basic operations, Methods, Functions, OOP, Modules & packages, Numpy, Pandas, Matplotlib, Handling errors, Decorators, Generators, Importing files, Web scraping, Image processing. | Certificate Coursework |
Contact Me
Please use the icons at the bottom of the navigation pane to get in touch via Twitter, GitHub, Linked-in, Tableau public or Email.