Python has become a powerful language of data science and is now commonly used as the leading programming language for predictive analytics and artificial intelligence. During this hands-on “Machine Learning with Python” training course, you will learn to utilise the most cutting edge Python libraries for clustering, customer segmentation, predictive analytics and machine learning on the real-world data.
The course explores practical applications of the most frequently used machine learning approaches such a Multiple Linear, Polynomial (non-linear) and Logistic Regressions, k-Means and Hierarchical Clustering, k-Nearest Neighbours, Naive Bayes, Decision Trees and ensemble algorithms e.g. Random Forests, Adaptive Boosting or Extra Gradient Boosting approaches using Python’s major scientific libraries such as NumPy, pandas, SciPy as well as more specialised, statistical and machine learning oriented packages e.g. scikit-learn, statsmodels, and h2o.
Apart from this, you will learn to evaluate the predictive models based on the obtained metrics such as sensitivity, specificity, F-score, Kappa etc., and optimise the accuracy and efficiency of these models using various methods of cross-validation, grid-search and performance boosting. Please note this training course doesn’t include Neural Networks and Deep Learning approaches – our “Deep Learning with Python” course is specifically designed to cover these methods in detail.
Who is this course for?
This tutor-led online course is suitable for data scientists, researchers, data analysts, developers and engineers, who are currently using Python language (preferably at intermediate level) and would like to expand their skills to include machine learning and predictive analytics toolkit. This course is also recommended for undergraduate and postgraduate students in social sciences, computer science, data science, economy, finance/banking, public health and medical science, and all other fields which implement machine and statistical learning methods.
This training course is tutor-led – all online tutorials are presented live by our expert instructor, you can ask questions, discuss the topic and interact with other learners,
In total, this course includes 15 hours of live teaching which equates to a 3-day classroom-based training course with exclusion of tea/coffee and lunch breaks,
Exercises and tasks completed by you between the live tutorial sessions will help you in better material retention and will enhance your learning progress,
Small group size allows easy interaction and stimulating environment for successful learning,
Email supervision and support provided during the course period maximises learning outcomes and improves your learning experience,
You will have access to self-paced online learning materials e.g. tutorial recordings, exercises and quizzes, Python code scripts used during tutorials, example datasets, optional and mandatory reading (e.g. blog articles, academic papers, industry reports), and other external recommended resources e.g. online books.
Your course instructor
Your instructor for this course will be Simon Walkowiak. Simon is a director at Mind Project Limited and a Ph.D. researcher in Artificial Intelligence at the Bartlett Centre for Advanced Spatial Analysis (University College London) and the Alan Turing Institute in London. Simon holds BSc (First Class Honours) in Psychology with Neuroscience and MSc (Distinction) in Big Data Science. He conducts and manages research projects on implementation and computational optimisation of novel AI approaches applicable to large-scale datasets to predict human behaviour and spatial cognition. Simon is the author of “Big Data Analytics with R” (2016) – a widely used textbook on high-performance computing with R language and its compatibility with ecosystem of Big Data tools e.g. SQL/NoSQL databases, Spark, Hadoop etc. Apart from research and data management consultancy, during the past several years, Simon has taught at more than 150 in-house or open-to-public statistical training courses (in R, Python, SQL and Scala for Spark languages) in the UK, Europe, Asia and USA. His major clients include organisations from finance and banking (HSBC, RBS, GE Capital, European Central Bank, Credit Suisse, ING etc.), research and academia (GSMA, CERN, University of Cambridge, UK Data Archive, Agri-Food Biosciences Institute, Newcastle University etc.), health (NHS), insurance (Liberty IT), transport (Steer Group) and government (Home Office, Ministry of Justice, Government Actuary’s Department etc.).
This instructor-led course duration is planned over 6 teaching weeks.
In between the six weekly online live tutorials (2.5 hours long each) you will improve your skills by watching pre-recorded video tutorials at our Mind Project Learning Platform and working through set tasks (e.g. quizzes) as well as homework coding exercises which will require 4-6 hours of your time commitment per week (24-36 hours). We estimate that the total time commitment is 40-50 hours over 6 teaching weeks.
Start date: Thursday, 24th of February 2022 at 14:00 London (UK) time
Schedule of sessions: Every Thursday at 14:00 London (UK) time for 6 weeks
Deadline for registrations: Tuesday, 22nd of February 2022 @ 17:00 London (UK) time
Week 1: Introduction to Machine Learning with Python
- Concepts, terminology and context: unsupervised vs. supervised vs. semi-supervised approaches,
- Overview of methods and applications,
- Preparing data for Machine Learning tasks: revision of probability distributions, data normalisation and standardisation techniques, feature engineering, dealing with missing values,
- Dimensionality reduction with Singular Value Decomposition, Principal Component Analysis and Factor Analysis.
Week 2: Unsupervised learning with clustering approaches
- K-means clustering,
- Hierarchical clustering,
- Evaluating clustering solutions, describing clusters and estimating cluster profiles,
- Overview of other important clustering methods: mean-shift, DBSCAN, and affinity propagation.
Week 3: Predicting continuous data with linear and non-linear models
- Multiple linear regression and selecting suitable predictors with stepwise regression,
- Ridge and lasso regularisation,
- Regression metrics for model evaluation, comparing models,
- Polynomial regression, splines and generalised additive models (GAMs).
Week 4: Binary and multinomial classification – part 1: methods, evaluation metrics, model selection
- Introduction to classification with logistic regression – understanding probabilities and log-odds,
- Model selection and classification metrics: sensitivity, specificity, F score, Kappa, log-loss, R-squared etc.,
- Cross-validation and re-sampling methods e.g. bootstrapping.
Week 5: Binary and multinomial classification – part 2: overview of other important approaches
- Distance-based classification: k-Nearest Neighbours algorithm,
- Probabilistic Naive Bayes classifier and kernel-based Support Vector Machines,
- Semi-automated and automated tuning of classification models.
Week 6: From decision trees to ensembles
- Classification and Regression Trees (CART),
- Estimating variable importance,
- Tree-based Random Forests ensemble,
- Bagging and boosting (e.g. Adaptive Boosting – AdaBoost and Extra Gradient Boosting – XGBoost algorithms).
Course pre-requisites and further instructions
We recommend that all attendees have the most recent version of Anaconda Individual Edition of Python 3.9 (or at least Python 3.5) installed on their PCs (any operating system). Anaconda’s Python is a free and fully-supported distribution and you can download it directly from https://www.anaconda.com/products/individual#Downloads. Please contact us should you have any questions or issues with the installation process. A short list of additional Python libraries to pre-install before the course will be sent to the enrolled attendees in the Welcome Pack alongside other Joining Instructions.
We recommend that the attendees have practical experience in data processing or quantitative research – gathered from either professional work or university education/research. A good knowledge of statistics would be beneficial. We suggest that the course is preceded with our “Python for Data Analysis” open-to-public tutor-led online training course.
Your PC needs to be connected to a stable WiFi/Internet network (either home or office-based) and have Zoom video-conferencing application installed.
You will need at least one commonly used web browser installed on your PC (e.g. Chrome, Safari, Firefox, Edge etc.) to access our Mind Project Learning Platform.
The primary spoken and written language of the course is English.
Discounts and multiple bookings
Early Bird Offer allows you to save up to 30% off the total course enrolment price (on top of other discounts). This offer is usually valid up until 3-4 weeks before the start of the course.
We offer 2 types of enrolment options:
- Regular Fee – full-priced enrolment for learners representing commercial organisations or self-funded individuals who do not meet our eligibility criteria for discounted rates (please see below),
- Discounted Fee – applicable to undergraduate and postgraduate students as well as representatives of registered charitable organisations and non-governmental organisations (NGOs) – this category also includes employees of the National Health Service (NHS).
Students and individuals eligible for the Discounted Fee should submit a copy of their student or organisation ID card (with their name and card expiry date visible) when making the purchase of their place on the course for the discount eligibility verification purposes. Alternatively, the discount eligibility can be verified by submitting either i.) a copy of a letter from the university registrar or student’s department confirming your status, or ii.) a copy of a letter from your employer (on a company letter-headed paper with a charity/NGO registration number) which confirms your current position within the organisation.
Apart from the Early Bird Offer and discounted fees for students or employees of charitable organisations and NGOs, we are able to offer further discounts on the overall cost of your training if you wish to attend multiple related courses or enrol several delegates on this specific course. Please note that this offer is only available through our website.
- If you book 5 or more tickets on any of our tutor-led open-to-public online training courses, you will receive 10% discount on the total price of your booking.
All discounts are calculated automatically when tickets are added to the Cart. For bookings of 6 and more delegates on one course, we recommend that you contact us directly – we may be able to arrange a separate course just for your delegates at a discounted rate.
Arrange this course at your organisation
This open-to-public online course is a more generalised version of our fully-customisable in-house / online training course “Machine Learning with Python”. If your delegates cannot attend this public course, or you are interested in arranging this training course explicitly for your delegates (or at your premises) or simply you need a bespoke, made-to-measure training solution, please request a quote for the in-house version of this course based on your specific needs and desired outcomes of the training.
You may email us directly at info(at)mindproject.io and include the following information in your enquiry:
contact details to a person who should receive the quote,
number of delegates you would like to train,
approximate number of online sessions (or half-days / full days for on-site in-house course) you would like to arrange the course for (including additional support/project guidance if needed),
location of the training venue if not online,
any details on course customisation or specific topics you would like the course to address – most importantly, please indicate desired outcomes of the course if different then presented above,
any other questions you may have.
If you don’t know the answers to questions above or you are at early stages of course planning, we would be happy to arrange an informal chat and help you choose the most suitable and budget-efficient option.