Notebooks for introducing the machine learning toolbox MLJ (Machine Learning in Julia)
Based on tutorials originally part of a 3.5 hour online workshop.
-
Familiarity with basic data manipulation in Julia: vectors, tuples, dictionaries, arrays, generating random numbers, tabular data (e.g., DataDrames.jl) basic stats, Distributions.jl.
-
Familiarity with Machine Learning fundamentals and best practice.
-
Part 1 - Data Representation
-
Part 2 - Selecting, Training and Evaluating Models
-
Part 3 - Transformers and Pipelines
-
Part 4 - Tuning hyper-parameters
-
Part 5 - Advanced model composition
The tutorials include links to external resources and exercises with solutions.
-
The tutorials focus on the machine learning part of the data science workflow, and less on exploratory data analysis and other conventional "data analytics" methodology
-
Here "machine learning" is meant in a broad sense, and is not restricted to so-called deep learning (neural networks)
-
The tutorials are crafted to rapidly familiarize the user with what MLJ can do and how to do it, and are not a substitute for a course on machine learning fundamentals. Examples do not necessarily represent best practice or the best solution to a problem.
- Slides from workshop given at ResBaz 2021 (Auckland)
- HelloJulia.jl - Resources from an Introduction to Julia workshop
- DataFrames.jl Cheatsheets
- MLJ Cheatsheet
- Common MLJ Workflows
- MLJ manual
- Data Science Tutorials in Julia
The author and maintainer of this repository is @ablaom. Pluto notebooks have been adapted from the julia scripts by @roland-KA who is also a maintainer.