Dr Eugene Dubossarsky

Eugene is Managing Partner of AlphaZetta Academy and a leader in the analytics field in Australia, with 20 years’ commercial data science experience. He is the head of the Sydney Data Science group (3,000+ members), the Sydney Users of R Forum (1,900+ members), and Datapreneurs (400+ members). He is regularly invited to be a conference presenter, consultant and advisor, and appears in print and on television to discuss data science and analytics. Eugene also applies data science in an entrepreneurial setting, to financial trading and online startups, and is the creator of ggraptR, an interactive visualisation package in R.

He is a Director at Presciient, providing analytics capability, development services including team selection, training, and executive coaching for team owners and sponsors. They also provide strategic advisory, communications and specialised advanced data analysis.

Data Literacy for Everyone

With the advent of automation, humans’ role has become to do what computers cannot. Many more white-collar workers—perhaps all of them—will end up “working with data” to some extent. This course for managers and workers without a strong quantitative background introduces a range of skills and applications related to critical thinking in such areas as forecasting, population measurement, set theory and logic, causal impact and attribution, scientific reasoning and the danger of cognitive biases. There are no prerequisites beyond high-school mathematics; this course has been designed to be approachable for everyone.

Intro to Predictive Analytics, Machine Learning, Data Science and AI

Our leading course has transformed the machine-learning and data-science practice of the many managers, sponsors, key stakeholders, entrepreneurs and beginning data-science practitioners who have attended it. This course is an intuitive, hands-on introduction to data science and machine learning. The training focuses on central concepts and key skills, leaving the trainee with a deep understanding of the foundations of data science and even some of the more advanced tools used in the field. The course does not involve coding, or require any coding knowledge or experience.

Intro to R (+ data visualisation)

R is the world’s most popular data mining and statistics package. It’s also free, and easy to use, with a range of intuitive graphical interfaces. This two-day course will introduce you to the R programming language, teaching you to create functions and customise code so you can manipulate data and begin to use R self-sufficiently in your work.

Intro to Python for Data Analysis

Python is a high-level, general-purpose language used by a thriving community of millions. Data-science teams often use it in their production environments and analysis pipelines, and it’s the tool of choice for elite data-mining competition winners and deep-learning innovations. This course provides a foundation for using Python in exploratory data analysis and visualisation, and as a stepping stone to machine learning.

Strategic Decision Making with Data for Executives

This course is for executives and managers who want to leverage analytics to support their most vital decisions and enable better decision-making at the highest levels. It empowers senior executives with skills to make more effective use of data analytics. It covers contexts including strategic decision-making and shows attendees ways to use data to make better decisions. Attendees will learn how to receive, understand and make decisions from a range of analytics methods, including visualisation and dashboards. They will also be taught to work with analysts as effective customers.

Soft skills for analytics professionals and data scientists

This course is for specialists working in the data domain. It teaches some of the vital skills that are not part of the formal training of quantitative professionals, which are essential in the modern workplace and crucial to the success of analytics efforts as well as the careers of analytics professionals. Issues covered include effective communication, including presentation and communication skills, “storytelling”, and effective listening and elicitation. Technical methods will be presented in the context of their communication value. The course also shares key insights and “trade secrets” that have served Eugene Dubossarsky well over decades of consulting, enterprise and startup work. These include effective ways to structure teams, projects, and analytics functions and careers, as well as “managing up”, branding and work style.

Advanced Python 1

This class builds on the introductory Python class. Jupyter Notebook advanced use and customisation is covered as well as configuring multiple environments and kernels. The Numpy package is introduced for working with arrays and matrices and a deeper coverage of Pandas data analysis and manipulation methods is provided including working with time series data. Data exploration and advanced visualisations are taught using the Plotly and Seaborne libraries.

Advanced Python 2

This class builds on the introductory Python class. Jupyter Notebook advanced use and customisation is covered as well as configuring multiple environments and kernels. The Numpy package is introduced for working with arrays and matrices and a deeper coverage of Pandas data analysis and manipulation methods is provided including working with time series data. Data exploration and advanced visualisations are taught using the Plotly and Seaborne libraries.

Advanced R 1

This class builds on “Intro to R (+data visualisation)” by providing students with powerful, modern R tools including pipes, the tidyverse, and many other packages that make coding for data analysis easier, more intuitive and more readable. The course will also provide a deeper view of functional programming in R, which also allows cleaner and more powerful coding, as well as R Markdown, R Notebooks, and the shiny package for interactive documentation, browser-based dashboards and GUIs for R code.

Advanced R 2

This course goes deeper into the tidyverse family of packages, with a focus on advanced data handling, as well as advanced data structures such as list columns in tibbles, and their application to model management. Another key topic is advanced functional programming with the purrr package, and advanced use of the pipe operator. Optional topics may include dplyr on databases, and use of rmarkdown and Rstudio notebooks.

Deep Learning and AI

This course is an introduction to the highly celebrated are of Neural Networks, popularised as “deep learning” and “AI”. The course will cover the key concepts underlying neural network technology, as well as the unique capabilities of a number of advanced deep learning technologies, including Convolutional Neural Nets for image recognition, recurrent neural nets for time series and text modelling, and new Artificial Intelligence techniques including Generative Adversarial Networks and Reinforcement Learning. Practical exercises will present these methods in some of the most popular Deep Learning packages available in Python, including Keras and Tensorflow. Trainees are expected to be familiar with the basics of machine learning from the introductory course, as well as the python language.

Quantum Computing

This is an introduction to the exciting new field of quantum computing, including programming actual quantum computers in the cloud. Quantum computing promises to revolutionise cryptography, machine learning, cyber security, weather forecasting and a host of other mathematical and high-performance computing fields. A practical component will include writing quantum programs and executing them on simulators as well as on actual quantum computers in the cloud.

Fraud and Anomaly Detection

This course presents statistical, computational and machine-learning techniques for predictive detection of fraud and security breaches. These methods are shown in the context of use cases for their application, and include the extraction of business rules and a framework for the interoperation of human, rule-based, predictive and outlier-detection methods. Methods presented include predictive tools that do not rely on explicit fraud labels, as well as a range of outlier-detection techniques including unsupervised learning methods, notably the powerful random-forest algorithm, which can be used for all supervised and unsupervised applications, as well as cluster analysis, visualisation and fraud detection based on Benford’s law. The course will also cover the analysis and visualisation of social-network data. A basic knowledge of R and predictive analytics is advantageous.

Advanced Fraud and Anomaly Detection

The detection of anomalies is one of the most eclectic and difficult activities in data analysis. This course builds on the basics introduced in the earlier course, and provides more advanced methods including supervised and unsupervised learning, advanced use of Benford’s Law, and more on statistical anomaly detection. Optional topics may include anomalies in time series, deception in text and the use of social network analysis to detect fraud and other undesirable behaviours.

Forecasting and Trend Analysis

This course is an intuitive introduction to forecasting and analysis of time-series data. We will review a range of standard forecasting methods, including ARIMA and exponential smoothing, along with standard means of measuring forecast error and benchmarking with naive forecasts, and standard pre-processing/de-trending methods such as differencing and missing value imputation. Other topics will include trend/seasonality/noise decomposition, autocorrelation, visualisation of time series, and forecasting with uncertainty.

Advanced Machine Learning Masterclass 1

This course is for experienced machine-learning practitioners who want to take their skills to the next level by using R to hone their abilities as predictive modellers. Trainees will learn essential techniques for real machine-learning model development, helping them to build more accurate models. In the masterclass, participants will work to deploy, test, and improve their models.

Advanced Machine Learning Masterclass 2: Random Forests

This course is for experienced machine-learning practitioners who want to take their skills to the next level by using R to hone their abilities as predictive modellers. Trainees will learn essential techniques for real machine-learning model development, helping them to build more accurate models. In the masterclass, participants will work to deploy, test, and improve their models.

Text and Language Analytics

Text analytics is a crucial skill set in nearly all contexts where data science has an impact, whether that be customer analytics, fraud detection, automation or fintech. In this course, you will learn a toolbox of skills and techniques, starting from effective data preparation and stretching right through to advanced modelling with deep-learning and neural-network approaches such as word2vec.

Advanced Deep Learning

This course provides a more rigorous, mathematically based view of modern neural networks, their training, applications, strengths and weaknesses, focusing on key architectures such as convolutional nets for image processing and recurrent nets for text and time series. This course will also include use of dedicated hardware such as GPUs and multiple computing nodes on the cloud. There will also be an overview of the most common available platforms for neural computation. Some topics touched in the introduction will be revisited in more thorough detail. Optional advanced topics may include Generative Adversarial Networks, Reinforcement Learning, Transfer Learning and probabilistic neural networks.