Machine Learning orientation
1
Introduction
I Machine learning: Shall we?
2
What is machine learning?
2.1
What is intelligence?
2.1.1
Definition of artificial intelligence sub domains
2.2
Is AI smarter than humans?
2.2.1
Thinking, fast and slow
(Kahneman
2011
)
2.3
Comparisons between AI and humans
2.3.1
Breast cancer detection
2.3.2
Working together: Lung cancer detection
2.3.3
ImageNet Large Scale Visual Recognition Challenge (ILSVRC)
2.3.4
AlphaGo Zero
2.4
ML models with bias
2.5
Attacks on ML models
2.5.1
Adding noise to image leads to misclassification
2.5.2
But what about attacks on human perception?
2.5.3
Model Hacking ADAS to Pave Safer Roads for Autonomous Vehicles
2.6
Measuring the Algorithmic Efficiency of Neural Networks
3
Outlook
3.1
Development of life
3.1.1
When will superhuman AI come, and will it be good?
3.1.2
AI aftermath scenario
3.2
Data religion: Dataism
4
Discussion points
4.1
Researcher stops his work due to ethical concerns
4.2
Career: Oxford seeks AI ethics professor
4.3
Shaping Europe’s digital future: Commission presents strategies for data and Artificial Intelligence
5
Strategies for machine learning
5.1
Management ML strategy: 7 steps for a successful ML project
5.2
Project management ML strategy: The Drivetrain Approach
5.2.1
Recommendation system
5.2.2
Exercise: Optimizing lifetime customer value
5.3
Developer ML strategy
II Machine learning fundamentals
6
Machine learning fundamentals
7
ML project process
7.1
Identify if ML is suited to fulfill need
7.2
Gather data
7.2.1
What kind of data can be used?
7.2.2
How much data is necessary?
7.2.3
Which data is useful?
7.3
Exploratory and quantitative data analysis
7.3.1
Example for exploratory and quantitative data analysis
7.3.2
Visualizations for Categorical Data: Exploring the OkCupid Data
7.4
Feature engineering
7.4.1
Encoding Categorical Predictors
7.4.2
Engineering numeric features
7.4.3
Feature importance
7.5
Model fit
7.6
Model tuning
7.6.1
Metrics
7.7
Deploy model
8
Machine learning types
8.1
Supervised learning
8.1.1
Self supervised learning
8.2
Unsupervised learning
8.2.1
Discovering clusters
8.2.2
Discovering latent factors
8.3
Reinforcement learning
8.3.1
Elements of reinforcement learning
8.3.2
RL algorithms
8.3.3
Example self driving car MIT
9
ML application examples
9.1
AI translation by DeepL
9.2
Artificial intelligence detects myocardial infarctions in the ECG more reliably than cardiologistsn
9.2.1
First in Germany: Artificial intelligence recognizes COVID-19 in clinical routine
9.3
A Deep Learning Approach to Antibiotic Discovery
9.4
Fundamental limits from chaos on instability time predictions in compact planetary systems
9.4.1
Comparison between SPOCK and previous models
9.5
Machine Learning Algorithms and Global Optimization Methods for SPICE Model Parameter Extraction
9.6
How many yards will an NFL player gain after receiving a handoff?
9.7
Predictive Maintenance for the elevator and escalator industry TBD
9.8
Disease outbreak risk software
9.9
Neural networks enable autonomous navigation of catheters
9.10
Bosch FLEXIDOME IP starlight 8000i
9.11
Master Autonomous Driving
9.12
University Suttgart: Indoor-Ortung mit Mobilfunk
10
ML algorithms
10.1
Linear regression
10.1.1
Example for linear regression
10.2
Logistic regression
10.2.1
Python example logistic regression
10.3
Tree based methods
10.3.1
Splitting metrics
10.3.2
Ensembles
10.3.3
Random forest
10.3.4
Boosted trees
10.4
Support Vector Machine (SVM) TBD
10.4.1
Kernels
10.4.2
Python example for SVM
10.5
Neural networks
10.5.1
Geometric Intuition for Training Neural Networks
10.5.2
Convolutional Neural Network (CNN) TBD
10.5.3
RNN TBD
10.5.4
GANs
10.6
A Gentle Introduction to CycleGAN for Image Translation
10.6.1
Examples for GANs
10.7
Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.
10.8
Transformers TBD
10.8.1
Example transformer for time series forecasting
11
Food for the algorithms: Data
11.1
NLP
11.2
Dataset search from Google
III Explainable ML
12
Explainable ML tbd
12.1
Method: Layer-Wise Relevance Propagation
12.2
Method: SpRay
12.3
Method: Lime tbd
12.4
alibi tbd
12.5
tf-explain tbd
12.6
keras-salient-object-visualization
12.6.1
Explaining How a Deep Neural Network Trained with End-to-End Learning Steers a Car
12.6.2
VisualBackProp: efficient visualization of CNNs
IV ML online resources
13
ML online courses
13.1
Coursera
13.2
Udemy
13.3
DataCamp
13.4
Udacity
13.4.1
Example for self-driving car course project
13.5
fast.ai
13.5.1
fast.ai Book
13.6
Kaggle Courses
13.7
Full Stack Deep Learning
14
ML online resources
14.1
In-depth introduction to machine learning in 15 hours of expert videos
14.1.1
An Introduction to Statistical Learning
14.2
The learning machine
14.3
DeepAI: The front page of A.I.
14.4
TensorFlow tutorials
14.4.1
MIT 6.S191 Introduction to Deep Learning
14.5
Embedding Projector
14.6
Tensorboard playground
14.7
Empowering companies to jumpstart AI and generate real-world value
14.8
TensorFlow, Keras and deep learning, without a PhD
14.9
Neural Networks and Deep Learning
14.10
Platform.ai: produce high-quality labels
15
ML online books
15.1
Neural Networks and Deep Learning
15.2
Deep Learning
V Examples from Kaggle
16
Examples in Kaggle
17
Melbourne University AES/MathWorks/NIH Seizure Prediction
17.1
Winning solution (1st)
17.1.1
Alex / Gilberto models
17.1.2
Feng models
17.1.3
Andriy models
17.1.4
Code on GitHub
17.2
Solution(4th place)
17.2.1
Pre-processing
17.2.2
Features
17.2.3
Model
17.2.4
GitHub code
18
Bosch Production Line Performance
18.1
1st place solution
18.1.1
Data exploration
18.1.2
Hand crafted features
18.1.3
Hardware
18.2
3rd place solution TBD
18.3
8th place solution with GitHub
18.3.1
Overall architecture
18.3.2
Input data sets
18.3.3
Ensembling
18.3.4
Features
18.3.5
Validation method
18.3.6
Software
18.3.7
Code on GitHub
19
Corporación Favorita Grocery Sales Forecasting
19.1
1st place solution
19.2
4th-Place Solution Overview
19.3
5th Place Solution
20
Rossmann Store Sales
20.1
1st place solution
20.2
3rd place solution
21
Severstal: Steel Defect Detection
22
Lyft 3D Object Detection for Autonomous Vehicles
22.1
3rd place solution
23
APTOS 2019 Blindness Detection
23.1
1st place solution summary
24
Predicting Molecular Properties
24.1
#1 Solution - hybrid
24.1.1
Overall architecture
24.1.2
Ensembling
24.1.3
Hardware
24.1.4
Software
24.1.5
Code on GitHub
24.2
#2 solution 🤖 Quantum Uncertainty 🤖
24.2.1
Overall architecture
24.2.2
Input features and embeddings
24.2.3
Data augmentation
24.2.4
Ensembling
24.2.5
Hardware
24.2.6
Software
24.2.7
Code on GitHub
VI Real world example
25
Real world example
25.1
Subject of the project
Depending from where you were looking:
Looking from the perspective of machine learning expert
25.2
Project phases
The main project phases are:
After data gathering iteration is trump
25.2.1
Feature engineering
25.3
Algorithm selection
25.3.1
Logistic regression
25.3.2
Tree based
25.3.3
Support Vector Machine (SVM) TBD
25.4
Performance measurement
25.4.1
Sensitivity and specificity
25.4.2
Receiver operating characteristic (ROC)
25.5
Confusion matrix and receiver operating characterstic (ROC) for pulse
25.5.1
Receiver operating characterstic (ROC) and probability density plots
25.6
Create augmented labeled data
25.6.1
Features of time signals
25.7
Features generated
25.7.1
Analysis of generated features
25.7.2
Dynamic time warp (DTW) for signal
25.8
Algorithm
25.9
Confusion matrix results logistic regression for measured data
25.9.1
ROC results for measured data
25.10
Several algorithms results for SNR = 18dB
25.10.1
ROC results for SNR 18dB
25.11
Calculation of return of invest (ROI)
25.11.1
Calculation of ML project invest
25.11.2
Calculation of ML profit
25.11.3
Resulting ROI
25.12
Compare models for SNR = 18dB
25.13
Optimize ML hyper parameter
VII Cloud-based machine learning
26
Cloud-based machine learning
VIII Kaggle Survey
27
Kaggle survey introduction
27.1
Kaggle survey details
27.2
Purpose
27.3
Navigation and handling
28
Results
28.1
Survey participants education level
28.2
Who uses which algorithm
28.3
Machine learning experience and algorithms
28.4
Experience and new algorithms
28.5
Role of participants
28.6
Company size
28.7
Company incorporation of machine learning
28.8
Favourite media sources on data science topics
28.9
Favourite online course platform
28.10
Favourite data analyzing tool
28.11
Experience in data analysis coding
28.12
Favourite integrated development environments (IDE’s)
28.13
Favourite hosted notebook products
28.14
Favourite programming languages
28.15
Recommended entry programming language
28.16
Favourite data visualization libraries or tools
28.17
Favourite specialized hardware
28.18
Favourite machine learning frameworks
28.19
Favourite cloud computing platforms
28.20
Favourite big data / analytics products
28.21
Favourite automated machine learning tools (or partial AutoML tools)
References
Published with bookdown
Machine learning orientation
12.5
tf-explain tbd
https://github.com/sicara/tf-explain