Microsoft Azure Machine Learning Algorithm Cheat Sheet



Machine Learning Algorithm Cheatsheet. Microsoft released a PDF cheatsheet of what machine learning algorithms to use, when. The one-pager lists various problem types as groups and the algorithms supported by Azure in each group. These groups are: Regression: for predicting values. Anomaly detection: for finding unusual data points. Microsoft Azure machine-learning-algorithm-cheat-sheet 1. ANOMALY DETECTION One-class SVM PCA-based anomaly detection Fast training 100 features, aggressive boundary CLUSTERING K-means TWO-CLASS CLASSIFICATION Two-class decision forest Two-class boosted decision tree Two-class decision jungle Two-class locally deep SVM Two-class SVM Two-class averaged perceptron Two-class logistic regression.

Some interesting information about the Microsoft Azure Learning Studio and a free Microsoft Azure Machine Learning Algorithm Cheat Sheet shown here.

Azure Machine Learning Studio comes with a large number of machine learning algorithms that you can use to build your predictive analytics solutions. These algorithms fall into the general machine learning categories of regression, classification, clustering, and anomaly detection, and each one is designed to address a different type of machine learning problem.

Pdf

The question is, is there something that can help me quickly figure out how to choose a machine learning algorithm for my specific solution?

The Microsoft Azure Machine Learning Algorithm Cheat Sheet is designed to help you sift through the available machine learning algorithms and choose the appropriate one to use for your predictive analytics solution. The cheat sheet asks you questions about both the nature of your data and the problem you're working to address, and then suggests an algorithm for you to try.

','resolvedBy':'manual','resolved':true}'>
-->

This reference content provides the technical background on each of the machine learning algorithms and modules available in Azure Machine Learning designer.

Each module represents a set of code that can run independently and perform a machine learning task, given the required inputs. A module might contain a particular algorithm, or perform a task that is important in machine learning, such as missing value replacement, or statistical analysis.

For help with choosing algorithms, see

Tip

In any pipeline in the designer, you can get information about a specific module. Select the Learn more link in the module card when hovering on the module in the module list, or in the right pane of the module.

Data preparation modules

FunctionalityDescriptionModule
Data Input and OutputMove data from cloud sources into your pipeline. Write your results or intermediate data to Azure Storage, SQL Database, or Hive, while running a pipeline, or use cloud storage to exchange data between pipelines.Enter Data Manually
Export Data
Import Data
Data TransformationOperations on data that are unique to machine learning, such as normalizing or binning data, dimensionality reduction, and converting data among various file formats.Add Columns
Add Rows
Apply Math Operation
Apply SQL Transformation
Clean Missing Data
Clip Values
Convert to CSV
Convert to Dataset
Convert to Indicator Values
Edit Metadata
Group Data into Bins
Join Data
Normalize Data
Partition and Sample
Remove Duplicate Rows
SMOTE
Select Columns Transform
Select Columns in Dataset
Split Data
Feature SelectionSelect a subset of relevant, useful features to use in building an analytical model.Filter Based Feature Selection
Permutation Feature Importance
Statistical FunctionsProvide a wide variety of statistical methods related to data science.Summarize Data

Machine learning algorithms

FunctionalityDescriptionModule
RegressionPredict a value.Boosted Decision Tree Regression
Decision Forest Regression
Fast Forest Quantile Regression
Linear Regression
Neural Network Regression
Poisson Regression
ClusteringGroup data together.K-Means Clustering
ClassificationPredict a class. Choose from binary (two-class) or multiclass algorithms.Multiclass Boosted Decision Tree
Multiclass Decision Forest
Multiclass Logistic Regression
Multiclass Neural Network
One vs. All Multiclass
One vs. One Multiclass
Two-Class Averaged Perceptron
Two-Class Boosted Decision Tree
Two-Class Decision Forest
Two-Class Logistic Regression
Two-Class Neural Network
Two Class Support Vector Machine

Modules for building and evaluating models

Microsoft Azure Machine Learning Algorithm Cheat Sheet Answer

FunctionalityDescriptionModule
Model TrainingRun data through the algorithm.Train Clustering Model
Train Model
Train Pytorch Model
Tune Model Hyperparameters
Model Scoring and EvaluationMeasure the accuracy of the trained model.Apply Transformation
Assign Data to Clusters
Cross Validate Model
Evaluate Model
Score Image Model
Score Model
Python LanguageWrite code and embed it in a module to integrate Python with your pipeline.Create Python Model
Execute Python Script
R LanguageWrite code and embed it in a module to integrate R with your pipeline.Execute R Script
Text AnalyticsProvide specialized computational tools for working with both structured and unstructured text.Convert Word to Vector
Extract N Gram Features from Text
Feature Hashing
Preprocess Text
Latent Dirichlet Allocation
Score Vowpal Wabbit Model
Train Vowpal Wabbit Model
Computer VisionImage data preprocessing and Image recognition related modules.Apply Image Transformation
Convert to Image Directory
Init Image Transformation
Split Image Directory
DenseNet
ResNet
RecommendationBuild recommendation models.Evaluate Recommender
Score SVD Recommender
Score Wide and Deep Recommender
Train SVD Recommender
Train Wide and Deep Recommender
Anomaly DetectionBuild anomaly detection models.PCA-Based Anomaly Detection
Train Anomaly Detection Model

Web service

Learn about the web service modules which are necessary for real-time inference in Azure Machine Learning designer.

Error messages

Learn about the error messages and exception codes you might encounter using modules in Azure Machine Learning designer.

Cheat

Azure Ml Algorithm

Next steps