Machine Learning for Business and Economics - Why

I. Ozkan

Fall 2025

Machine Learning

“Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of statistical algorithms that can learn from data and generalise to unseen data, and thus perform tasks without explicit instructions. […]The application of ML to business problems is known as predictive analyticsWikipedia

Machine Learning

Other Fields and Machine Learning (ML):

Machine Learning and Automation

Machine Learning Tasks

Machine Learning

Examples of Recent Success stories:

Speech Recognition

NLP

Translation

Image Processing

Machine Learning: Lots of Keywords about Learning

Learning:

Task and Data:

Why Machine Learning in Business and Finance: An Example

Example: Loan Applications (digitization vs. ML)

The New World of Data

Traditional Statistics is helpful (and necessary) but new methods/approaches are essential for business decisions

Machine Learning Approaches

Supervised Learning Unsupervised Learning Reinforcement Learning
{Y;X} available {X} available Actions in Dynamic Environment Ex: Game
\(E[Y \: given \: X]\) Pattern inside data
\(P(Y=y \: given \:X=x)\) Homogeneous Groups
Ex: Regression Ex: Clustering

Supervised Learning: Labelled Data

\(Data=Pattern + Error, \: y=f(X)+\varepsilon\)


Unsupervised Learning: Unlabeled Data

\(Data \propto Pattern, \: X \propto Pattern\)

Reinforcement Learning: An Intelligent Agent should Take Actions in Dynamic Environment to maximize a reward

Data Rich Environment: [Very] High Dimensionality

Supervised Learning: main goal is to find:

\(Data=Pattern(s)+Error(s)\)

Example: Standard Regression

\(y=\beta_0+\beta_1 x_1+\beta_2 x_2+ \cdots + \beta_k x_k + \varepsilon\)

for some \(k>>2\)

This is equivalent to

\(Pattern=\beta_0+\beta_1 x_1+\beta_2 x_2+ \cdots + \beta_k x_k \; and \; \; error=\varepsilon\)

Or put in another form:

\(\mu(X)=E[Y|X=x]=\hat \beta_0+\hat \beta_1 x_1+\hat \beta_2 x_2+ \cdots +\hat \beta_k x_k\)

given \(E[\varepsilon]=0\) and \(\hat \beta_i\) are the estimated coefficients.

How to find the parameters, \(\hat \beta_i\):

\(MSE=\frac{1}{N+1} \sum_{i=0}^{N} (y_i-\mu(x_i))^2=\frac{1}{N+1} \sum_{i=0}^{N} \varepsilon_i^2\)

High Dimensionality

\(\implies\) high dimensionality comes with difficulties.

Fundamental Table

Data Causal Predictive
Observational Good/Bad Good/Bad
Experimental Good/Bad Good/Bad

Lets think two variables, \(y\) and \(x\), and the causality structure such that \(X\) causes \(Y\). All of the alternatives are:

Causality

It is possible then,

\(X \implies Y\)

\(Y\) do not causes \(X\) since the sample is split by chance then chance causes \(X\)

\(Z\) may cause both possible but by chance

It could still be by chance

It could be by selection, but it should be excluded by the experimenter

Fundamental Table

Data Causal Predictive
Observational Bad Good
Experimental Good Bad


In economics Observational Data set is used for Causal Inference


In \(Theories \implies Models \implies Validate \: with \: Data\) flow, causal structure is dictated by \(Theories\). Hence the word EconoMetrics had been used in Economics.

In Economics

Means:

Correlation vs Causation must be discussed (this one is the main critique)

Error structure is important

Behavioral assessments to model is crucial

Goodness of fit is not the main focus (though it is important)

In Econometrics