Sometimes the model can overfit the training data and hence underperforms with real data. To reduce complexity, we use regularization.
Add a penalty factor on complexity i.e. number of features and values of weights.
Lasso: Force coefficients to zero if not relevant. Ends up performing feature selection.
Ridge: Reduces coefficients of irrelevant features but does not drop them to zero.
Dictionary
Regularization
Jul 13, 2023
Derive new features
Dictionary
Feature Extraction
Jul 13, 2023
Identify frequently occurring patterns and rules
Dictionary
Association
Jul 13, 2023
Decision trees are versatile tools that can be employed for both Regression and Classification tasks.
When developing a decision tree, it’s crucial to ask a series of targeted questions in order to zero in on the most accurate label. The objective is to construct an efficient tree that minimizes the number of splits necessary to divide the data. This process continues until no further splits can be made.
It’s important to find the right balance, as shallow trees tend to under-fit the data, while deep trees can lead to overfitting, with each example ending up as its own leaf node.
Dictionary
Decision Tree
Jul 13, 2023
Predict numeric values
Linear Regression builds relationship between inputs (features) and outputs (targets). It is a simple yet effective model to start with to get a benchmark.
Using training data, the goal is to minimize the loss.
You can measure loss function using SSE
Polynomial Regression: Linear Regression can also be used to model a non-linear relationship. To achieve this, we transform an input feature by a non-linear transformation function and create a new feature that can be used as input in the model.
Dictionary
Regression
Jul 13, 2023
Chinese proverb: One chopstick is fragile, but a group of chopsticks is robust.
A big challenge with ML models is overfitting to training data. And hence, we can use ensemble models — average multiple models into a meta model, which in turn reduces variance.
Dictionary
Ensemble Model
Jul 13, 2023
Predict likely categories
One of the unsupervised classification techniques is clustering.
Dictionary
Classification
Jul 13, 2023
Identify unusual cases
Dictionary
Anomaly Detection
Jul 13, 2023
Chinese proverb: Make a mistake and learn a lesson
Similar to unsupervised learning, it does not need labeled data. The model learns from interactions (trial and error)and hence takes time to perfect. Initially the algorithm acts randomly, exploring the environment, but it learns with time by being rewarded when it makes the right choices.
It is best suited for use cases that require achieving a goal, for example, autonomous vehicles, headline testing or personalizing a website.
Why it matters: In any business you want to make sure that you are adding customers faster than you are losing them and at the right Customer Acquisition Cost.
Sometimes churn is good because your product isn’t for everyone! It is better to let them go than to keep supporting them or serving their needs.
Types: E-commerce businesses will measure transaction churn. In contrast, Subscription businesses will measure two forms of churn:
Customer Churn in a specific period (say a month) is the number of customers lost divided by the number of customers you had at the start of the period. Use this if you have only one Pricing Tier. However, if you’ve more than one Pricing Tier, then you should calculate revenue churn or visualize churn by customer cohorts.
Revenue Churn in a specific period (say a month) is the total revenue on the first day of the period minus the revenue from the same customers at the end of the period minus any value from Upsell. We need to subtract the upsells because they hide loss from churned customers.
Challenges:Churn is a delayed backward looking and you cannot get a live number.
Examples. Everyone has faced challenges with churn:
The Wall Street Journal worries about aging subscribers.
Once someone is on your Internet Products, always push them to consume one more piece of content. Pages Per Session captures how interested are audiences in exploring your website beyond the landing page.
You can do this by capturing the audience’s intent of coming to your platform and serving them basis that intent using:
Search interfaces give audiences exactly what they want.
To understand their intent you can rely on the piece of content they are consuming right now or the organic keywords they were searching for prior to landing on your platform.