Kurious Fox

Multiple linear regression

by Kurious Fox
September 8, 2024November 16, 2024

Multiple linear regression is a powerful tool for modeling relationships between multiple independent variables and a single dependent variable. Let’s take a look at some examples with codes in Python and R to demonstrate its practical application

Review: Maximum Likelihood Estimation

by Kurious Fox
September 7, 2024November 16, 2024

Maximum Likelihood Estimation (MLE) is a statistical method that estimates parameters by maximizing the likelihood function. For example, in a Poisson distribution, the MLE for the rate parameter ? is the sample mean. And here is the detailed derivation

Comparing forward, backward, stepwise feature selection

by Kurious Fox
September 7, 2024November 16, 2024

Forward selection adds features one by one, optimizing model performance but potentially missing the best subset. Backward selection starts with all features and removes the least significant, refining the model but being more computationally intensive. Stepwise selection combines both methods, adding or removing features for a balanced approach but can be complex.

Line and Bar Plot in the same graph with Error Bars

by Kurious Fox
September 7, 2024November 8, 2024

The codes to for this graph is as below, with the following keypoints: Legend Handling: The legend is constructed from both plots (line plot & bar plot), ensuring that all data series are labeled correctly.… Line and Bar Plot in the same graph with Error Bars

What does AI see about Asian- A Generative Image Experiment

by Kurious Fox
September 7, 2024November 27, 2024

In this experiment, I used Pikaso to generate 20 images with a provided command, with “AI prompt” on (that means Freepik AI will automatically improve short prompts). Why? The generative model learns the patterns from… What does AI see about Asian- A Generative Image Experiment

Hyperparameter tuning by train-validation-test split – process & example

by Kurious Fox
September 7, 2024November 16, 2024

implementing Lasso regression with train-validation-test split and finding the optimal regularization parameter. In Python, it involves splitting the data, training Lasso model with different alpha values, finding the best alpha, retraining the model, and evaluating on the test set. In R, it includes data splitting, training Lasso models, finding the best lambda, retraining, and testing.

Grid search and train-validation-test split for hyperparameter tuning – intro

by Kurious Fox
September 6, 2024November 16, 2024

The training-validation-test split involves using the training set to fit the model, the validation set to tune hyperparameters, and the test set to evaluate performance. Python’s scikit-learn library can be used for this process, ensuring the model generalizes well to new data by evaluating it on unseen data and avoiding overfitting.

A comic guide to underfitting

by Kurious Fox
September 6, 2024November 19, 2024

Underfitting in machine learning occurs when a model fails to capture underlying data patterns due to simplicity or insufficient training data. To address underfitting, select complex models, add features, and obtain more training data. Also, fine-tune hyperparameters and optimize the model’s architecture. Few features in a model can also cause underfitting, requiring the identification of relevant additional features or more advanced modeling techniques.

Evaluation measure: MSE versus MAE, RMSE

by Kurious Fox
September 6, 2024November 19, 2024

This comic explains MSE and MAE, the commonly used evaluation metrics for regression. MSE emphasizes large deviations, while MAE provides a more robust measure when outliers are less significant. MSE is preferred as a loss function due to its ability to penalize larger errors more heavily and its suitability for mathematical optimization, stability, and statistical interpretation. RMSE is the square root of MSE and also penalizes large errors.

Parameters and Loss function

by Kurious Fox
September 6, 2024November 19, 2024

Machine learning parameters are values learned from training data to minimize prediction errors. For example, in a uniform distribution for bus arrival times, parameters $latex a$ and $latex b$ define the range. They are the model’s knobs for accurate predictions.

Unsupervised learning helps detect shady people

by Kurious Fox
September 6, 2024November 19, 2024

Unsupervised learning is a type of machine learning algorithm used to draw inferences from datasets consisting of input data without labeled responses. In unsupervised learning, the goal is to infer the natural structure present within… Unsupervised learning helps detect shady people

The model that’s not a girl & time machine

by Kurious Fox
September 6, 2024December 13, 2024

Comments: I already asked my student, and he confirmed that the reason he studied the ML class was because there was a model in that class ?. So, Mr. Fox left the class after he… The model that’s not a girl & time machine

Supervised learning: who’s supervising the forest?

by Kurious Fox
September 6, 2024November 19, 2024

Supervised learning involves training an algorithm on labeled data and pairing input with correct output. Unsupervised learning uses unlabeled data to find patterns. For example, predicting pizza delivery tips involves features like time, pizza type, distance, and tip history, with the goal of predicting tip outcomes.

A comic guide to Train – test split + Python & R codes

by Kurious Fox
September 6, 2024April 8, 2025

After collecting and preprocessing the dataset, it is essential to divide it into two distinct sets: training set and testing set. The training set is used to train the model while the testing set is used to evaluate its performance. This allows assessment of the model’s generalization to new data. Two code examples in Python and R demonstrate how to create synthetic data and split it into training and testing sets using popular libraries.

A comic guide to model generalization and overfitting

by Kurious Fox
September 6, 2024April 8, 2025

Model generalization in machine learning is a crucial concept that refers to the ability of a trained model to perform well on new, unseen data. When a model generalizes well, it demonstrates an understanding of… A comic guide to model generalization and overfitting

Simple Linear Regression Review: Sunlight & Selfie

by Kurious Fox
September 6, 2024November 19, 2024

Simple linear regression is a statistical method used to model and analyze the relationship between two continuous variables. Specifically, it aims to predict the value of one variable (the dependent or response variable) based on… Simple Linear Regression Review: Sunlight & Selfie

How to open a jupyter notebook in Google Drive/Github using Colab

by Kurious Fox
August 28, 2024August 28, 2024

Method 1: Go to Google Colab Visit Google Colab. A dialog like this will appear Do similarly if you want to open a notebook from Github. Alternatively, if you’re already in Google Colab and want… How to open a jupyter notebook in Google Drive/Github using Colab

Create a Sitemap to improve SEO Using a Plugin in WordPress

by Kurious Fox
August 26, 2024September 6, 2024

Why using a sitemap? Sitemaps are essential for helping search engines understand and index your site effectively. They ensure that search engines are aware of all your important content, even the less discoverable pages. This… Create a Sitemap to improve SEO Using a Plugin in WordPress

18. Decision tree for squirrels to find place to store her acorns

by Kurious Fox
August 24, 2024October 13, 2025

This comic introduces the concept of a decision tree in machine learning when a rabbit was trying to help a squirrel select a tree to store her her acorns

16. What is an outlier?

by Kurious Fox
August 24, 2024October 13, 2025

this comic illustrate what is an outlier when some birds detect a cute funny zebra with green stripes

15. Clustering for organizing your room

by Kurious Fox
August 24, 2024October 13, 2025

This funny comic introduces clustering, a machine-learning technique for grouping similar data points, with applications including customer segmentation, image segmentation, document clustering, anomaly detection, and social network analysis. Businesses utilize it for targeted marketing, while it helps in organizing images, categorizing documents, identifying unusual behavior in cybersecurity, and discovering communities in social networks.

14. The Forest Snack Company

by Kurious Fox
August 24, 2024October 13, 2025

this funny forest snack comic introduces surveying, a method of data collection involving structured questionnaires or interviews to gather specific information from a sample of individuals. They offer first-hand insights, enable large-scale data collection, support informed decision-making, and are cost-effective, making them essential across research, marketing, and social science for actionable data.

13. What is surveying?

by Kurious Fox
August 24, 2024October 13, 2025

This funny comic about the duck family introduces what is data and surveying. Data comprises various forms of information, such as numbers and text, collected for analysis. Surveys are effective tools for gathering opinions and preferences, enabling better decision-making by capturing diverse insights quickly. They facilitate understanding of collective preferences, helping individuals, businesses, and organizations make informed choices based on real feedback.

12.What’s Generative Music

by Kurious Fox
August 24, 2024October 13, 2025

This comic about monkeys learning music introduces generative music, which is a type of music composed using algorithms that enable its evolution over time, producing unique pieces with every playback. This interactive form of music allows user input to personalize the experience. It’s applied in diverse areas like video games, art installations, and film scoring, enhancing the dynamism of soundtracks.

11. Generative AI in Content Creation

by Kurious Fox
August 24, 2024October 13, 2025

Generative AI refers to the exciting and innovative capabilities of artificial intelligence systems that can create new content and ideas. Unlike traditional AI, which typically analyzes data to make decisions or predictions, generative AI goes a step further by producing original outputs such as text, images, music, and even videos. Generative AI has numerous real-world… 11. Generative AI in Content Creation

10. How can we predict the population of owls in the future?

by Kurious Fox
August 24, 2024October 13, 2025

This funny comic illustrates how scientists forecast future animal population sizes using various methods, including mathematical models, data collection through field surveys, and statistical techniques to analyze trends. They incorporate environmental factors and utilize simulation software to predict changes. These approaches are essential for effective conservation and wildlife management strategies.

9.How Lengthy Passwords Enhance Your Online Safety

by Kurious Fox
August 24, 2024October 13, 2025

Longer passwords enhance security by increasing complexity and resistance to attacks. They allow for more character combinations, making guessing and brute-force cracking significantly harder. Additionally, they enable the use of diverse characters, reduce reliance on predictable patterns, and considerably extend the time required for potential breaches, thus safeguarding unauthorized access.

8.How randomness encourages fairness with skunk’s fragrant dishes

by Kurious Fox
August 24, 2024October 13, 2025

This funny comic about skunk fragrance dishes helps us understand how randomness encourages fairness. It can help unbiased judicial and political processes and combat algorithmic biases. It fosters diversity in group participation and enhances transparency and trust. By integrating randomness, organizations can achieve equitable outcomes and ensure equal opportunities for all participants.

7.What’s random? Random MC for a show

by Kurious Fox
August 24, 2024October 15, 2025

Randomness is vital in various domains, enhancing decision-making, creativity, and research. It fosters innovation while underpinning cryptographic security and statistical sampling, ensuring unbiased data collection. By promoting equal selection chances, randomness strengthens conclusions and generalizations, …

6. Misclassification, applications of classification

by Kurious Fox
August 24, 2024October 13, 2025

This cute and funny comic about bats and owls features misclassification and applications of classification. Misclassification occurs when a model wrongly categorizes data, leading to consequences like financial loss and safety risks. Classification applications span healthcare, finance, marketing, image and speech recognition, and natural language processing, enhancing decision-making and efficiency.

5. What’s classification?

by Kurious Fox
August 24, 2024October 13, 2025

This cute, funny comic about the zebras and chihuahuas helps us understand what classification is. Classification systematically organizes entities into categories based on common traits, enhancing identification and analysis across various fields, including biology and library science, facilitating knowledge organization, retrieval, and effective communication in research and decision-making.

4. What is an algorithm?

by Kurious Fox
August 24, 2024October 13, 2025

This cute comic helps to understand what an algorithm is. An algorithm is a structured set of instructions aimed at completing a task or solving a problem. It can range from basic tasks, like sorting, to complex AI systems, and is essential in various fields, including technology and science.

« Previous
1
…
10
11
12
13
14
…
17
Next »