AI-Machine Learning - Statistics - CS Archives - Page 20 of 20

Multiple Imputation with Chained Equations method & Python codes

MICE (Multiple Imputation by Chained Equations) is a statistical method used for handling missing data by creating multiple imputations or…

by Kurious FoxMay 17, 2024October 12, 2024

data preprocessing

K-Nearest Neighbors (KNN) imputation in sklearn

K-Nearest Neighbors (KNN) imputation is another method to handle missing data. It uses the ‘k’ closest instances (rows) to each…

by Kurious FoxMay 17, 2024October 12, 2024

data preprocessing

A comic guide to mean/median/mode imputation & Python codes

Handling missing data is a common preprocessing task in machine learning. In scikit-learn, you can handle missing data by using…

by Kurious FoxMay 17, 2024October 12, 2024

data preprocessing

SVD for dimension reduction

Singular Value Decomposition (SVD) is a powerful matrix decomposition technique that generalizes the concept of eigenvalue decomposition to non-square matrices.…

by Kurious FoxMay 16, 2024August 18, 2024

data preprocessing

test for outliers in multivariate data in Python

To test for outliers in multivariate data in Python, you can use several libraries like numpy, scipy, pandas, sklearn, etc. Here’s how you can…

by Kurious FoxMay 16, 2024August 18, 2024

grazing the maze of probability

Application of Bayesian theorem in spam detection & medical diagnosis

Example 1: Spam Detection Let’s say historically, 20% of emails are spam, so and the probability that the email is…

by Kurious FoxMay 2, 2024October 1, 2024