Category: AI-Machine Learning – Statistics – CS
Examples of Exponential distribution
The exponential distribution is commonly used to model the time between events in a Poisson process. It is defined by…
denoising via dimension reduction in python
Dimension reduction methods like Principal Component Analysis (PCA) or Singular Value Decomposition (SVD) can be used for denoising data because…
why we can & probably should use missing at random imputation methods for data that’s not missing at random?
Missing At Random (MAR) imputation methods are based on the assumption that the chance of missing data is not related…
Missing data analysis: where’s your missing piece?
Why missing data occurs can be attributed to various reasons, including human error, malfunctioning equipment, or even intentional omission. It…
Imputation using SoftImpute in python
SoftImpute is a matrix completion algorithm in Python that allows you to fill in missing data in your dataset. This…
Multiple Imputation with Chained Equations method & Python codes
MICE (Multiple Imputation by Chained Equations) is a statistical method used for handling missing data by creating multiple imputations or…
K-Nearest Neighbors (KNN) imputation in sklearn
K-Nearest Neighbors (KNN) imputation is another method to handle missing data. It uses the ‘k’ closest instances (rows) to each…
A comic guide to mean/median/mode imputation & Python codes
Handling missing data is a common preprocessing task in machine learning. In scikit-learn, you can handle missing data by using…
SVD for dimension reduction
Singular Value Decomposition (SVD) is a powerful matrix decomposition technique that generalizes the concept of eigenvalue decomposition to non-square matrices.…