A comic guide to mean/median/mode imputation & Python codes
Handling missing data is a common preprocessing task in machine learning. In scikit-learn, you can handle missing data by using…
SVD for dimension reduction
Singular Value Decomposition (SVD) is a powerful matrix decomposition technique that generalizes the concept of eigenvalue decomposition to non-square matrices.…
test for outliers in multivariate data in Python
To test for outliers in multivariate data in Python, you can use several libraries like numpy, scipy, pandas, sklearn, etc. Here’s how you can…
Application of Bayesian theorem in spam detection & medical diagnosis
Example 1: Spam Detection Let’s say historically, 20% of emails are spam, so and the probability that the email is…