What’s hypothesis testing
Hypothesis testing is a structured way to use sample data to make decisions or draw conclusions about a population. It answers questions like: It’s the backbone of inferential statistics. 🎯 The Core Idea You start…
Hypothesis testing is a structured way to use sample data to make decisions or draw conclusions about a population. It answers questions like: It’s the backbone of inferential statistics. 🎯 The Core Idea You start…
⭐ Central Limit Theorem (CLT) The Central Limit Theorem says something surprisingly powerful: If you take many random samples and compute their means,the distribution of those sample means will be approximately normal,even if the original…
⭐ Z‑Score A z‑score tells you how many standard deviations an observation is from the mean. What it does Example Population mean , standard deviation .What is the z‑score of ? Interpretation:The value is 1.5…
⭐ Histogram vs. Density Plot Both visualize distributions, but they answer slightly different questions and behave differently. 📊 Histogram A histogram groups data into bins and shows counts (or proportions) in each bin. Key features…
A random variable that follows a geometric distribution satisfies: This means: The probability you still have to wait more trials does NOT depend on how long you’ve already been waiting. Your past failures don’t change…
The geometric distribution models the number of trials needed until the first success occurs in a sequence of independent Bernoulli trials (like repeated coin flips). Think of it as the math of “How long until…
⭐ Binomial Distribution The binomial distribution models the number of successes in a fixed number of independent trials, where each trial has the same probability of success. Think of it as the math of “How…
The expected value (mean) of random variables adds even if the variables are dependent. This is the magic part: Expectation is always linear — no independence required. Formally, for any random variables and : And…
A histogram is a graph that shows how data are distributed by grouping values into bins (intervals) and showing how many observations fall into each bin. It’s perfect for visualizing: Think of it as stacking…
A probability density function describes the distribution of a continuous random variable. If is continuous, its PDF is a function such that: The key idea For continuous variables: The PDF is not a probability. Probability…
A probability mass function is a function that gives the probability of each individual value of a discrete random variable. If is a discrete random variable, then its PMF is: It tells you: A PMF…
Most random variables fall into two big categories: Everything else is a refinement of these two. 🎯 1. Discrete Random Variables A discrete random variable takes countable values — usually integers. Key features Examples Common…
Independent vs. Mutually Exclusive 🎯 Mutually Exclusive (Disjoint) Events Two events are mutually exclusive if they cannot happen at the same time. Example:Rolling a die: 🎯 Independent Events Two events are independent if knowing one…
⭐ General Addition Rule The general addition rule tells you how to find the probability that A or B happens — even when the events overlap. 📌 The formula Why subtract the intersection?Because if A…
⭐ Independent Events Two events are independent when one happening does not change the probability of the other. That’s the whole heart of it. 📌 Formal definition Events and are independent if This equation is…
⭐ Complementary Events Two events are complements when they cover the entire sample space together and cannot happen at the same time. If is an event, then its complement is: 📌 Key properties 🎯 Examples…
⭐ Sample vs. Event Think of probability as a story with two levels: 🎯 Sample Space (S) The sample space is the complete list of everything that could happen in an experiment. Examples Key idea…
Prism (at https://prism.openai.com/) is OpenAI’s AI-powered collaborative LaTeX editor designed for researchers, scientists, and academics. It integrates frontier models (like GPT variants) directly into the LaTeX writing workflow for drafting, editing, rewriting, fixing errors, improving…
Downsampling techniques can be used to create custom batches for training a machine learning model. These custom batches can be tailored to specific needs, such as improving training efficiency, handling imbalanced data, or focusing on…
Downsampling for hyperparameter tuning reduces the dataset size to speed up model training and experimentation while preserving key data characteristics. Here’s a concise overview: Why Downsample for Hyperparameter Tuning? Key Considerations Practical Steps Pitfalls to…