I want to create a probability density function for a particular dataset. First of all, I calculate the mean and the variance of my dataset. So, I use the mean and the variance to create a probability density function, for example, Gaussian distribution. Is my thinking correct?
2026-03-31 06:28:06.1774938486
On
Creating a probability density function for a particular dataset
153 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
2
There are 2 best solutions below
0
On
A nonparametric way to estimate a density corresponding to your data is through kernel density estimation.
Given an iid sample $(x_1,...,x_n),$ this method estimates your density function as
$$\widehat f=\frac{1}{nh}\sum_{i=1}^n K\left(\frac{x-x_i}{h}\right)$$
for a suitable choice of a bandwidth parameter $h$ and kernel $K(\cdot)$.
I encourage you to read the wiki article for details. Further lecture notes are here.
I encourage you to visualize the dataset.
For example, you have to consider if your data is symmetrical.
If your data is symmetrical and you believe that Normal distribution would be a good fit, then using the mean and unbiased estimator is indeed a common practice.
This Wikipedia page describes your approach as follows: