机器学习之参数估计

大家好，我是讯享网，很高兴认识大家。

Probability Theory focus on computing the probability of data arising from a parametric model with known parameters. Statistical Inference flips this on its head: we will estimate the probability of parameters given a parametric model and observed data drawn from it.

比如我得到了一些样本数据，并已知这些数据底层的分布是指数分布，但是并不知道具体是哪个指数分布！因为指数分布不是一个确定的分布，而是 one-parameter family of distributions. 不同的参数 λ 会得到不同的指数分布。正态分布，二项分布也都是同样的道理，不同的参数会得到不同的结果。我们通常把这样的分布叫做 parametric distributions or parametric models.

在这篇文章中，我将介绍一些方法，用给定的数据和参数模型，来估算出这些未知的 population parameters：

a population mean μ
the difference in two population means μ1−μ2
a population variance σ2
the ratio of two population variances σ21/σ22

Point Estimation VS Interval Estimation

下面是维基百科中关于 Point Estimation 的定义：

In statistics, point estimation involves the use of sample data to calculate a single value which is to serve as a “best guess” or “best estimate” of an unknown population parameter. More formally, it is the application of a point estimator to the data.

下面是维基百科中关于 Interval Estimation 的定义：

In statistics, interval estimation is the use of sample data to calculate an interval of plausible values of an unknown population parameter; this is in contrast to point estimation, which gives a single value.

下面是维基百科中关于 Confidence interval 的定义：

讯享网

In statistics, a confidence interval is a type of interval estimate (of a population parameter) that is computed from the observed data. The confidence level is the frequency (i.e., the proportion) of possible confidence intervals that contain the true value of their corresponding parameter. In other words, if confidence intervals are constructed using a given confidence level in an infinite number of independent experiments, the proportion of those intervals that contain the true value of the parameter will match the confidence level.

如果你对上面关于 Confidence interval 的定义有些不太理解，没有关系。当我介绍到如何解释一个 Confidence interval 的含义时，你会对这个定义理解的更加深刻。实际上，Interval Estimation 包含很多种方法，但是在这篇文章中我只介绍 confidence intervals.

Point Estimation

假设我们想知道中国人每天读书的平均时间， μ ，由于我们不可能去问到每个中国人他们每天拿出多少时间来读书，因此我们只能随机抽取出一些国人，得到他们的读书时间，然后用得到的这些数据去估算整个所有国人的每天平均读书时间。

我们有2种方法可以做这样的估算，它们分别是 maximum likelihood estimation 和 method of moments. 在这个小节中，我也会介绍一种方法来评估某个点估计是否为一个 “好” 的点估计。

在介绍这个点估计的方法之前，我先来介绍一下 point estimator（点估计量） 与 point estimate（点估计值） 的含义。

point estimator VS point estimate

We denote the n random variables arising from a random sample as subscripted uppercase letters:

X1,X2,⋯,Xn

机器学习之参数估计

Point Estimation VS Interval Estimation

Point Estimation

point estimator VS point estimate

相关推荐