交叉驗證：修订间差异 - 求闻百科，共笔求闻

第1行：

'''交叉验证'''，有時亦稱'''循環估計'''<ref name="Kohavi95">{{cite journal | last = Kohavi | first = Ron | year = 1995 | title = A study of cross-validation and bootstrap for accuracy estimation and model selection | journal = Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence | url = http://citeseer.ist.psu.edu/kohavi95study.html | volume = 2 | issue = 12 | pages = 1137–1143 | access-date = 2008-07-14 | | | }}(Morgan Kaufmann, San Mateo)</ref><ref name="Chang92">Chang, J., Luo, Y., and Su, K. 1992. GPSM: a Generalized Probabilistic Semantic Model for ambiguity resolution. In Proceedings of the 30th Annual Meeting on Association For Computational Linguistics (Newark, Delaware, June 28 - July 02, 1992). Annual Meeting of the ACL. Association for Computational Linguistics, Morristown, NJ, 177-184</ref><ref name="Devijver82">Devijver, P. A., and J. Kittler, Pattern Recognition: A Statistical Approach, Prentice-Hall, London, 1982</ref>，

'''交叉验证'''，有时亦称'''循环估计'''<ref name="Kohavi95">{{cite journal | last = Kohavi | first = Ron | year = 1995 | title = A study of cross-validation and bootstrap for accuracy estimation and model selection | journal = Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence | url = http://citeseer.ist.psu.edu/kohavi95study.html | volume = 2 | issue = 12 | pages = 1137–1143 | access-date = 2008-07-14 | | | }}(Morgan Kaufmann, San Mateo)</ref><ref name="Chang92">Chang, J., Luo, Y., and Su, K. 1992. GPSM: a Generalized Probabilistic Semantic Model for ambiguity resolution. In Proceedings of the 30th Annual Meeting on Association For Computational Linguistics (Newark, Delaware, June 28 - July 02, 1992). Annual Meeting of the ACL. Association for Computational Linguistics, Morristown, NJ, 177-184</ref><ref name="Devijver82">Devijver, P. A., and J. Kittler, Pattern Recognition: A Statistical Approach, Prentice-Hall, London, 1982</ref>，

是一種[[~~統計學~~]]上將[[数据]][[樣本]][[集合划分|切割]]成較小子集的實用方法。於是可以先在一個子集上做分析，而其它子集則用來做~~後續對~~此分析的確認及驗證。一開始的子集被稱為'''訓練集'''。而其它的子集則被稱為'''驗證集'''或'''測試集'''。交叉验证的目的，是用未用来给模型作训练的新数据，测试模型的性能，以便減少诸如过拟合和选择偏差等問題，并给出模型如何在一个独立的数据集上通用化（即，一个未知的数据集，如实际问题中的数据）。

是一种[[统计学]]上将[[数据]][[样本]][[集合划分|切割]]成较小子集的实用方法。于是可以先在一个子集上做分析，而其它子集则用来做后续对此分析的确认及验证。一开始的子集被称为'''训练集'''。而其它的子集则被称为'''验证集'''或'''测试集'''。交叉验证的目的，是用未用来给模型作训练的新数据，测试模型的性能，以便減少诸如过拟合和选择偏差等问题，并给出模型如何在一个独立的数据集上通用化（即，一个未知的数据集，如实际问题中的数据）。

交叉驗證的理論是由{{tsl|en|Seymour Geisser|}}所開始的。它對於防範根据数据建议的测试假设是非常重要的，特別是~~當後續~~的[[樣本]]是危險、成本過高或科学上不适合时去搜集。

交叉验证的理论是由{{tsl|en|Seymour Geisser|}}所开始的。它对于防范根据数据建议的测试假设是非常重要的，特别是当后续的[[样本]]是危险、成本过高或科学上不适合时去搜集。

== 交叉验证的使用 ==

第8行：

交叉验证是一种预测模型拟合性能的方法。

== 常見的交叉驗證形式 ==

== 常见的交叉验证形式 ==

=== Holdout 驗證 ===

=== Holdout 验证 ===

常~~識來說~~，Holdout ~~驗證並~~非一種交叉驗證，因為数据並沒有交叉使用。

常识来说，Holdout 验证并非一种交叉验证，因为数据并沒有交叉使用。

~~隨機從~~最初的樣本中選出部分，形成交叉驗證数据，而剩餘的就當做訓練数据。

随机从最初的样本中选出部分，形成交叉验证数据，而剩餘的就当做训练数据。

一般來說，少於原本樣本三分之一的数据被選做驗證数据。

一般来说，少于原本样本三分之一的数据被选做验证数据。

<ref>{{cite web | title=Tutorial 12 | work=Decision Trees Interactive Tutorial and Resources | url=http://decisiontrees.net/node/36 | accessdate=2006-06-21 | | | }}</ref>

第21行：

''k''折交叉验证（{{lang-en|''k''-fold cross-validation}}），将训练集分割成''k''个子样本，一个单独的子样本被保留作为验证模型的数据，其他''k'' − 1个样本用来训练。交叉验证重复''k''次，每个子样本验证一次，平均''k''次的结果或者使用其它结合方式，最终得到一个单一估测。这个方法的优势在于，同时重复运用随机产生的子样本进行训练和验证，每次的结果验证一次，10次交叉验证是最常用的。

=== 留一驗證 ===

=== 留一验证 ===

正如名稱所建議，留一驗證（{{lang-en|leave-one-out cross-validation, LOOCV}}）意指只使用原本樣本中的一~~項來當~~做~~驗證資~~料，而剩餘的則留下來當做~~訓練資~~料。這個步驟一直持續到每個樣本都被當做一次~~驗證資~~料。

正如名称所建议，留一验证（{{lang-en|leave-one-out cross-validation, LOOCV}}）意指只使用原本样本中的一项来当做验证资料，而剩餘的则留下来当做训练资料。这个步驟一直持续到每个样本都被当做一次验证资料。

事實上，這等同於''k''折交叉验证，其中''k''為原本樣本個數。<ref>{{Cite web|url=https://web.stanford.edu/~hastie/ElemStatLearn/|title=Elements of Statistical Learning: data mining, inference, and prediction. 2nd Edition.|website=web.stanford.edu|access-date=2019-04-04|||}}</ref>

事实上，这等同于''k''折交叉验证，其中''k''为原本样本个数。<ref>{{Cite web|url=https://web.stanford.edu/~hastie/ElemStatLearn/|title=Elements of Statistical Learning: data mining, inference, and prediction. 2nd Edition.|website=web.stanford.edu|access-date=2019-04-04|||}}</ref>

在某些情況下是存在有效率的演算法，如使用{{tsl|en|kernel regression|}} 和[[吉洪诺夫正则化]]。

== 誤差估計 ==

== 误差估计 ==

可以計算估計誤差。常見的誤差衡量標準是[[均方差]]和[[方根均方差]]，

可以计算估计误差。常见的误差衡量标準是[[均方差]]和[[方根均方差]]，

分別為交叉驗證的[[方差]]和[[標準差]]。

分别为交叉验证的[[方差]]和[[标準差]]。

== 另見 ==

== 另见 ==

* [[重抽样]]

* [[提升方法]]

* [[Bagging算法|引导聚集算法]]

== 參考文獻 ==

== 参考文献 ==

== 外部連結 ==

== 外部链接 ==

* [https://web.archive.org/web/20090214023159/http://paul.luminos.nl/documents/show_document.php?d=198 Naive Bayes implementation with cross-validation in Visual Basic] (includes executable and source code)

* [https://web.archive.org/web/20081201180042/http://www.cs.technion.ac.il/%7Eronbeg/gcv/index.html A generic k-fold cross-validation implementation] (free open source; includes a distributed version that can utilize multiple computers and in principle can speed up the running time by several orders of magnitude.)

[[Category:统计检验]]

@@ 第1行： / 第1行： @@
-'''交叉验证'''，有時亦稱'''循環估計'''<ref name="Kohavi95">{{cite journal | last = Kohavi | first = Ron | year = 1995 | title = A study of cross-validation and bootstrap for accuracy estimation and model selection | journal = Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence | url = http://citeseer.ist.psu.edu/kohavi95study.html | volume = 2 | issue = 12 | pages = 1137–1143 | access-date = 2008-07-14 | | | }}(Morgan Kaufmann, San Mateo)</ref><ref name="Chang92">Chang, J., Luo, Y., and Su, K. 1992. GPSM: a Generalized Probabilistic Semantic Model for ambiguity resolution. In Proceedings of the 30th Annual Meeting on Association For Computational Linguistics (Newark, Delaware, June 28 - July 02, 1992). Annual Meeting of the ACL. Association for Computational Linguistics, Morristown, NJ, 177-184</ref><ref name="Devijver82">Devijver, P. A., and J. Kittler, Pattern Recognition: A Statistical Approach, Prentice-Hall, London, 1982</ref>，
+'''交叉验证'''，有时亦称'''循环估计'''<ref name="Kohavi95">{{cite journal | last = Kohavi | first = Ron | year = 1995 | title = A study of cross-validation and bootstrap for accuracy estimation and model selection | journal = Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence | url = http://citeseer.ist.psu.edu/kohavi95study.html | volume = 2 | issue = 12 | pages = 1137–1143 | access-date = 2008-07-14 | | | }}(Morgan Kaufmann, San Mateo)</ref><ref name="Chang92">Chang, J., Luo, Y., and Su, K. 1992. GPSM: a Generalized Probabilistic Semantic Model for ambiguity resolution. In Proceedings of the 30th Annual Meeting on Association For Computational Linguistics (Newark, Delaware, June 28 - July 02, 1992). Annual Meeting of the ACL. Association for Computational Linguistics, Morristown, NJ, 177-184</ref><ref name="Devijver82">Devijver, P. A., and J. Kittler, Pattern Recognition: A Statistical Approach, Prentice-Hall, London, 1982</ref>，
-是一種[[統計學]]上將[[数据]][[樣本]][[集合划分|切割]]成較小子集的實用方法。於是可以先在一個子集上做分析，而其它子集則用來做後續對此分析的確認及驗證。一開始的子集被稱為'''訓練集'''。而其它的子集則被稱為'''驗證集'''或'''測試集'''。交叉验证的目的，是用未用来给模型作训练的新数据，测试模型的性能，以便減少诸如过拟合和选择偏差等問題，并给出模型如何在一个独立的数据集上通用化（即，一个未知的数据集，如实际问题中的数据）。
+是一种[[统计学]]上将[[数据]][[样本]][[集合划分|切割]]成较小子集的实用方法。于是可以先在一个子集上做分析，而其它子集则用来做后续对此分析的确认及验证。一开始的子集被称为'''训练集'''。而其它的子集则被称为'''验证集'''或'''测试集'''。交叉验证的目的，是用未用来给模型作训练的新数据，测试模型的性能，以便減少诸如过拟合和选择偏差等问题，并给出模型如何在一个独立的数据集上通用化（即，一个未知的数据集，如实际问题中的数据）。
-交叉驗證的理論是由{{tsl|en|Seymour Geisser|}}所開始的。它對於防範根据数据建议的测试假设是非常重要的，特別是當後續的[[樣本]]是危險、成本過高或科学上不适合时去搜集。
+交叉验证的理论是由{{tsl|en|Seymour Geisser|}}所开始的。它对于防范根据数据建议的测试假设是非常重要的，特别是当后续的[[样本]]是危险、成本过高或科学上不适合时去搜集。
 == 交叉验证的使用 ==
@@ 第8行： / 第8行： @@
 交叉验证是一种预测模型拟合性能的方法。
-== 常見的交叉驗證形式 ==
+== 常见的交叉验证形式 ==
-=== Holdout 驗證 ===
+=== Holdout 验证 ===
-常識來說，Holdout 驗證並非一種交叉驗證，因為数据並沒有交叉使用。
+常识来说，Holdout 验证并非一种交叉验证，因为数据并沒有交叉使用。
-隨機從最初的樣本中選出部分，形成交叉驗證数据，而剩餘的就當做訓練数据。
+随机从最初的样本中选出部分，形成交叉验证数据，而剩餘的就当做训练数据。
-一般來說，少於原本樣本三分之一的数据被選做驗證数据。
+一般来说，少于原本样本三分之一的数据被选做验证数据。
 <ref>{{cite web | title=Tutorial 12 | work=Decision Trees Interactive Tutorial and Resources | url=http://decisiontrees.net/node/36 | accessdate=2006-06-21 | | | }}</ref>
@@ 第21行： / 第21行： @@
 ''k''折交叉验证（{{lang-en|''k''-fold cross-validation}}），将训练集分割成''k''个子样本，一个单独的子样本被保留作为验证模型的数据，其他''k''&nbsp;−&nbsp;1个样本用来训练。交叉验证重复''k''次，每个子样本验证一次，平均''k''次的结果或者使用其它结合方式，最终得到一个单一估测。这个方法的优势在于，同时重复运用随机产生的子样本进行训练和验证，每次的结果验证一次，10次交叉验证是最常用的。
-=== 留一驗證 ===
+=== 留一验证 ===
 <!-- This section is linked from [[数据挖掘]] -->
-正如名稱所建議，留一驗證（{{lang-en|leave-one-out cross-validation, LOOCV}}）意指只使用原本樣本中的一項來當做驗證資料，而剩餘的則留下來當做訓練資料。這個步驟一直持續到每個樣本都被當做一次驗證資料。
+正如名称所建议，留一验证（{{lang-en|leave-one-out cross-validation, LOOCV}}）意指只使用原本样本中的一项来当做验证资料，而剩餘的则留下来当做训练资料。这个步驟一直持续到每个样本都被当做一次验证资料。
-事實上，這等同於''k''折交叉验证，其中''k''為原本樣本個數。<ref>{{Cite web|url=https://web.stanford.edu/~hastie/ElemStatLearn/|title=Elements of Statistical Learning: data mining, inference, and prediction. 2nd Edition.|website=web.stanford.edu|access-date=2019-04-04|||}}</ref>
+事实上，这等同于''k''折交叉验证，其中''k''为原本样本个数。<ref>{{Cite web|url=https://web.stanford.edu/~hastie/ElemStatLearn/|title=Elements of Statistical Learning: data mining, inference, and prediction. 2nd Edition.|website=web.stanford.edu|access-date=2019-04-04|||}}</ref>
 在某些情況下是存在有效率的演算法，如使用{{tsl|en|kernel regression|}} 和[[吉洪诺夫正则化]]。
-== 誤差估計 ==
+== 误差估计 ==
-可以計算估計誤差。常見的誤差衡量標準是[[均方差]]和[[方根均方差]]，
+可以计算估计误差。常见的误差衡量标準是[[均方差]]和[[方根均方差]]，
-分別為交叉驗證的[[方差]]和[[標準差]]。
+分别为交叉验证的[[方差]]和[[标準差]]。
-== 另見 ==
+== 另见 ==
 * [[重抽样]]
 * [[提升方法]]
 * [[Bagging算法|引导聚集算法]]
-== 參考文獻 ==
+== 参考文献 ==
 {{reflist|30em}}
-== 外部連結 ==
+== 外部链接 ==
 * [https://web.archive.org/web/20090214023159/http://paul.luminos.nl/documents/show_document.php?d=198 Naive Bayes implementation with cross-validation in Visual Basic] (includes executable and source code)
 * [https://web.archive.org/web/20081201180042/http://www.cs.technion.ac.il/%7Eronbeg/gcv/index.html A generic k-fold cross-validation implementation] (free open source; includes a distributed version that can utilize multiple computers and in principle can speed up the running time by several orders of magnitude.)
-{{統計學}}
+{{统计学}}
 [[Category:统计检验]]