<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>
<channel>
	<title>Comments on: Why practitioners discretize their continuous data</title>
	<atom:link href="http://lixiaoxu.lxxm.com/why-practitioners-discretize-their-continous-data/feed/" rel="self" type="application/rss+xml" />
	<link>http://lixiaoxu.lxxm.com/why-practitioners-discretize-their-continous-data/</link>
	<description>Teaching notes of szpku dot lixiaoxu at gmail dot com</description>
	<pubDate>Fri, 30 Jul 2010 07:59:13 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: discrete data</title>
		<link>http://lixiaoxu.lxxm.com/why-practitioners-discretize-their-continous-data/comment-page-1/#comment-342</link>
		<dc:creator>discrete data</dc:creator>
		<pubDate>Tue, 30 Mar 2010 01:27:22 +0000</pubDate>
		<guid isPermaLink="false">http://ap2007.72pines.com/?p=61#comment-342</guid>
		<description>[...] setbacks. ... adopted DRT (Discrete Reportable Transcription) technology recognizes the need for ...10001036 : Why practitioners discretize their continuous dataWhy practitioners discretize their continuous data. Yihui asked this question yesterday. ... [...]</description>
		<content:encoded><![CDATA[<p>[...] setbacks. ... adopted DRT (Discrete Reportable Transcription) technology recognizes the need for ...10001036 : Why practitioners discretize their continuous dataWhy practitioners discretize their continuous data. Yihui asked this question yesterday. ... [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: 谢益辉: 统计学博文导读：火箭队比赛与分类树、神经网络与降维 &#124; 统计之都</title>
		<link>http://lixiaoxu.lxxm.com/why-practitioners-discretize-their-continous-data/comment-page-1/#comment-296</link>
		<dc:creator>谢益辉: 统计学博文导读：火箭队比赛与分类树、神经网络与降维 &#124; 统计之都</dc:creator>
		<pubDate>Sun, 15 Mar 2009 10:04:34 +0000</pubDate>
		<guid isPermaLink="false">http://ap2007.72pines.com/?p=61#comment-296</guid>
		<description>[...] 李晓煦老师的博客：非常专业，为数不多的会用LaTeX写上数学公式的博客，李老师对统计理论细节研究很认真，很有国外统计研究者的风范；博文如Why practitioners discretize their continuous data讲述了为什么大家喜欢将连续型数据离散化的原因之一。 [...]</description>
		<content:encoded><![CDATA[<p>[...] 李晓煦老师的博客：非常专业，为数不多的会用LaTeX写上数学公式的博客，李老师对统计理论细节研究很认真，很有国外统计研究者的风范；博文如Why practitioners discretize their continuous data讲述了为什么大家喜欢将连续型数据离散化的原因之一。 [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: lixiaoxu</title>
		<link>http://lixiaoxu.lxxm.com/why-practitioners-discretize-their-continous-data/comment-page-1/#comment-295</link>
		<dc:creator>lixiaoxu</dc:creator>
		<pubDate>Sun, 08 Mar 2009 09:56:27 +0000</pubDate>
		<guid isPermaLink="false">http://ap2007.72pines.com/?p=61#comment-295</guid>
		<description>Residuals and errors are different.  The more intervals, squared-residuals decrease while squared-errors increase. So the black points, or discretization with max intervals, predict red population the worst.

Discretization fades micro information (most errors) while highlights macro information (usually non-linear). When LOESS is popular enough, discretization will be abandoned. Practitioners really need local smoothing to preview their concerned macro models.</description>
		<content:encoded><![CDATA[<p>Residuals and errors are different.  The more intervals, squared-residuals decrease while squared-errors increase. So the black points, or discretization with max intervals, predict red population the worst.</p>
<p>Discretization fades micro information (most errors) while highlights macro information (usually non-linear). When LOESS is popular enough, discretization will be abandoned. Practitioners really need local smoothing to preview their concerned macro models.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Yihui</title>
		<link>http://lixiaoxu.lxxm.com/why-practitioners-discretize-their-continous-data/comment-page-1/#comment-294</link>
		<dc:creator>Yihui</dc:creator>
		<pubDate>Sun, 08 Mar 2009 07:26:08 +0000</pubDate>
		<guid isPermaLink="false">http://ap2007.72pines.com/?p=61#comment-294</guid>
		<description>The discretization here is essentially a kind of local smoothing techniques using a constant kernel function. Generally speaking, local modeling can effectively improve fitness (lower error sum of squares) but we have to carefully avoid overfitting. If you discretize x into more intervals, the fitting will be even better.</description>
		<content:encoded><![CDATA[<p>The discretization here is essentially a kind of local smoothing techniques using a constant kernel function. Generally speaking, local modeling can effectively improve fitness (lower error sum of squares) but we have to carefully avoid overfitting. If you discretize x into more intervals, the fitting will be even better.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Keep on Fighting!</title>
		<link>http://lixiaoxu.lxxm.com/why-practitioners-discretize-their-continous-data/comment-page-1/#comment-293</link>
		<dc:creator>Keep on Fighting!</dc:creator>
		<pubDate>Fri, 06 Mar 2009 16:01:17 +0000</pubDate>
		<guid isPermaLink="false">http://ap2007.72pines.com/?p=61#comment-293</guid>
		<description>&lt;strong&gt;离散化：毁灭信息的有效手段...&lt;/strong&gt;

如果你想掩盖数据，那么就把它们离散化吧！不知道为什么这么多人钟爱于将连续数据离散化，例如明明有年龄数据，在分析的时候非要分成老幼青壮这样的分类变量；明明有原始的计数数据...</description>
		<content:encoded><![CDATA[<p><strong>离散化：毁灭信息的有效手段...</strong></p>
<p>如果你想掩盖数据，那么就把它们离散化吧！不知道为什么这么多人钟爱于将连续数据离散化，例如明明有年龄数据，在分析的时候非要分成老幼青壮这样的分类变量；明明有原始的计数数据...</p>
]]></content:encoded>
	</item>
</channel>
</rss>
