<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Histograms &#8212; matplotlib vs. R</title>
	<atom:link href="http://brenocon.com/blog/2012/02/histograms-matplotlib-vs-r/feed/" rel="self" type="application/rss+xml" />
	<link>http://brenocon.com/blog/2012/02/histograms-matplotlib-vs-r/</link>
	<description>cognition, language, social systems; statistics, visualization, computation</description>
	<lastBuildDate>Tue, 25 Nov 2025 13:11:20 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
	<item>
		<title>By: brendano</title>
		<link>http://brenocon.com/blog/2012/02/histograms-matplotlib-vs-r/#comment-171505</link>
		<dc:creator>brendano</dc:creator>
		<pubDate>Mon, 06 Aug 2012 14:52:07 +0000</pubDate>
		<guid isPermaLink="false">http://brenocon.com/blog/?p=1112#comment-171505</guid>
		<description><![CDATA[O.R.: Good question... QQ Plots are good for continuous data, of course.  You can use them for discrete data, if you impose and ordering on the levels/values.  This does seem artificial though.]]></description>
		<content:encoded><![CDATA[<p>O.R.: Good question&#8230; QQ Plots are good for continuous data, of course.  You can use them for discrete data, if you impose and ordering on the levels/values.  This does seem artificial though.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: O.R.</title>
		<link>http://brenocon.com/blog/2012/02/histograms-matplotlib-vs-r/#comment-167869</link>
		<dc:creator>O.R.</dc:creator>
		<pubDate>Wed, 25 Jul 2012 14:27:40 +0000</pubDate>
		<guid isPermaLink="false">http://brenocon.com/blog/?p=1112#comment-167869</guid>
		<description><![CDATA[I&#039;m curious, if histograms aren&#039;t optimal,  what kind of exploratory visualization would you use to check the distribution shape of discrete data?]]></description>
		<content:encoded><![CDATA[<p>I&#8217;m curious, if histograms aren&#8217;t optimal,  what kind of exploratory visualization would you use to check the distribution shape of discrete data?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brendan O'Connor</title>
		<link>http://brenocon.com/blog/2012/02/histograms-matplotlib-vs-r/#comment-120218</link>
		<dc:creator>Brendan O'Connor</dc:creator>
		<pubDate>Wed, 08 Feb 2012 19:05:16 +0000</pubDate>
		<guid isPermaLink="false">http://brenocon.com/blog/?p=1112#comment-120218</guid>
		<description><![CDATA[Lukas -- yeah it&#039;s dumb to use histograms-designed-for-continuous-data at all; I just think the R histogram is doing better in a challenging situation.]]></description>
		<content:encoded><![CDATA[<p>Lukas &#8212; yeah it&#8217;s dumb to use histograms-designed-for-continuous-data at all; I just think the R histogram is doing better in a challenging situation.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Lukas Biewald</title>
		<link>http://brenocon.com/blog/2012/02/histograms-matplotlib-vs-r/#comment-120212</link>
		<dc:creator>Lukas Biewald</dc:creator>
		<pubDate>Wed, 08 Feb 2012 18:19:27 +0000</pubDate>
		<guid isPermaLink="false">http://brenocon.com/blog/?p=1112#comment-120212</guid>
		<description><![CDATA[Brendan - are you sure the R plot is really better?  Just looking at the first comparison you have - R magically picked a perfect bin size that doesn&#039;t have aliasing effects with you underlying discrete data.  But it completely hides the fact that your data is discrete.  It sure looks prettier, but I think the prettiness is misleading - I would guess from that histogram that the data comes from a continuous distribution.

If the possible values of your data is a small set of discrete points, why use a histogram at all?

Anyway, just a thought :).  Come back to California!]]></description>
		<content:encoded><![CDATA[<p>Brendan &#8211; are you sure the R plot is really better?  Just looking at the first comparison you have &#8211; R magically picked a perfect bin size that doesn&#8217;t have aliasing effects with you underlying discrete data.  But it completely hides the fact that your data is discrete.  It sure looks prettier, but I think the prettiness is misleading &#8211; I would guess from that histogram that the data comes from a continuous distribution.</p>
<p>If the possible values of your data is a small set of discrete points, why use a histogram at all?</p>
<p>Anyway, just a thought :).  Come back to California!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: brendano</title>
		<link>http://brenocon.com/blog/2012/02/histograms-matplotlib-vs-r/#comment-118993</link>
		<dc:creator>brendano</dc:creator>
		<pubDate>Sat, 04 Feb 2012 05:19:08 +0000</pubDate>
		<guid isPermaLink="false">http://brenocon.com/blog/?p=1112#comment-118993</guid>
		<description><![CDATA[OK I figured out my problem.  RPy2 crashes like crazy.  But RPy works nicely.  (I installed rpy with https://gist.github.com/1732879 ).   To get plotting to work, I have to start Python with &quot;ipython --pylab&quot;.  And this seems to work for R&#039;s graphics commands, not just matplotlib (which it&#039;s supposed to be for).  If you don&#039;t say &quot;--pylab&quot;, then the plot command is unreliable.]]></description>
		<content:encoded><![CDATA[<p>OK I figured out my problem.  RPy2 crashes like crazy.  But RPy works nicely.  (I installed rpy with <a href="https://gist.github.com/1732879" rel="nofollow">https://gist.github.com/1732879</a> ).   To get plotting to work, I have to start Python with &#8220;ipython &#8211;pylab&#8221;.  And this seems to work for R&#8217;s graphics commands, not just matplotlib (which it&#8217;s supposed to be for).  If you don&#8217;t say &#8220;&#8211;pylab&#8221;, then the plot command is unreliable.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: michael toomim</title>
		<link>http://brenocon.com/blog/2012/02/histograms-matplotlib-vs-r/#comment-118703</link>
		<dc:creator>michael toomim</dc:creator>
		<pubDate>Fri, 03 Feb 2012 01:25:55 +0000</pubDate>
		<guid isPermaLink="false">http://brenocon.com/blog/?p=1112#comment-118703</guid>
		<description><![CDATA[I used rpy. Rpy2 seemed lame.]]></description>
		<content:encoded><![CDATA[<p>I used rpy. Rpy2 seemed lame.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: brendano</title>
		<link>http://brenocon.com/blog/2012/02/histograms-matplotlib-vs-r/#comment-118683</link>
		<dc:creator>brendano</dc:creator>
		<pubDate>Thu, 02 Feb 2012 22:15:40 +0000</pubDate>
		<guid isPermaLink="false">http://brenocon.com/blog/?p=1112#comment-118683</guid>
		<description><![CDATA[RPy crashes like crazy for me.  Do you use &quot;rpy&quot; or &quot;rpy2&quot; ?]]></description>
		<content:encoded><![CDATA[<p>RPy crashes like crazy for me.  Do you use &#8220;rpy&#8221; or &#8220;rpy2&#8243; ?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Yang</title>
		<link>http://brenocon.com/blog/2012/02/histograms-matplotlib-vs-r/#comment-118681</link>
		<dc:creator>Yang</dc:creator>
		<pubDate>Thu, 02 Feb 2012 22:05:35 +0000</pubDate>
		<guid isPermaLink="false">http://brenocon.com/blog/?p=1112#comment-118681</guid>
		<description><![CDATA[You probably know about it already, but I like using RPy to leverage R for plotting and other &quot;better-in-R&quot; routines.]]></description>
		<content:encoded><![CDATA[<p>You probably know about it already, but I like using RPy to leverage R for plotting and other &#8220;better-in-R&#8221; routines.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
