<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Why Reddit is broken, and how to fix it</title>
	<atom:link href="http://codeulate.com/2007/12/reddit-is-broken/feed/" rel="self" type="application/rss+xml" />
	<link>http://codeulate.com/2007/12/reddit-is-broken/</link>
	<description></description>
	<lastBuildDate>Mon, 30 Jan 2012 11:57:11 -0700</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.3</generator>
	<item>
		<title>By: Sold Out Activist</title>
		<link>http://codeulate.com/2007/12/reddit-is-broken/comment-page-1/#comment-585</link>
		<dc:creator>Sold Out Activist</dc:creator>
		<pubDate>Sat, 26 Jan 2008 07:13:06 +0000</pubDate>
		<guid isPermaLink="false">http://codeulate.com/?p=3#comment-585</guid>
		<description>I&#039;ve just launched a site that has public user tagging that&#039;s flavored (good/bad/neutral) so you can customize the site according to your preferences with each tag you add. When many users tag something the same way, it becomes a permanent tag of that submission. My website link will send you there. I&#039;m looking forward to comments if anyone is interested.</description>
		<content:encoded><![CDATA[<p>I&#8217;ve just launched a site that has public user tagging that&#8217;s flavored (good/bad/neutral) so you can customize the site according to your preferences with each tag you add. When many users tag something the same way, it becomes a permanent tag of that submission. My website link will send you there. I&#8217;m looking forward to comments if anyone is interested.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kari Chisholm</title>
		<link>http://codeulate.com/2007/12/reddit-is-broken/comment-page-1/#comment-74</link>
		<dc:creator>Kari Chisholm</dc:creator>
		<pubDate>Sun, 16 Dec 2007 22:05:42 +0000</pubDate>
		<guid isPermaLink="false">http://codeulate.com/?p=3#comment-74</guid>
		<description>&lt;i&gt;Then recommend stuff HE likes to me! It’s like wisdom of the crowd TIMES bayesian filtering. We can call it Web 3.0!&lt;/i&gt;

Or, you can call it Web 1.0.  Amazon worked like this from the beginning, right?</description>
		<content:encoded><![CDATA[<p><i>Then recommend stuff HE likes to me! It’s like wisdom of the crowd TIMES bayesian filtering. We can call it Web 3.0!</i></p>
<p>Or, you can call it Web 1.0.  Amazon worked like this from the beginning, right?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tad Chef</title>
		<link>http://codeulate.com/2007/12/reddit-is-broken/comment-page-1/#comment-44</link>
		<dc:creator>Tad Chef</dc:creator>
		<pubDate>Fri, 14 Dec 2007 14:27:24 +0000</pubDate>
		<guid isPermaLink="false">http://codeulate.com/?p=3#comment-44</guid>
		<description>Btw. it&#039;s not social bookmarking but social news. del.icio.us is for bookmarking. Aside of that reddit is the most hostile &quot;social&quot; site I ever encountered.</description>
		<content:encoded><![CDATA[<p>Btw. it&#8217;s not social bookmarking but social news. del.icio.us is for bookmarking. Aside of that reddit is the most hostile &#8220;social&#8221; site I ever encountered.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Aidan Finn</title>
		<link>http://codeulate.com/2007/12/reddit-is-broken/comment-page-1/#comment-43</link>
		<dc:creator>Aidan Finn</dc:creator>
		<pubDate>Thu, 13 Dec 2007 22:31:02 +0000</pubDate>
		<guid isPermaLink="false">http://codeulate.com/?p=3#comment-43</guid>
		<description>&quot;After all, reddit-funder Paul Graham was the guy who thought up using bayesian filtering for spam.&quot;

http://citeseer.ist.psu.edu/sahami98bayesian.html</description>
		<content:encoded><![CDATA[<p>&#8220;After all, reddit-funder Paul Graham was the guy who thought up using bayesian filtering for spam.&#8221;</p>
<p><a href="http://citeseer.ist.psu.edu/sahami98bayesian.html" rel="nofollow">http://citeseer.ist.psu.edu/sahami98bayesian.html</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: matt knox</title>
		<link>http://codeulate.com/2007/12/reddit-is-broken/comment-page-1/#comment-42</link>
		<dc:creator>matt knox</dc:creator>
		<pubDate>Thu, 13 Dec 2007 16:24:21 +0000</pubDate>
		<guid isPermaLink="false">http://codeulate.com/?p=3#comment-42</guid>
		<description>For bayesian inference to work, you want there to be a large number of features per judgement, so that you can multiply together the probabilities of buckethood vs. nonbuckethood.  So in the spam filtering case, every word is a feature.  Reddit does not see the sites to which it links, so all it has as a featureset is the article title and the URL.  Crawling the linked sites, though possible, would mean that they would have to store a ton of data-maybe as much as 100x the amount they presently store per article.  

But let&#039;s assume they get around that.  They have a huge featureset for every article.  Then they have a giant computational challenge-they have to run every article against  the &#039;goodness-filter&#039; trained by each user who views a page.  Doing that at page-view time would be a disaster, so they&#039;d have to run it beforehand and store it, which means they&#039;d need MxN goodness values for M users and N articles.  

They know about bayesian inference, but it&#039;s not such a good fit here.</description>
		<content:encoded><![CDATA[<p>For bayesian inference to work, you want there to be a large number of features per judgement, so that you can multiply together the probabilities of buckethood vs. nonbuckethood.  So in the spam filtering case, every word is a feature.  Reddit does not see the sites to which it links, so all it has as a featureset is the article title and the URL.  Crawling the linked sites, though possible, would mean that they would have to store a ton of data-maybe as much as 100x the amount they presently store per article.  </p>
<p>But let&#8217;s assume they get around that.  They have a huge featureset for every article.  Then they have a giant computational challenge-they have to run every article against  the &#8216;goodness-filter&#8217; trained by each user who views a page.  Doing that at page-view time would be a disaster, so they&#8217;d have to run it beforehand and store it, which means they&#8217;d need MxN goodness values for M users and N articles.  </p>
<p>They know about bayesian inference, but it&#8217;s not such a good fit here.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: shestheoneforme</title>
		<link>http://codeulate.com/2007/12/reddit-is-broken/comment-page-1/#comment-41</link>
		<dc:creator>shestheoneforme</dc:creator>
		<pubDate>Thu, 13 Dec 2007 14:58:49 +0000</pubDate>
		<guid isPermaLink="false">http://codeulate.com/?p=3#comment-41</guid>
		<description>When I saw Paul Graham&#039;s name, I thought it looked very familiar.
I&#039;m reading Hackers and Painters right now, and I love it.
I didn&#039;t know he had anything to do with Reddit!</description>
		<content:encoded><![CDATA[<p>When I saw Paul Graham&#8217;s name, I thought it looked very familiar.<br />
I&#8217;m reading Hackers and Painters right now, and I love it.<br />
I didn&#8217;t know he had anything to do with Reddit!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jack Alexander</title>
		<link>http://codeulate.com/2007/12/reddit-is-broken/comment-page-1/#comment-40</link>
		<dc:creator>Jack Alexander</dc:creator>
		<pubDate>Thu, 13 Dec 2007 13:17:05 +0000</pubDate>
		<guid isPermaLink="false">http://codeulate.com/?p=3#comment-40</guid>
		<description>I installed the &#039;greasemonkey&#039; script to get rid of ron paul and his spambots....reddit might need more work, but you have to admit that it is better, by far, than digg and propeller.</description>
		<content:encoded><![CDATA[<p>I installed the &#8216;greasemonkey&#8217; script to get rid of ron paul and his spambots&#8230;.reddit might need more work, but you have to admit that it is better, by far, than digg and propeller.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dov</title>
		<link>http://codeulate.com/2007/12/reddit-is-broken/comment-page-1/#comment-39</link>
		<dc:creator>Dov</dc:creator>
		<pubDate>Thu, 13 Dec 2007 11:50:18 +0000</pubDate>
		<guid isPermaLink="false">http://codeulate.com/?p=3#comment-39</guid>
		<description>I don&#039;t care what the aims and objectives are of this site. There is no excuse for permitting repeated and vile anti-Semitic attacks. At some point the people who run broad-spectrum internet sites have to be put the same standards of accountability as others in the mass media.

http:/www.root-1.co.il/reddit.htm</description>
		<content:encoded><![CDATA[<p>I don&#8217;t care what the aims and objectives are of this site. There is no excuse for permitting repeated and vile anti-Semitic attacks. At some point the people who run broad-spectrum internet sites have to be put the same standards of accountability as others in the mass media.</p>
<p>http:/www.root-1.co.il/reddit.htm</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: quirkyalone</title>
		<link>http://codeulate.com/2007/12/reddit-is-broken/comment-page-1/#comment-38</link>
		<dc:creator>quirkyalone</dc:creator>
		<pubDate>Thu, 13 Dec 2007 10:25:18 +0000</pubDate>
		<guid isPermaLink="false">http://codeulate.com/?p=3#comment-38</guid>
		<description>The question is, whether the Bayeasian Filtering will work in this content. According my limited knowledge, BF is good at filtering out what you do not want (=spam), but not good at recommends what you want.

Some time ago, I played with one open source rss reader (sorry can&#039;t remember the name), which used BF to filter out &quot;interesting&quot; RSS posts based on my votes, which is the similar principle you advocates. But it just didnt work too well...</description>
		<content:encoded><![CDATA[<p>The question is, whether the Bayeasian Filtering will work in this content. According my limited knowledge, BF is good at filtering out what you do not want (=spam), but not good at recommends what you want.</p>
<p>Some time ago, I played with one open source rss reader (sorry can&#8217;t remember the name), which used BF to filter out &#8220;interesting&#8221; RSS posts based on my votes, which is the similar principle you advocates. But it just didnt work too well&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: pp</title>
		<link>http://codeulate.com/2007/12/reddit-is-broken/comment-page-1/#comment-37</link>
		<dc:creator>pp</dc:creator>
		<pubDate>Thu, 13 Dec 2007 08:18:54 +0000</pubDate>
		<guid isPermaLink="false">http://codeulate.com/?p=3#comment-37</guid>
		<description>But hey. You like lisp people, paul graham, and you don&#039;t like sexprs? :)</description>
		<content:encoded><![CDATA[<p>But hey. You like lisp people, paul graham, and you don&#8217;t like sexprs? :)</p>
]]></content:encoded>
	</item>
</channel>
</rss>

