<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Extracting The Main Content From a Webpage</title>
	<atom:link href="http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/feed/" rel="self" type="application/rss+xml" />
	<link>http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/</link>
	<description>A blog about web development, software business, and WordPress</description>
	<lastBuildDate>Wed, 08 Feb 2012 21:10:53 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: ag gründen</title>
		<link>http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/comment-page-1/#comment-187604</link>
		<dc:creator>ag gründen</dc:creator>
		<pubDate>Wed, 05 Oct 2011 07:40:49 +0000</pubDate>
		<guid isPermaLink="false">http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/#comment-187604</guid>
		<description>You really make it appear so easy with your presentation however I find this topic to be actually something that I believe I&#039;d never understand. It sort of feels too complex and extremely huge for me. I&#039;m taking a look ahead on your subsequent post, I?ll try to get the hold of it!</description>
		<content:encoded><![CDATA[<p>You really make it appear so easy with your presentation however I find this topic to be actually something that I believe I&#8217;d never understand. It sort of feels too complex and extremely huge for me. I&#8217;m taking a look ahead on your subsequent post, I?ll try to get the hold of it!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sreejith</title>
		<link>http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/comment-page-1/#comment-61812</link>
		<dc:creator>Sreejith</dc:creator>
		<pubDate>Tue, 06 Jul 2010 12:02:32 +0000</pubDate>
		<guid isPermaLink="false">http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/#comment-61812</guid>
		<description>Hi...DOM implementation is very slow while using bulk url&#039;s...is it a bug??any solution to solve this???</description>
		<content:encoded><![CDATA[<p>Hi&#8230;DOM implementation is very slow while using bulk url&#8217;s&#8230;is it a bug??any solution to solve this???</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sreejith</title>
		<link>http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/comment-page-1/#comment-54202</link>
		<dc:creator>Sreejith</dc:creator>
		<pubDate>Tue, 15 Jun 2010 04:48:58 +0000</pubDate>
		<guid isPermaLink="false">http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/#comment-54202</guid>
		<description>I have tried the program..Great Result...But some it extracts unnecessary contents (comments,wheather etc) from some sites..

http://www.omaha.com/article/20100526/NEWS11/100529757
http://www.billboard.com/features/will-lee-or-crystal-win-idol-stars-weigh-1004093694.story

pls chk the links....</description>
		<content:encoded><![CDATA[<p>I have tried the program..Great Result&#8230;But some it extracts unnecessary contents (comments,wheather etc) from some sites..</p>
<p><a href="http://www.omaha.com/article/20100526/NEWS11/100529757" rel="nofollow">http://www.omaha.com/article/20100526/NEWS11/100529757</a><br />
<a href="http://www.billboard.com/features/will-lee-or-crystal-win-idol-stars-weigh-1004093694.story" rel="nofollow">http://www.billboard.com/features/will-lee-or-crystal-win-idol-stars-weigh-1004093694.story</a></p>
<p>pls chk the links&#8230;.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sreejith</title>
		<link>http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/comment-page-1/#comment-51907</link>
		<dc:creator>Sreejith</dc:creator>
		<pubDate>Wed, 09 Jun 2010 03:43:55 +0000</pubDate>
		<guid isPermaLink="false">http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/#comment-51907</guid>
		<description>Hi...

$utf_spaces = array(&quot;\xC2\xA0&quot;, &quot;\xE1\x9A\x80&quot;, &quot;\xE2\x80\x83&quot;, 
			&quot;\xE2\x80\x82&quot;, &quot;\xE2\x80\x84&quot;, &quot;\xE2\x80\xAF&quot;, &quot;\xA0&quot;);

Pls say me the details for the above coding....</description>
		<content:encoded><![CDATA[<p>Hi&#8230;</p>
<p>$utf_spaces = array(&#8220;\xC2\xA0&#8243;, &#8220;\xE1\x9A\x80&#8243;, &#8220;\xE2\x80\x83&#8243;,<br />
			&#8220;\xE2\x80\x82&#8243;, &#8220;\xE2\x80\x84&#8243;, &#8220;\xE2\x80\xAF&#8221;, &#8220;\xA0&#8243;);</p>
<p>Pls say me the details for the above coding&#8230;.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Web Design Wellington</title>
		<link>http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/comment-page-1/#comment-49675</link>
		<dc:creator>Web Design Wellington</dc:creator>
		<pubDate>Tue, 01 Jun 2010 11:42:18 +0000</pubDate>
		<guid isPermaLink="false">http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/#comment-49675</guid>
		<description>Works like a charm, thanks for posting this it has saved me hours of development time</description>
		<content:encoded><![CDATA[<p>Works like a charm, thanks for posting this it has saved me hours of development time</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: website reviews</title>
		<link>http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/comment-page-1/#comment-33936</link>
		<dc:creator>website reviews</dc:creator>
		<pubDate>Tue, 09 Mar 2010 08:45:32 +0000</pubDate>
		<guid isPermaLink="false">http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/#comment-33936</guid>
		<description>I was just looking for this script to detect parked domain</description>
		<content:encoded><![CDATA[<p>I was just looking for this script to detect parked domain</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ø Detecting Parked Domains &#124; W-Shadow.com ø</title>
		<link>http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/comment-page-1/#comment-32392</link>
		<dc:creator>ø Detecting Parked Domains &#124; W-Shadow.com ø</dc:creator>
		<pubDate>Fri, 13 Nov 2009 20:14:30 +0000</pubDate>
		<guid isPermaLink="false">http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/#comment-32392</guid>
		<description>[...] a related note, my content extraction script could also be used for detecting is a site is low on actual content. Its algorithm is much simpler [...]</description>
		<content:encoded><![CDATA[<p>[...] a related note, my content extraction script could also be used for detecting is a site is low on actual content. Its algorithm is much simpler [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dohn &#187; links for 2009-07-15</title>
		<link>http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/comment-page-1/#comment-30856</link>
		<dc:creator>Dohn &#187; links for 2009-07-15</dc:creator>
		<pubDate>Wed, 15 Jul 2009 08:33:03 +0000</pubDate>
		<guid isPermaLink="false">http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/#comment-30856</guid>
		<description>[...] ø Extracting The Main Content From a Webpage &#124; W-Shadow.com ø [...]</description>
		<content:encoded><![CDATA[<p>[...] ø Extracting The Main Content From a Webpage | W-Shadow.com ø [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: digiwebtools</title>
		<link>http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/comment-page-1/#comment-30170</link>
		<dc:creator>digiwebtools</dc:creator>
		<pubDate>Tue, 12 May 2009 13:55:30 +0000</pubDate>
		<guid isPermaLink="false">http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/#comment-30170</guid>
		<description>Wow, that&#039;s what i am looking for!
I&#039;ll try and give some feedback here.
Thank you.</description>
		<content:encoded><![CDATA[<p>Wow, that&#8217;s what i am looking for!<br />
I&#8217;ll try and give some feedback here.<br />
Thank you.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: White Shadow</title>
		<link>http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/comment-page-1/#comment-20388</link>
		<dc:creator>White Shadow</dc:creator>
		<pubDate>Wed, 11 Mar 2009 18:29:25 +0000</pubDate>
		<guid isPermaLink="false">http://w-shadow.com/blog/2008/01/25/extracting-the-main-content-from-a-webpage/#comment-20388</guid>
		<description>You could easily do that with a few regular expressions. However, I&#039;m not doing your homework :P</description>
		<content:encoded><![CDATA[<p>You could easily do that with a few regular expressions. However, I&#8217;m not doing your homework :P</p>
]]></content:encoded>
	</item>
</channel>
</rss>

