<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule">

<channel>
	<title>The Tlog - a technology blog &#187; Databases</title>
	<atom:link href="http://www.thetlog.net/category/databases/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.thetlog.net</link>
	<description></description>
	<lastBuildDate>Fri, 27 Nov 2009 15:18:58 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
<creativeCommons:license>http://creativecommons.org/licenses/by-nc-nd/2.5/pt/</creativeCommons:license>		<item>
		<title>Adventures with my Technorati ranks &quot;toy&quot;</title>
		<link>http://www.thetlog.net/2007/05/23/adventures-with-my-technorati-ranks-toy/</link>
		<comments>http://www.thetlog.net/2007/05/23/adventures-with-my-technorati-ranks-toy/#comments</comments>
		<pubDate>Wed, 23 May 2007 23:18:10 +0000</pubDate>
		<dc:creator>Pedro Timóteo</dc:creator>
				<category><![CDATA[Databases]]></category>
		<category><![CDATA[PHP]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[mysql]]></category>
		<category><![CDATA[technorati-ranks]]></category>

		<guid isPermaLink="false">http://www.thetlog.net/2007/05/23/adventures-with-my-technorati-ranks-toy/</guid>
		<description><![CDATA[As I mentioned here before, a couple of days ago I coded a program to take an OPML file and generate a table in which the sites listed on that file appear ordered by Technorati ranks. It also shows the number of incoming links (again, from Technorati), and each site&#8217;s PageRank. (By the way: no, [...]


Related posts:<ol><li><a href='http://www.thetlog.net/2007/05/21/my-technorati-ranks-toy/' rel='bookmark' title='Permanent Link: My Technorati ranks &quot;toy&quot;'>My Technorati ranks &quot;toy&quot;</a></li>
<li><a href='http://www.thetlog.net/2007/05/25/more-additions-to-the-pa-top-technorati-ranks-table/' rel='bookmark' title='Permanent Link: More additions to the PA top Technorati ranks table'>More additions to the PA top Technorati ranks table</a></li>
<li><a href='http://www.thetlog.net/2006/03/29/html-css-and-tables/' rel='bookmark' title='Permanent Link: HTML, CSS and tables'>HTML, CSS and tables</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>As I mentioned here <a href="http://www.thetlog.net/2007/05/21/my-technorati-ranks-toy/">before</a>, a couple of days ago I coded a program to take an OPML file and generate a <a href="http://planetatheism.com/technorati/tr-show.php">table</a> in which the sites listed on that file appear ordered by <strong><a href="http://www.technorati.com/">Technorati</a> ranks</strong>. It also shows the number of <strong>incoming links</strong> (again, from Technorati), and each site&#8217;s <strong>PageRank</strong>.</p>
<p><small>(By the way: no, this is not ready for release yet. But it will be. Soon.)</small></p>
<p>Initially, the data collecting part of my program started by clearing a table in a MySQL database, which would then be filled with the values it would get from Technorati and Google. However, this had two problems:</p>
<ol>
<li>Technorati allows only a <em>limited number</em> of accesses per day. I discovered it when I was making several tests, and, after about half a dozen or so, it stopped giving me data. The problem, then, was that it had already cleared the table&#8230; so I ended up with an empty one.</li>
<li>From time to time, Technorati gives me &#8220;wrong&#8221; ranks / links for a blog &#8211; values much <em>lower</em> (but not absurd / &#8220;bogus&#8221;, just wrong) than what they should be. It&#8217;s weird, and not reproducible, and usually, by asking TR again, the correct value is then returned.</li>
</ol>
<p>To solve the first problem, obviously, some form of keeping the data from the previous run while getting the new values was in order, so that, if Technorati told me to get stuffed, I would still have the data from the day before.</p>
<p>The second problem was a little more complicated, though, in a way, the solution to the first helped me crack it.</p>
<p>My method was this: when running the script, start by copying the original table to another (let&#8217;s call it <code>temp1</code>) and clearing the original table. Then get the new data to yet another table (<code>temp2</code>). Afterwards, regenerate the original table with data from <code>temp1</code> and <code>temp2</code>, the following way:</p>
<ul>
<li>if an entry (identified by the site&#8217;s URL) exists in only <em>one</em> of the tables, use it.</li>
<li>if an entry exists in <em>both</em>, use the common values (URL, site&#8217;s name), and for the 3 numeric values, choose the <b>best</b> value (from the two tables) for each. &#8220;Best&#8221; means the highest # of incoming links, the highest PageRank, and the lowest Technorati rank.</li>
</ul>
<p>This way, if once in a while Technorati gives it a much worse value than it should (I&#8217;ve never seen it rate a blog <i>better</i> than the reality), it still has a more correct value to use instead.</p>
<p>Sounds fine, doesn&#8217;t it? But there&#8217;s a problem with this method&#8230; which I solved later, but which I&#8217;ll discuss the next post. Until then&#8230; any guesses as to what it was? <img src='http://www.thetlog.net/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<hr/>Copyright &copy; 2010 <strong><a href="http://www.thetlog.net">The Tlog - a technology blog</a></strong> 

<p>Related posts:<ol><li><a href='http://www.thetlog.net/2007/05/21/my-technorati-ranks-toy/' rel='bookmark' title='Permanent Link: My Technorati ranks &quot;toy&quot;'>My Technorati ranks &quot;toy&quot;</a></li>
<li><a href='http://www.thetlog.net/2007/05/25/more-additions-to-the-pa-top-technorati-ranks-table/' rel='bookmark' title='Permanent Link: More additions to the PA top Technorati ranks table'>More additions to the PA top Technorati ranks table</a></li>
<li><a href='http://www.thetlog.net/2006/03/29/html-css-and-tables/' rel='bookmark' title='Permanent Link: HTML, CSS and tables'>HTML, CSS and tables</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.thetlog.net/2007/05/23/adventures-with-my-technorati-ranks-toy/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>
