<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Jorge Alegre Blog &#187; amazon</title>
	<atom:link href="http://www.jorgealegre.com/tag/amazon/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.jorgealegre.com</link>
	<description>Internet Startups, Technology, My Music and Me</description>
	<lastBuildDate>Sat, 19 Jun 2010 09:04:09 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Amazon EC2 y el hosting VPS no mola nada</title>
		<link>http://www.jorgealegre.com/2009/07/amazon-ec2-y-el-hosting-vps-no-mola-nada/</link>
		<comments>http://www.jorgealegre.com/2009/07/amazon-ec2-y-el-hosting-vps-no-mola-nada/#comments</comments>
		<pubDate>Mon, 13 Jul 2009 19:05:22 +0000</pubDate>
		<dc:creator>Jorge Alegre</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[ec2]]></category>

		<guid isPermaLink="false">http://www.jorgealegre.com/es/?p=255</guid>
		<description><![CDATA[
			
				
			
		
Mi enamoramiento con el hosting virtual creo que ha llegado a su fin. La idea es interesante, pero al utilizar los sistemas que tanto Amazon como otros ofrecen terminas preguntándote las razones por las que uno se ha metido en esta movida en vez de un hosting de servidor dedicado o co-location.
No inspiran confianza
A ver, [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.jorgealegre.com%2F2009%2F07%2Famazon-ec2-y-el-hosting-vps-no-mola-nada%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.jorgealegre.com%2F2009%2F07%2Famazon-ec2-y-el-hosting-vps-no-mola-nada%2F&amp;source=jalegre&amp;style=normal" height="61" width="50" /><br />
			</a>
		</div>
<p>Mi enamoramiento con el hosting virtual creo que ha llegado a su fin. La idea es interesante, pero al utilizar los sistemas que tanto Amazon como otros ofrecen terminas preguntándote las razones por las que uno se ha metido en esta movida en vez de un hosting de servidor dedicado o co-location.</p>
<p><strong>No inspiran confianza</strong></p>
<p>A ver, te ofrecen por X precio un servicio que se supone que aprovecha mejor los recursos de un sistema, que estas compartiendo una infraestructura sin malgastar recursos. Pero la realidad es otra, estas pagando un dinero por un nivel de servicio hosting (mucha gente en una máquina) a un precio de servidor dedicado. Cuando las cosas van mal (mala velocidad), te preguntas por qué, por ejemplo, una aplicación funciona 4X peor en una imagen Small de EC2 que en una máquina lenta (Pemtium). Lo peor es que no puedes realizar re-ingenería de procesos cuando las &#8220;performances&#8221; varían tanto.</p>
<p>Amazon debería empezar por hacer que funcionen bien sus servicios virtuales, desde el pequeño hasta el grande. Y luego hablamos de crear más imágenes para escalar. La cuestión es por qué voy a tener que tener 1-5 imagenes pequeñas para hacer lo que un buen servidor dedicado puede hacer. En mi experiencia Amazon debería también cobrar on-demand, pero de verdad, por tiempos de cuellos de botella de sus imágenes, mal I/O, etc&#8230;</p>
<p><strong>¿Necesitas potencia o te apañas con poco?</strong></p>
<p>Esta es la gran cuestión, es el &#8220;To be or not To Be&#8221; de los VPS. Si tienes una infraestructura que puede variar en carga y sabes que no utilizas el 100% de una máquina, Amazon puede valer. También puede valer para ampliar la carga de tu granja de servidores sin tener que pagar servidores &#8220;por si acaso&#8221;, que solo utilizas cuando hipoteticamente haya más carga.</p>
<p>Pero si no estás en un caso u otro, piénsatelo.</p>
<p>En el caso de Buscaplus, no creo que se use Amazon EC2, ya que los procesos necesitan el 100% de las máquinas la mayoría del tiempo y muy pocos procesos están en variable (quizás las búsquedas). Siempre es preferible montar un VPS donde tu controles todo que ir con el EC2, usando las aplicaciones de Open Source que están saliendo en el mercado.</p>
<p>Ahora, si tu Start-Up tiene menos de 5 millones de páginas vistas y poco &#8220;overhead&#8221; en tus sistemas, puedes ir mejor con 2 o 3 imágenes (aunque sean menos potente que un servidor dedicado tipo Dell), y tener redundancia, replicación, etc&#8230;</p>
<p><strong>Competidores</strong></p>
<p>De los nuevos que he visto hoy:</p>
<ul>
<li>vps.net</li>
<li>bytemark.co.uk</li>
</ul>
<p>La forma en la que yo veo esto es que un ISP te ofrezca una infraestructura, decidas sobre tipos de discos, ancho de banda, y luego seas tu quien decida como quieres dividir dicha infraestructura. Para sistemas grandes como el de Buscaplus y muchas Startups esto es una necesidad imperiosa.</p>
<p align="left"><a class="tt" href="http://twitter.com/home/?status=Amazon+EC2+y+el+hosting+VPS+no+mola+nada+http://www.jorgealegre.com/?p=352" title="Post to Twitter"><img class="nothumb" src="http://www.jorgealegre.com/wp-content/plugins/tweet-this/icons/tt-twitter.png" alt="Post to Twitter" /></a> <a class="tt" href="http://twitter.com/home/?status=Amazon+EC2+y+el+hosting+VPS+no+mola+nada+http://www.jorgealegre.com/?p=352" title="Post to Twitter">Tweet This Post</a> <a class="tt" href="http://plurk.com/?status=Amazon+EC2+y+el+hosting+VPS+no+mola+nada+http://www.jorgealegre.com/?p=352" title="Post to Plurk"><img class="nothumb" src="http://www.jorgealegre.com/wp-content/plugins/tweet-this/icons/tt-plurk.png" alt="Post to Plurk" /></a> <a class="tt" href="http://plurk.com/?status=Amazon+EC2+y+el+hosting+VPS+no+mola+nada+http://www.jorgealegre.com/?p=352" title="Post to Plurk">Plurk This Post</a> <a class="tt" href="http://digg.com/submit?url=http://www.jorgealegre.com/2009/07/amazon-ec2-y-el-hosting-vps-no-mola-nada/&amp;title=Amazon+EC2+y+el+hosting+VPS+no+mola+nada" title="Post to Digg"><img class="nothumb" src="http://www.jorgealegre.com/wp-content/plugins/tweet-this/icons/tt-digg.png" alt="Post to Digg" /></a> <a class="tt" href="http://digg.com/submit?url=http://www.jorgealegre.com/2009/07/amazon-ec2-y-el-hosting-vps-no-mola-nada/&amp;title=Amazon+EC2+y+el+hosting+VPS+no+mola+nada" title="Post to Digg">Digg This Post</a> <a class="tt" href="http://ping.fm/ref/?method=microblog&amp;title=Amazon+EC2+y+el+hosting+VPS+no+mola+nada&amp;link=http://www.jorgealegre.com/2009/07/amazon-ec2-y-el-hosting-vps-no-mola-nada/" title="Post to Ping.fm"><img class="nothumb" src="http://www.jorgealegre.com/wp-content/plugins/tweet-this/icons/tt-ping.png" alt="Post to Ping.fm" /></a> <a class="tt" href="http://ping.fm/ref/?method=microblog&amp;title=Amazon+EC2+y+el+hosting+VPS+no+mola+nada&amp;link=http://www.jorgealegre.com/2009/07/amazon-ec2-y-el-hosting-vps-no-mola-nada/" title="Post to Ping.fm">Ping This Post</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.jorgealegre.com/2009/07/amazon-ec2-y-el-hosting-vps-no-mola-nada/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Using Amazon EC2 As Infrastructure For An Internet Search Engine</title>
		<link>http://www.jorgealegre.com/2009/06/using-amazon-ec2-as-infrastructure-for-an-internet-search-engine/</link>
		<comments>http://www.jorgealegre.com/2009/06/using-amazon-ec2-as-infrastructure-for-an-internet-search-engine/#comments</comments>
		<pubDate>Tue, 23 Jun 2009 21:38:26 +0000</pubDate>
		<dc:creator>Jorge Alegre</dc:creator>
				<category><![CDATA[Technology]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[ec2]]></category>

		<guid isPermaLink="false">http://www.jorgealegre.com/?p=111</guid>
		<description><![CDATA[
			
				
			
		

Today I was doing testing on different Amazon EC2 images with bulk writing activities usually performed in my startup Buscaplus, an Internet search engine framework. Currently I have a set of 4 servers with SATA disks and I am planning to move to Amazon.
We use Berkeley DB as index database engine. It is pretty fast, [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.jorgealegre.com%2F2009%2F06%2Fusing-amazon-ec2-as-infrastructure-for-an-internet-search-engine%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.jorgealegre.com%2F2009%2F06%2Fusing-amazon-ec2-as-infrastructure-for-an-internet-search-engine%2F&amp;source=jalegre&amp;style=normal" height="61" width="50" /><br />
			</a>
		</div>
<p><img class="alignnone size-full wp-image-112" title="logo_aws" src="http://www.jorgealegre.com/wp-content/uploads/2009/06/logo_aws.gif" alt="logo_aws" width="164" height="60" /></p>
<p>Today I was doing testing on different Amazon EC2 images with bulk writing activities usually performed in my startup Buscaplus, an Internet search engine framework. Currently I have a set of 4 servers with SATA disks and I am planning to move to Amazon.</p>
<p>We use Berkeley DB as index database engine. It is pretty fast, specially  if you define correctly the memory cache, etc&#8230; In Buscaplus we need to write huge amounts of data to disk and bottlenecks are often found due to the high database requirements for a search engine. So this is crucial if we ever move to amazon, speed of writing stuff to disk. A deployment and cloud design for many instances has not been accomplished but with today´s tests seems clear that Amazon EC2 is an option for Buscaplus.</p>
<p><strong>Tests</strong></p>
<p>Berkeley DB writes data in key-&gt; value sets. You can select BTREE as well as other engines. We use BTREE and a cache spool of 128MB for all tests. Also, we write 100 Bytes for each row of data. The keys are simply a counter with zeros on right, like &#8216;0000000345&#8242;.</p>
<table class="wiki" border="0">
<tbody>
<tr>
<td>Sample</td>
<td>index-1</td>
<td>ec2 small</td>
<td>ec2 large</td>
<td>ec2 ultra large</td>
<td>ec2 medium</td>
<td>ec2 high extra large</td>
</tr>
<tr>
<td>1.000.000</td>
<td>13.35</td>
<td>18.60</td>
<td>9.50</td>
<td>9.50</td>
<td>9.00</td>
<td>7.99</td>
</tr>
<tr>
<td>3.000.000</td>
<td>39.81</td>
<td>44.62</td>
<td>27.47</td>
<td>26.19</td>
<td>26.14</td>
<td>25.90</td>
</tr>
<tr>
<td>20.000.000</td>
<td>&#8211;</td>
<td>&#8211;</td>
<td>&#8211;</td>
<td>&#8211;</td>
<td>&#8211;</td>
<td>Unstable</td>
</tr>
</tbody>
</table>
<p>index-1 shows one of the current servers. I would conclude that the &#8220;medium&#8221; instance is a great option. At only $0.20 / hour has great performance, better than current infrastructure.</p>
<p>I also found that when dealing with a lot of data, small instance of course are a &#8220;no-no&#8221;, but also higher instances with local disks. I noticed that when dealing with high I/O even big instances may do bad if load at that time is high. I found that this is not the case when having EBS. With high I/O and EBS I got great results all the time. So I would go for sure with EBS.</p>
<p>The 20 million rows tests were unstable even with a $0.80 High CPU Extra Large instance. This ended up in a DB table of more than 3GB.<span id="more-111"></span></p>
<p><strong>EBS</strong></p>
<table class="wiki" border="0">
<tbody>
<tr>
<td>Sample</td>
<td>Time</td>
</tr>
<tr>
<td>1.000.000</td>
<td>8.86</td>
</tr>
<tr>
<td>3.000.000</td>
<td>24.49</td>
</tr>
<tr>
<td>5.000.000</td>
<td>41.57</td>
</tr>
<tr>
<td>10.000.000</td>
<td>76.86</td>
</tr>
<tr>
<td>20.000.000</td>
<td>178.82</td>
</tr>
</tbody>
</table>
<p>So EBS does a great job at performance and stability (no load bottlenecks found).</p>
<p>Also was found that disks mounted in / had better performance than those in /mnt and EBS volumes.</p>
<p><strong>Conclusion</strong></p>
<p>I think testing in the cloud could lead to wrong conclusions. The testing data will vary as the load in the server hosting your instance increases or decreases so it is a good option to repeat tests in different time period, like morning and night or something similar.</p>
<p>In my particular case where I need lots of writing and reading I would go with medium instances for the database nodes and probably medium instances for processing nodes. I would stay away with small instances since the I/O it is very bad. Would get small instances only for testing purposes, development servers or early stage beta when load is not high.</p>
<p>I prefer to pay for 4 medium instances ($0.80) than for one $0.80 instance.</p>
<p align="left"><a class="tt" href="http://twitter.com/home/?status=Using+Amazon+EC2+As+Infrastructure+For+An+Internet+Search+Engine+http://www.jorgealegre.com/?p=111" title="Post to Twitter"><img class="nothumb" src="http://www.jorgealegre.com/wp-content/plugins/tweet-this/icons/tt-twitter.png" alt="Post to Twitter" /></a> <a class="tt" href="http://twitter.com/home/?status=Using+Amazon+EC2+As+Infrastructure+For+An+Internet+Search+Engine+http://www.jorgealegre.com/?p=111" title="Post to Twitter">Tweet This Post</a> <a class="tt" href="http://plurk.com/?status=Using+Amazon+EC2+As+Infrastructure+For+An+Internet+Search+Engine+http://www.jorgealegre.com/?p=111" title="Post to Plurk"><img class="nothumb" src="http://www.jorgealegre.com/wp-content/plugins/tweet-this/icons/tt-plurk.png" alt="Post to Plurk" /></a> <a class="tt" href="http://plurk.com/?status=Using+Amazon+EC2+As+Infrastructure+For+An+Internet+Search+Engine+http://www.jorgealegre.com/?p=111" title="Post to Plurk">Plurk This Post</a> <a class="tt" href="http://digg.com/submit?url=http://www.jorgealegre.com/2009/06/using-amazon-ec2-as-infrastructure-for-an-internet-search-engine/&amp;title=Using+Amazon+EC2+As+Infrastructure+For+An+Internet+Search+Engine" title="Post to Digg"><img class="nothumb" src="http://www.jorgealegre.com/wp-content/plugins/tweet-this/icons/tt-digg.png" alt="Post to Digg" /></a> <a class="tt" href="http://digg.com/submit?url=http://www.jorgealegre.com/2009/06/using-amazon-ec2-as-infrastructure-for-an-internet-search-engine/&amp;title=Using+Amazon+EC2+As+Infrastructure+For+An+Internet+Search+Engine" title="Post to Digg">Digg This Post</a> <a class="tt" href="http://ping.fm/ref/?method=microblog&amp;title=Using+Amazon+EC2+As+Infrastructure+For+An+Internet+Search+Engine&amp;link=http://www.jorgealegre.com/2009/06/using-amazon-ec2-as-infrastructure-for-an-internet-search-engine/" title="Post to Ping.fm"><img class="nothumb" src="http://www.jorgealegre.com/wp-content/plugins/tweet-this/icons/tt-ping.png" alt="Post to Ping.fm" /></a> <a class="tt" href="http://ping.fm/ref/?method=microblog&amp;title=Using+Amazon+EC2+As+Infrastructure+For+An+Internet+Search+Engine&amp;link=http://www.jorgealegre.com/2009/06/using-amazon-ec2-as-infrastructure-for-an-internet-search-engine/" title="Post to Ping.fm">Ping This Post</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.jorgealegre.com/2009/06/using-amazon-ec2-as-infrastructure-for-an-internet-search-engine/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
