<rss version="2.0">
	<channel>
		<title>All Things Distributed</title>
		<link>http://www.allthingsdistributed.com</link>
		<description> Werner Vogels' weblog on building scalable and robust distributed systems. </description>
		<language>en</language>
		<copyright>Copyright 2011</copyright>
		<lastBuildDate>Mon, 14 May 2012 19:43:51 EST</lastBuildDate>
		<generator>http://jekyllrb.com</generator>
		<docs>http://www.rssboard.org/rss-specification</docs>
		
	 	<item>
	   		<title>Dynamic Content Support in Amazon CloudFront</title>
	   		<link>http://www.allthingsdistributed.com/2012/05/cloudfront-dynamic-content-support.html</link>
	   		<guid>http://www.allthingsdistributed.com/2012/05/cloudfront-dynamic-content-support.html</guid>
	   		<pubDate>Mon, 14 May 2012 00:01:00 EST</pubDate>
	   		<description>&lt;p&gt;In the past three and a half years, &lt;a href=&quot;http://aws.amazon.com/cloudfront&quot;&gt;Amazon CloudFront&lt;/a&gt; has changed the content delivery landscape. It has demonstrated that a CDN does not have to be complex to use with expensive contracts, minimum commits, or upfront fees, such that you are forcibly locked into a single vendor for a long time. CloudFront is simple, fast and reliable with the usual pay-as-you-go model. With just one click you can enable content to be distributed to the customer with low latency and high-reliability.&lt;/p&gt;

&lt;p&gt;Today Amazon CloudFront has taken another major step forward in ease of use. It now supports delivery of entire websites containing both static objects and dynamic content. With these features CloudFront makes it as simple as possible for customers to use CloudFront to speed up delivery of their entire dynamic website running in Amazon EC2/ELB (or third-party origins), without needing to worry about which URLs should point to CloudFront and which ones should go directly to the origin.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Dynamic Content Support&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Recall that last month the CloudFront team announced lowering the minTTL customers can set on their objects, down to as low as 0 seconds to support delivery of dynamic content.  In addition to the TTLs, customers also need some other features to deliver dynamic websites through CloudFront.  The first set of features that CloudFront is launching today include:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Multiple Origin Servers:&lt;/strong&gt; the ability to specify multiple origin servers, including a default origin, for a CloudFront download distribution. This is useful when customers want to use different origin servers for different types of content. For example, an Amazon S3 bucket can be used as the origin for static objects and an Amazon EC2 instance as the origin for dynamic content, all fronted by the same CloudFront distribution domain name. Of course non-AWS origins are also permitted.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Query String based Caching:&lt;/strong&gt; the ability to include query string parameters as part of the object's cache key. Customers will have a switch to turn query strings 'on' or 'off'. When turned off, CloudFront's behavior will be the same as today - i.e., CloudFront will not pass the query string to the origin server nor include query string parameters as a part of the object's cache key. And when query strings are turned on, CloudFront will pass the full URL (including the query string) to the origin server and also use the full URL to uniquely identify an object in the cache.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;URL based configuration:&lt;/strong&gt; the ability to configure cache behaviors based on URL path patterns.  Each URL path pattern will include a set of cache behaviors associated with it. These cache behaviors include the target origin, a switch for query strings to be on/off, a list of trusted signers for private content, the viewer protocol policy, and the minTTL that CloudFront should apply for that URL path pattern.  See the graphic at the end of this post for an example configuration.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;More new features&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;In addition to these features, there are other things the CloudFront team has achieved to speed up delivery of content, but all customer will get these benefits by default without additional configuration.  These performance optimizations are available for all types of content (static and dynamic) delivered via CloudFront.  Specifically:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Optimal TCP Windows.&lt;/strong&gt; The TCP initcwnd has been increased for all CloudFront hosts to maximize the available bandwidth between the edge and the viewer.  This is in addition to the existing optimizations of routing viewers to the edge location with lowest latency for that user, and also persistent connections with the clients.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Persistent Connection to Origins.&lt;/strong&gt; Connections are improved from CloudFront edge locations to the origins by maintaining long-lived persistent connections.  This helps by reducing the connection set-up time from the edge to the origin for each new viewer.  When the viewer is far away from the origin, this is even more helpful in minimizing total latency between the viewer and the origin.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Selecting the best AWS region for Origin Fetch.&lt;/strong&gt; When customers run their origins in AWS, we expect that our network paths from each CloudFront edge to the various AWS Regions will perform better with less packet loss given that we monitor and optimize these network paths for availability and performance.  In addition, we have shown an optional configuration in the architecture diagram how developers can use Route 53’s LBR (Latency Based Routing) to run their origin servers in different AWS Regions.  Each CloudFront edge location will then go to the “best” AWS Region for the origin fetch.  And Route 53 already understands very well which CloudFront host is in which edge location (this is integration we’ve built between the two services).  This helps improve performance even further.&lt;/p&gt;

&lt;p&gt;Amazon CloudFront is expanding it functionality and feature set at an incredible pace. I am particularly excited about these features that help customers deliver both static and dynamic content through one distribution. CloudFront stays true to its mission in making a Content Delivery Network dead simple to use, and now they also do this for dynamic content.&lt;/p&gt;

&lt;p&gt;For more details, see the &lt;a href=&quot;http://aws.amazon.com/cloudfront&quot;&gt;CloudFront detail page&lt;/a&gt; and the posting on the &lt;a href=&quot;http://aws.typepad.com/aws/2012/05/amazon-cloudfront-support-for-dynamic-content.html&quot;&gt;AWS developer blog&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/images/cloudfront-dynamic.jpg&quot; width=&quot;600&quot;&gt;&lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>AWS re: Invent</title>
	   		<link>http://www.allthingsdistributed.com/2012/05/aws-reinvent.html</link>
	   		<guid>http://www.allthingsdistributed.com/2012/05/aws-reinvent.html</guid>
	   		<pubDate>Wed, 09 May 2012 14:00:00 EST</pubDate>
	   		<description>&lt;br/&gt;


&lt;p&gt;&lt;i&gt;
Invention comes in many forms and at many scales. The most radical and transformative of inventions are often those that empower others to unleash their creativity – to pursue their dreams.&lt;/p&gt;

&lt;p&gt;We are creating powerful self-service platforms that allow thousands of people to boldly experiment and accomplish things that would otherwise be impossible or impractical.&lt;/p&gt;

&lt;p align=right&gt;
Jeff Bezos
&lt;br/&gt;&lt;a href=http://sec.gov/Archives/edgar/data/1018724/000119312512161812/d329990dex991.htm&gt;2011 Letter to Shareholders&lt;/a&gt;
&lt;/p&gt;


&lt;p&gt;&lt;/i&gt;&lt;/p&gt;

&lt;br/&gt;


&lt;br/&gt;


&lt;p&gt;Hundreds of thousands of businesses in over 190 countries are relying on AWS to support some or all of their business and IT needs. From SAP and Oracle production installations to risk management HPC, from Internet banking to protein folding, from social gaming to mobile collaboration tools, from video distribution to hotel reservations systems, supply chain management and medical archiving, the list of what our customers achieve using the AWS cloud is fascinating.  Many of our customers will tell you that although the cost savings that AWS brings them are important, more important is that they are able to be more agile, that they are able to move faster in a world with murderous competition and highly compressed time-to-markets. They will tell you that because of the new resource models they are able to reinvent the way they are doing business and have entered into a new world where they are no longer constrained by access to resources and can focus their best people on adding value to the customer instead of managing infrastructure.&lt;/p&gt;

&lt;p&gt;We want to help businesses to continue to reinvent themselves by sharing our experiences, sharing our great customer and partner’s stories, and to present our vision for the future. &lt;a href=&quot;http://reinvent.awsevents.com/&quot;&gt;AWS re: Invent&lt;/a&gt; is our inaugural global customer and partner conference that will be held November 27-29 at The Venetian in Las Vegas. Well over 100 sessions from deep technical workshops to strategy sessions for executives, there will high quality content for everyone. And enough food for thought for everyone about how to continue to reinvent...&lt;/p&gt;

&lt;p&gt;Save the date, and watch &lt;a href=&quot;http://reinvent.awsevents.com/&quot;&gt;this page&lt;/a&gt; for more details.&lt;/p&gt;

&lt;center&gt;
    &lt;a href=&quot;http://reinvent.awsevents.com/&quot;&gt;&lt;img src=&quot;/images/reinventlogo.png&quot; style=&quot;margin: 40px 40px 40px 40px;&quot;&gt;&lt;/a&gt;
&lt;/center&gt;

</description>
	 	</item>
	 	
	 	<item>
	   		<title>Expanding the Cloud for Windows Developers</title>
	   		<link>http://www.allthingsdistributed.com/2012/05/aws-rds-sql-server.html</link>
	   		<guid>http://www.allthingsdistributed.com/2012/05/aws-rds-sql-server.html</guid>
	   		<pubDate>Tue, 08 May 2012 14:00:00 EST</pubDate>
	   		<description>&lt;p&gt;The software that powers today’s world of Internet services has become incredibly diverse. Today’s &lt;a href=&quot;http://aws.typepad.com/aws/2012/05/net-support-for-aws-elastic-beanstalk-amazon-rds-for-sql-server-.html&quot;&gt;announcement&lt;/a&gt; of &lt;a href=&quot;http://aws.amazon.com/rds/sqlserver&quot;&gt;Amazon RDS for Microsoft SQL Server&lt;/a&gt; and &lt;a href=&quot;http://aws.amazon.com/elasticbeanstalk&quot;&gt;.NET support for AWS Elastic Beanstalk&lt;/a&gt; marks another important step in our commitment to increase the flexibility for AWS customers to use the choice of operating system, programming language, development tools and database software that meet their application requirements.&lt;/p&gt;

&lt;p&gt;Using the &lt;a href=&quot;http://aws.amazon.com/visualstudio/&quot;&gt;AWS Toolkit for Visual Studio&lt;/a&gt;, you can now deploy your .NET applications to AWS Elastic Beanstalk directly from your Visual Studio environment without changing any code. You can then off load the management and scaling of your database and application stack to &lt;a href=&quot;http://aws.amazon.com/rds/sqlserver&quot;&gt;Amazon RDS&lt;/a&gt; and &lt;a href=&quot;http://aws.amazon.com/elasticbeanstalk&quot;&gt;AWS Elastic Beanstalk&lt;/a&gt;, and focus on adding value to your customers.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Amazon RDS for SQL Server&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Managing databases has been a stumbling block for many of our customers, shifting their time away from developing innovative applications to the “muck” of administrative tasks such as OS and database software patching, storage management, and implementing reliable backup and disaster recovery solutions. Amazon RDS manages all these time consuming database administration tasks including patch management, striping the storage for better performance, and database and log backups for disaster recovery, enabling developers to focus more on their applications.&lt;/p&gt;

&lt;p&gt;Since we launched Amazon RDS for MySQL in &lt;a href=&quot;http://www.allthingsdistributed.com/2009/10/amazon_relational_database_service.html&quot;&gt;October 2009&lt;/a&gt;, it has become one of the most popular services on AWS, with customers such as &lt;a href=&quot;http://www.allthingsdistributed.com/2012/04/customer-conversations-amazon-rds.html&quot;&gt;Intuit&lt;/a&gt; using the service to keep up with the steep increase in traffic during the tax season. We introduced Amazon RDS for Oracle &lt;a href=&quot;http://aws.typepad.com/aws/2011/05/now-available-amazon-rds-for-oracle-db.html&quot;&gt;last year&lt;/a&gt;, and based on the demand from our Windows customers, are introducing Amazon RDS for SQL Server today. Amazon RDS currently supports SQL Server 2008 R2 and plans to add support for SQL Server 2012 later this year.&lt;/p&gt;

&lt;p&gt;Depending on your requirements, you can choose from four different SQL Server Editions: Express, Web, Standard and Enterprise to run on Amazon RDS. If you are a new Amazon RDS customer, you can get started with Amazon RDS for SQL Server with a &lt;a href=&quot;http://aws.amazon.com/rds/sqlserver/free&quot;&gt;Free Usage Tier&lt;/a&gt;, which includes 750 hours per month of Amazon RDS micro instances with SQL Server Express Edition, 20GB of database storage and 10 million I/O requests per month.&lt;/p&gt;

&lt;p&gt;After the Free Usage Tier, you can run Amazon RDS for SQL Server under two different licensing models - &quot;License Included&quot; and &lt;a href=&quot;http://aws.amazon.com/windows/mslicensemobility/&quot;&gt;Microsoft License Mobility&lt;/a&gt;.  Under the License Included service model, you do not need to purchase SQL Server software licenses. “License Included” pricing starts at $0.035/hour and is inclusive of SQL Server software, hardware, and Amazon RDS management capabilities. The Microsoft License Mobility program allows customers who already own SQL Server licenses to run SQL Server deployments on Amazon RDS. This benefit is available to Microsoft Volume Licensing customers with SQL Server licenses covered by active Microsoft Software Assurance contracts. The Microsoft License Mobility program is suited for customers who prefer to use existing SQL Server licenses or purchase new licenses directly from Microsoft.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;.NET support for Elastic Beanstalk&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;In our effort to let &lt;a href=&quot;http://www.allthingsdistributed.com/2012/03/a-thousand-platforms.html&quot;&gt;a thousand platforms bloom&lt;/a&gt; on AWS, I am excited to introduce .NET support for AWS Elastic Beanstalk. Elastic Beanstalk gives developers an easy way to quickly build and manage their Java, PHP and as of today, their .NET applications in the AWS cloud. As discussed here, Elastic Beanstalk targets both application developers by providing a simple set of tools to get started with development quickly and the platform developers by giving control over the underlying technology stack. Developers simply upload their application and Elastic Beanstalk automatically creates the AWS resources and application stack needed to run the application, freeing developers from worrying about server capacity, load balancing, scaling their application, and version control.&lt;/p&gt;

&lt;p&gt;Using the &lt;a href=&quot;http://aws.amazon.com/visualstudio/&quot;&gt;AWS Toolkit for Visual Studio&lt;/a&gt;, developers can now deploy their .NET applications directly to Elastic Beanstalk, without leaving their development environment. The incremental deployment capabilities allow for quick development and testing cycles by only uploading modified files. Within seconds, new application versions get updated on a set of Amazon EC2 instances.&lt;/p&gt;

&lt;p&gt;To get started with Amazon RDS for SQL Server and AWS Elastic Beanstalk, visit  &lt;a href=&quot;http://aws.amazon.com/rds/sqlserver&quot;&gt;http://aws.amazon.com/rds/sqlserver&lt;/a&gt; and &lt;a href=&quot;http://aws.amazon.com/elasticbeanstalk&quot;&gt;http://aws.amazon.com/elasticbeanstalk&lt;/a&gt;. For a hands-on demo on how to deploy .NET applications on Elastic Beanstalk with Amazon RDS for SQL Server, visit the &lt;a href=&quot;http://docs.amazonwebservices.com/elasticbeanstalk/latest/dg/create_deploy_NET.quickstart.html&quot;&gt;AWS Elastic Beanstalk Developer Guide&lt;/a&gt;.&lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>Do You Want to Help Build the Next AWS Service?</title>
	   		<link>http://www.allthingsdistributed.com/2012/04/help-build-aws.html</link>
	   		<guid>http://www.allthingsdistributed.com/2012/04/help-build-aws.html</guid>
	   		<pubDate>Thu, 19 Apr 2012 02:00:00 EST</pubDate>
	   		<description>&lt;p&gt;Over the past several years I’ve spent much of my time traveling around the world speaking about distributed systems.  From building infinitely scalable data stores, architectures for high performance computing, to the challenges imposed by the CAP theorem, there are wonderful, complex, fascinating problems to be solved in the area of distributed computing. During my travels I’ve met thousands of brilliant engineers who are leveraging the cloud to deliver exciting new products and revolutionize IT as we know it. One thing that’s become obvious to me is that there are innovative, inspiring developers in every corner of the planet from Australia to Iceland and from Israel to Peru.&lt;/p&gt;

&lt;p&gt;And that leads me to another distributed problem – finding good engineers to help AWS build the next generation of cloud computing services.  We’ve got a big vision and to realize it we need to find qualified engineers to join us on our journey.  A quick look at the AWS career web sites reveals that we are hiring hundreds of people around the world.&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://us-amazon.icims.com/jobs/search?ss=1&amp;amp;in_iframe=1&amp;amp;searchKeyword=%22http%3A%2F%2Faws.amazon.com%22&quot;&gt;Click here for our current job openings in the U.S.&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://uk-amazon.icims.com/jobs/search?ss=1&amp;amp;in_iframe=1&amp;amp;searchKeyword=aws&amp;amp;searchLocation=&amp;amp;searchCategory=&quot;&gt;Click here for our current job openings in Europe, Asia, and South Africa&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Distributed problems call for innovative solutions. So next month we will be taking a distributed approach to finding engineers who want to join AWS. On May 17th and 18th we will be traveling to Houston, Minneapolis, and Nashville to interview candidates who want to join the AWS team.  If you live in or near one of those cities and are interested in a meeting with us about careers in AWS check out &lt;a href=&quot;http://aws.amazon.com/careers/local-events/&quot;&gt;this page&lt;/a&gt;.  You can also simply email your resume to aws-recruiting@amazon.com&lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>Expanding the Cloud – Introducing AWS Marketplace</title>
	   		<link>http://www.allthingsdistributed.com/2012/04/AWS-Marketplace.html</link>
	   		<guid>http://www.allthingsdistributed.com/2012/04/AWS-Marketplace.html</guid>
	   		<pubDate>Thu, 19 Apr 2012 02:00:00 EST</pubDate>
	   		<description>&lt;p&gt;Today Amazon Web Services launched &lt;a href=&quot;http://aws.amazon.com/marketplace/ref=mkt_blg_wv&quot;&gt;AWS Marketplace&lt;/a&gt;, an online store that makes it easy for you to find, buy, and immediately start using software and services that run on the AWS Cloud.  You can use AWS Marketplace’s 1-Click deployment to quickly launch pre-configured software on your own Amazon EC2 instances and pay only for what you use, by the hour or month.  AWS handles billing and payments, and software charges appear on your AWS bill.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/images/marketplace.png&quot; width=&quot;151&quot; height=&quot;646&quot; style=&quot;float: right; margin: 0 0 20px 20px;&quot; /&gt;&lt;/p&gt;

&lt;p&gt;Marketplace has software listings from well-known vendors including 10gen, CA, Canonical, Couchbase, Check Point Software, IBM, Microsoft, SAP, Zend, and others, as well as many widely used open source offerings including Wordpress, Drupal, and MediaWiki.&lt;/p&gt;

&lt;p&gt;AWS Marketplace brings the same simple and trusted online shopping experience that customers enjoy on Amazon.com to software built for the AWS platform, streamlining the process of doing research and purchasing software.
It features a wide selection of development and business software, including software infrastructure, developer tools, and business applications. Product prices are clearly stated and appear on the same bill as your other AWS services.&lt;/p&gt;

&lt;p&gt;AWS Marketplace also simplifies many of the challenges software companies face, such as acquiring customers, developing distribution channels, and billing for their software.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Why shop here?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The way businesses are buying applications is changing. There is a new generation of leaders that have very different expectations about how they can select the products and tools they need to be successful.  Last week I met with a CIO for a discussion about how her IT department can use AWS to help make their business units be more agile and move faster. One of the stumbling blocks she mentioned was how to select the best software running on AWS, in a way that was completely in line with the “Cloud Experience”: no software to install, no sales cycle, no procurement delays, and a selection of licensing models to choose from. She jokingly asked for an “Amazon 1-Click” experience for software. I am sure she will be a very happy CIO today.&lt;/p&gt;

&lt;p&gt;AWS Marketplace features a wide selection of commercial and free IT and business software. AWS Marketplace enables you to compare options, read reviews, and quickly find the software you want.&lt;/p&gt;

&lt;p&gt;We wanted to shrink the time between finding what you want and getting it up and running. Once you find software you like, you can deploy that software to your own EC2 instance with 1-Click -- like the CIO suggested -- or using popular management tools like the AWS Console.&lt;/p&gt;

&lt;p&gt;In addition, for most products, software prices are clearly posted on the website so you can purchase software immediately, with the payment instrument you already have on file with Amazon Web Services. Software charges appear on the same monthly bill as your AWS infrastructure charges.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Why sell here?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The Amazon Web Services have helped to create great ecosystem of ISVs that are selling software and services to other customers running in the cloud.  It has had a true democratization effect: no longer does the dominant vendor in a market automatically get chosen. I have many IT decision makers ask me who are the young and exciting companies they should be paying attention to. Who are the companies that have a native cloud product, who are the ones that have innovative new licensing models, who are the young and hungry companies that break with the old style of enterprise software vending and are truly customer-centric.  At the same time the up-and-coming companies often ask me how we can help them get in front of more customers such that they can compete in an open and honest way. And they also often ask whether we can help them with what Amazon.com does so well for its sellers: handle billing and charging.&lt;/p&gt;

&lt;p&gt;AWS Marketplace includes both large, well known companies as well as exciting up and coming companies.  If you’re a software provider with an offering that runs on the AWS cloud, you can gain new customers, enable usage-based billing without much additional work, and ensure that customers have a fast and easy deployment experience with their software.&lt;/p&gt;

&lt;p&gt;AWS Marketplace helps software and SaaS providers find new customers by exposing their products to some of the hundreds of thousands of AWS customers, ranging from individual software developers to large enterprises.&lt;/p&gt;

&lt;p&gt;Additionally, if you are interested in adding hourly billing to your software, AWS Marketplace can help. Simply upload an Amazon Machine Image to AWS and provide the hourly cost. Billing is managed by AWS Marketplace, relieving sellers of the responsibility of metering usage, managing customer accounts, and processing payments, leaving software developers more time to focus on building great software.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Summary&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;At Amazon we have a long experience with buyers and sellers in a marketplace. We know that something great happens when you solve problems for both the people selling things and those buying things – the market becomes more and more vibrant. We know that for buyers it is important to have very convenient ways of discovering and buying products. For sellers it is important to get their products in front of as many relevant customers as possible and make the sales process as painless as possible.&lt;/p&gt;

&lt;p&gt;But more important that anything else for both parties is trust: easy to understand product information, high quality, relevant reviews by other customers, that the seller is reputable and has a history of delivery, and that the buyer will only be charged for his exact usage. For the seller it removes the burden of having to manage customers, measuring their usage and collecting payments for it.&lt;/p&gt;

&lt;p&gt;The AWS Marketplace is a great step forward in making easier to buy and deploy software. It also makes it dead simple for ISVs for add hourly billing to their offerings and get their software in from of hundreds of thousands of active AWS customers.&lt;/p&gt;

&lt;p&gt;For more information see the announcement at the &lt;a href=&quot;http://aws.amazon.com/about-aws/whats-new/2012/04/19/introducing-aws-marketplace/&quot;&gt;AWS website&lt;/a&gt;, the &lt;a href=&quot;http://www.youtube.com/watch?v=_fDmapRd_qU&quot;&gt;&quot;Introducing AWS Marketplace&quot; video&lt;/a&gt;, the posting on the &lt;a href=&quot;http://aws.typepad.com/aws/2012/04/the-aws-marketplace-find-compare-and-launch-cloud-software.html&quot;&gt;AWS blog&lt;/a&gt; and off course visit the &lt;a href=&quot;http://aws.amazon.com/marketplace/ref=mkt_blg_wv&quot;&gt;AWS Marketplace&lt;/a&gt; for a test drive.  Happy shopping!&lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>Expanding the Cloud – Introducing Amazon CloudSearch</title>
	   		<link>http://www.allthingsdistributed.com/2012/04/amazon-cloudsearch.html</link>
	   		<guid>http://www.allthingsdistributed.com/2012/04/amazon-cloudsearch.html</guid>
	   		<pubDate>Thu, 12 Apr 2012 00:00:00 EST</pubDate>
	   		<description>&lt;p&gt;Today Amazon Web Services is introducing &lt;a href=&quot;http://aws.amazon.com/cloudsearch&quot;&gt;Amazon CloudSearch&lt;/a&gt;, a new web service that brings the power of the Amazon.com’s search technology to every developer. Amazon CloudSearch provides a fully-featured search engine that is easy to manage and scale.  It offers full-text search with features like faceting and user-defined rank functions.  And like most AWS services, Amazon CloudSearch scales automatically as your data and traffic grow, making it an easy choice for applications small to large. With Amazon CloudSearch, developers just create a Search Domain, upload data, and start querying.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Why Search?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Search is an essential part of many of today's cloud-centric applications.  While in our daily lives we are mostly familiar with the search functionality offered by web search, there are in fact many more cases where search is a fundamental component of an application.  Search is a much broader technology than just the indexing of large collections of web pages.  Many organizations have large collections of documents, structured and unstructured, that can benefit from  a specialized search service.  With the rise of the App developer culture there is an increasing number of consumer data sources that cannot be simply queried with a web search engine. Using specialized ranking functions these apps can give their customers a highly specialized search experience.&lt;/p&gt;

&lt;p&gt;And increasingly, search is applied to data that, though called a &quot;document&quot; for the purposes of search, is really just a record in a database or an object in a NoSQL system.  On the query side, we are used to seeing search results as users, but search results are increasingly being used at the core of complex distributed systems where the results are consumed by machines, not people.&lt;/p&gt;

&lt;p&gt;With these applications in mind, our customers have told us that a cloud-based managed search service is high on their wish lists.  Their main motivation is that existing search technologies, both commercial and open source, have proven to be hard to manage and complex to configure.&lt;/p&gt;

&lt;p&gt;Amazon CloudSearch will have democratization effect as it offers features that have been out of reach for many customers. With Amazon CloudSearch, a powerful search engine is now in the hands of every developer, at our familiar low prices, using a pay-as-you-go model. It will allow developers to improve functionality of their products, at lower costs with almost zero administration. It is very simple to get started; customers can create a Search Domain, upload their documents, and can immediately start querying.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;How it Works&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Developers set up a Search Domain -- a set of resources in AWS that will serve as the home for one collection of data.  Developers then access their domain through two HTTP-based endpoints: a document upload endpoint and a query endpoint.  As developers send documents to the upload endpoint they are quickly incorporated into the searchable index and become searchable.&lt;/p&gt;

&lt;p&gt;Developers can upload data either through the AWS console, from the command-line tools, or by sending their own HTTP POST requests to the upload endpoint.&lt;/p&gt;

&lt;p&gt;There are three features that make it easy to configure and customize the search results to meet exactly the needs of the application.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Filtering:&lt;/strong&gt; Conceptually, this is using a match in a document field to restrict the match set. For example, if documents have a &quot;color&quot; field, you can filter the matches for the color &quot;red&quot;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Ranking:&lt;/strong&gt; Search has at least two major phases: matching and ranking. The query specifies which documents match, generating a match set. After that, scores are computed (or direct sort criterion is applied) for each of the matching documents to rank them best to worst. Amazon CloudSearch provides the ability to have customized ranking functions to fine tune the search results.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Faceting:&lt;/strong&gt; Faceting allows you to categorize your search results into refinements on which the user can further search. For example, a user might search for ‘umbrellas’, and facets allow you to group the results by price, such as $0-$10, $10-$20, $20-$40, etc. Amazon CloudSearch also allows for result counts to be included in facets, so that each refinement has a count of the number of documents in that group. The example could then be: $0-$10 (4 items), $10-$20 (123 items), $20-$40 (57 items), etc.&lt;/p&gt;

&lt;p&gt;For more information on the different configuration possibilities visit the &lt;a href=&quot;http://aws.amazon.com/cloudsearch&quot;&gt;Amazon CloudSearch detail page&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Automatic Scaling&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Amazon CloudSearch is itself built on AWS, which enables it to handle scale.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/images/cloudsearch.png&quot; width=&quot;600&quot;&gt;&lt;/p&gt;

&lt;p&gt;Amazon CloudSearch supports both horizontal and vertical scaling. The main search index is kept in memory to ensure that requests can be served at very high rates.  As developers add data, CloudSearch increases either the size of your underlying node or it increases the number of nodes in the cluster.  To handle growing request rates, the service autoscales the number of instances handling queries.&lt;/p&gt;

&lt;p&gt;Amazon CloudSearch is based on more than a decade of developing high quality search technologies for Amazon.com. It has been developed by A9, the Amazon.com subsidiary that focuses on search technologies. The technology that is used at all the different places where you can search on Amazon.com is also at the core of at Amazon CloudSearch.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Summary&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;With the launch of Amazon CloudSearch the Amazon Web Services remove yet another pain point for developers. Almost every application these days needs some form of search and as such every developer has to spend significant time implementing it. With Amazon CloudSearch developers can now simply focus on their application and leave the management of search to the cloud.&lt;/p&gt;

&lt;p&gt;For more information see the &lt;a href=&quot;http://aws.amazon.com/cloudsearch&quot;&gt;Amazon CloudSearch detail pages&lt;/a&gt;, the &lt;a href=&quot;http://docs.amazonwebservices.com/cloudsearch/latest/developerguide/SvcIntro.html?r=1756&quot;&gt;Amazon CloudSearch Developer Guide&lt;/a&gt; and the &lt;a href=&quot;http://aws.typepad.com/aws/2012/04/amazon-cloudsearch-start-searching-in-one-hour.html&quot;&gt;posting on the AWS developer blog&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;You can sign up for the &lt;a href=&quot;https://www2.gotomeeting.com/register/699095922&quot;&gt;Introduction To Amazon CloudSearch webinar&lt;/a&gt; on May 10.&lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>Customer Conversations - How Intuit and Edmodo Innovate using Amazon RDS</title>
	   		<link>http://www.allthingsdistributed.com/2012/04/customer-conversations-amazon-rds.html</link>
	   		<guid>http://www.allthingsdistributed.com/2012/04/customer-conversations-amazon-rds.html</guid>
	   		<pubDate>Fri, 06 Apr 2012 06:00:00 EST</pubDate>
	   		<description>&lt;p&gt;&lt;b&gt;From tax preparation to safe social networks, Amazon
RDS brings new and innovative applications to the cloud&lt;/b&gt;&lt;/p&gt;

&lt;p&gt;Empowering
innovation is at the heart of everything we do at Amazon Web Services (AWS). I often
get to meet, discuss, and learn from innovators how they are using AWS to
deliver transformative applications to their users, customers and partners. Often
we think about innovation as doing 'new things' or based on revolutionary new
technologies such as DynamoDB, but it is more important to ensure that one can
also innovate based on existing paradigms. One of the services that is very
successful in driving innovation at our customers in this context is &lt;a
href=&quot;http://aws.amazon.com/rds&quot;&gt;Amazon RDS&lt;/a&gt;, the Relational Database Service. Amazon RDS removes
the headaches of running a relational database service reliably at scale,
allowing Amazon RDS customers to focus on innovation for their customers.&lt;/p&gt;

&lt;p&gt;Recently
I had great conversations with Troy Otillio, Senior Development Manager at &lt;a
href=&quot;http://community.intuit.com/&quot;&gt;Intuit&lt;/a&gt; and Jack Murgia, Senior DevOps Engineer at &lt;a
href=&quot;http://www.edmodo.com&quot;&gt;Edmodo&lt;/a&gt;. Troy and his team have added a contextual
social offering to the popular TurboTax and Intuit applications. Jack and his
engineers have created a safe social app for teachers and students. These
innovators use Amazon RDS in conjunction with other Amazon Web Services to
build, scale and operate their applications. Below is my dialog with
them. Read on.&lt;/p&gt;

&lt;p&gt;&lt;i&gt;Note:
  If you want to see how Amazon RDS can enable your creative agenda, sign up
  for the 60 day &lt;a href=&quot;http://aws.amazon.com/rds/free-trial&quot;&gt;free trial&lt;/a&gt;.
&lt;/i&gt;&lt;/p&gt;

&lt;p&gt;&lt;i&gt;&lt;b&gt;Troy, Jack, Tell me a little bit about your app. What's unique and
innovative about your service?&lt;/b&gt;&lt;/i&gt;&lt;/p&gt;

&lt;p&gt;&lt;b&gt;Troy:&lt;/b&gt; &lt;a href=&quot;http://community.intuit.com/&quot;&gt;Live Community&lt;/a&gt;
Platform is Intuit's flagship Contextual Social offering &amp;#8211; Live Community
makes it easy to find answers when and where you need them. This is
a unique and innovative platform.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt; Intelligent Social network - Facilitate topical
Q&amp;amp;A conversations among employees, customers and our most valued super
contributors. &lt;/li&gt;

&lt;li&gt; Large Seasonal Peaks &amp;#8211; Our largest community
supports TurboTax where the peak traffic during February or April is often
100's of times greater than a quiet day in June. Live Community
Experience is deeply integrated into the tax experience, so we built a highly
responsive and reliable web experience.&lt;/li&gt;

&lt;li&gt; Read-your-mind contextual integration &amp;#8211; Our core
innovation and underlying secret sauce involves selecting the most relevant
content for a given page if not given user &amp;#8211; to provide the right answer
at the right time to our users.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;b&gt;Jack&lt;/b&gt;: &lt;a href=&quot;http://www.edmodo.com&quot;&gt;Edmodo&lt;/a&gt; is the safe
social network for education used by a network of over 6 million teachers and
students worldwide that allows teachers to create and maintain their classroom
communities. Some unique and innovative characteristics of Edmodo are:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
Edmodo is as easy to use as other
social network sites, but secure - the teacher has the same control over
access, content and behavior in Edmodo as he/she does in the classroom.&lt;/li&gt;

&lt;li&gt;
Students gain experience they need
for the modern workplace, learning how to work responsibly and
effectively in a collaborative, project-based manner on a social website.&lt;/li&gt;

&lt;li&gt;
Teachers can use Edmodo to share
educational content, manage projects and assignments, handle notifications, and
conduct quizzes and events.&lt;/li&gt;

&lt;li&gt;
Teachers can interact with their
colleagues in professional learning networks.&lt;/li&gt;

&lt;li&gt;
Schools and districts can claim
unique Edmodo web addresses for added communication and
customization.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;b&gt;&lt;i&gt;How are your users adopting and responding to your service? &lt;/i&gt;&lt;/b&gt;&lt;/p&gt;

&lt;p&gt;&lt;b&gt;Troy:&lt;/b&gt; We moved our service from internal servers to
AWS. Our 25+ million strong TurboTax and Intuit user community
grows every year and Live Community is an integral component of the overall
product experience. Moving to AWS has enabled us with operational agility to deliver
more value to those customers without having to worry about scale and
infrastructure maintenance. We now have more time to focus on
innovation while being confident that when demand increases we can easily add
more capacity.&lt;/p&gt;

&lt;p&gt;&lt;b&gt;Jack:&lt;/b&gt;Since our launch in late 2008, we've
grown to over 6 million teachers and students globally primarily through word
of mouth of teachers who have shared Edmodo with each other. In addition to
using Edmodo to engage students in classroom activities, teachers all over the world
build profile pages on Edmodo, which they use to discover and share content,
meet and stay in contact with other educators, and best practices and top
resources.&lt;/p&gt;

&lt;p&gt;&lt;b&gt;&lt;i&gt;Jack, how did this idea come about? How did you choose a SQL approach
to solve this problem? &lt;/i&gt;&lt;/b&gt;&lt;/p&gt;

&lt;p&gt;&lt;b&gt;Jack:&lt;/b&gt; After years of seeing teachers
struggle to share the web with their classroom, Edmodo founders Nic Borg and
Jeff O'Hara knew there was a need for a highly scalable, secure social network
targeted at K-12. SQL was the right choice because it was an
established and proven technology for use in similar environments, and the
massive knowledge base that exists around it.&lt;/p&gt;

&lt;p&gt;&lt;b&gt;&lt;i&gt;And how about you Troy? Why did you choose a SQL approach to build your
social community app?&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;

&lt;p&gt;&lt;b&gt;Troy:&lt;/b&gt; The initial architecture was based on MySQL&amp;#8211;
we've continued with use of SQL but are now leveraging RDS. Of course,
with as much textual data as we have we are leveraging Lucene/SOLR (a NoSQL
solution) for Search and Semantic processing. More recently we've
expanded our platform to include additional forms of user interaction
observation in support of our real-time analytics &amp;#8211; here we've begun to
leverage NoSQL technologies like Redis. Going forward we'll
continue to employ a hybrid approach using RDS for the necessary transactional
computation and services like &lt;a href=&quot;http://aws.amazon.com/dynamodb&quot;&gt;DynamoDB&lt;/a&gt; for
high performance and scalability for structured data.&lt;/p&gt;

&lt;p&gt;&lt;b&gt;&lt;i&gt;What did you find unique about RDS? What has been your experience so
far?&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;

&lt;p&gt;&lt;b&gt;Troy:&lt;/b&gt; We love RDS &amp;#8211; it's reduced our operational
workload by a noticeable factor but even more exciting is the benefits around fast
recovery enabled by the Multi-Availability Zone capability. My team
often brags about the one-click creation of read replicas, ability to upsize or
downsize the database without downtime and automatic back-up.
However, the shining moment occurred just last month &amp;#8211; during peak load
there was a hardware failure on the Server powering a RDS Master Database
&amp;#8211; RDS automatically failed over to the alternate zone within minutes and
our customers experience was fully functional shortly thereafter.
The best part was that the entire process was what I call Òhands freeÓ and took
near zero development effort. With self-hosted databases we
would have invested considerable engineering effort to implement, test and
retest failover &amp;#8211; to achieve fast recovery with RDS we simply changed our
configuration. And when the actual production event took place the
recovery required no manual intervention &amp;#8211; the response from our CTO
after hearing what happened: &amp;quot;that's cool&amp;quot;.&lt;/p&gt;

&lt;p&gt;We
encountered a few situations that required help from the Amazon team &amp;#8211;
for example, we didn't know that the I/O capacity of the Server is governed by
the size of storage and size of the server. When we first attempted
to load our production database it took 28 hours &amp;#8211; after a few days of
attempts to reduce the load time through well-known optimizations (&lt;a
href=&quot;http://aws.amazon.com/rds/faqs/#128&quot;&gt;mostly
documented on the RDS website&lt;/a&gt;) we
were stuck at 8 hours. We consulted directly with Amazon and
learned that the storage and the DB Server size affected I/O throughput &amp;#8211;
after altering our size we dropped our load time to 1 hour which was within
expectations relative to native database.&lt;/p&gt;

&lt;p&gt;&lt;b&gt;Jack:&lt;/b&gt; Based on our experience during this period of
phenomenal growth while our team productivity is stretched to the max, we see
that:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;RDS is a huge time-saver&lt;/li&gt;
&lt;li&gt;RDS provides peace of mind about our data&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Anything that saves time
and simplifies processes for employees of a young startup has a
positive affect that CAN NOT be overstated. The peace of mind part needs no
explanation. Nobody on our team regrets moving to RDS MySQL - quite the opposite;
we all agree we don't want to think about where we would have been without RDS.
We have been able to meet our goal of architecting our application for 0%
&amp;quot;maintenance downtime&amp;quot;&lt;/p&gt;

&lt;p&gt;Out of the box, RDS'
CloudWatch data and graphs speed up the troubleshooting process.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Complete certainty in a DB
environment is VERY unique- we never worry that:&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;our DB parameters are identical
across replicas, and changes propagate at a time of my choosing&lt;/li&gt;
&lt;li&gt;the recoverability of data&lt;/li&gt;
&lt;/ul&gt;
&lt;li&gt;Lack of root access to the
underlying server OS was unique &amp;#8211; but we have found it does not hinder us
in anyway. Anything we have needed to accomplish, including a late night
upgrade to MySQL 5.5 and implementation of Facebook's Online Schema
Change for MySQL has been easily accomplished.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;b&gt;&lt;i&gt;This is great to hear. We are glad we RDS meets your needs. Now, what's next on
your innovation agenda?&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;

&lt;p&gt;&lt;b&gt;Jack:&lt;/b&gt; We want to deliver an even more performance and rock
solid experience for our global user base of teachers, students, administrators
and parents. We will be:&lt;/p&gt;

&lt;ul type=disc&gt;
 &lt;li&gt;Building &amp;quot;incident
     managers&amp;quot; which utilize the cloud watch data and AWS APIs to
     automatically replace servers and/or re-deploy when problems arise.&lt;/li&gt;
 &lt;li&gt;Building &amp;quot;incident
     creators&amp;quot; - servers which test our ability to maintain peak
     performance.&amp;nbsp;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;We believe that by leveraging
the services that Amazon provides to the fullest we can continue to scale our
exceptional user experience so that Edmodo can be the platform for classrooms
around the world on devices of all shapes and sizes.&lt;/p&gt;

&lt;p&gt;&lt;b&gt;Troy:&lt;/b&gt; At Intuit, we can go further to leverage the
benefits of elasticity and further improve our resiliency. We
are investing in use of CloudFormation coupled with Chef &amp;#8211; the result
will enable us to lower costs and further reduce risk.&lt;/p&gt;

&lt;p&gt;Prior
to AWS we had several tiers that we now think can be delegated to AWS services
&amp;#8211; this should free up our team to focus on our domain
problems. For example, we are intending to replace our
EC2/Memcache tier with ElastiCache, our batch processing with Simple Workflow
and Web servers with CloudFront.&lt;/p&gt;

&lt;p&gt;With
our newfound agility we can launch new services quickly and there are a few on
our plate in the near term. In some cases we are refactoring our
system into smaller, discrete services, while in other cases we are creating
wholesale new services Our core problem domain consists of extracting greater
value out of textual and behavioral data which means that use of EMR and even
the newly released Workflow should enable us to focus more on the domain and
less on the system engineering.&lt;/p&gt;

&lt;p&gt;&lt;b&gt;&lt;i&gt;Troy, Jack, Thank you both very much for sharing your unique
experience. I look forward to hearing your progress.&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;

&lt;p&gt;&lt;b&gt;Jack: &lt;/b&gt;
Thank you. This has been a great dialog.&lt;/p&gt;

&lt;p&gt;&lt;b&gt;Troy: &lt;/b&gt;
Thank you Werner. We appreciate the opportunity.&lt;/p&gt;

&lt;p&gt;As
I noted before, it's a great pleasure to talk to these innovators and how AWS helps their journey. If you have never used RDS before, you can sign up for a &lt;a
href=&quot;http://aws.amazon.com/rds/free-trial&quot;&gt;60
day free trial&lt;/a&gt; What innovation
will you bring to market? How will it change the world? We won't know until you
try and build something.&lt;/p&gt;


</description>
	 	</item>
	 	
	 	<item>
	   		<title>A Thousand Platforms ...</title>
	   		<link>http://www.allthingsdistributed.com/2012/03/a-thousand-platforms.html</link>
	   		<guid>http://www.allthingsdistributed.com/2012/03/a-thousand-platforms.html</guid>
	   		<pubDate>Wed, 21 Mar 2012 15:00:00 EST</pubDate>
	   		<description>&lt;p&gt;Today’s &lt;a href=&quot;http://aws.amazon.com/elasticbeanstalk&quot;&gt;AWS Elastic Beanstalk&lt;/a&gt; &lt;a href=&quot;http://aws.amazon.com/about-aws/whats-new/2012/03/20/php-and-git-deployment-for-aws-elastic-beanstalk/&quot;&gt;announcement of PHP and Git support&lt;/a&gt; reminded me of the &lt;a href=&quot;http://www.allthingsdistributed.com/2011/01/aws_elastic_beanstalk.html&quot;&gt;post&lt;/a&gt;  where I mentioned that we want to let a thousand platforms bloom on AWS. Some might ask why AWS would want a thousand platforms.&lt;/p&gt;

&lt;p&gt;One of the most important AWS principles is &lt;em&gt;flexibility&lt;/em&gt;. Flexibility is in the choice of software and languages running on AWS, in the tools and interfaces available to manipulate resources and applications, and in the ability to leverage services from other providers. One of our customers I met last week was talking about his application and how it runs on AWS; He collects geo-location data, analyzes and crunches this data using Elastic Map Reduce, stores the data for quick access in DynamoDB, runs his user interface on Heroku and his web services layer for mobile devices on Elastic Beanstalk. This application is a great way to highlight how developers might leverage different services, abstractions, and tools to deliver the most value to their customers.&lt;/p&gt;

&lt;p&gt;If you’re seeking ultimate flexibility, AWS allows you to interact with services such as Amazon Elastic Compute Cloud (&lt;a href=&quot;http://aws.amazon.com/ec2&quot;&gt;Amazon EC2&lt;/a&gt;) and Amazon Simple Storage Service (&lt;a href=&quot;http://aws.amazon.com/S3&quot;&gt;Amazon S3&lt;/a&gt;) directly and to piece these services together in a building block fashion. This might incur some initial groundwork, especially if you just want to deploy a simple application. &lt;a href=&quot;http://aws.amazon.com/cloudformation&quot;&gt;AWS CloudFormation&lt;/a&gt; can help bring the building blocks together through its template mechanism. This simplifies the provisioning and updates, but you’re still responsible for the operational aspects of running your application.&lt;/p&gt;

&lt;p&gt;If you don’t need control over the software stack, you can use development platforms such as &lt;a href=&quot;http://www.appfog.com&quot;&gt;AppFog&lt;/a&gt;, &lt;a href=&quot;http://www.engineyard.com&quot;&gt;Engine Yard&lt;/a&gt;, and &lt;a href=&quot;http://www.heroku.com&quot;&gt;Heroku&lt;/a&gt; to help you manage, deploy, and monitor your applications on AWS more easily. We’ve seen some newcomers in this space over the last year such as &lt;a href=&quot;http://www.activestate.com/press-releases/activestate-brings-stackato-amazon-ec2&quot;&gt;Stackato&lt;/a&gt; and &lt;a href=&quot;http://www.nodejitsu.com/&quot;&gt;NodeJitsu&lt;/a&gt;, and each platform continues to add value through highly curated software stacks and a set of management automation.&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;http://aws.amazon.com/elasticbeanstalk&quot;&gt;AWS Elastic Beanstalk&lt;/a&gt; is another abstraction on top of the core AWS building blocks. It takes a different approach than most other development platforms by exposing the underlying resources. This approach provides the simplicity to quickly get started for application developers, but it also allows them to modify the stack to meet their goals. For example, one customer needed extensive Apache rewrite rules and a few other mods to meet his security requirements. He simply created a new AMI to use as his base for his Elastic Beanstalk container. Another pattern I have seen is customers attaching a debugger to the JVM running in their EC2 instance so that they can debug particular interaction patterns between their code and the JVM.&lt;/p&gt;

&lt;p&gt;So is there a “one-size-fits-all” in the development platform space? No, each platform fits the needs of different developers, applications, and use cases. Preference and familiarity also play a role in why some developers choose one over the other. Ultimately, we want developers to successfully run and manage reliable, highly scalable applications on AWS, irrespective of the abstraction that their development platform of choice offers.&lt;/p&gt;

&lt;p&gt;We will continue to work closely together with all current and future platform partners. Based on their feedback, we will develop new features and services that can help them be more successful by allowing them to focus on their customers instead of the infrastructure on which they run. This will also make it easier for new platforms to be developed such that developers will have more choice and flexibility, and they can really find the exact tools that make them most productive. AWS Elastic Beanstalk can play an important role there, too, because it is a good base for building new platforms. We are looking forward to seeing a thousand platforms bloom.&lt;/p&gt;

&lt;p&gt;AWS Elastic Beanstalk now supports PHP applications (in addition to Java) and the ability to deploy through the popular &lt;a href=&quot;http://git-scm.com/&quot;&gt;Git version control system&lt;/a&gt;. To get started using PHP and Git on AWS Elastic Beanstalk, visit &lt;a href=&quot;http://docs.amazonwebservices.com/elasticbeanstalk/latest/dg/create_deploy_PHP.html&quot;&gt;Deploying PHP Applications Using Git&lt;/a&gt; in the AWS Elastic Beanstalk Developer Guide. More details about the release at the &lt;a href=&quot;http://aws.typepad.com/aws/2012/03/aws-elastic-beanstalk-build-php-apps-using-git-based-deployment.html&quot;&gt;AWS developer blog&lt;/a&gt;.&lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>Driving Compute Cost Down for AWS Customers</title>
	   		<link>http://www.allthingsdistributed.com/2012/03/ec2-price-reduction.html</link>
	   		<guid>http://www.allthingsdistributed.com/2012/03/ec2-price-reduction.html</guid>
	   		<pubDate>Tue, 06 Mar 2012 09:01:00 EST</pubDate>
	   		<description>&lt;p&gt;AWS today announced a substantial price drop from March 1, 2012 for many of the Amazon EC2, Amazon RDS, and Amazon ElastiCache instances types around the world. For example, the popular m1.small instance type will see a price drop of 6% for EC2 On-Demand usage and 33% for EC2 Reserved Instance usage. Some of the other instance types have even greater savings: for example, the high memory M2 instances will see a 10% price cut for On Demand and 37% for Reserved instances.  Similarly, Amazon RDS will cut its On-Demand prices by up to 10% and Reserved Instance prices by up to 42%. Amazon ElastiCache customers will see their prices drop by up to 10%, depending on their cache node types.&lt;/p&gt;

&lt;p&gt;Driving costs down for our customers is part of the DNA of Amazon and therefore also part of the DNA of AWS. Customers are often surprised when our Solution Architects that are working with them have a relentless focus on helping customers to reduce their AWS bill wherever possible. We strongly believe that if we can help our customers reduce their costs, they are likely to be more successful in the long term. The relationship with our customers often becomes more that of a partnership as AWS truly has the customer’s best interest at heart and we are equally interested in their long-term success.&lt;/p&gt;

&lt;p&gt; We will continue to drive AWS prices down, even without any competitive pressure to do so. And we will work hard to do this across all the different services, such as the price reduction for Amazon S3 and EBS last moth. I recently ran into a customer who explained that his invoice processing consolidation project was supposed to pay for itself within two years, but because of the continuous AWS price drops they were able to do it in 9 months. I am pretty sure that customer will adjust this estimate favorably again after today’s announcement.&lt;/p&gt;

&lt;p&gt;Reducing pricing is not just a matter of passing on the benefits of economies of scale, although that certainly plays a role. We continuously apply all our innovative skills to the design of datacenters, servers, storage, network, etc. to drive new efficiencies and higher reliability. Experiences with the highly scalable, ultra-efficient supply chains of Amazon.com drive great new innovations in the highly redundant supply chains for AWS, which lead to new efficiencies that we can pass on to our customers.  Also on the business model side, we continue to innovate, as the introduction of Reserved Instances and Spot Instances have helped customers make significant savings.  With the new announcement, we have introduced volume discount tiers for Reserved Instances, which will provide additional discounts of 10% and 20%, both of which are in addition to the price cuts available to all customers, to further help customers drive down costs as they scale on AWS infrastructure.&lt;/p&gt;

&lt;p&gt;For more details on the new pricing, visit the &lt;a href=&quot;http://aws.amazon.com/ec2/pricing/&quot;&gt;EC2 pricing page&lt;/a&gt;, &lt;a href=&quot;http://aws.amazon.com/rds/pricing/&quot;&gt;RDS pricing page&lt;/a&gt;, &lt;a href=&quot;http://aws.amazon.com/elasticache/&quot;&gt;ElastiCache page&lt;/a&gt;, forum announcement, and the &lt;a href=&quot;http://aws.typepad.com&quot;&gt;AWS developer blog&lt;/a&gt;.&lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>Expanding the Cloud – The Amazon Simple Workflow Service</title>
	   		<link>http://www.allthingsdistributed.com/2012/02/Amazon-Simple-Workflow-Service.html</link>
	   		<guid>http://www.allthingsdistributed.com/2012/02/Amazon-Simple-Workflow-Service.html</guid>
	   		<pubDate>Wed, 22 Feb 2012 19:01:01 EST</pubDate>
	   		<description>&lt;p&gt;Today AWS launched an exciting new service for developers: the &lt;a href=&quot;http://aws.amazon.com/swf&quot;&gt;Amazon Simple Workflow Service&lt;/a&gt;. Amazon SWF is an orchestration service for building scalable distributed applications. Often an application consists of several different tasks to be performed in particular sequence driven by a set of dynamic conditions. Amazon SWF makes it very easy for developers to architect and implement these tasks, run them in the cloud or on premise and coordinate their flow. Amazon SWF manages the execution flow such that the tasks are load balanced across the registered workers, that inter-task dependencies are respected, that concurrency is handled appropriately and that child workflows are executed.&lt;/p&gt;

&lt;p&gt;A growing number of applications are relying on asynchronous and distributed processing, with scalability of the application as the primary motivation. By designing autonomous distributed components, developers get the flexibility to deploy and scale out parts of the application independently as load increases. The asynchronous and distributed model has the benefits of loose coupling and selective scalability, but it also creates new challenges. Application developers must coordinate multiple distributed components to get the desired results. They must deal with the increased latency and unreliability inherent in remote communication. Components may take extended periods of time to complete tasks, requests may fail and errors originating from remote systems must be handled. Today, to accomplish this, developers are forced to write complicated infrastructure that typically involves message queues and databases along with complex logic to synchronize them. All this ‘plumbing’ is extraneous to business logic and makes the application code unnecessarily complicated and hard to maintain.&lt;/p&gt;

&lt;p&gt;Amazon SWF enables applications to be built by orchestrating tasks coordinated by a &lt;em&gt;decider&lt;/em&gt; process. &lt;em&gt;Tasks&lt;/em&gt; represent logical units of work and are performed by application components that can take any form, including executable code, scripts, web service calls, and human actions. Developers have full control over implementing and orchestrating tasks, without worrying about underlying complexities such as tracking their progress and keeping their state.&lt;/p&gt;

&lt;p&gt;Developers implement &lt;em&gt;workers&lt;/em&gt; to perform tasks. They run their workers either on cloud infrastructure, such as &lt;a href=&quot;http://aws.amazon.com/ec2&quot;&gt;Amazon EC2&lt;/a&gt;, or on-premise. Tasks can be long-running, may fail, may timeout and may complete with varying throughputs and latencies. Amazon SWF stores tasks for workers, assigns them when workers are ready, tracks their progress, and keeps their latest state, including details on their completion. To orchestrate tasks, developers write programs that get the latest state of tasks from Amazon SWF and use it to initiate subsequent tasks in an ongoing manner. Amazon SWF maintains an application’s execution state durably so that the application can be resilient to failures in individual application components.&lt;/p&gt;

&lt;p&gt;An important feature of Amazon SWF is the &lt;em&gt;auditability&lt;/em&gt;; Amazon SWF gives visibility into the execution of each step in the application. The Management Console and APIs let you monitor all running executions of the application. The customer can zoom in on any execution to see the status of each task and its input and output data. To facilitate troubleshooting and historical analysis, Amazon SWF retains the history of executions for any number of days that the customer cab specify, up to a maximum of 90 days.&lt;/p&gt;

&lt;p&gt;Amazon SWF provides a collection of very powerful building blocks that also can be used to build higher-level execution engines. Some of our early customers used Amazon SWF to implement their domain specific languages (DL) for specialized business process execution, This is an area where I think the availability of Amazon SWF will drive a lot of innovation.&lt;/p&gt;

&lt;p&gt;As part of the AWS SDK, the AWS Flow Framework helps developers create Amazon SWF based application quickly and easily.  The Java version of the SDK includes really cool integration at the language level, making it easy for developers to automatically transform java code into tasks, create the right dependencies, and manage the execution of the workflow. This brings the power that some languages with built-in distribution and concurrency like Erlang offer to Java.&lt;/p&gt;

&lt;p&gt;For more insight into workflow execution, task coordination, task routing, task distribution, exception handling, child workflows, timers, signals, markers and much more see the &lt;a href=&quot;http://aws.amazon.com/swf&quot;&gt;Amazon SWF&lt;/a&gt; detail page. More information about the SDK see the developers guide. As always The &lt;a href=&quot;http://aws.typepad.com/aws/2012/02/amazon-simple-workflow-cloud-based-workflow-management.html&quot;&gt;AWS developer blog&lt;/a&gt; has additional details. At the &lt;a href=&quot;http://blog.rightscale.com/2012/02/22/rightscale-server-orchestration-and-amazon-swf-launch/&quot;&gt;Rightscale blog&lt;/a&gt; Thorsten von Eicken talks about their use of SWF.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/images/awsswf.png&quot;&gt;&lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>Driving Storage Costs Down for AWS Customers</title>
	   		<link>http://www.allthingsdistributed.com/2012/02/amazon-s3-price-drop.html</link>
	   		<guid>http://www.allthingsdistributed.com/2012/02/amazon-s3-price-drop.html</guid>
	   		<pubDate>Mon, 06 Feb 2012 21:00:00 EST</pubDate>
	   		<description>&lt;p&gt;One of the things that differentiates Amazon Web Services from other technology providers is its commitment to let customers benefits from continuous cost-cutting innovations and from the economies of scale AWS is able to achieve. As we showed last week one of the services that is growing rapidly is the Amazon Simple Storage Service (S3).&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/images/AWSgrowth.jpg&quot;&gt;&lt;/p&gt;

&lt;p&gt;AWS today announced  a substantial price drop per February 1, 2012 for Amazon S3 standard storage to help customers drive their storage cost down. A customer storing 50TB will see on average a 12% drop in cost when they get their Amazon S3 bill for February. Other storage tiers may see even greater cost savings.&lt;/p&gt;

&lt;p&gt;These Amazon S3 cost savings will also help drive down the cost of Amazon EBS snapshots and Amazon Storage Gateway snapshots, for example in the US East (Virginia) Region, their cost will drop from $0.14 to $0.125 per Gigabyte.&lt;/p&gt;

&lt;p&gt;In a time where on-premise infrastructure costs are rising significantly it is great to see that AWS can let all of its customers, big and small, benefit from the cost cutting innovations in storage.&lt;/p&gt;

&lt;p&gt;More details can be found in the &lt;a href=&quot;https://forums.aws.amazon.com/ann.jspa?annID=1354&quot;&gt;Forum Announcement&lt;/a&gt;, on &lt;a href=&quot;http://aws.typepad.com/aws/2012/02/amazon-s3-price-reduction.html&quot;&gt;Jeff Barr's blog&lt;/a&gt; and on the &lt;a href=&quot;http://aws.amazon.com/s3/#pricing&quot;&gt;Amazon S3 Pricing Page&lt;/a&gt;.&lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>Expanding the Cloud - The AWS Storage Gateway</title>
	   		<link>http://www.allthingsdistributed.com/2012/01/The-AWS-Storage-Gateway.html</link>
	   		<guid>http://www.allthingsdistributed.com/2012/01/The-AWS-Storage-Gateway.html</guid>
	   		<pubDate>Mon, 23 Jan 2012 19:01:01 EST</pubDate>
	   		<description>&lt;p&gt;Today Amazon Web Services has launched the AWS Storage Gateway, making the power of secure and reliable cloud storage accessible from customers’ on-premises applications.&lt;/p&gt;

&lt;p&gt;We have been working closely with our customers on their requests to bring the power of the Amazon Web Services cloud closer to their existing on-premises compute infrastructures. The Amazon Virtual Private Cloud extends on-premises compute with all the power of AWS, making it elastic, scalable and highly reliable. AWS Identity and Access Management brings together on-premises and cloud identity management.  VM Import allows our customers to move virtual machine images from their datacenters to the Cloud and Amazon Direct Connect makes the network latencies and bandwidth between on-premises and AWS more predictable. With the launch of the AWS Storage Gateway our customers can now integrate their on-premises IT environment with AWS’s storage infrastructure.&lt;/p&gt;

&lt;p&gt;The AWS Storage Gateway is a service connecting an on-premises software appliance with cloud-based storage.  Once the AWS Storage Gateway’s software appliance is installed on a local host, you can mount Storage Gateway volumes to your on-premises application servers as iSCSI devices, enabling a wide variety of systems and applications to make use of them.  Data written to these volumes is maintained on your on-premises storage hardware while being asynchronously backed up to AWS, where it is stored in Amazon S3 in the form of Amazon EBS snapshots.  Snapshots are encrypted to make sure that customers do not have to worry about encrypting sensitive data themselves.  When customers need to retrieve data, they can restore snapshots locally, or create Amazon EBS volumes from snapshots for use with applications running in Amazon EC2.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/images/arch_diagram_storagegateway.png&quot;/ width=&quot;650&quot;&gt;&lt;/p&gt;

&lt;p&gt;Here are three example use cases that we envision for the AWS Storage Gateway.  The first one is using the AWS Storage Gateway to back up your data to Amazon S3’s highly reliable storage environment.  Amazon S3 is designed to sustain the concurrent loss of data in two facilities, redundantly storing your data on multiple devices across multiple facilities in an AWS Region.  So, backing up your data to Amazon S3 means a lot less headaches worrying about your local storage environment.&lt;/p&gt;

&lt;p&gt;The second use case is where customers want to move data between local infrastructure and the Amazon Web Services cloud to provide access to applications and other computations running in Amazon EC2. The use of the Amazon EBS snapshot format means the data that was on-premises can be restored as an Amazon EBS volume mounted to an Amazon EC2 instance.&lt;/p&gt;

&lt;p&gt;The third use case, cloud-based Disaster Recovery, is a specific variation of the previous two.  If there is a failure in your local infrastructure, you can quickly launch a DR environment in Amazon EC2 which will have full access to the data snapshots backed up into Amazon S3 by the AWS Storage Gateway.&lt;/p&gt;

&lt;p&gt;For more information on the AWS Storage Gateway, you can visit the &lt;a href=&quot;http://aws.amazon.com/storagegateway&quot;&gt;detail page&lt;/a&gt;  Jeff Barr over at the &lt;a href=&quot;http://aws.typepad.com&quot;&gt;AWS Developer Blog&lt;/a&gt; has more details.&lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>Amazon DynamoDB – a Fast and Scalable NoSQL Database Service Designed for Internet Scale Applications</title>
	   		<link>http://www.allthingsdistributed.com/2012/01/amazon-dynamodb.html</link>
	   		<guid>http://www.allthingsdistributed.com/2012/01/amazon-dynamodb.html</guid>
	   		<pubDate>Thu, 19 Jan 2012 02:00:00 EST</pubDate>
	   		<description>&lt;p&gt;Today is a very exciting day as we release &lt;a href=&quot;http://aws.amazon.com/DynamoDB&quot;&gt;Amazon DynamoDB&lt;/a&gt;, a fast, highly reliable and cost-effective NoSQL database service designed for internet scale applications. DynamoDB is the result of 15 years of learning in the areas of large scale non-relational databases and cloud services. Several years ago we published a paper on the details of &lt;a href=&quot;http://www.allthingsdistributed.com/2007/10/amazons_dynamo.html&quot;&gt;Amazon’s Dynamo technology&lt;/a&gt;, which was one of the first non-relational databases developed at Amazon. The original Dynamo design was based on a core set of strong distributed systems principles resulting in an ultra-scalable and highly reliable database system. Amazon DynamoDB, which is a new service, continues to build on these principles, and also builds on our years of experience with running non-relational databases and cloud services, such as Amazon SimpleDB and Amazon S3, at scale. It is very gratifying to see all of our learning and experience become available to our customers in the form of an easy-to-use managed service.&lt;/p&gt;

&lt;p&gt;Amazon DynamoDB is a fully managed NoSQL database service that provides fast performance at any scale. Today’s web-based applications often encounter database scaling challenges when faced with growth in users, traffic, and data. With Amazon DynamoDB, developers scaling cloud-based applications can start small with just the capacity they need and then increase the request capacity of a given table as their app grows in popularity. Their tables can also grow without limits as their users store increasing amounts of data. Behind the scenes, Amazon DynamoDB automatically spreads the data and traffic for a table over a sufficient number of servers to meet the request capacity specified by the customer. Amazon DynamoDB offers low, predictable latencies at any scale. Customers can typically achieve average service-side in the single-digit milliseconds. Amazon DynamoDB stores data on Solid State Drives (SSDs) and replicates it synchronously across multiple AWS Availability Zones in an AWS Region to provide built-in high availability and data durability.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;History of NoSQL at Amazon – Dynamo&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The Amazon.com ecommerce platform consists of hundreds of decoupled services developed and managed in a decentralized fashion. Each service encapsulates its own data and presents a hardened API for others to use. Most importantly, direct database access to the data from outside its respective service is not allowed. This architectural pattern was a response to the scaling challenges that had challenged Amazon.com through its first 5 years, when direct database access was one of the major bottlenecks in scaling and operating the business. While a service-oriented architecture addressed the problems of a centralized database architecture, each service was still using traditional data management systems. The growth of Amazon’s business meant that many of these services needed more scalable database solutions.&lt;/p&gt;

&lt;p&gt;In response, we began to develop a collection of storage and database technologies to address the demanding scalability and reliability requirements of the Amazon.com ecommerce platform. We had been pushing the scalability of commercially available technologies to their limits and finally reached a point where these third party technologies could no longer be used without significant risk. This was not our technology vendors’ fault; Amazon's scaling needs were beyond the specs for their technologies and we were using them in ways that most of their customers were not. A number of outages at the height of the 2004 holiday shopping season can be traced back to scaling commercial technologies beyond their boundaries.&lt;/p&gt;

&lt;p&gt;Dynamo was born out of our need for a highly reliable, ultra-scalable key/value database. This non-relational, or NoSQL, database was targeted at use cases that were core to the Amazon ecommerce operation, such as the shopping cart and session service. Any downtime or performance degradation in these services has an immediate financial impact and their fault-tolerance and performance requirements for their data systems are very strict. These services also require the ability to scale infrastructure incrementally to accommodate growth in request rates or dataset sizes. Another important requirement for Dynamo was predictability. This is not just predictability of median performance and latency, but also at the end of the distribution (the 99.9th percentile), so we could provide acceptable performance for virtually every customer.&lt;/p&gt;

&lt;p&gt;To achieve all of these goals, we needed to do groundbreaking work. After the successful launch of the first Dynamo system, we documented our experiences in a paper so others could benefit from them. Since then, several Dynamo clones have been built and the &lt;a href=&quot;http://www.allthingsdistributed.com/2007/10/amazons_dynamo.html&quot;&gt;Dynamo paper&lt;/a&gt; has been the basis for several other types of distributed databases. This demonstrates that Amazon is not the only company than needs better tools to meet their database needs.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Lessons learned from Amazon's Dynamo&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Dynamo has been in use by a number of core services in the ecommerce platform, and their engineers have been very satisfied by its performance and incremental scalability. However, we never saw much adoption beyond these core services. This was remarkable because although Dynamo was originally built to serve the needs of the shopping cart, its design and implementation were much broader and based on input from many other service architects. As we spoke to many senior engineers and service owners, we saw a clear pattern start to emerge in their explanations of why they didn't adopt Dynamo more broadly: while Dynamo gave them a system that met their reliability, performance, and scalability needs, it did nothing to reduce the operational complexity of running large database systems. Since they were responsible for running their own Dynamo installations, they had to become experts on the various components running in multiple data centers. Also, they needed to make complex tradeoff decisions between consistency, performance, and reliability. This operational complexity was a barrier that kept them from adopting Dynamo.&lt;/p&gt;

&lt;p&gt;During this period, several other systems appeared in the Amazon ecosystem that did meet their requirements for simplified operational complexity, notably Amazon S3 and Amazon SimpleDB. These were built as managed web services that eliminated the operational complexity of managing systems while still providing extremely high durability. Amazon engineers preferred to use these services instead of managing their own databases like Dynamo, even though Dynamo's functionality was better aligned with their applications’ needs.&lt;/p&gt;

&lt;p&gt;With Dynamo we had taken great care to build a system that met the requirements of our engineers. After evaluations, it was often obvious that Dynamo was ideal for many database use cases. But ... we learned that engineers found the prospect of running a large software system daunting and instead looked for less ideal design alternatives that freed them from the burden of managing databases and allowed them to focus on their applications.&lt;/p&gt;

&lt;p&gt;It became obvious that developers strongly preferred simplicity to fine-grained control as they voted &quot;with their feet&quot; and adopted cloud-based AWS solutions, like Amazon S3 and Amazon SimpleDB, over Dynamo. Dynamo might have been the best technology in the world at the time but it was still software you had to run yourself. And nobody wanted to learn how to do that if they didn’t have to. Ultimately, developers wanted a service.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;History of NoSQL at Amazon - SimpleDB&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;One of the cloud services Amazon developers preferred for their database needs was Amazon SimpleDB. In the 5 years that SimpleDB has been operational, we have learned a lot from its customers.&lt;/p&gt;

&lt;p&gt;First and foremost, we have learned that a database service that takes away the operational headache of managing distributed systems is extremely powerful. Customers like SimpleDB’s table interface and its flexible data model. Not having to update their schemas when their systems evolve makes life much easier. However, they most appreciate the fact that SimpleDB just works. It provides multi-data center replication, high availability, and offers rock-solid durability. And yet customers never need to worry about setting up, configuring, or patching their database.&lt;/p&gt;

&lt;p&gt;Second, most database workloads do not require the complex query and transaction capabilities of a full-blown relational database. A database service that only presents a table interface with a restricted query set is a very important building block for many developers.&lt;/p&gt;

&lt;p&gt;While SimpleDB has been successful and powers the applications of many customers, it has some limitations that customers have consistently asked us to address.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Domain scaling limitations&lt;/em&gt;. SimpleDB requires customers to manage their datasets in containers called Domains, which have a finite capacity in terms of storage (10 GB) and request throughput. Although many customers worked around SimpleDB’s scaling limitations by partitioning their workloads over many Domains, this side of SimpleDB is certainly not simple. It also fails to meet the requirement of incremental scalability, something that is critical to many customers looking to adopt a NoSQL solution.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Predictability of Performance&lt;/em&gt;. SimpleDB, in keeping with its goal to be simple, indexes all attributes for each item stored in a domain. While this simplifies the customer experience on schema design and provides query flexibility, it has a negative impact on the predictability of performance. For example, every database write needs to update not just the basic record, but also all attribute indices (regardless of whether the customer is using all the indices for querying). Similarly, since the Domain maintains a large number of indices, its working set does not always fit in memory. This impacts the predictability of a Domain’s read latency, particularly as dataset sizes grow.&lt;br/&gt;
Consistency. SimpleDB’s original implementation had taken the &quot;eventually consistent&quot; approach to the extreme and presented customers with consistency windows that were up to a second in duration. This meant the system was not intuitive to use and developers used to a more traditional database solution had trouble adapting to it. The SimpleDB team eventually addressed this issue by enabling customers to specify whether a given read operation should be strongly or eventually consistent.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Pricing complexity&lt;/em&gt;. SimpleDB introduced a very fine-grained pricing dimension called “Machine Hours.” Although most customers have eventually learned how to predict their costs, it was not really transparent or simple.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Introducing DynamoDB&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;As we thought about how to address the limitations of SimpleDB and provide 1) the most scalable NoSQL solution available and 2) predictable high performance, we realized our goals could not be met with the SimpleDB APIs. Some SimpleDB operations require that all data for a Domain is on a single server, which prevents us from providing the seamless scalability our customers are demanding. In addition, SimpleDB APIs assume all item attributes are automatically indexed, which limits performance.&lt;/p&gt;

&lt;p&gt;We concluded that an ideal solution would combine the best parts of the original Dynamo design (incremental scalability, predictable high performance) with the best parts of SimpleDB (ease of administration of a cloud service, consistency, and a table-based data model that is richer than a pure key-value store). These architectural discussions culminated in Amazon DynamoDB, a new NoSQL service that we are excited to release today.&lt;/p&gt;

&lt;p&gt;Amazon DynamoDB is based on the principles of Dynamo, a progenitor of NoSQL, and brings the power of the cloud to the NoSQL database world. It offers customers high-availability, reliability, and incremental scalability, with no limits on dataset size or request throughput for a given table. And it is fast – it runs on the latest in solid-state drive (SSD) technology and incorporates numerous other optimizations to deliver low latency at any scale.&lt;/p&gt;

&lt;p&gt;Amazon DynamoDB is the result of everything we’ve learned from building large-scale, non-relational databases for Amazon.com and building highly scalable and reliable cloud computing services at AWS.  Amazon DynamoDB is a NoSQL database service that offers the following benefits:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Managed&lt;/strong&gt;. DynamoDB frees developers from the headaches of provisioning hardware and software, setting up and configuring a distributed database cluster, and managing ongoing cluster operations. It handles all the complexities of scaling and partitions and re-partitions your data over more machine resources to meet your I/O performance requirements. It also automatically replicates your data across multiple Availability Zones (and automatically re-replicates in the case of disk or node failures) to meet stringent availability and durability requirements.    From our experience of running Amazon.com, we know that manageability is a critical requirement. We have seen many job postings from companies using NoSQL products that are looking for NoSQL database engineers to help scale their installations. We know from our Amazon experiences that once these clusters start growing, managing them becomes the same nightmare that running large RDBMS installations was. Because Amazon DynamoDB is a managed service, you won’t need to hire experts to manage your NoSQL installation—your developers can do it themselves.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Scalable&lt;/strong&gt;. Amazon DynamoDB is designed to scale the resources dedicated to a table to hundreds or even thousands of servers spread over multiple Availability Zones to meet your storage and throughput requirements. There are no pre-defined limits to the amount of data each table can store. Developers can store and retrieve any amount of data and DynamoDB will spread the data across more servers as the amount of data stored in your table grows.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Fast&lt;/strong&gt;. Amazon DynamoDB provides high throughput at very low latency. It is also built on Solid State Drives to help optimize for high performance even at high scale. Moreover, by not indexing all attributes, the cost of read and write operations is low as write operations involve updating only the primary key index thereby reducing the latency of both read and write operations. An application running in EC2 will typically see average service-side latencies in the single-digit millisecond range for a 1KB object. Most importantly, DynamoDB latencies are predictable. Even as datasets grow, latencies remain stable due to the distributed nature of DynamoDB's data placement and request routing algorithms.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Durable and Highly Available&lt;/strong&gt;. Amazon DynamoDB replicates its data over at least 3 different data centers so that the system can continue to operate and serve data even under complex failure scenarios.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Flexible&lt;/strong&gt;. Amazon DynamoDB is an extremely flexible system that does not force its users into a particular data model or a particular consistency model. DynamoDB tables do not have a fixed schema but instead allow each data item to have any number of attributes, including multi-valued attributes. Developers can optionally use stronger consistency models when accessing the database, trading off some performance and availability for a simpler model. They can also take advantage of the atomic increment/decrement functionality of DynamoDB for counters.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Low cost&lt;/strong&gt;. Amazon DynamoDB’s pricing is simple and predictable: Storage is $1 per GB per month. Requests are priced based on how much capacity is reserved: $0.01 per hour for every 10 units of Write Capacity and $0.01 per hour for every 50 units of Read Capacity. A unit of Read (or Write) Capacity equals one read (or write) per second of capacity for items up to 1KB in size. If you use eventually consistent reads, you can achieve twice as many reads per second for a given amount of Read Capacity. Larger items will require additional throughput capacity.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;


&lt;p&gt;In the current release, customers will have the choice of using two types of keys for primary index querying: Simple Hash Keys and Composite Hash Key / Range Keys:&lt;/p&gt;

&lt;p&gt;Simple Hash Key gives DynamoDB the Distributed Hash Table abstraction. The key is hashed over the different partitions to optimize workload distribution. For more background on this please read the original Dynamo paper.&lt;/p&gt;

&lt;p&gt;Composite Hash Key with Range Key allows the developer to create a primary key that is the composite of two attributes, a “hash attribute” and a “range attribute.” When querying against a composite key, the hash attribute needs to be uniquely matched but a range operation can be specified for the range attribute: e.g. all orders from Werner in the past 24 hours, all log entries from server 16 with clients IP addresses on subnet 192.168.1.0&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Performance Predictability in DynamoDB&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;In addition to taking the best ideas of Dynamo and SimpleDB, we have added new functionality to provide even greater performance predictability.&lt;/p&gt;

&lt;p&gt;Cloud-based systems have invented solutions to ensure fairness and present their customers with uniform performance, so that no burst load from any customer should adversely impact others. This is a great approach and makes for many happy customers, but often does not give a single customer the ability to ask for higher throughput if they need it.&lt;/p&gt;

&lt;p&gt;As satisfied as engineers can be with the simplicity of cloud-based solutions, they would love to specify the request throughput they need and let the system reconfigure itself to meet their requirements. Without this ability, engineers often have to carefully manage caching systems to ensure they can achieve low-latency and predictable performance as their workloads scale. This introduces complexity that takes away some of the simplicity of using cloud-based solutions.&lt;/p&gt;

&lt;p&gt;The number of applications that need this type of performance predictability is increasing: online gaming, social graphs applications, online advertising, and real-time analytics to name a few. AWS customers are building increasingly sophisticated applications that could benefit from a database that can give them fast, predictable performance that exactly matches their needs.&lt;/p&gt;

&lt;p&gt;Amazon DynamoDB’s answer to this problem is “Provisioned Throughput.” Customers can now specify the request throughput capacity they require for a given table. Behind the scenes, DynamoDB will allocate sufficient resources to the table to predictably achieve this throughput with low-latency performance. Throughput reservations are elastic, so customers can increase or decrease the throughput capacity of a table on-demand using the AWS Management Console or the DynamoDB APIs. CloudWatch metrics enable customers to make informed decisions about the right amount of throughput to dedicate to a particular table. Customers using the service tell us that it enables them to achieve the appropriate amount of control over scaling and performance while maintaining simplicity. Rather than adding server infrastructure and re-partitioning their data, they simply change a value in the management console and DynamoDB takes care of the rest.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Summary&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Amazon DynamoDB is designed to maintain predictably high performance and to be highly cost efficient for workloads of any scale, from the smallest to the largest internet-scale applications. You can get started with Amazon DynamoDB using a free tier that enables 40 million of requests per month free of charge. Additional request capacity is priced at cost-efficiently hourly rates as low as $.01 per hour for 10 units of Write Capacity or 50 strongly consistent units of Read Capacity (if you use eventually consistent reads you can get twice the throughput at the same cost, or the same read throughput at half the cost) Also, replicated solid state disk (SSD) storage is $1 per GB per month. Our low request pricing is designed to meet the needs of typical database workloads that perform large numbers of reads and writes against every GB of data stored.&lt;/p&gt;

&lt;p&gt;To learn more about Amazon DynamoDB its functionality, APIs, use cases, and service pricing, please visit the detail page at &lt;a href=&quot;http://aws.amazon.com/DynamoDB&quot;&gt;aws.amazon.com/DynamoDB&lt;/a&gt; and also the &lt;a href=&quot;http://docs.amazonwebservices.com/amazondynamodb/latest/developerguide/&quot;&gt;Developer Guide&lt;/a&gt;. I am excited to see the years of experience with systems such as Amazon Dynamo result in an innovative database service that can be broadly used by all our customers.&lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>Countdown to What is Next in AWS</title>
	   		<link>http://www.allthingsdistributed.com/2012/01/countdown-aws-what-next.html</link>
	   		<guid>http://www.allthingsdistributed.com/2012/01/countdown-aws-what-next.html</guid>
	   		<pubDate>Sat, 14 Jan 2012 03:00:00 EST</pubDate>
	   		<description>&lt;p&gt;Join me at 9AM PST on Wednesday January 18, 2012 to find out what is next in the AWS Cloud. &lt;a href=&quot;http://aws.amazon.com/register-livestream-cloud&quot;&gt;Registration required&lt;/a&gt;.&lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>Expanding the Cloud – Introducing the AWS South America (Sao Paulo) Region</title>
	   		<link>http://www.allthingsdistributed.com/2011/12/aws-south-america-sao-paolo-region.html</link>
	   		<guid>http://www.allthingsdistributed.com/2011/12/aws-south-america-sao-paolo-region.html</guid>
	   		<pubDate>Thu, 15 Dec 2011 14:00:00 EST</pubDate>
	   		<description>&lt;p&gt;&lt;img  src=&quot;/images/sa.jpg&quot; width=&quot;200&quot; height=&quot;267&quot; style=&quot;float: right; margin: 20px 0 20px 20px;&quot; /&gt;&lt;/p&gt;

&lt;p&gt;Today, Amazon Web Services is expanding its worldwide coverage with the launch of a new AWS Region in Sao Paulo, Brazil. This new Region has been highly requested by companies worldwide, and it provides low-latency access to AWS services for those who target customers in South America.&lt;/p&gt;

&lt;p&gt;South America is one of the fastest growing economic regions in the world. In particular, South American IT-oriented companies are seeing very rapid growth. Case in point: over the past 10 years IT has risen to become 7% of the GDP in Brazil. With the launch of the South America (Sao Paolo) Region, AWS now provides companies large and small with infrastructure that allows them to get to market faster while reducing their costs which enables them to focus on delivering value, instead of wasting time on non-differentiating tasks.&lt;/p&gt;

&lt;p&gt;Local companies have not been the only ones to frequently ask us for a South American Region, but also companies from outside South America who would like to start delivering their products and services to the South American market. Many of these firms have wanted to enter this market for years but had refrained due to the daunting task of acquiring local hosting or datacenter capacity. These companies can now benefit from the fact that the new Sao Paulo Region is similar to all other AWS Regions, which enables software developed for other Regions to be quickly deployed in South America as well.&lt;/p&gt;

&lt;p&gt;Several prominent South American customers have been using AWS since the early days. The new Sao Paulo Region provides better latency to South America, which enables AWS customers to deliver higher performance services to their South American end-users. Additionally, it allows them to keep their data inside of Brazil. In the words of Guilherme Horn, the CEO of &lt;a href=&quot;https://www.orama.com.br/&quot;&gt;ÓRAMA&lt;/a&gt;, a Brazilian financial services firm and &lt;a href=&quot;http://aws.amazon.com/solutions/case-studies/orama/&quot;&gt;AWS customer&lt;/a&gt;: “The opening of the South America Sao Paulo Region will enable greater flexibility in developing new services as well as guarantee that we will always be compliant to the needs of the regulations of the financial markets.”&lt;/p&gt;

&lt;p&gt;You can learn more about our growing global infrastructure footprint at &lt;a href=&quot;http://aws.amazon.com/about-aws/globalinfrastructure&quot;&gt;http://aws.amazon.com/about-aws/globalinfrastructure&lt;/a&gt;. Please also visit the &lt;a href=&quot;http://aws.typepad.com/aws/2011/12/now-open-south-america-sao-paulo-region-ec2-s3-and-lots-more.html&quot;&gt;AWS developer blog&lt;/a&gt; for more great stories from our South American customers.&lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>Expanding the Cloud - Introducing Amazon ElastiCache</title>
	   		<link>http://www.allthingsdistributed.com/2011/08/amazon-elasticache.html</link>
	   		<guid>http://www.allthingsdistributed.com/2011/08/amazon-elasticache.html</guid>
	   		<pubDate>Tue, 23 Aug 2011 12:40:58 EST</pubDate>
	   		<description>&lt;p&gt;Today AWS has launched &lt;a href=&quot;http://aws.amazon.com/elasticache&quot;&gt;Amazon ElastiCache&lt;/a&gt;, a new service that makes it easy to add distributed in-memory caching to any application. Amazon ElastiCache handles the complexity of creating, scaling and managing an in-memory cache to free up brainpower for more differentiating activities. There are many success stories about the effectiveness of caching in many different scenarios; next to helping applications achieving fast and predictable performance, it often protects databases from requests bursts and brownouts under overload conditions. Systems that make extensive use of caching almost all report a significant reduction in the cost of their database tier. Given the widespread use of caching in many of the applications in the AWS Cloud, a caching service had been high on the request list of our customers.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/images/crowd100.jpg&quot; width=&quot;650&quot;/&gt;&lt;/p&gt;

&lt;p&gt;Caching has become a standard component in many applications to achieve a fast and predictable performance, but maintaining a collection of cache servers in a reliable and scalable manner is not a simple task. These efforts clearly fall into the category of &quot;operational muck&quot;, but given the widespread usage of caching, maintenance of cache servers is no longer a differentiator and everyone will have to uptake it as the &quot;costs of doing business&quot;. Amazon ElastiCache takes away many of the headaches of deploying, operating and scaling the caching infrastructure. A Cache Cluster, which is a set of collaborating Cache Nodes, can be started in minutes. Scaling the total memory in the Cache Cluster is under complete control of the customers as Caching Nodes can be added and deleted on demand. Amazon Cloudwatch can be used to get detailed metrics about the performance of the Cache Nodes. Amazon ElastiCache automatically detects and replaces failed Cache Nodes to protect the cluster from those failure scenarios. Access to the Cache Cluster is controlled using Cache Security Groups giving customers full control over which application components can access which Cache Cluster.&lt;/p&gt;

&lt;p&gt;Amazon ElastiCache is compliant with &lt;a href=&quot;http://memcached.org&quot;&gt;Memcached&lt;/a&gt;, which makes it easy for developers who are already familiar with that system to start using the service immediately. Existing applications, tools and libraries that are using a Memcached environment can simply switch over to using Amazon ElastiCache without much effort.&lt;/p&gt;

&lt;p&gt;For more details on Amazon ElastiCache visit the &lt;a href=&quot;http://aws.amazon.com/elasticache&quot;&gt;detail page&lt;/a&gt; of the service. For more hands-on information and to get started right away, see &lt;a href=&quot;http://aws.typepad.com/aws/2011/08/amazon-elasticache-distributed-in-memory-caching.html&quot;&gt;Jeff Barr's posting&lt;/a&gt; on the AWS Developer Blog. Please note that Amazon ElastiCache is currently available in the US East (Virginia) Region. It will be available in other AWS Regions in the coming months.&lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>Job Openings in AWS - Senior Leader in Database Services</title>
	   		<link>http://www.allthingsdistributed.com/2011/08/AWS-jobs-senior-leader-database-services.html</link>
	   		<guid>http://www.allthingsdistributed.com/2011/08/AWS-jobs-senior-leader-database-services.html</guid>
	   		<pubDate>Sat, 20 Aug 2011 05:40:58 EST</pubDate>
	   		<description>&lt;p&gt;There are some great job openings within Amazon Web Services. I will try to highlight some of those in coming weeks. This week it is an opening for senior leaders with AWS Database Services.&lt;/p&gt;

&lt;p&gt;AWS Database Services is responsible for setting the database strategy and delivering distributed structured storage services to our AWS customers. This team is constantly rethinking the assumptions behind how traditional databases were built and constantly working on building the right database architectures suited for the Cloud environment. The database services organization is looking for senior leaders who will be able to hire and lead a large software development team that is responsible for designing and running services that are at the cutting edge of distributed database technology that helps our customers to build scalable database-driven applications in the cloud and have a significant bottom-line impact on our business.&lt;/p&gt;

&lt;p&gt;The ideal candidate will be someone who has built and ran large scale distributed systems and/or databases. She (or he) will be able to reason about the standard tradeoffs in building large scale distributed databases and is capable of guiding the team to make these tradeoffs.&lt;/p&gt;

&lt;p&gt;For more information: &lt;a href=&quot;http://www.amazon.com/gp/jobs/141602/ref=j_sr_1_t?ie=UTF8&amp;amp;category=*&amp;amp;location=*&amp;amp;keywords=senior%20manager%20simpledb&amp;amp;page=1&quot;&gt;Head of Software Development&lt;/a&gt; &lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>Driving down the cost of Big-Data analytics</title>
	   		<link>http://www.allthingsdistributed.com/2011/08/amazon-emr-on-ec2-spot-instances.html</link>
	   		<guid>http://www.allthingsdistributed.com/2011/08/amazon-emr-on-ec2-spot-instances.html</guid>
	   		<pubDate>Fri, 19 Aug 2011 09:00:50 EST</pubDate>
	   		<description>&lt;p&gt;The &lt;a href=&quot;http://aws.amazon.com/elasticmapreduce/&quot;&gt;Amazon Elastic MapReduce&lt;/a&gt; (EMR) team announced today the ability to seamlessly use &lt;a href=&quot;http://aws.amazon.com/ec2/spot-instances/&quot;&gt;Amazon EC2 Spot Instances&lt;/a&gt; with their service, significantly driving down the cost of data analytics in the cloud. Many of our Big-Data customers already saw a big drop in their AWS bill last month when the cost of incoming bandwidth was &lt;a href=&quot;http://www.allthingsdistributed.com/2011/06/aws_bandwith_price_drop_july11.html&quot;&gt;dropped to $0.00&lt;/a&gt;. Now, given that historically customers using Spot Instances have seen cost saving up to 66% over On-Demand Instance prices, Amazon EMR customers are poised to achieve even greater cost savings.&lt;/p&gt;

&lt;p&gt;Analyzing vast amounts of data is critical for companies looking to incorporate customer insights into their business, including building recommendation engines or optimizing customer targeting. &lt;a href=&quot;http://hadoop.apache.org/&quot;&gt;Hadoop&lt;/a&gt; is quickly becoming the preferred tool for this type of large scale data analytics. However, Hadoop users often waste significant intellectual bandwidth on managing clusters and running Hadoop jobs rather than focusing on creating value through analytics. &lt;a href=&quot;http://aws.amazon.com/elasticmapreduce/&quot;&gt;Amazon Elastic MapReduce&lt;/a&gt; takes away much of this muck by providing a hosted Hadoop framework that enables businesses, researchers, data analysts, and developers to easily and efficiently spin up resizable clusters for distributed processing of large data sets.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/images/ElasticMapReduceFlow.png&quot; style=&quot;text-align: center; display: block; margin: 20px auto 20px;&quot; /&gt;&lt;/p&gt;

&lt;p&gt;An interesting observation is that data analytics is no longer the purview of large enterprises. Every young business launching today knows they must integrate data collection and analytics from the start. In order to compete in today’s market, these companies must  have a deep understanding of their customers’ behavior, allowing them to continuously improve how they serve them. Launching a business with a minimally viable product and then rapidly iterating in the direction that customers lead them is becoming a standard approach to success. However, this cannot be done without efficient, scalable data analytics. Many of these startups are using Hadoop for data processing and Amazon Elastic MapReduce is the ideal environment for them: it provides instant scalability and lets them focus on analytics while EMR handles the hassle of running the various Hadoop components. Given the initial shoestring budget of many of these new companies, driving down the overall cost of analytics using Spot Instances is a huge benefit.&lt;/p&gt;

&lt;p&gt;There are three categories of instances in an Amazon EMR cluster: 1) the &lt;em&gt;Master Instance Group&lt;/em&gt; which contains the Hadoop Master Node that schedules the various tasks, 2) the &lt;em&gt;Core Instance Group&lt;/em&gt; which contains instances that both store the data to be analyzed and run map and reduce tasks, and 3) the &lt;em&gt;Task Instance Group&lt;/em&gt; which only runs map and the reduce tasks.  For each instance group, you can decide to use On-Demand Instances (possibly from your Reserved Instances pool) or Spot Instances. If you choose to use Spot Instances you provide the bid price you are willing to pay for each instance in that group. If the current Spot Price is below the bid price, the Instance Group will launch. The instance groups in which Spot Instances are appropriate depends on the use case. For example, for data-critical workloads you might decide to run only the Task Group on Spot Instances, with the Core Group on On-Demand, while if you are performing application testing you may decide to run all Instance Groups using Spot Instances.&lt;/p&gt;

&lt;p&gt;If you want a quick introduction on how to get started with mixing Spot Instances with On-Demand Instances in an Amazon EMR cluster, watch this &lt;a href=&quot;http://d2bshspm9sjaf0.cloudfront.net/videos/elasticmapreduce/usingspotwithemr/index.html&quot;&gt;Getting Started Video&lt;/a&gt;. More details can be found in the &lt;a href=&quot;http://docs.amazonwebservices.com/ElasticMapReduce/latest/DeveloperGuide/SpotInstances&quot;&gt;Spot Instances Section&lt;/a&gt; of the Amazon Elastic MapReduce &lt;a href=&quot;http://docs.amazonwebservices.com/ElasticMapReduce/latest/DeveloperGuide&quot;&gt;Developer Guide&lt;/a&gt;. The posting on the &lt;a href=&quot;http://aws.typepad.com/aws/2011/08/run-amazon-elastic-mapreduce-on-ec2-spot-instances.html&quot;&gt;AWS developer blog&lt;/a&gt; also has some more background.&lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>No Server Required - Jekyll &amp; Amazon S3</title>
	   		<link>http://www.allthingsdistributed.com/2011/08/Jekyll-amazon-s3.html</link>
	   		<guid>http://www.allthingsdistributed.com/2011/08/Jekyll-amazon-s3.html</guid>
	   		<pubDate>Thu, 18 Aug 2011 04:40:58 EST</pubDate>
	   		<description>&lt;p&gt;As some of you may remember I was pretty excited when &lt;a href=&quot;http://aws.amazon.com/s3&quot;&gt;Amazon Simple Storage Service&lt;/a&gt; (S3) released its website feature such that I could serve this &lt;a href=&quot;http://www.allthingsdistributed.com/2011/02/website_amazon_s3.html&quot;&gt;weblog completely from S3&lt;/a&gt;. If you have a largely static site you can rely on the enormous power of S3 to make serving your content highly scalable and storing it extremely durable. Amazon S3 is much more than just storage; the network and distributed systems infrastructure to ensure that content can be served fast and at high rates without customers impacting each other, is amazing. Just dropping your website in an S3 bucket brings all that power to you.&lt;/p&gt;

&lt;p&gt;And it is not just purely static websites. The increasing sophistication of client-side JavaScript has redefined what dynamic means; where in the past dynamic content would be mainly server generated, today much content is served statically with JavaScript on the client side doing the dynamic modifications. A good example is the comments section on this blog; a few lines of JavaScript and these pages have a dynamic nature with comments, trackbacks and social media discussion showing up as they happen.&lt;/p&gt;

&lt;p&gt;But while this blog happily runs out of S3, the process of creating and updating the content still required a server to run my Moveable Type installation and hold the database. I took my time to figure out what weblog CMS I was going to use to free me from having to run a server. Of course the easiest would have been to just install Wordpress on a Amazon EC2 micro instance and use a plugin to convert wordpress php to static pages and then sync that to S3. But I really want a setup that allows me to thinker with the blog where ever I am (e.g. at 30,000 feet). Ideally for me my blog content would sit in DropBox and I just run a converter to generate a version of the website whenever I want, regardless which laptop I have with me.  This left me with two top choices: Cactus and Jekyll.&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://github.com/koenbok/Cactus&quot;&gt;Cactus&lt;/a&gt; is a static website generator developed by &lt;a href=&quot;http://www.linkedin.com/in/koenbok&quot;&gt;Koen Bok&lt;/a&gt; of &lt;a href=&quot;http://www.madebysofa.com/&quot;&gt;Made By Sofa&lt;/a&gt; (recently &lt;a href=&quot;http://www.madebysofa.com/blog/facebook-acquires-sofa/&quot;&gt;acquired by Facebook&lt;/a&gt;). It is simple and elegant, as you would expect from someone who has won several design awards. It is written in Python and makes use of the &lt;a href=&quot;https://docs.djangoproject.com/en/dev/ref/templates/builtins/?from=olddocs&quot;&gt;Django templates&lt;/a&gt;, which makes it  very powerful. Cactus had my preference as learning more about Django was still on my todo list. Although there are some good examples that come with Cactus is still early days and there is not much of a community using it. Combine that with the generic power of Django templates and my task lists for figuring out each of the pieces for my blog was substantial. I decided to let it rest for a moment (sorry Koen) and get back to it later when I can more easily step in the shoes of others.&lt;/p&gt;

&lt;p&gt;&lt;img alt=&quot;Jekyll&quot; src=&quot;/images/jekyll.png&quot; style=&quot;text-align: center; display: block; margin: 20px auto 20px;&quot; /&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;http://jekyllrb.com/&quot;&gt;Jekyll&lt;/a&gt; also is a static website generator. It has been developed by &lt;a href=&quot;http://tom.preston-werner.com/&quot;&gt;Tom Preston-Werner&lt;/a&gt; of &lt;a href=&quot;https://github.com/&quot;&gt;GitHub&lt;/a&gt; fame. It is in daily use to generate much of the GitHub pages and a whole &lt;a href=&quot;https://github.com/mojombo/jekyll/wiki/Sites&quot;&gt;series of weblogs&lt;/a&gt;. Next to that there is a very active community developing &lt;a href=&quot;https://github.com/mojombo/jekyll/wiki/Plugins&quot;&gt;plugins&lt;/a&gt; and &lt;a href=&quot;http://rfelix.com/2010/01/19/jekyll-extensions-minus-equal-pain/&quot;&gt;extensions&lt;/a&gt; which address a number of things that I want to do with the blog in the future. Jekyll in written in Ruby and uses &lt;a href=&quot;http://yaml.org/&quot;&gt;YAML&lt;/a&gt; for metadata management and uses the &lt;a href=&quot;http://www.liquidmarkup.org/&quot;&gt;Liquid template engine&lt;/a&gt; to manipulate the content. Let there be no mistake: Jekyll is not a polished high-end dashboard driven CMS, it is best described by TPW's original charge: &lt;a href=&quot;http://tom.preston-werner.com/2008/11/17/blogging-like-a-hacker.html&quot;&gt;Blogging like a Hacker&lt;/a&gt;. Which suits me just fine.&lt;/p&gt;

&lt;p&gt;I have now for the most part replicated the way that my blog was generated in MT but now using Jekyll. I am still using the same layout and css I used with MT, as I prefer to make one change at the time: design comes next. I have regenerated all pages since 2005, the pages before that can be found in the &quot;/historical&quot; section. There are a number of pages in the &quot;categories&quot; section that have not been regenerated as according to the website statistics not many of those were accessed.&lt;/p&gt;

&lt;p&gt;My templates and blog posts are now located in &lt;a href=&quot;https://www.dropbox.com&quot;&gt;DropBox&lt;/a&gt; and thus locally cached at each machine I use. I simple have to run Jekyll to generate a version of the site and &lt;a href=&quot;http://s3tools.org/s3tools&quot;&gt;s3cmd&lt;/a&gt; takes care of the rest.&lt;/p&gt;

&lt;p&gt;In the coming days I will cleanup the templates and put them in GitHub for others to reuse. I will also submit my convertor to transform an MT installation using SQLite into input for Jekyll.&lt;/p&gt;

&lt;p&gt;I am grateful to Matt Mullenweg for the magnificent Wordpress, it is not your fault I didn't want to run a server, to Koen Bok for the elegant Cactus, I am sure to come back to it when I have more guts and time, and to Tom Preston-Werner for enabling me to Blog like a Hacker.&lt;/p&gt;

&lt;p&gt;No Server Required. Amazon S3 FTW!&lt;/p&gt;
</description>
	 	</item>
	 	
	 	<item>
	   		<title>Expanding the Cloud - The AWS GovCloud (US) Region</title>
	   		<link>http://www.allthingsdistributed.com/2011/08/aws_govcloud_region.html</link>
	   		<guid>http://www.allthingsdistributed.com/2011/08/aws_govcloud_region.html</guid>
	   		<pubDate>Tue, 16 Aug 2011 23:40:58 EST</pubDate>
	   		<description>&lt;p&gt;Today AWS announced the launch of the AWS GovCloud (US) Region. This new region, which is located on the West Coast of the US, helps US government agencies and contractors move more of their workloads to the cloud by implementing a number of US government-specific regulatory requirements.&lt;/p&gt;

&lt;p&gt;The concept of regions gives AWS customers control over the placement of their resources and services. Next to GovCloud (US) there are five general purpose regions; two in the US (one on the west coast and one on the east coast), one in the EU (in Ireland) and two in APAC (in Singapore and Tokyo). There are different considerations when deciding where to allocate resources with latency and cost being the two obvious ones, but compliance sometimes plays an important role as well. For example a number of our European customers are subject to data residency requirements when it comes to PII data and they use the EU Region to meet to those requirements.&lt;/p&gt;

&lt;p&gt;Our government customers sometimes have an additional layer of regulatory requirements given that they at times deal with highly sensitive information, such as defense-related data. These customers are satisfied with the general security controls and procedures in AWS but in these more sensitive cases they often need assurances that only personnel that meet certain requirements, e.g. citizenship or permanent residency, can  access  their data. AWS GovCloud (US) implements specific requirements of the US government such that agencies at the federal, state and local levels can use the AWS cloud for their more sensitive workloads.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Cloud First&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The US Federal Cloud Computing Strategy lays out a “Cloud First” strategy which compels US federal agencies to consider Cloud Computing first as the target for their IT operations:&lt;/p&gt;

&lt;blockquote&gt;&lt;p&gt;&lt;em&gt;To harness the benefits of cloud computing, we have instituted a Cloud First policy. This policy is intended to accelerate the pace at which the government will realize the value of cloud computing by requiring agencies to evaluate safe, secure cloud computing options before making any new investments&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;By leveraging shared infrastructure and economies of scale, cloud computing presents a compelling business model for Federal leadership  Organizations will be able to measure and pay for only the IT resources they consume, increase or decrease their usage to match requirements and budget constraints, and leverage the shared underlying capacity of IT resources via a network  Resources needed  to support mission critical capabilities can be provisioned more rapidly and with minimal overhead and routine provider interaction.&lt;/em&gt;&lt;/p&gt;&lt;/blockquote&gt;

&lt;p&gt;Given the current economic climate, reducing cost within the US federal government is essential -- and an aggressive move to cloud will have a substantial positive impact on the governments IT budget. The move to the cloud is projected by 2015 see a reduction of 30% in IT infrastructure costs, which amounts to $7.2 billion. The application of the Cloud First strategy across all agencies will see many cost savings similar to what the GSA saw when they moved their main portal to the cloud: a savings of $1.7M on a yearly basis while greatly improving uptime and maintainability.&lt;/p&gt;

&lt;p&gt;With AWS’s strategy of continuous price reduction as additional economies of scale are achieved, many of these cost saving may become even more substantial without the agencies have to do anything.&lt;/p&gt;

&lt;p&gt;Many US federal agencies are already migrating existing IT infrastructure onto the cloud using Amazon Web Services. The Cloud First strategy is most visible with new Federal IT programs, which are all designed to be “Cloud Ready”; many of these applications are launching on AWS from the start, and a number can be found on the AWS Federal use case list.&lt;/p&gt;

&lt;p&gt;There were however a number of programs that really could benefit from the Cloud but which had unique regulatory requirements, such as ITAR, that blocked migration to AWS. ITAR is the International Traffic in Arms Regulatory framework which stipulates for example that data must be stored in an environment where physical and logical access is restricted to US Persons. There is no formal ITAR certification process, but a review of the ITAR compliance program for AWS GovCloud (US) has been conducted and resulted in a favorable letter of attestation with respect to the stated ITAR objects. This clears the path for agencies that have IT programs that need to be ITAR-compliant to start using AWS GovCloud (US) for these applications.&lt;/p&gt;

&lt;p&gt;This new region, like all other AWS regions, provides FISMA Moderate controls and supports existing AWS security controls and certifications such as SAS-70, ISO 27001 and PCI DSS Level 1.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Government and Big Data&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;One particular early use case for AWS GovCloud (US) will be massive data processing and analytics. Several agencies of very different parts of the government have needs for data analytics that really put the Big in Big-Data, sometimes several orders of magnitude larger than commonly found in industry. Examples here are certain agencies that work on national security and those that work on economic recovery; their incoming data streams are exploding in size and their needs for collecting, storing, organizing, analyzing and sharing are changing rapidly. It is very difficult for an on-premise IT infrastructure to effectively address the needs of these agencies and the time scales at which they need to operate. The scalability, flexibility and the elasticity of AWS makes it an ideal environment for the agencies to run their analytics.&lt;/p&gt;

&lt;p&gt;Often the data streams that they operate on are not classified in nature, but the combination and aggregation of these streams using complex new algorithms may fall for example under the controls of ITAR. AWS GovCloud (US) will be used by several of these agencies to help them with their Bigger-than-Big-Data needs.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;More information&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;As with all AWS services and regions, information on GovCloud is publicly available on the AWS website, However, given the restrictive nature of this new AWS Region, customers will need to sign an AWS GovCloud Enterprise agreement that requires a manual step beyond the usual self-service signup process. To make use of the services in this region, customers will use the Amazon Virtual Private Cloud (VPC) to organize their AWS resources.&lt;/p&gt;

&lt;p&gt;As the name of the region already suggests, we do not envision that over time GovCloud will address only the needs of the US Government and contractors. We are certainly interested in understanding whether there are opportunities in other governments with respect to their specific regulatory requirements that could be solved by a specialized region.&lt;/p&gt;

&lt;p&gt;For more details on the AWS GovCloud  (US) visit the Federal Government section of the AWS website and the posting on the AWS developer blog.&lt;/p&gt;
</description>
	 	</item>
	 	
	</channel>
</rss>
