Create an Account
username: password:
 
  MemeStreams Logo

MemeStreams Discussion

search


This page contains all of the posts and discussion on MemeStreams referencing the following web page: Database Developments - Startupping Forums. You can find discussions on MemeStreams as you surf the web, even if you aren't a MemeStreams member, using the Threads Bookmarklet.

Database Developments - Startupping Forums
by Lost at 4:50 am EST, Dec 15, 2007

The single biggest challenge in scaling an Internet service is the database. In my experience, the ways things generally work is like this: you put up an Internet service using a single instance of a basic database, like MySQL or BerkeleyDB. Everything is run off that single database. As you grow, the database becomes overloaded, and then you begin an endless cycle of scaling work. First, you split the single database into multiple, unrelated databases. You also need to worry about reliability, and so start looking at things like hot backups. Then you look at adding read-only copies of your databases, and with those you have to worry about keeping things in sync and other related hassles. And of course you start implementing in-memory caches using something like memcached, which are great, but have their own issues as well (cold caches on restarts being the biggest I can think of off hand). All of this takes a lot of work, a lot of expertise, and a lot of maintenance.

Two items relating to databases caught my eye this week. The first was the benchmarking of a RAID consisting of Solid State Drives, or SSDs. An SSD is basically a chunk of non-volatile RAM in a package with a disk-drive interface. They're designed to replace standard hard drives with something with the performance of RAM. While SSDs have been in development for a few years, I think they're really starting to become interesting for use in databases. They provide orders of magnitude speedups over hard drives in some areas, like seek performance. They're not perfect, but if I was in the situation where I needed to scale a database immediately, I'd definitely look at using an SSD.

The other item was the announcement of Amazon's SimpleDB. This is a new web service that goes along with the existing S3 and EC2 services. SimpleDB provides a robust, simple, scalable database system. If it offers high performance and high reliability, as they claim, then this could be a very big deal. In combination with the other services, it would eliminate the vast majority of scaling issues facing an Internet service. It's only in limited beta right now, but once it goes live (and works as advertised), I think I'd be hard pressed to not recommend that a new startup host their entire site using Amazon. This further reduces the cost structure associated with an Internet startup (both in terms of money and in terms of talent needed) and lowers the barrier to entry for new services. Like I said, a very big deal.


 
 
Powered By Industrial Memetics