What databases do the World Wide Web’s biggest sites run on? [closed]

Facebook.com

  • MySQL with MyRocks. Used to store user info and social activities such as likes, comments, and shares.
  • Hive (Data warehouse for Hadoop, supports tables and a variant of SQL called hiveQL). Used for “simple summarization jobs, business intelligence and machine learning and many other applications”
  • Cassandra (Multi-dimensional, distributed key-value store). Currently used for Facebook’s private messaging.

Currently running 610 (soon to be 1000) Hadoop nodes in a single cluster with Hive datastore. Both Hive and Cassandra have been open-sourced by Facebook.

Facebook stats:

  • More than 200 million active users
  • More than 100 million users log on to Facebook at least once each day
  • More than 30 million users update their statuses at least once each day
  • Average user has 120 friends on the site

Sources:

  • http://www.dbms2.com/2009/05/11/facebook-hadoop-and-hive/
  • http://www.facebook.com/note.php?note_id=89508453919
  • http://www.facebook.com/press/info.php?statistics
  • http://hadoop.apache.org/hive/
  • http://wiki.apache.org/hadoop/Hive/Design
  • http://www.facebook.com/note.php?note_id=24413138919
  • https://code.facebook.com/posts/190251048047090/myrocks-a-space-and-write-optimized-mysql-database

Leave a Comment

Hata!: SQLSTATE[HY000] [1045] Access denied for user 'divattrend_liink'@'localhost' (using password: YES)