Large scale data processing Hbase vs Cassandra [closed]

As a Cassandra developer, I’m better at answering the other side of the question: Cassandra scales better. Cassandra is known to scale to over 400 nodes in a cluster; when Facebook deployed Messaging on top of HBase they had to shard it across 100-node HBase sub-clusters. Cassandra supports hundreds, even thousands of ColumnFamilies. “HBase currently … Read more

SQL versus noSQL (speed)

People who tend to use noSQL use it specifically because it fits their use cases. Being divorced from normal RDBMS table relationships and constraints, as well as ACID-ity of data, it’s very easy to make it run a lot faster. Consider Twitter, which uses NoSQL because a user only does very limited things on site, … Read more

Best data store for billions of rows

Storing ~3.5TB of data and inserting about 1K/sec 24×7, and also querying at a rate not specified, it is possible with SQL Server, but there are more questions: what availability requirement you have for this? 99.999% uptime, or is 95% enough? what reliability requirement you have? Does missing an insert cost you $1M? what recoverability … Read more

MongoDB with redis

Redis and MongoDB can be used together with good results. A company well-known for running MongoDB and Redis (along with MySQL and Sphinx) is Craiglist. See this presentation from Jeremy Zawodny. MongoDB is interesting for persistent, document oriented, data indexed in various ways. Redis is more interesting for volatile data, or latency sensitive semi-persistent data. … Read more

NoSql Crash Course/Tutorial [closed]

At its most basic form NoSQL is really no more than a way of storing objects using some sort of key/value pairing system. You use this all the time already I assume. For instance. in javascript you can create an object named foo and then do foo[‘myobj’] = myobj; to store stuff in the object. … Read more

Cassandra port usage – how are the ports used?

@Schildmeijer is largely right, however port 7001 is still used when using TLS Encrypted Internode communication So my complete list would be for current versions of Cassandra: 7199 – JMX (was 8080 pre Cassandra 0.8.xx) 7000 – Internode communication (not used if TLS enabled) 7001 – TLS Internode communication (used if TLS enabled) 9160 – … Read more

Hata!: SQLSTATE[HY000] [1045] Access denied for user 'divattrend_liink'@'localhost' (using password: YES)