How does dynamo handle consistency and availability? What are the issues that must be resolved?
- Dynamo: Amazon’s Highly Available Key-value Store
Why and how does dynamo modify regular consistent hashing? How does this affect replication?
- Dynamo: Amazon’s Highly Available Key-value Store
How does dynamo handle shopping cart inconsistency?
- Dynamo: Amazon’s Highly Available Key-value Store
Describe the BigTable data model. How does it handle persistence and availability?
- The BigTable data model consists of a mulit-dimensional stored map indexed by row & column keys and timestamps. The row key is an "arbitrary strings (currently up to 64KB in size, although 10-100 bytes" [1]. The row key also provides concurrency on row updates given the row keys atomic nature. Column Keys form the basic access control of the database. Colume keys are then ground into a column-family of the same type in order to group the piticlure data. Also, at the Column key level, this key controls the disk and memeopry at each column-family. Lastly, Timstamps are used to log each change version of a row/column. Given that BigTable can have many version of the same data the timestamp enables a smoother retraval foreach version.
Why did Facebook develop cassandra? What features of older systems did it combine?
- Cassandra - A Decentralized Structured Storage System
What is a good approach to data modeling for cassandra? Why is this different from relational methods?
http://www.datastax.com/dev/tutorials
Discuss the evolution of query slices in cassandra. How exactly do they relate to the underlying database and the query language? Include discussion of 'with compact storage.'
Why did Netfix change its architecture? Describe each the change rationale.
- http://www.infoq.com/presentations/Keeping-Movies-Running-Amid-Thunderstorms
How does Netfix handle rapidly changing demand for services? Give a scenario.
- http://www.infoq.com/presentations/Keeping-Movies-Running-Amid-Thunderstorms
What is the 'thundering herd'? How is it handled?
- http://www.infoq.com/presentations/Keeping-Movies-Running-Amid-Thunderstorms
Describe the strategy, operation, and issues with replication and partitioning in cassandra.