12 Comments

Th final matrix is totally on point, Vivek!

Expand full comment

Thankyou Raul 🙌

Expand full comment

Very insightful 👍!!

Expand full comment

Revisiting the article and found out I missed to like it :D

Expand full comment

Thanks for this one!

Expand full comment

Informative & succinct, Vivek !

Expand full comment

So, it means that in sharding, we are just having replica of data while in partitioning, the dataset is divided ?

Expand full comment

When we don’t do Partition & do Sharding (Matrix 3rd quadrant) then it is complete data replica. It totally depends on your business requirements, Quadrant 3 is more suitable when data can be fit in single database server but read traffic is high & you need data base availability.

Expand full comment

Designing Data-Intensive Applications - Pg.199

Terminological confusion

What we call a partition here is called a shard in MongoDB, Elasticsearch,

and SolrCloud; it’s known as a region in HBase, a tablet

in Bigtable, a vnode in Cassandra and Riak, and a vBucket in

Couchbase. However, partitioning is the most established term, so

we’ll stick with that.

Expand full comment

It’s very insightful 😇

Expand full comment

Really confused about the difference, could it be correct if i say database sharding is database replication, However, if i say that, it would be wrong because a shard might contain records not available in another shard.

Expand full comment

Yes replication can not be called shardjng.

Expand full comment