Amazon Aurora is an AWS service for relational database. As part of this paper, we will look into the decisions that led to a scalable…
Paper Notes: Bitcask – A Log-Structured Hash Table for Fast Key/Value Data
As part of this post, I will cover the research paper for Bitcask and do a code walkthrough of an implementation that I wrote using…
Paper Notes: SWIM(Scalable Weakly-consistent Infection-style Process Group Membership Protocol)
In any distributed system comprising of multiple nodes, one question that we need to answer time and again is which nodes in our system are…
Paper Notes: Bigtable – A Distributed Storage System for Structured Data
Bigtable is a distributed storage solution developed at Google for storing structured data in a scalable manner. Multiple products(At the time the paper was written)…
Paper Notes: Real-time Data Infrastructure at Uber
An application like Uber might seem simple in the first look but does a great job of hiding complexity in order to provide a great…
Paper Notes: Amazon DynamoDB – A Scalable, Predictably Performant, and Fully Managed NoSQL Database Service
Sometime back, I wrote a post on Dynamo which is Amazon’s key-value store. This paper is not about that. This paper was presented at a…
Paper Notes: WiscKey – Separating Keys from Values in SSD-conscious Storage
During the last post about LSM trees, we discussed how awesome they are for write-heavy storage engines. But we also got a glimpse into why…
Paper Notes: Zookeeper – Wait-free coordination for Internet-scale systems
In a distributed computing environment, servers often need to work with certain shared set of attributes such as locks, system configurations etc. In order to…
Paper Notes: Dynamo – Amazon’s Highly Available Key-value Store
Dynamo is one of the most influential papers in the domain of distributed storage technologies. It has inspired multiple open source databases such as Cassandra,…
Paper Notes: Cassandra- A Decentralized Structured Storage System
Cassandra is a distributed storage system that focusses on providing a highly scalable & available service for storing very large amount of data. As part…