Spanner is a scalable & globally distributed database built at Google. It is the first database to replicate data globally while providing consistent distributed transaction…
Paper Notes: Kora – A Cloud-Native Event Streaming Platform For Kafka
With growing demand for data, robust solutions for handling large-scale data streaming has become essential for organizations. In the cloud-native world, getting both scalable and…
Paper Notes: Megastore- Providing Scalable, Highly Available Storage for Interactive Services
Megastore is a storage system built at Google that provides best of the both database worlds. It provides scalability of NoSQL along with strong consistency…
Paper Notes: Spark – Cluster Computing with Working Sets
In one of the previous posts, we looked into how MapReduce is used to perform large scale computations on large-scale data using commodity hardware. MapReduce…
Paper Notes: Distributed Transactions at Scale in Amazon DynamoDB
NoSQL databases come up with lot of good things such as high-availability, high-scalability and cloud-scale performance. But providing transaction support that doesn’t leaves the data…
Paper Notes: MapReduce – Simplified Data Processing on Large Clusters
MapReduce is another paradigm-shift similar to Google file system in the domain of distributed computing. It is a programming model for processing large sets of…
Paper Notes: The Google File System
Google file system(GFS) is a foundational paper in the domain of distributed storage. Built at Google, GFS is a scalable distributed file system for data…
Paper Notes: Firestore – The NoSQL Serverless Database for the Application Developer
In the world of application development, most of the computation is performed on the edge. Though for certain use cases such as syncing user data…
Paper Notes: Amazon Aurora – Design Considerations for High Throughput Cloud-Native Relational Databases
Amazon Aurora is an AWS service for relational database. As part of this paper, we will look into the decisions that led to a scalable…
Paper Notes: Bitcask – A Log-Structured Hash Table for Fast Key/Value Data
As part of this post, I will cover the research paper for Bitcask and do a code walkthrough of an implementation that I wrote using…