Database Research & Development

  • Home
  • NoSQL
    • NoSQL
    • Cassandra
  • Databases
    • Database Theory
    • Database Designing
    • SQL Server Coding Standards
    • SQL Server
    • PostgreSQL
    • MySQL
    • Greenplum
    • Linux
  • Interviews
    • SQL Server Interviews
    • MySQL Interviews
    • SQL Puzzles
  • DBA Scripts
    • SQL Server DBA Scripts
    • PostgreSQL DBA Scripts
    • MySQL DBA Scripts
    • Greenplum DBA Scripts
  • Home
  • Blog Archives !
  • (: Laugh@dbrnd :)
  • Contact Me !
sqlserverinterviews
Home 2016 April NoSQL: Key concepts of the Cassandra Architecture (Day 1)

NoSQL: Key concepts of the Cassandra Architecture (Day 1)

This article is half-done without your Comment! *** Please share your thoughts via Comment ***

As I started to learn and post about the Apache Cassandra so in this post, I am going to share some of the important Key concepts of the Cassandra Architecture.

Data Replication:

Cassandra supports high availability by implementing the Data Replication.
One logical database is spread among the multiple Nodes of the Cluster, and it creates different replicas of the Nodes.
If one Node goes down, another Node is available with the data, so it avoids a single point of failure.

Consistent Hashing Algorithm:

Actually, there are primary two problems with the distributed database system, The first is, every time determining a Node with the specific data set and The second is, require to reduce the data movement when adding or removing Nodes.
The Consistent Hashing Algorithm achieves this problem by storing Cassandra row keys to physical nodes.

Data Consistency:

As data is replicated across the different nodes, it ensures that data is synchronised across all the replicas. Before any operation on the nodes, it checks the last update value and version of the data.
It also provides the tunable consistency in which user can determine the consistency level.

Gossip Protocol:

Cassandra is using Gossip Protocol in which all Nodes discover information about the other nodes by exchanging information with each other.
Because of the network traffic, Nodes cannot exchange the information of all nodes, but it can exchange information maximum of 3 nodes.

Snitches and Replication Strategies:

Cassandra uses snitches and replication strategies to determine how data is replicated across all data centres and nodes.

The Snitches determine the how each node uses near to each other in the ring and Replication strategies this information to determine location information for each copy of the data.

Bloom Filters:

A bloom filter tests the existence of a data structure, and it checks that items are available in the set or not. It also reduces CPU I/O operation because it is swift.

Merkle Tree:

It is one type of hash tree, and it uses for finding the differences between the Nodes.

SSTable:

A Sorted String Table is ordered key-value map which is storing large sorted data.

Write Back Cache:

A Write Back Cache is only for writing operation in which write operations can use the dedicated cache.

Memtable:

A memtable is residing in the memory and manage the data of Write Back Cache.

Cassandra Keyspace:

It is similar to RDBMS schema or database. It is container of all types of data and objects.

Cassandra Column Family:

It is similar to RDBMS table and it contains column and data related information.

Apr 27, 2016Anvesh Patel
NoSQL: What is a Cassandra Query Language (CQL)?NoSQL: Cassandra Architecture - Read and Write operations in The Ring (Day 2)
Anvesh Patel
Anvesh Patel

Database Engineer

April 27, 2016 Cassandra, NoSQLAnvesh Patel, Cassandra, Column Store, database, database research and development, dbrnd, Document Store, Gossip Protocol, Graph Store, Key Value Store, Memtable, Merkle Tree, NoSQL, Not Only SQL, SSTable, Unstrucutred
About Me!

I'm Anvesh Patel, a Database Engineer certified by Oracle and IBM. I'm working as a Database Architect, Database Optimizer, Database Administrator, Database Developer. Providing the best articles and solutions for different problems in the best manner through my blogs is my passion. I have more than six years of experience with various RDBMS products like MSSQL Server, PostgreSQL, MySQL, Greenplum and currently learning and doing research on BIGData and NoSQL technology. -- Hyderabad, India.

About DBRND !

dbrnd

This is a personal blog (www.dbrnd.com).

Any views or opinions represented in this blog are personal and belong solely to the blog owner and do not represent those of people, institutions or organizations that the owner may or may not be associated with in professional or personal capacity, unless explicitly stated.

Feel free to challenge me, disagree with me, or tell me I’m completely nuts in the comments section of each blog entry, but I reserve the right to delete any comment for any reason whatsoever (abusive, profane, rude, or anonymous comments) - so keep it polite.

The content of this website is protected by copyright. No portion of this website may be copied or replicated in any form without the written consent of the website owner.

Recent Comments !
  • Anvesh Patel { Sure will do... } – May 27, 12:43 PM
  • Anvesh Patel { Great... } – May 27, 12:41 PM
  • Anvesh Patel { Great... } – May 27, 12:39 PM
  • Anvesh Patel { Great... } – May 27, 12:36 PM
  • Anvesh Patel { Great... } – May 27, 12:28 PM
  • Anvesh Patel { Great... } – May 27, 12:27 PM
  • Anvesh Patel { Great... } – May 27, 12:16 PM
  • Older »
Follow Me !
  • facebook
  • linkedin
  • twitter
  • youtube
  • google
  • flickr
© 2015 – 2019 All rights reserved. Database Research & Development (dbrnd.com)
Posting....