Overview
Apache Cassandra (https://cassandra.apache.org) is a distributed, linearly scalable, highly available NoSQL column-family database designed for large-scale write-heavy and geo-distributed deployments. It uses a peer-to-peer architecture, the Gossip protocol, and SSTable storage, and supports tunable consistency with no single point of failure.
Core features and highlights
- Linear scalability: strong horizontal scaling to smoothly handle PB-scale data and high-concurrency workloads.
- High availability and fault tolerance: multi-replica, multi-datacenter replication and automatic failover ensure continuous service availability.
- Tunable consistency: configurable from strong to eventual consistency to balance performance and correctness.
- Rich ecosystem: supports
CQLand integrates well with Spark, Kafka, Hadoop, and more.
Use cases and target users
- Typical use cases: IoT time-series data, logs and metrics storage, social platforms, real-time analytics, and mixed online transaction/analytic workloads.
- Target users: backend architects, database administrators, SREs, and big data engineering teams.