What is the primary purpose of a router in a scalable database design?
To perform joins between shards.
To route client requests to the appropriate shard.
To store backup configurations for shards.
To hash keys for partitioning data.
What is a "shard" in the context of databases?
A method of encrypting sensitive data.
A backup server in a replica set.
A horizontal partition of a database.
A tool for monitoring database traffic.
What is a major advantage of sharding in database systems?
It eliminates the need for backups.
It increases both scalability and redundancy.
It simplifies complex SQL queries.
It reduces the need for a router.
What is a replica set in MongoDB?
A set of configuration servers.
A collection of shards distributed across regions.
A primary server and its associated secondary servers for redundancy.
A group of shards that handle write-heavy traffic.
How does MongoDB ensure high availability when a primary server fails?
Traffic is rerouted to the config server.
Secondary servers automatically elect a new primary.
The router assigns new shards dynamically.
The primary server is rebooted automatically.
What does "eventual consistency" mean in the context of Cassandra?
Data is always immediately consistent across all nodes.
Data might take time to propagate across nodes, leading to temporary inconsistencies.
Data must be manually synchronized across nodes.
Consistency is not guaranteed due to system failures.
What is the "celebrity problem" in sharded databases?
High traffic on specific shards causes uneven load distribution.
Celebrities’ data is stored differently for security reasons.
All shards are overloaded with high read and write requests.
Certain shards fail to replicate data efficiently.
What is re-sharding, and why is it challenging?
Redistributing data when adding or removing shards, which requires fault-tolerant methods.
Creating backups for new shards added to the system.
Replicating data across all shards during traffic spikes.
Distributing write-heavy traffic across existing shards.
What is a key advantage of using a normalized database schema?
It minimizes storage space and simplifies updates.
It allows faster reads with fewer database hits.
It duplicates data for high performance.
It eliminates the need for joins between tables.
Why might denormalized data be preferred in a NoSQL database?
It reduces redundancy and improves update efficiency..
It allows for single-query retrieval of all necessary data.
It enforces strict schema rules.
It is easier to scale horizontally.
What should you prioritize when designing a horizontally scalable database?
Using as many joins as possible for efficiency.
Designing around simple key-value lookups to avoid joins.
Avoiding hash functions for data distribution.
Relying heavily on a single primary server for consistency.
Which database systems are examples of NoSQL?
PostgreSQL and MySQL.
MongoDB, Cassandra, and DynamoDB.
Oracle and SQLite.
Redis and HBase and SQLite
In what scenarios might you choose to use normalized data over denormalized data in a NoSQL database?
When updates are frequent and space efficiency is important.
When scaling horizontally is a top priority.
When high traffic requires fewer database hits.
When you need to perform complex joins efficiently.