Data Lakes and Big Data Systems

1

What is a "data lake"?

1

Which cloud service is commonly used to implement a data lake?

1

What is the primary purpose of AWS Glue in the context of a data lake?

1

What tool allows SQL queries directly on data stored in Amazon S3?

1

How does Redshift Spectrum enhance the capabilities of Amazon Redshift?

1

Why is partitioning data important in a data lake?

1

What is a typical partitioning strategy for storing log data?

1

How should you approach data lake architecture from a system design perspective?

1

What is one advantage of using off-the-shelf tools like AWS Glue and Amazon Athena?