Hive, Spark, Presto for Interactive Queries on Big Data

Traditional relational database systems can not be efficiently used to analyze data with large volume and different formats, i.e. big data. Apache Hadoop is one of the first open-source tools that provides a distributed data storage system and resource manager. The space of big data processing has b...

Full description

Bibliographic Details
Main Author: Gureev, Nikita
Format: Others
Language:English
Published: KTH, Skolan för elektroteknik och datavetenskap (EECS) 2018
Subjects:
SQL
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-234927