Large Scale ETL Design, Optimization and Implementation Based On Spark and AWS Platform

Nowadays, the amount of data generated by users within an Internet product is increasing exponentially, for instance, clickstream for a website application from millions of users, geospatial information from GIS-based APPs of Android and IPhone, or sensor data from cars or any electronic equipment,...

Full description

Bibliographic Details
Main Author: Zhu, Di
Format: Others
Language:English
Published: KTH, Skolan för informations- och kommunikationsteknik (ICT) 2017
Subjects:
ETL
AWS
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-215702