A Data-driven, High-performance and Intelligent CyberInfrastructure to Advance Spatial Sciences

abstract: In the field of Geographic Information Science (GIScience), we have witnessed the unprecedented data deluge brought about by the rapid advancement of high-resolution data observing technologies. For example, with the advancement of Earth Observation (EO) technologies, a massive amount of E...

Full description

Bibliographic Details
Other Authors: Shao, Hu (Author)
Format: Doctoral Thesis
Language:English
Published: 2018
Subjects:
Online Access:http://hdl.handle.net/2286/R.I.51779
id ndltd-asu.edu-item-51779
record_format oai_dc
spelling ndltd-asu.edu-item-517792019-02-02T03:01:23Z A Data-driven, High-performance and Intelligent CyberInfrastructure to Advance Spatial Sciences abstract: In the field of Geographic Information Science (GIScience), we have witnessed the unprecedented data deluge brought about by the rapid advancement of high-resolution data observing technologies. For example, with the advancement of Earth Observation (EO) technologies, a massive amount of EO data including remote sensing data and other sensor observation data about earthquake, climate, ocean, hydrology, volcano, glacier, etc., are being collected on a daily basis by a wide range of organizations. In addition to the observation data, human-generated data including microblogs, photos, consumption records, evaluations, unstructured webpages and other Volunteered Geographical Information (VGI) are incessantly generated and shared on the Internet. Meanwhile, the emerging cyberinfrastructure rapidly increases our capacity for handling such massive data with regard to data collection and management, data integration and interoperability, data transmission and visualization, high-performance computing, etc. Cyberinfrastructure (CI) consists of computing systems, data storage systems, advanced instruments and data repositories, visualization environments, and people, all linked together by software and high-performance networks to improve research productivity and enable breakthroughs that are not otherwise possible. The Geospatial CI (GCI, or CyberGIS), as the synthesis of CI and GIScience has inherent advantages in enabling computationally intensive spatial analysis and modeling (SAM) and collaborative geospatial problem solving and decision making. This dissertation is dedicated to addressing several critical issues and improving the performance of existing methodologies and systems in the field of CyberGIS. My dissertation will include three parts: The first part is focused on developing methodologies to help public researchers find appropriate open geo-spatial datasets from millions of records provided by thousands of organizations scattered around the world efficiently and effectively. Machine learning and semantic search methods will be utilized in this research. The second part develops an interoperable and replicable geoprocessing service by synthesizing the high-performance computing (HPC) environment, the core spatial statistic/analysis algorithms from the widely adopted open source python package – Python Spatial Analysis Library (PySAL), and rich datasets acquired from the first research. The third part is dedicated to studying optimization strategies for feature data transmission and visualization. This study is intended for solving the performance issue in large feature data transmission through the Internet and visualization on the client (browser) side. Taken together, the three parts constitute an endeavor towards the methodological improvement and implementation practice of the data-driven, high-performance and intelligent CI to advance spatial sciences. Dissertation/Thesis Shao, Hu (Author) Li, Wenwen (Advisor) Rey, Sergio (Advisor) Maciejewski, Ross (Committee member) Arizona State University (Publisher) Geography eng 132 pages Doctoral Dissertation Geography 2018 Doctoral Dissertation http://hdl.handle.net/2286/R.I.51779 http://rightsstatements.org/vocab/InC/1.0/ 2018
collection NDLTD
language English
format Doctoral Thesis
sources NDLTD
topic Geography
spellingShingle Geography
A Data-driven, High-performance and Intelligent CyberInfrastructure to Advance Spatial Sciences
description abstract: In the field of Geographic Information Science (GIScience), we have witnessed the unprecedented data deluge brought about by the rapid advancement of high-resolution data observing technologies. For example, with the advancement of Earth Observation (EO) technologies, a massive amount of EO data including remote sensing data and other sensor observation data about earthquake, climate, ocean, hydrology, volcano, glacier, etc., are being collected on a daily basis by a wide range of organizations. In addition to the observation data, human-generated data including microblogs, photos, consumption records, evaluations, unstructured webpages and other Volunteered Geographical Information (VGI) are incessantly generated and shared on the Internet. Meanwhile, the emerging cyberinfrastructure rapidly increases our capacity for handling such massive data with regard to data collection and management, data integration and interoperability, data transmission and visualization, high-performance computing, etc. Cyberinfrastructure (CI) consists of computing systems, data storage systems, advanced instruments and data repositories, visualization environments, and people, all linked together by software and high-performance networks to improve research productivity and enable breakthroughs that are not otherwise possible. The Geospatial CI (GCI, or CyberGIS), as the synthesis of CI and GIScience has inherent advantages in enabling computationally intensive spatial analysis and modeling (SAM) and collaborative geospatial problem solving and decision making. This dissertation is dedicated to addressing several critical issues and improving the performance of existing methodologies and systems in the field of CyberGIS. My dissertation will include three parts: The first part is focused on developing methodologies to help public researchers find appropriate open geo-spatial datasets from millions of records provided by thousands of organizations scattered around the world efficiently and effectively. Machine learning and semantic search methods will be utilized in this research. The second part develops an interoperable and replicable geoprocessing service by synthesizing the high-performance computing (HPC) environment, the core spatial statistic/analysis algorithms from the widely adopted open source python package – Python Spatial Analysis Library (PySAL), and rich datasets acquired from the first research. The third part is dedicated to studying optimization strategies for feature data transmission and visualization. This study is intended for solving the performance issue in large feature data transmission through the Internet and visualization on the client (browser) side. Taken together, the three parts constitute an endeavor towards the methodological improvement and implementation practice of the data-driven, high-performance and intelligent CI to advance spatial sciences. === Dissertation/Thesis === Doctoral Dissertation Geography 2018
author2 Shao, Hu (Author)
author_facet Shao, Hu (Author)
title A Data-driven, High-performance and Intelligent CyberInfrastructure to Advance Spatial Sciences
title_short A Data-driven, High-performance and Intelligent CyberInfrastructure to Advance Spatial Sciences
title_full A Data-driven, High-performance and Intelligent CyberInfrastructure to Advance Spatial Sciences
title_fullStr A Data-driven, High-performance and Intelligent CyberInfrastructure to Advance Spatial Sciences
title_full_unstemmed A Data-driven, High-performance and Intelligent CyberInfrastructure to Advance Spatial Sciences
title_sort data-driven, high-performance and intelligent cyberinfrastructure to advance spatial sciences
publishDate 2018
url http://hdl.handle.net/2286/R.I.51779
_version_ 1718970079469633536