GLOBAL DATA SPATIALLY INTERRELATE SYSTEM FOR SCIENTIFIC BIG DATA SPATIAL-SEAMLESS SHARING
|
|
|
- Logan Morrison
- 10 years ago
- Views:
Transcription
1 GLOBAL DATA SPATIALLY INTERRELATE SYSTEM FOR SCIENTIFIC BIG DATA SPATIAL-SEAMLESS SHARING Jieqing Yu a, Lixin WU b, a, c*, Yizhou Yang c, Xie Lei d, Wang He d a School of Environment Science and Spatial Informatics, China University of Mining and Technology, Xuzhou, China ([email protected]) b IoT (Mine Perception) Centre, China University of Mining and Technology, Xuzhou, China; [email protected] c Institute of Geo-informatics and Digital Mine Research, Northeastern University, Shenyang, China d College of Geoscience and Surveying Engineering, China University of Mining and Technology (Beijing), Beijing, China Commission: IV KEY WORDS: data sharing, global data spatially interrelate system, earth system spatial grid, SDOG-ESSG, big data management ABSTRACT: A good data sharing system with spatial-seamless services will prevent the scientists from tedious, boring, and time consuming work of spatial transformation, and hence encourage the usage of the scientific data, and increase the scientific innovation. Having been adopted as the framework of Earth datasets by Group on Earth Observation (GEO), Earth System Spatial Grid (ESSG) is potential to be the spatial reference of the Earth datasets. Based on the implementation of ESSG, SDOG-ESSG, a data sharing system named global data spatially interrelate system (GASE) was design to make the data sharing spatial-seamless. The architecture of GASE was introduced. The implementation of the two key components, V-Pools, and interrelating engine, and the prototype is presented. Any dataset is firstly resampled into SDOG-ESSG, and is divided into small blocks, and then are mapped into hierarchical system of the distributed file system in V-Pools, which together makes the data serving at a uniform spatial reference and at a high efficiency. Besides, the datasets from different data centres are interrelated by the interrelating engine at the uniform spatial reference of SDOG- ESSG, which enables the system to sharing the open datasets in the internet spatial-seamless. 1. INTRODUCTION Most of scientific researches rely on scientific data, especially, the Earth System Science (ESS). As the development of Earth observation technology, more and more scientific data are obtained for scientific research. However, without a data sharing system, these data are nothing. As Carlson [1] said, a perfect data-sharing system is science's unobtainium. A variety of data centres and data sharing platforms, have been built in the past decades, e.g., NASA Earth Observing System Data and Information System (EOSDIS) [2], USGS Earth Resources Observation and Science Centre (EROS) [3], NOAA The Comprehensive Large Array-data Stewardship System (CLASS) [4], Infrastructure for Spatial Information in Europe (INSPIRE) [5], World Data System (WDS) [6], Global Earth Observation System of Systems (GEOSS) [7], Google earth, and Tianditu [8]. NASA EOSDIS, USGS EROS, and NOAA CLASS are the three most frequently used data resources for earth scientists around the world perhaps. All of them are very big data centres that distribute a vast of earth observation data to the public. WDS is a new version of world data centre under The International Council for Science (ICSU). It collects datasets from its member organizations based on the so called data policy [9], and offers a uniform portal to access the data from different members through the internationally recognized standards and the common infrastructure built by the members. INSPIRE is a project founded by European Union (EU) in 2007 to build an infrastructure to share the environmental spatial information across Europe based on OGC standards. Google earth is software platform developed by Google TM Corporation to sharing global image online. Tianditu is an online geographic information sharing platform developed by National Administration of Surveying, Mapping and Geo-information of China. As seen, the concerning of the data centres and data sharing platforms are the data sharing policies, common infrastructure, uniform portal, and online data services. Few of them pay much attention to a uniform spatial reference. As known, there are hundreds of spatial references, e.g., UTM, Gauss-Kruger, and WGS-84, in use simultaneously for different datasets. Scientists have to download a set of datasets from data centres, and then make many spatial transformations on the datasets for further data analysis manually in order to make a research. However, the spatial transformation is tedious, boring, and time consuming. It will sometime be a problem for non-specialist in GIS, which may discourage the users from the usage of the data, and hence reduce the scientific innovation. It will obviously benefit the scientists a lot if a spatial-seamless data sharing system with no spatial transformation is needed. Earth System Spatial Grid (ESSG) proposed by Wu and Yu [10] has recently been established as the framework of Earth datasets by GEO [11], who is a governmental and international level organization intending to build a Global Earth Observation System of Systems (GEOSS) to share and interrelate the worldwide datasets freely and globally. Thus, ESSG could be the uniform spatial reference for data sharing in GEOSS. Based on ESSG, Wu [12] proposed a new mode of data sharing, named Global Data Spatial Interrelate (GASE), for global scientific data sharing. The GASE is a typical spatial-seamless data sharing system. This paper will make an detail introduction to GASE, including the architecture and implementation of the GASE. doi: /isprsarchives-xl
2 2. ARCHITECTURE OF GLOBAL DATA SPATIALLY INTERRELATE SYSTEM 2.1 EARTH SYSTEM SPATIAL GRID FOR SCIENTIFIC DATA REPRESENTATION Earth System Spatial Grid (ESSG) is an exhaustive coverage, hierarchical structure, approximate sized, definite frame, unique coding and multiple granularities three-dimensional (3D) global grid system intended for Earth System Science research. Yu [13, 14] had given out an implementation of ESSG, named SDOG-ESSG, based on Spheroid degenerated-octree grid with refinement on radius. It is achieved by a spheroid degenerated-octree subdivision on a large reference spheroid whose radius is the twice of the average Earth radius, and then by a radial bisection on the produced grids. A coupled degenerated-z curve (CDZ-curve) was used to fill the SDOG-ESSG to get a unique code for each grid [13, 14]. The large reference spheroid, the features of exhaustive coverage, hierarchical structure, approximate sized, multiple granularities, and the unique code make the SDOG-ESSG being an effective spatial reference for Earth datasets and hence being a suggested reference for scientific data exchange by GEO. Fig. 1 shows an example of SDOG-ESSG and its CDZ-curve. (a) reference sphere (b) SDOG Fig. 2 Data representation in SDOG-ESSG 2.2 GLOBAL DATA SPATIALLY INTERRELATE SYSTEM BASED ON SDOG-ESSG Based on SDOG-ESSG, a Global Data Spatially Interrelate System (GASE) for global datasets spatial-seamless sharing is proposed by Wu [12]. GASE mainly consists of V-Pools, ESSG engine, interrelating engine, and tools engine (Fig. 3). V-Pools is a large distributive database system that keeps a vast of datasets for GASE. All datasets are transformed into SDOG-ESSG triples before being flushed into the V-Pools. The interrelating engine interrelates the various datasets from the data centers seamless in a unified spatial reference. It provides remote scientific data finding, and spatial transformation services to the ESSG engine if the dataset required is not found in the V-Pools. ESSG engine gets data either from interrelating engine or from V-Pools depending on whether the data is in the V-Pools, and then provides data services to the data users or tools engine. The tools engine provides online analysis service to the data users, such as online visualization, if further analysis is required. Among these components, V-Pools is the key component to manage the dataset in a uniform spatial reference and to decide the performance of data service, while interrelating engine is key component that makes the data service spatial-seamless and provide a variety of datasets to the users. Both of them are vital to the GASE. We will then discuss them in detail in the following. (c) SDOG-ESSG (d) CDZ-curve Fig.1 SDOG-based Earth System Spatial Grid (SDOG-ESSG) and its CDZ-curve. The grid element in SDOG-ESSG is not only a 3D voxel in the space of planet Earth, but also a 3D container for the spatial data [12]. Any dataset in the space can be considered as information that is attached to the grid. From the method of SDOG-ESSG encoding, the information of location, shape and granularity for a grid is embodied in the code. In other words, the grid code contains all the spatial information needed. Thus, a triple (C, T, A) model [15] is enough to represent the temporal-spatial objects (phenomenon) based on SDOG-ESSG (Fig. 2). In the triple model, C refers to the grid code of SDOG-ESSG, T refers to the time stamp for the temporal datasets, and A is the attribute values of the temporal-spatial objects (phenomenon). Referring to the resolutions of datasets and application demands, any dataset can be resampled or re-gridded into SDOG-ESSG at a certain granularity level, so as to get the value of spatial C and attribute A for the elements of triple model [12]. Fig.3 Architecture of GASE. GASE is mainly composed of V-Pools, ESSG engine, interrelating engine and tools engine. V-Pools is a data archiving system that manages a vast of datasets. ESSG engine is data providing system to the tools engine and the data users. Tools engine is a collection of tool sets that provide further process service on the retrieval data to the users. Interrelating engine is a data finding, updating and automatically spatial transforming system that gives supplements to the V-Pools. doi: /isprsarchives-xl
3 3. MANAGEMENT OF BIG DATA IN THE V-POOLS Any dataset is resampled into SDOG-ESSG before being flushed into the V-Pools. The methods of resampling dataset into SDOG-ESSG is presented in [14, 15]. The biggest problem for data management in V-Pools is the data volume. The data volume in the V-pools will be very huge and hence be far beyond the ability of a single data server. Advance storage technologies, such as distributive storage and cloudbased storage, is required to manage the big data. Fortunately, we can take full advantages of SDOG-ESSG to build a scalable, distributive or even cloud-based big data managing system. The main idea of managing the big data in V-Pools based SDOG-ESSG is to build a hierarchical system for the big data and then to divide each dataset into blocks so as to take full ability of the IOs. 3.1 HIERARCHICAL SYSTEM AND DATA PARTITION IN V-POOLS As known from the method of grid subdivision, SDOG-ESSG is a hierarchical grid system with each grid encoded uniquely, which makes itself extremely suitable for big data management in distributive environment. In SDOG-ESSG, grids of low level can be grouped into a grid of high level, and grids of high level can be grouped into a grid of higher level, and so forth. Thus, a hierarchical tree can be built according to the grouping strategy. In the hierarchical tree, grids of different level are located on the nodes of different depth. Each grid owns one single parent and a number of children, and can be labeled by a unique code that indicates the extent, granularity and location of the grid. The extent of the grid could then be used for the indexing of a block of voxel data, such as triples. As known, any dataset has a limitation on the resolution. So, according to the resolution we can attach any dataset to the corresponding depth of the hierarchical tree in which the granularity of the grid is no smaller than the resolution of the dataset. For example, datasets with resolution at 1Km 1Km can be attached to the node grid at level 6 whose granularity is about 50Km 50Km. Fig. 4 gives an illustration on the hierarchical tree for big data management. The whole tree is composed of a vast of nodes of different depths. The root of the hierarchical tree is set to be the globe. The grid of SDOG-ESSG at level k d is mapped into the node at depth k, where d is a number no smaller than 1 and refers to the difference of grid level between nodes of two adjacent depth. Each node grid is given a code indicated by C n, where n is the level of the node grid. Dataset is firstly resampled or re-grid into SDOG-ESSG at the level corresponding to its resolution, and then is divided into small blocks according to the extent of the node grid that it is going to be attached to. The level of the node grid (n d ) can be jointly decided by the resampling level (n re ), the minimum difference of level (D min ) and maximum difference of level (D max ) between the resampled datasets and the node grid. The relation is shown in the following, nre Dmin nd nre D max, Formula 1 where, D D d - 1. max - min The way of attaching the block to the node-grid is to take the code of node grid as the main part of the block code. The code of block consists of theme code, node grid code, and time stamp. Theme code (A) refers to the identification of the dataset. The theme code should be unique in the GASE system, e.g., the attribute classification code based on the industry standards. Time stamp (T) is a string that records the date or the time when the dataset is produced or observed in the forms of 'yyyymmddhh', such as ' ' and ' '. The three components are joined together by a separator character like '_' in sequence. For example, ' Temp01_ _ ' refers to the dataset of 'Temp01' produced on January 1, 2000 with its spatial extent within the boundary of grid code ' '. SDOG A i _C nd _T 1 A i _C nd _T 2 A 2 _C nd _T 1 A 2 _C nd _T 2 C d C d A 1 _C nd _T 1 A 1 _C nd _T 2 T 1 T 2 A i _C nd _T j C kd C kd C nd T j A 1 _C nd _T 1 A 2 _C nd _T j Fig.4 Hierarchical system for big data management based on SDOG-ESSG. The globe is divided into grids of different level by the method of degenerated octree subdivision. Grids of the same level consist of the nodes of the same depth of the hierarchical tree. On the other hand, datasets are divided into small blocks by the method of SDOG-ESSG subdivision. Each block is attached to the corresponding node grid, and jointly coded by dataset identification (A), node gird code (C) and time stamp (T) separated by a special character like '_'. The hierarchical system is then mapped into the file system or the distributed file system. doi: /isprsarchives-xl
4 3.2 DATA STORAGE IN THE V-POOLS The hierarchical tree is a logical way of managing the big data based on SDOG-ESSG, and has to be mapped into the database. Here, a mixed way of relation database and file system is used to store the big data in V-Pools. The relation database is used to store the metadata, e.g., the difference of grid level (d) between nodes of two adjacent depth, the minimum difference of level (D min ) and maximum difference of level (D max ) between the resampled datasets and the node grid, resampling level of each dataset, boundary box of each dataset, and any other useful metadata of each dataset. The file system is used to store the block data and its hierarchical index. Each block data is written into a file that records the value of each grid in sequence. The key problem of the storage is to build a hierarchical index for the block data. However, the hierarchical index, which acts as a hierarchical folder system, can be made by mapping the hierarchical tree into the file system. In the hierarchical folder system, each folder refers to a node of the hierarchical tree, and is named by the grid code. The block data attached to node grid is put into the folder referring to the node. For example, the folder ' ' is attached to the node grid ' ', and may contain both sub-folder ' ' and data block ' ', where ' ' is the father of the father grid of ' '. Thus, the way to find the data block is to construct a full path from the root of the hierarchical tree to the attached node grid, e. g., '\10\1026\102652\ \*_ _*.tif' for the data block ' '. This hierarchical folder system can be easily extended to cluster. As known, the efficiency of data accessing is a key measurement to evaluate the performance of a big data management system. However, the bottleneck always lies on the data accessing on IOs. An efficient way is to make the data accessing taken from different disks at the same time. Considering that the data accessed at a time is always spatially adjacent, the data nearby needs to be flushed into different disks so as to explore the full ability of IO. As argued above, any dataset is divided into small blocks. If these blocks are distributed into different disks, the ability of disks can be explored fully. According to coding system of SDOG-ESSG, the code of the d th level child of a grid owns a suffix of d-digit ranging from 0 to 8 d -1. A good balance on the data distribution will be achieved, if the blocks are assigned into numbered disks according to the last d digitals of the code. For example, data block with a code ended by a suffix of '15' is attached to the disk '15'. Thus, most of the disks will work at the same time, which will be no doubt to enhance the ability of data accessing, if a large extent of data is requested. Even though, there is a limitation on the capacity of a single cluster. A much larger distribution system which is open and scaleable is required for V-Pools. Fig. 5 give an illustration on the architecture of V-Pools. It is composed of various data houses that are interconnected to by Internet. Each data house may either be a cluster, or a voluntary storage node contributed by any institute. It is responsible for several datasets independently. If it is a cluster, the datasets will be distributed uniformly on the storage nodes following the strategy of blocks partitioning and the assignment of round robin argued above; otherwise, the whole dataset will be flushed into a single node after blocks partitioning. Therefore, the full path for the data block ' ' may be 'A2\Node22\10\1026\102652\ \*_ _*.tif', where the paths of data house and storage nodes are included. V-Pools Data house A 1 A 2 A i Nodes YY YY A 1 _XX02_T j A 2 _XX02_T j A 2 _XXYY_T j A 1 _XX01_T j A 1 _XXYY_T j A 2 _XX02_T j Blocks A 1 _XXX02_T j A 2 _XXX02_T j A 2 _XXXYY_T j A 1 _XXX01_T j A 1 _XXXYY_T j A 2 _XXX02_T j A i _XXYY_T j A i _XXYY_T j Fig. 5 V-Pools is composed of many data house, each of which may either be a cluster or a single node on the Internet, and is responsible for several dataset independently. The datasets are divided into blocks which are mapped into the storage nodes according to round robin strategy. 3.3 DATA RETRIEVAL IN THE V-POOLS The only thing that the users need to do to get the desirable data from this system is to submit a request that contains the information of dataset descriptions, time range, resolution, and spatial extent to the system. The system will handle it automatically, and then return the exact datasets or the production in a unified spatial reference. No more attention is need to be paid on the spatial transformation, which will save the data users a lot of time and frees the data users from boring work in spatial transformation. Dataset descriptions are used to determinate the identification of the dataset that the user requests. The work is usually done by ESSG engine. Time range contains the information of time span that the user concerns. It is transformed into two time stamps in the ESSG engine in advance. Resolution is the key information to decide the resampling level of the dataset requested, and the level of node grid that the dataset attached doi: /isprsarchives-xl
5 to. Spatial extent is divided into small grids at the node level. Once the node grids and their codes are obtained, a full path that indicates the location of the block data can be generated for further retrieving by the identification of the dataset, the node grid codes and their father grids' codes. The work of searching the exact data file in the full path can be done by the searching expressions which are generated in advance in the memory to avoid extra searching in the file system. The expression is a mathematical expression that defines the range of block code using the node-grid codes, theme code and two time stamp, such as '> Temp01_ _ and < Temp01_ _ ', and '> Temp01_ _ and < Temp01_ _ '. The final result can be gotten by the execution of the searching expressions in the full path. 4. INTERRELATING ENGINE Any data center has a limitation on its data resource. There is is a huge data resource in the internet where everybody can contribute their datasets. If a data sharing platform can interrelate their system to the open data resource in the internet, it will no doubt be more attractive. So far, a lot of data sharing system has been developed to try to provide data service upon multiply data centres organizations in the internet, such GEOSS, INSPIRE, and WDS. However, what they do is to interrelate datasets from different data centres through a uniform data sharing policy, a well-recognized international standard, a common infrastructure and a uniform portal. Little attention has been paid to interrelate the datasets in a uniform spatial reference. Referring to above problem, a component of spatial data interrelating, named interrelating engine, is designed to interrelate the datasets from different data centres in a uniform spatial reference in GASE. It is composed of three components (Fig. 6), including the components of updating, searching and conversion. Updating component is responsible for updating the V-Pools data from remote data centres in case of any dataset is updated. It is a little similar to software updating, who queries the server periodically and then updates itself when there is a new version available. The difference lies that the focus of the software updates is software itself while that of the updating component in interrelating engine is open datasets. The updating component queries the data centre periodically to test whether there is a new version of the dataset. If a new version is found, the component will download the data automatically, and then summit the new version dataset to conversion component. The conversion component will resample the datasets of various spatial references into a uniform spatial reference, which is SDOG-ESSG, and then divide them into blocks which are to be stored in V-Pools. Searching component is a component to discover and download new dataset from the internet through standard protocols, such as OGC catalog service-web (CSW) and web coverage service (WCS). The dataset downloaded is also submitted to conversion components, and then return the resampled data blocks to ESSG engine and V-Pools at the same time. 5. A PROTOTYPE FOR GASE A prototype for GASE has been developed with C++ and Java language, Qt, GDAL open source library. It was built upon a six-node cluster (Fig.7), where four of them were used for storage nodes (V-Pools), one for ESSG server and the rest one for interrelating engine. The six nodes were all connected by a router which was connect to the Internet. We took the datasets of global 30m DEM as experiment data to implement the V-Pools and ESSG server. D min, D max and d were set to 5, 7 and 3, respectively. The volume for global 30 DEM is about 500TB. It was resampled into SDOG-ESSG at the level of 19, and divided into blocks at the level of 12. Block with node codes ended by '0' and '4' were stored in node '0' and '4', and that ended by '1' and '5' were stored in node 1, and so on. Our test showed that the retrieval of data is convenient and high efficient. The retrieval of Xuzhou region DEM is near real time. Fig. 8 shows the interface of GASE and the return DEM of Xuzhou region. Interrelating engine Updating V-Pools Conversion Remote data centers Searching ESSG engine Fig.6 Architecture of interrelating engine. It is mainly composed of updating, searching and conversion components. NASA developed a language of IIMSAQL to help the users to obtain data from its EOSDIS system. Thus, based on IIMSAQL we implemented an interrelating engine, by which datasets from NASA EOSDIS can be obtained and automatically updated in the form of the uniform spatial reference of SDOG-ESSG. Fig. 9 shows the interface of interrelating engine which updates the datasets 'ADVANCED MICROWAVE SOUNDING UN17-A (AMSU-A) SWATH FROM NOAA-17' into V-Pools every day automatically in the spatial reference of SDOG-ESSG. Interrelating engine ESSG server V-Pools Fig.7 The environments of the GASE prototype. 6. SUMMARIES AND REMARKS Internet Client Data sharing is vital to earth science research. However, a good data sharing system with spatial-seamless services is also very important to scientists, which will prevent the scientists from tedious, boring, and time consuming work of spatial transformation, and hence encourage the usage of the scientific data, and increase the scientific innovation. Having doi: /isprsarchives-xl
6 been adopted as the framework of Earth datasets by GEO, which is a governmental and international level organization intending to share the worldwide datasets freely and globally, ESSG is potential to be the spatial reference of the Earth datasets. Based on the implementation of ESSG, named SDOG-ESSG, a data sharing system named global data spatially interrelate system (GASE) was design to make the data sharing spatial-seamless. Fig.8 Retrieving DEM in Xuzhou region from GASE. Fig.9 Updating of the dataset 'ADVANCED MICROWAVE SOUNDING UN17-A (AMSU-A) SWATH FROM NOAA- 17' into V-Pools automatically in the spatial reference of SDOG-ESSG. The system was composed of V-Pools, ESSG server, interrelating engine and tools engine, among which V-Pools and interrelating engine are the most important two in the system. The V-Pools is responsible for the big data management at the uniform spatial reference of SDOG-ESSG, and offers a high performance accessing to the datasets. The dataset is firstly resampled into SDOG-ESSG, and is divided into small blocks, and then are mapped into hierarchical system of the distributed file system, which together makes the data serving at a uniform spatial reference and at a high efficiency. The interrelating engine interrelated the datasets from various data centres at the uniform spatial reference of SDOG-ESSG, and provides data updating and downloading from the data centres automatically. The component enable the system to sharing the open datasets in the internet spatialseamless. ACKNOWLEDGEMENT This study is funded by National Natural Science Foundation of China (No ; ), and the Fundamental Research Funds for the Central Universities (No. 2013QNB10). REFERENCE 1. Carlson D., A lesson in sharing [J]. Nature, 2011, 469: NASA EOSDIS [EB/OL], [ USGS EROS [EB/OL], [ NOAA CLASS [EB/OL], [ INSPIRE [EB/OL], [ WDS [EB/OL], [ GEOSS [EB/OL], [ Tianditu [EB/OL], [ ICSU, WDS Data Policy [EB/OL]. [ ] 10. Wu Lixin, Yu Jieqing. Earth System Spatial Grid and Its Application Mode [J]. Geography and Geo-information Science, 2012, 28(01): GEO, GEO WORK PLAN [EB/OL], [ Lixin WU, Jieqing Yu, Yizhou Yang, et al. Spatial Big Data Organization, Access and Visualization with ESSG [C]. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences. 2013, XL-4/W2: Yu Jieqing, Wu Lixin. Adaptable Spheroid Degenerated- Octree Grid and Its Coding Method [J]. Geography and Geo-Information Science, 2012, 28(1): Yu Jieqing. SDOG-based Earth System Spatial Grid and Its Application and Three Dimensional Modeling [D]. Beijing Normal University, Yu Jieqing, Wu Lixin, Zi Guojie, et al. SDOG-based multi-scale 3D modeling and visualization on global lithosphere[j]. Sci China Earth Sci, 2012, 55(6): doi: /isprsarchives-xl
RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE
RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE WANG Jizhou, LI Chengming Institute of GIS, Chinese Academy of Surveying and Mapping No.16, Road Beitaiping, District Haidian, Beijing, P.R.China,
The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets
The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets!! Large data collections appear in many scientific domains like climate studies.!! Users and
IC05 Introduction on Networks &Visualization Nov. 2009. <[email protected]>
IC05 Introduction on Networks &Visualization Nov. 2009 Overview 1. Networks Introduction Networks across disciplines Properties Models 2. Visualization InfoVis Data exploration
An Esri White Paper June 2011 ArcGIS for INSPIRE
An Esri White Paper June 2011 ArcGIS for INSPIRE Esri, 380 New York St., Redlands, CA 92373-8100 USA TEL 909-793-2853 FAX 909-793-5953 E-MAIL [email protected] WEB esri.com Copyright 2011 Esri All rights reserved.
Cloud-based Geospatial Data services and analysis
Cloud-based Geospatial Data services and analysis Xuezhi Wang Scientific Data Center Computer Network Information Center Chinese Academy of Sciences 2014-08-25 Outlines 1 Introduction of Geospatial Data
Visualization Method of Trajectory Data Based on GML, KML
Visualization Method of Trajectory Data Based on GML, KML Junhuai Li, Jinqin Wang, Lei Yu, Rui Qi, and Jing Zhang School of Computer Science & Engineering, Xi'an University of Technology, Xi'an 710048,
Clustering through Decision Tree Construction in Geology
Nonlinear Analysis: Modelling and Control, 2001, v. 6, No. 2, 29-41 Clustering through Decision Tree Construction in Geology Received: 22.10.2001 Accepted: 31.10.2001 A. Juozapavičius, V. Rapševičius Faculty
Make search become the internal function of Internet
Make search become the internal function of Internet Wang Liang 1, Guo Yi-Ping 2, Fang Ming 3 1, 3 (Department of Control Science and Control Engineer, Huazhong University of Science and Technology, WuHan,
13.2 THE INTEGRATED DATA VIEWER A WEB-ENABLED APPLICATION FOR SCIENTIFIC ANALYSIS AND VISUALIZATION
13.2 THE INTEGRATED DATA VIEWER A WEB-ENABLED APPLICATION FOR SCIENTIFIC ANALYSIS AND VISUALIZATION Don Murray*, Jeff McWhirter, Stuart Wier, Steve Emmerson Unidata Program Center, Boulder, Colorado 1.
PART 1. Representations of atmospheric phenomena
PART 1 Representations of atmospheric phenomena Atmospheric data meet all of the criteria for big data : they are large (high volume), generated or captured frequently (high velocity), and represent a
The ORIENTGATE data platform
Seminar on Proposed and Revised set of indicators June 4-5, 2014 - Belgrade (Serbia) The ORIENTGATE data platform WP2, Action 2.4 Alessandra Nuzzo, Sandro Fiore, Giovanni Aloisio Scientific Computing and
Improving Data Mining of Multi-dimension Objects Using a Hybrid Database and Visualization System
Improving Data Mining of Multi-dimension Objects Using a Hybrid Database and Visualization System Yan Xia, Anthony Tung Shuen Ho School of Electrical and Electronic Engineering Nanyang Technological University,
PERSONALIZED WEB MAP CUSTOMIZED SERVICE
CO-436 PERSONALIZED WEB MAP CUSTOMIZED SERVICE CHEN Y.(1), WU Z.(1), YE H.(2) (1) Zhengzhou Institute of Surveying and Mapping, ZHENGZHOU, CHINA ; (2) North China Institute of Water Conservancy and Hydroelectric
GeoCloud Project Report GEOSS Clearinghouse
GeoCloud Project Report GEOSS Clearinghouse Qunying Huang, Doug Nebert, Chaowei Yang, Kai Liu 2011.12.06 Description of Application GEOSS clearinghouse is a FGDC, GEO, and NASA project that connects directly
Draft Martin Doerr ICS-FORTH, Heraklion, Crete Oct 4, 2001
A comparison of the OpenGIS TM Abstract Specification with the CIDOC CRM 3.2 Draft Martin Doerr ICS-FORTH, Heraklion, Crete Oct 4, 2001 1 Introduction This Mapping has the purpose to identify, if the OpenGIS
The USGS Landsat Big Data Challenge
The USGS Landsat Big Data Challenge Brian Sauer Engineering and Development USGS EROS [email protected] U.S. Department of the Interior U.S. Geological Survey USGS EROS and Landsat 2 Data Utility and Exploitation
A FRAMEWORK OF WEB-BASED SERVICE SYSTEM FOR SATELLITE IMAGES AUTOMATIC ORTHORECTIFICATION
A FRAMEWORK OF WEB-BASED SERVICE SYSTEM FOR SATELLITE IMAGES AUTOMATIC ORTHORECTIFICATION Jiaojiao Tian, Xinming Tang, Huabin Wang Key Laboratory of Geo-informatics of State Bureau of Surveying and Mapping,
D 2.2.3 EUOSME: European Open Source Metadata Editor (revised 2010-12-20)
Project start date: 01 May 2009 Acronym: EuroGEOSS Project title: EuroGEOSS, a European Theme: FP7-ENV-2008-1: Environment (including climate change) Theme title: ENV.2008.4.1.1.1: European Environment
Cloud-based Infrastructures. Serving INSPIRE needs
Cloud-based Infrastructures Serving INSPIRE needs INSPIRE Conference 2014 Workshop Sessions Benoit BAURENS, AKKA Technologies (F) Claudio LUCCHESE, CNR (I) June 16th, 2014 This content by the InGeoCloudS
http://www.paper.edu.cn
5 10 15 20 25 30 35 A platform for massive railway information data storage # SHAN Xu 1, WANG Genying 1, LIU Lin 2** (1. Key Laboratory of Communication and Information Systems, Beijing Municipal Commission
ArcGIS Framework Plug-In: Extending the ArcGIS Desktop for ANSI Standard Framework Data to Support Government Decision Making
ArcGIS Framework Plug-In: Extending the ArcGIS Desktop for ANSI Standard Framework Data to Support Government Decision Making Final Project Report October 28, 2008 Agreement Number Organization Project
Web Map Service Architecture for Topographic Data in Finland
Web Map Service Architecture for Topographic Data in Finland Teemu Sipilä National Land Survey of Finland Abstract. Since 2012 National Land Survey of Finland has been renewing its web map services and
Data Grids. Lidan Wang April 5, 2007
Data Grids Lidan Wang April 5, 2007 Outline Data-intensive applications Challenges in data access, integration and management in Grid setting Grid services for these data-intensive application Architectural
The THREDDS Data Repository: for Long Term Data Storage and Access
8B.7 The THREDDS Data Repository: for Long Term Data Storage and Access Anne Wilson, Thomas Baltzer, John Caron Unidata Program Center, UCAR, Boulder, CO 1 INTRODUCTION In order to better manage ever increasing
Facts about Visualization Pipelines, applicable to VisIt and ParaView
Facts about Visualization Pipelines, applicable to VisIt and ParaView March 2013 Jean M. Favre, CSCS Agenda Visualization pipelines Motivation by examples VTK Data Streaming Visualization Pipelines: Introduction
Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan
Metadata for Data Discovery: The NERC Data Catalogue Service Steve Donegan Introduction NERC, Science and Data Centres NERC Discovery Metadata The Data Catalogue Service NERC Data Services Case study:
EUMETSAT EO Portal. End User Image Access using OGC WMS/WCS services. EUM/OPS/VWG/10/0095 Issue <1> <14/01/2010> Slide: 1
EUMETSAT EO Portal End User Image Access using OGC WMS/WCS services Slide: 1 Overview Introduction: status of data access and visualization EUMETSAT datasets Architecture Web Map Service implementation
Oklahoma s Open Source Spatial Data Clearinghouse: OKMaps
Oklahoma s Open Source Spatial Data Clearinghouse: OKMaps Presented by: Mike Sharp State Geographic Information Coordinator Oklahoma Office of Geographic Information MAGIC 2014 Symposium April 28-May1,
UK Location Programme
Location Information Interoperability Board Data Publisher How To Guide Understand the background to establishing an INSPIRE View Service using GeoServer DOCUMENT CONTROL Change Summary Version Date Author/Editor
ON DEVELOPING SPATIAL DATA INFRASTRUCTURE OF CHINA
ON DEVELOPING SPATIAL DATA INFRASTRUCTURE OF CHINA Deren Li, Huayi Wu LIESMARS, Wuhan University Luoyu Road 129, Wuhan, P.R. China, 430079 Email: [email protected], [email protected] Commission
Applying GIS in seismic hazard assessment and data integration for disaster management
Applying GIS in seismic hazard assessment and data integration for disaster management Rumiana Vatseva, Dimcho Solakov, Emilia Tcherkezova, Stela Simeonova, Petya Trifonova National Institute of Geophysics,
HydroDesktop Overview
HydroDesktop Overview 1. Initial Objectives HydroDesktop (formerly referred to as HIS Desktop) is a new component of the HIS project intended to address the problem of how to obtain, organize and manage
Agile Retrieval of Big Data with. EarthServer. ECMWF Visualization Week, Reading, 2015-sep-29
Agile Retrieval of Big Data with EarthServer ECMWF Visualization Week, Reading, 2015-sep-29 Peter Baumann Jacobs University rasdaman GmbH [email protected] [co-funded by EU through EarthServer, PublicaMundi]
GeoNetwork, The Open Source Solution for the interoperable management of geospatial metadata
GeoNetwork, The Open Source Solution for the interoperable management of geospatial metadata Ing. Emanuele Tajariol, GeoSolutions Ing. Simone Giannecchini, GeoSolutions GeoSolutions GeoSolutions GeoNetwork
An architecture for open and scalable WebGIS
An architecture for open and scalable WebGIS Aleksandar Milosavljević, Leonid Stoimenov, Slobodanka Djordjević-Kajan CG&GIS Lab, Department of Computer Science Faculty of Electronic Engineering, University
Large Scale Spatial Data Management on Mobile Phone data set Using Exploratory Data Analysis
Large Scale Spatial Data Management on Mobile Phone data set Using Exploratory Data Analysis K.S.Pandiyan, Dr.S.Sivakumar Research scholar, Professor, Department of Geology, Anna University, Chennai Abstract
CityGML goes to Broadway
CityGML goes to Broadway Thomas H. Kolbe, Barbara Burger, Berit Cantzler Chair of Geoinformatics [email protected] September 11, 2015 Photogrammetric Week 2015, Stuttgart The New York City Open Data
TOWARDS AN AUTOMATED HEALING OF 3D URBAN MODELS
TOWARDS AN AUTOMATED HEALING OF 3D URBAN MODELS J. Bogdahn a, V. Coors b a University of Strathclyde, Dept. of Electronic and Electrical Engineering, 16 Richmond Street, Glasgow G1 1XQ UK - [email protected]
GIS Initiative: Developing an atmospheric data model for GIS. Olga Wilhelmi (ESIG), Jennifer Boehnert (RAP/ESIG) and Terri Betancourt (RAP)
GIS Initiative: Developing an atmospheric data model for GIS Olga Wilhelmi (ESIG), Jennifer Boehnert (RAP/ESIG) and Terri Betancourt (RAP) Unidata seminar August 30, 2004 Presentation Outline Overview
CONCEPTUAL MODEL OF MULTI-AGENT BUSINESS COLLABORATION BASED ON CLOUD WORKFLOW
CONCEPTUAL MODEL OF MULTI-AGENT BUSINESS COLLABORATION BASED ON CLOUD WORKFLOW 1 XINQIN GAO, 2 MINGSHUN YANG, 3 YONG LIU, 4 XIAOLI HOU School of Mechanical and Precision Instrument Engineering, Xi'an University
Remote Sensitive Image Stations and Grid Services
International Journal of Grid and Distributed Computing 23 Remote Sensing Images Data Integration Based on the Agent Service Binge Cui, Chuanmin Wang, Qiang Wang College of Information Science and Engineering,
Study on Architecture and Implementation of Port Logistics Information Service Platform Based on Cloud Computing 1
, pp. 331-342 http://dx.doi.org/10.14257/ijfgcn.2015.8.2.27 Study on Architecture and Implementation of Port Logistics Information Service Platform Based on Cloud Computing 1 Changming Li, Jie Shen and
EnterpriseLink Benefits
EnterpriseLink Benefits GGY AXIS 5001 Yonge Street Suite 1300 Toronto, ON M2N 6P6 Phone: 416-250-6777 Toll free: 1-877-GGY-AXIS Fax: 416-250-6776 Email: [email protected] Web: www.ggy.com Table of Contents
Sanjeev Kumar. contribute
RESEARCH ISSUES IN DATAA MINING Sanjeev Kumar I.A.S.R.I., Library Avenue, Pusa, New Delhi-110012 [email protected] 1. Introduction The field of data mining and knowledgee discovery is emerging as a
DESIGN PATTERNS OF WEB MAPS. Bin Li Department of Geography Central Michigan University Mount Pleasant, MI 48858 USA (517) 774-1165 bin.li@cmich.
DESIGN PATTERNS OF WEB MAPS Bin Li Department of Geography Central Michigan University Mount Pleasant, MI 48858 USA (517) 774-1165 [email protected] Abstract Web maps have reached the level of depth and
Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007
Data Management in an International Data Grid Project Timur Chabuk 04/09/2007 Intro LHC opened in 2005 several Petabytes of data per year data created at CERN distributed to Regional Centers all over the
DEVELOPMENT OF THE PLANETARY CARTOGRAPHY WEB-SITE WITH OPEN SOURCE CONTENT MANAGEMENT SYSTEM
CO-131 DEVELOPMENT OF THE PLANETARY CARTOGRAPHY WEB-SITE WITH OPEN SOURCE CONTENT MANAGEMENT SYSTEM ROZHNEV I. Moscow State University of Geodesy and Cartography, PUSHKINO, RUSSIAN FEDERATION Considerable
How To Use Hadoop For Gis
2013 Esri International User Conference July 8 12, 2013 San Diego, California Technical Workshop Big Data: Using ArcGIS with Apache Hadoop David Kaiser Erik Hoel Offering 1330 Esri UC2013. Technical Workshop.
Application of GIS Analyzes with Cloud Computing
ion of GIS Analyzes with Cloud Computing Sevket BEDIROGLU, Volkan YILDIRIM and Selcuk ERBAS (Turkey) Key words: GIS, Cloud Computing, Spatial Cloud Computing SUMMARY Usage of Cloud Computing (CC) has been
Concept of Cache in web proxies
Concept of Cache in web proxies Chan Kit Wai and Somasundaram Meiyappan 1. Introduction Caching is an effective performance enhancing technique that has been used in computer systems for decades. However,
The Status of Geospatial Information Management in China
The Status of Geospatial Information Management in China Submitted by the National Administration of Surveying, Mapping and Geoinformation of China 1. Administration System The National Administration
Scalable Internet Services and Load Balancing
Scalable Services and Load Balancing Kai Shen Services brings ubiquitous connection based applications/services accessible to online users through Applications can be designed and launched quickly and
Abstract. Introduction
Data Replication and Data Sharing Integrating Heterogeneous Spatial Databases Mark Stoakes and Katherine Irwin Professional Services, Safe Software Inc. Abstract Spatial data warehouses are becoming more
Web-based spatio-temporal visualization and analysis of the Siberian Earth System Science Cluster (SIB-ESS-C)
Web-based spatio-temporal visualization and analysis of the Siberian Earth System Science Cluster (SIB-ESS-C) Roman Gerlach Supervisor: Prof. C. Schmullius (Dept. of Geography, Friedrich-Schiller-University
In Memory Accelerator for MongoDB
In Memory Accelerator for MongoDB Yakov Zhdanov, Director R&D GridGain Systems GridGain: In Memory Computing Leader 5 years in production 100s of customers & users Starts every 10 secs worldwide Over 15,000,000
Hydrogeological Data Visualization
Conference of Junior Researchers in Civil Engineering 209 Hydrogeological Data Visualization Boglárka Sárközi BME Department of Photogrammetry and Geoinformatics, e-mail: [email protected] Abstract
Scala Storage Scale-Out Clustered Storage White Paper
White Paper Scala Storage Scale-Out Clustered Storage White Paper Chapter 1 Introduction... 3 Capacity - Explosive Growth of Unstructured Data... 3 Performance - Cluster Computing... 3 Chapter 2 Current
GIS Data Conversion. GIS maps are digital not analog. Getting the Map into the Computer
Getting the Map into the Computer Getting the Map into the Computer Getting Started with Geographic Information Systems Chapter 4 4.1 Analog-to-Digital Maps 4.2 Finding Existing Map Data 4.3 Digitizing
Chapter 7. Using Hadoop Cluster and MapReduce
Chapter 7 Using Hadoop Cluster and MapReduce Modeling and Prototyping of RMS for QoS Oriented Grid Page 152 7. Using Hadoop Cluster and MapReduce for Big Data Problems The size of the databases used in
INTEROPERABLE IMAGE DATA ACCESS THROUGH ARCGIS SERVER
INTEROPERABLE IMAGE DATA ACCESS THROUGH ARCGIS SERVER Qian Liu Environmental Systems Research Institute 380 New York Street Redlands, CA92373, U.S.A - [email protected] KEY WORDS: OGC, Standard, Interoperability,
Handling Heterogeneous EO Datasets via the Web Coverage Processing Service
Handling Heterogeneous EO Datasets via the Web Coverage Processing Service Piero Campalani a*, Simone Mantovani b, Alan Beccati a, a a a Jacobs University Bremen (DE) / b MEEO Srl (IT) * {[email protected]
Spatial Data Infrastructure. A Collaborative Network
Spatial Data Infrastructure A Collaborative Network Solving Challenges Takes Collaboration Social challenges, environmental issues, and economic downturns all take cooperation to solve. Working together
The Challenges of Integrating Structured and Unstructured Data
LANDMARK TECHNICAL PAPER 1 LANDMARK TECHNICAL PAPER The Challenges of Integrating Structured and Unstructured Data By Jeffrey W. Pferd, PhD, Sr. Vice President Strategic Consulting Practice at Petris Presented
GIS Databases With focused on ArcSDE
Linköpings universitet / IDA / Div. for human-centered systems GIS Databases With focused on ArcSDE Imad Abugessaisa [email protected] 20071004 1 GIS and SDBMS Geographical data is spatial data whose
EED Task Order. Contract: NNG10HP02C Contractor: Raytheon Task Type:
EED Task Order Title: Studies CMR Phase 0 No-Cost Extension Task Number: 9 Rev 15 Originator: Marinelli Effective Date: Dec 11, 2013 ESDIS POC: Marinelli Task Estimate Cost and Maximum Available Fee Estimate
NASA's Strategy and Activities in Server Side Analytics
NASA's Strategy and Activities in Server Side Analytics Tsengdar Lee, Ph.D. High-end Computing Program Manager NASA Headquarters Presented at the ESGF/UVCDAT Conference Lawrence Livermore National Laboratory
Open Data. Open Data. Share and map your data with ArcGIS. technology.
Open Data Share and map your data with ArcGIS Open Data technology. Publishing data as map services is one of the most valuable ways to communicate and collaborate. Maps enable an immediate understanding
1. Domain Name System
1.1 Domain Name System (DNS) 1. Domain Name System To identify an entity, the Internet uses the IP address, which uniquely identifies the connection of a host to the Internet. However, people prefer to
BIG DATA VISUALIZATION. Team Impossible Peter Vilim, Sruthi Mayuram Krithivasan, Matt Burrough, and Ismini Lourentzou
BIG DATA VISUALIZATION Team Impossible Peter Vilim, Sruthi Mayuram Krithivasan, Matt Burrough, and Ismini Lourentzou Let s begin with a story Let s explore Yahoo s data! Dora the Data Explorer has a new
Data Integration Strategies
Data Integration Strategies International Delta Roundtable November 2007 U.S. Department of the Interior U.S. Geological Survey USGS Data Integration Vision and Drivers USGS Science Strategy: Science in
Workload Characterization and Analysis of Storage and Bandwidth Needs of LEAD Workspace
Workload Characterization and Analysis of Storage and Bandwidth Needs of LEAD Workspace Beth Plale Indiana University [email protected] LEAD TR 001, V3.0 V3.0 dated January 24, 2007 V2.0 dated August
Distributed Database for Environmental Data Integration
Distributed Database for Environmental Data Integration A. Amato', V. Di Lecce2, and V. Piuri 3 II Engineering Faculty of Politecnico di Bari - Italy 2 DIASS, Politecnico di Bari, Italy 3Dept Information
