Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring



Similar documents
Metadata Technique with E-government for Malaysian Universities

A DATA WAREHOUSE SOLUTION FOR E-GOVERNMENT

E-government Supported by Data warehouse Techniques for Higher education: Case study Malaysian universities

IST722 Data Warehousing

Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University

Data Warehousing Systems: Foundations and Architectures

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

A Critical Review of Data Warehouse

A Design and implementation of a data warehouse for research administration universities

A Survey on Data Warehouse Architecture

Data Warehouse: Introduction

Data Warehousing and Data Mining in Business Applications

DATA WAREHOUSING AND OLAP TECHNOLOGY

A Knowledge Management Framework Using Business Intelligence Solutions

Hybrid Support Systems: a Business Intelligence Approach

Part 22. Data Warehousing

An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of

Enterprise Solutions. Data Warehouse & Business Intelligence Chapter-8

E-Governance in Higher Education: Concept and Role of Data Warehousing Techniques

Data Warehousing and OLAP Technology for Knowledge Discovery

14. Data Warehousing & Data Mining

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA

Turkish Journal of Engineering, Science and Technology

A Model-based Software Architecture for XML Data and Metadata Integration in Data Warehouse Systems

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP ( 28

DATA WAREHOUSING APPLICATIONS: AN ANALYTICAL TOOL FOR DECISION SUPPORT SYSTEM

CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University

Deriving Business Intelligence from Unstructured Data

SAS BI Course Content; Introduction to DWH / BI Concepts

The Integration of Agent Technology and Data Warehouse into Executive Banking Information System (EBIS) Architecture

Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006

Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach

Data Search. Searching and Finding information in Unstructured and Structured Data Sources

Fluency With Information Technology CSE100/IMT100

Advanced Data Management Technologies

Introduction to Data Warehousing. Ms Swapnil Shrivastava

Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram

THE QUALITY OF DATA AND METADATA IN A DATAWAREHOUSE

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

Student Performance Analytics using Data Warehouse in E-Governance System

Data Warehouse Overview. Srini Rengarajan

Overview. DW Source Integration, Tools, and Architecture. End User Applications (EUA) EUA Concepts. DW Front End Tools. Source Integration

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES

Week 3 lecture slides

An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000

Data Warehousing and Data Mining

An Overview of Database management System, Data warehousing and Data Mining

Data Warehousing: Data Models and OLAP operations. By Kishore Jaladi

SAS Business Intelligence Online Training

Data Warehousing. Jens Teubner, TU Dortmund Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1

An Approach for Facilating Knowledge Data Warehouse

The Role of Metadata for Effective Data Warehouse

Data Mart/Warehouse: Progress and Vision

Dimensional Modeling for Data Warehouse

OLAP & DATA MINING CS561-SPRING 2012 WPI, MOHAMED ELTABAKH

OLAP Theory-English version

Lection 3-4 WAREHOUSING

Business Intelligence in E-Learning

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

The Design and the Implementation of an HEALTH CARE STATISTICS DATA WAREHOUSE Dr. Sreèko Natek, assistant professor, Nova Vizija,

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

Migrating a Discoverer System to Oracle Business Intelligence Enterprise Edition

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Why Business Intelligence

Understanding Data Warehousing. [by Alex Kriegel]

Model-Driven Data Warehousing

META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE

Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina

Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers

Building Cubes and Analyzing Data using Oracle OLAP 11g

Data Warehousing. Overview, Terminology, and Research Issues. Joachim Hammer. Joachim Hammer

CHAPTER 4 Data Warehouse Architecture

Research on Airport Data Warehouse Architecture

Subject Description Form

INTERACTIVE DECISION SUPPORT SYSTEM BASED ON ANALYSIS AND SYNTHESIS OF DATA - DATA WAREHOUSE

Business Intelligence : a primer

Gradient An EII Solution From Infosys

The Impact Of Organization Changes On Business Intelligence Projects

Data Mining for Successful Healthcare Organizations

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

The Benefits of Data Modeling in Data Warehousing

Presented by: Jose Chinchilla, MCITP

Data W a Ware r house house and and OLAP II Week 6 1

COURSE OUTLINE. Track 1 Advanced Data Modeling, Analysis and Design

Transcription:

www.ijcsi.org 78 Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring Mohammed Mohammed 1 Mohammed Anad 2 Anwar Mzher 3 Ahmed Hasson 4 2 faculty of education of pure scienses Thi-Qar University 3 Foundation of Technical Education (FTE) Nasiriyah, Iraq Abstract E-Government Monitor (egovmon) is used to ensure the new online services for e-government (egov). To effectively address the real needs of the citizens, businesses and governmental agencies. A system to monitor this development can give a better understanding of how to build good online service for citizens and enterprises. Moreover egovmon is used many tools to help to enhance the quality of public web sites. Thus it needs better understand for e-government s data and information with quick respond for their queries. But still there are more issues such as distributed data, structure data and design data. However, Meta Data in data warehouse (DW) uses to explain the data from itself because of that it called data about data. This paper proposes data warehouse egovmon architecture to get better realize of data by using data warehouse techniques and business intelligence (BI) tools such as ETL, DBMS, OLAP, DM and OLAM. egovmon with BI tools make the information more understandable with good quality of data to improve the performance for e-government service system. Also metadata technique and data mart small storages technique are structured and distributed e-government information properly. Keywords: E-government, egovmon, Data warehouse, Data warehouse, ETL, DBMS, Metadata, Data mart and OLAP, DM and OLAM. Introduction E-government provides online service to citizen in anytime and anywhere through information communication technology (ICT) communication technique, information technology (IT) interface technique and internet website technique [1]. E-Gov has four such as Presence, interaction, transaction and transformation. E-Government goals to make the sharing among government and citizens (G2C), government and business enterprises (G2B), government and employees (G2E) and interagency relationships (G2G) more open, well-located, transparent, bi-directional, and less expensive [2]. E-Government Monitor provides e-government s online services with effectively address to real needs of the citizens, businesses and governmental agencies. Therefore monitor system can give a better understanding of how to build good online service for citizens and enterprises by using open source tools to help to improve the quality of public web sites [3]. Data warehouse is huge of repository uses to store all the clear and clean data and information from multiply and heterogeneous data source. According to Inmon, he is father of data warehouse. That he said data warehouse is a subject-oriented, non-volatile, integrated, timevariant repository which collect all data and information from different source and save it in common warehouse [4]. Data warehouse compare of e-government services quality and stored the results in four observatories: Accessibility, Transparency, Efficiency and Impact (ATEI). This way helps governments provide better services to citizens and improve interactions with business and industry with better government management. Business Intelligence (DWBI) is collection of

www.ijcsi.org 79 technologies, applications and practices that use to gather combine and analyze huge of data and information from data warehouse and get useful knowledge and information from it. BI has many tools and techniques for business which use improve their works such as: ETL: There are many companies develop ETL tools, such as Oracle and SAP. ETL is extract, transform and load that mean first the data extracts from multi databases, storage and documents then transfer this data into suitable form for new common warehouse by using special rules and tables. Finally the fit data will be load to the warehouse to be saved. ETL has not only these pervious benefits but it can take out clean and meaningful data from data sources then load it up to ROLAP or MOLAP in the data warehouse [5]. DBMS: It s one of the technologies that need to support present and future as well. Moreover, it is the main method for other techniques that uses for making decisions. DBMS in data warehouse it stores the data structurally inside warehouse by using many way of structure such as star schema structure with de-normalized dimension tables and snowflake schema structure with normalized dimension tables. These kinds of structure give the ability to make multidimensional view for information. The proper structure help to access to data easily and fast respond for queries [6]. Metadata: It s the way to structure the information to provide explanation and description to create more flexible use or manage for an information resource. Data about data is the second name for metadata because it gives illustrates the data from the data itself [7]. There are many types of metadata but the main three types. First descriptive metadata explains major resource such as title, author, abstract, keywords and references. Second structural metadata shows ability to use the aims with each other to, for example, how pages are used to write a book. Third administrative metadata manages a resource by using benefits of information, such as when and how file was made and which person allow to access this file [8]. Data Mart: according to Inmon data mart is subset of repository receives its data and information from common warehouse but it is dependent from it [9]. In other way data mart is small storage of data build to save the data and information for similar departments. The reason of data mart is to provide easier access of data and fast respond for queries. Moreover, data mart increases the departments performance [10]. OLAP, DM & OLAM: Online analytical process is one of DW tools use to analyze the information with multidimensional view in OLAP cube. Therefore it gives information three dimensions. [11].on the other hand data mining (DM) tool uses to mining the data by extract knowledge from amount of data [12]. Online Analytical mining is tool created by using OLAP and DM techniques for obtain on information which will be suitable for decision support system [13]. DSS: Decision support system uses to choose the better decision for company, which depends on its information, knowledge, reports and analytic data that comes from OLAP, DM and OLAM. This system helps decision maker to get better understand of available decisions [14]. Moreover DSS makes qualitative and quantitative for data by apply many of its models and techniques, therefore provides decision makers with useful information and solves unstructured and semistructured information [15]. Thus, all of these tools give the ability for egovmon to get high quality of data and information to provide better understand for e- government online services. Moreover it make suitable structure for information and right way of distribute data depend on government s departments. Relative Works Data Warehouse Solution for E-Government Monitor The E-government monitor task will enhance large-scale web to standard of egov services to the citizen. Therefore egov services of ATEI going to be evaluated in automat way, in which a large amount of evaluation data will be produced. The constructed data warehouse needs to be met the increasing data storage demand while providing a high performance for the analysis of ATEI. Thus, it proposed a distributed data warehouse architecture which is outlined in Figure 1.

www.ijcsi.org 80 Egov Mon DW is data warehouse located in the center, which has different storages of data, such as the repository for Northern Europe (NE), Central Europe (CE) and Southern Europe (SE) and so on, and each repository can be physically distributed on different servers and geographically in different places, but these separate data repositories function as one logical data warehouse by queries. For resources Right Time ETL (RiTE) collect and extract the data from several databases of different countries, such as France, Greece, Italy, Germany and so on. Nevertheless, there is one ETL for each country s database even if the country is big or small [14]. E-government metadata Governments start use e-government services to provide public with better services and to share public data. But this both succeed depend on data governance and data quality [15]. On one hand data governance depend on the trust and on the other hand metadata is often exploited measure data quality to allow combination of services that government provided online for citizen and to guarantee compliance to system. Metadata inside e-government repository has many different levels, such as term level uses to explain the meaning of a term, structure level uses to save groups of terms and reused it later and implementation level which has containing terms and structure implemented in e-services that works individually. The metadata model for e-governance depends on governmental function as an individual domain for example (name, person, and address). This domain has implemented within three structured layers (term-structure and implementation) which Figure 1: egovmon Data Warehouse Architecture usually going to be developed in bottom-up or top-down [16]. Metadata for information sharing in E-government Sharing government s data and information between agencies leads to enhance the e-services by getting knowledge from this information. Recently a lot of countries start to give more attention about information sharing technique and try to improve it in there egov system [17]. But still all of this cares there is a weakness in information sharing and interaction among e- government agencies in several types of G2G, G2E and G2C [18]. Even if there are high quality of information sharing there is difficult to understand what they shared from other departments. Thus metadata gives ability to get better understand from data or information itself by structure them in the right place then discover the hidden information inside the data [19]. Data Warehouse Tool for E- government Online Services Recently, analyzed and processed information use to achieve a high level of matching of user needs. Because it s difficult to the user to deal with huge of data and to get better knowledge and understand for it. Therefore it leads us to the biggest gap in traditional database technologies because database can t be suitable for user requirement from information. Thus, it s so important to create e-government online services supported by data warehouse techniques and tools, as shown in figure2.

www.ijcsi.org 81 Important part in data warehouse system of e- government is that front-end tool layer, because it provides e-government with great analysis and application tools. These tools are worth for e- government s information. Thus, data warehouse is not only huge storage of data. Inside the frontend tools many of reporting tools, analysis tools, mining tools, data warehouse application market development tools, as well as query tools. All of these tools run by the users to provide multidimensional data query to them also make analysis operations available in order to create decisions easily. Data warehouse techniques with e-government can achieve staff requirements fast with easy analysis for their complex of queries and show the understand result. Therefore, that will create strong decision making for e- government staff [20]. In aim to compare between data warehouse and database depend on analysis feature, it s obvious that data warehouse has good ability to deal with huge of integrated information, analysis and will be extracted inherent, in order to excavate from deep level information [21]. Also data warehouse has huge space of repository can use to develop information services for e-government. In the data warehouse e-government system, by the data warehouse and data mart further extracted provide users with data analysis and data mining functions, where the data warehouse has many subjects of detailed time data and summarized data, these data has collected from different government sources. In general data mart saves the summarized information and detailed Figure2. Architecture of e-government data warehouse information for one subject domain for each government department, where the data might be a subset of one data warehouse [22]. Metadata and data mart techniques for egovmon data warehouse system Metadata technique E-government monitor data and information extracted then loaded into common warehouse by RiTE for each database. The government data and information in data warehouse repository are clean and clear because of right time ETL. However, this clear data are structured inside warehouse database by DBMS. There are two ways to structure data inside warehouse tables, such as denormalized dimension tables by star schema or normalized dimension tables through snowflake schema. Data structure in tables give the ablity for easier access for data and fast respond for user queries. But the egovmon data will be store randomly in the data warehouse. Therefore, metadata uses to give perfect structure for e-government data and information by gives details about term of e-government monitor data and save this data as group of terms then containing and structure this information. Thus, Metadata techniques will increase data quality for egovmon data and it will give better understand and explanation for the data itself. Data Mart technique Data warehouse repository is very huge storage; it can save all current e-government monitor data and information with historical data in one common place. Thus, that will make some of

www.ijcsi.org 82 delay for answer staff queries from whole of this among of data, information, documents and reports. However, data mart technique uses subset of storage store the data depend on egovmon s departments. Moreover, data mart store summarized information and derails information for one subject in egov Mon. that make easy analyzing and manning for data also with high-speed of reply for inquiries and high performance by using data warehouse front-end tools, such as OLAP, DM & OLAM. This shows in Figure3. Figure3. Architecture of egovmon data warehouse Conclusion In recent times, data warehouse is replacing database in a lot of computer science technologies not only because of data warehouse tools and the huge storage, but also because of analysis, understandable, mining and fast access for government data. Therefore, our paper created architecture for e-government monitor supported with data warehouse tools and technologies. This paper is used metadata to illustrate data terms and structure data depend of term of groups to obtain more understandable for egovmon data, also to give high details about their data. As well as data mart is used as independent sub storage to keep egovmon department information in one small repository with high mining and analyzed. Therefore, data mart has stored summary information and details information which will be apply to swift access for egovmon data and information with rapidly respond for their staff inquiries. However, decision making one of the big gaps in e- government departments. Because it gives government s staff difficult to make a good decision. Additionally, till now there is no use for one of data warehouse tool to solve this issue. Thus we suggest as future studies, using decision support system like a data warehouse tool to fix this problem. Reference [1]A. Al-Hashmi, A. Darem,. Understanding Phases of E-government Project.2009. [2] e-government Primer. infodev World Bank, 2009. [3]eGovMon - egovernment Monitor. Available from http://www.egovmon.no/en as of 2012-08-7. [4] W. H. Inmon, Building the data warehouse: Wiley-India, 2001. [5]X. Liu and X. Luo. A Data Warehouse Solution for E-government. IJRRAS 4 (1). July 2010 [6]W., McKnight. Choosing a DBMS for Data Warehousing.white paper 2002. [7] Li, S., Yang, Z. & Liu, Q. Research of Metadata Based Digital Educational Resource Sharing. International Conference on Computer Science and Software Engineering 2008. [8]Understanding Metadata. Booklet published by National Information Standards Organization in 2004.

www.ijcsi.org 83 [9] W.H.. INMON, Building the Data Warehouse, 4 th edition, John Wiley & Sons, New York(2005). [10] M.Paulraj and P. Sivaprakasam. Functional Behavior Pattern for Data Mart Based on Attribute Relativity. IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 4, No 1, July 2012. [11] F. Christanto, W. Utomo and E. Sediyono. The Process of Data Tabulation Using Data Warehouse and OLAP Technology to Sales Analysis at Distributor Company. IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 4, No 1, July 2012. [12] A. Sair, B. Erraha, M. Elkyal and S. Loudcher. Prediction in OLAP Cube. IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 4, No 1, July 2012. [13] J. Han. OLAP Mining: An Integration of OLAP with Data Mining. In Proceedings of the 7th IFIP Conference on Data Semantics, Leysin, Switzerland, October 1997. [14]X. Liu and X. Luo. A Data Warehouse Solution for E-government. IJRRAS 4 (1). July 2010. [15] T. Margaritopoulus, M. Margaritopoulus, I. Mavridis and A. Manitsaris. A Conceptual Framework for Metadata Quality Assessment. Proceedings of the International Conference on Dublin Core and Metadata applications, 2008. [20]X. Liu, D. Li. Data Warehouse-Based Personalized Information Service Scheme in E- Government. 2009 Seventh ACIS International Conference on Software Engineering Research, Management and Applications. [21] Y. Cahen, W. Yin. Data Warehouse and Enterprise Information Service, Information Science, 2001, Vol.19 (1): 63. [22]X. Liu, D. Li. Data Warehouse-Based Personalized Information Service Scheme in E- Government. 2009 Seventh ACIS International Conference on Software Engineering Research, Management and Applications. [16] P. Myrseth, J. Stang and V. Dalberg. A data quality framework applied to e-government metadata. A prerequsite to establish governance of interoperable e-services. 2011 IEEE. [17] X. Lü, (2007). Distributed Secure Information Sharing Model for E-Government in China. [18] F. Jing, and Z. Pengzhu, (2009). A Field Study of G2G Information Sharing in Chinese Context Based on the Layered Behavioral Model. [19] D., Coleman, A., Hughes, W., Perry. The Role of Data Governance to Relieve Information Sharing Impairments in the Federal Government. 2009 World Congress on Computer Science and Information Engineering.