A Feasibility Study of an Approach to Extend Research Footprints

Size: px
Start display at page:

Download "A Feasibility Study of an Approach to Extend Research Footprints"

Transcription

1 The Workshops of the Thirtieth AAAI Conference on Artificial Intelligence Scholarly Big Data: AI Perspectives, Challenges, and Ideas: Technical Report WS A Feasibility Study of an Approach to Extend Research Footprints Francisco Osuna, Bhanukiran Gurijala, Patricia Esparza, Monika Akbar, and Ann Gates Department of Computer Science and Cyber-ShARE Center of Excellence University of Texas at El Paso, El Paso, Texas, USA fjosuna, bgurijala, pesparza3, makbar, agates@utep.edu Abstract Funding agencies and the National Academies of Science, Engineering, and Medicine have been promoting the importance of interdisciplinary research (IDR). Supporting team-based IDR requires the ability to discover the expertise needed to solve complex problems. Many universities have adopted expertise systems, which includes the presentation of keywords or concepts to identify experts. The efforts at University of Texas at El Paso (UTEP) have focused on building communities of practice that support diverse faculty who have an affinity for a particular topic and facilitate the ability to identify researchers with diverse expertise, knowledge, and skills who can contribute to new initiatives on campus. Our premise is that the university can facilitate the identification of potential contributors to communities of practice by correlating their associated ontologies to the concepts associated with researchers publications and proposal submissions. This paper presents the results of a preliminary study to examine the feasibility of the approach. Introduction There has been an increased emphasis on interdisciplinary research (IDR) and activities that support interactions needed to solve problems that cross disciplinary boundaries and advance education and research (Committee on Key Challenge Areas for Convergence and Health, Board on Life Sciences, Division on Earth and Life Studies, & National Research Council, 2014; Cooke & Hilton, 2015; Stokols, Hall, Taylor, & Moser, 2008)). Indeed, today s scientific and social challenges are complex and require engaging individuals who can contribute different perspectives, experiences, knowledge, and skills. One challenge in supporting IDR opportunities is the ability to identify researchers who have expertise, or even peripheral knowledge or interest, to contribute to an initiative, considering that discoveries often occur at the boundaries of different disciplines. There are a number of differ- Copyright 2015, Association for the Advancement of Artificial Intelligence ( All rights reserved. ent expertise systems, e.g., Influuent, Vivo, Team Science Toolkit. In this paper, we define an expertise system as a web-based system that publishes expertise and resources at an institution or across institutions through a distributed network. The purpose of the paper is to present results of a study that evaluates how key concepts extracted from publications and proposal submissions can be used to identify potential membership in communities of practice (CoPs), i.e., groups of people with a shared domain of interest. The effort supports UTEP s long-term goal of developing CoPs and associated ontologies to extend the researcher s footprint, i.e., concepts that define a researcher s interests based on his or her research activities. The paper presents a Background section that compares various expertise systems. The next section describes the methodology used to extract the concepts associated with publications and proposal submissions and assess their alignment with CoPs. The approach was applied to publications and proposals of over 90 researchers. The paper presents a case study that analyzes the results for three of those researchers. The researchers have unique experiences, e.g., one highly focused in a particular area of research and others who work across areas with involvement in one or more CoP(s). The paper ends with a summary. Background Efforts to connect people across disciplines and institutions have centered on the adoption of expertise systems. We briefly describe three exemplars: Vivo, which supports research and other creative collaborations within an institution and across institutions; Influuent, which supports collaborations with the University of Texas (UT) System institutions; and the Team Science Toolkit, which supports the practice and study of team science. Vivo (Krafft et al., 2010) provides a portal built on Semantic Web technologies to support the acquisition and management of data. Member institutions represent the structure of their data through their own ontology which 684

2 can then be mapped to the VIVO Web ontology. Influuent (The University of Texas System Office of Public Affairs, 2015), was developed in collaboration with Elsevier. Influuent populates the expertise of University of Texas (UT) System researchers through analysis of their publications in Scopus, an abstract and citation database. Influuent displays the researcher s fingerprint with the option to view collaborators and their associated departments and institutions. The Team Science Toolkit (Vogel et al., 2013) allows people to register their contact information, expertise statement, and keywords. The website provides resources to help users manage, support, and conduct team-based research. UTEP developed the Expertise Connector (EC) system (Expertise Connector, 2015) using Semantic Web technologies to support collaborations by helping users identify and search for experts that shape scholarly and educational research at the university. The portal also aims to align potential grants with communities. The source of expertise for faculty and professional staff profiles is Digital Measures (DM), an information system that allows faculty members to store information related to their professional activities and accomplishments for annual, tenure, and promotion evaluations (Digital Measures, 2015). Recognizing that expertise comes from multiple sources, UTEP extended DM to include links to personal websites, curriculum vita, associated centers, research stories, communities (described later), and social networks. Research stories are fed from the Communications Office on a daily basis. EC provides a portal to CoP (Wenger, 1998) that connect people who can learn from others with similar interests through joint activities (virtual or in-person) that target all or part of the membership. The portal supports the ability to share information by linking related efforts and resources. Each community has a set of keywords associated with it. We are in the process of relating an ontology with each CoP that captures concepts and relationships to which members can associate. Methodology This section describes the methodology used to conduct the feasibility study. It is organized around the following steps: data acquisition, footprint generation, community-ofpractice matching, and the case study. Data Acquisition The study utilized titles and abstracts from two data sources that document publications and proposal submissions of faculty members within the university. Researchers publication data were retrieved from DM. The source of proposal submissions was UTEP s ORSP. Because the intent of the effort is to identify potential interest in a CoP, this study does not make a distinction between types of publications, the status of the proposal, i.e., whether it is pending, funded, or not funded, or the level of contribution to such artifacts. Other data used in the study were the EC keywords describing research expertise identified by the researcher, as well as ontologies associated with the communities of practice. In the study, we targeted three CoPs: Smart Cities, Cyber Security, and Undergraduate Research. It is important to note that researchers who self-describe keywords may consider broader audiences of the expertise system, or those from their discipline. In the former case, keywords or concepts would more likely be general, while in the latter case, they would be more discipline specific. Footprint Generation Expertise systems typically use publications to define a footprint. This paper evaluates the use of proposal submissions to define the footprint with the aim of ultimately including CoP membership. The methodology extracted keywords from the titles and abstracts of researcher s publications and submitted proposals. Later, these keywords were used to identify prevailing concepts in the research activities. The following subsections detail each step. Keyword Extraction For each faculty member, two different footprints were created using the set of keywords extracted from the titles and abstracts of publication records and proposals as described above. The extraction was done with the Rapid Automatic Keyword Extraction (RAKE) algorithm (Berry & Kogan, 2010). RAKE is an unsupervised and domainindependent method for extracting keywords from individual document. RAKE was chosen for its simplicity and efficiency in automatically extracting keywords in a single pass providing advantages in high volume collections by freeing up computing resources for other analytical methods. Parsed data were cleaned using stop-lists of keywords that presented noise, e.g. professor, teacher, paper, or results. Only keywords of length three or greater were considered as valid keywords. This work also considered phrases containing at most six keywords. Concept Identification The set of keywords and phrases resulting from the extraction contained a large number of keywords at different levels of abstraction. To gain meaningful insight into the research areas of a faculty member, it was not sufficient to depend solely on the extracted keywords. Thus, the next step was to use the keywords to create concept-based footprints for both publications and proposal submissions. A concept is a high-level abstraction of related keywords that aid in the identification of notions not necessarily explicitly described in a document. There are a number of approaches available for deriving meaning from keywords 685

3 including clustering (Jain, Murty, & Flynn, 1999), topic modeling (Blei, 2012), semantic similarity (Jiang & Conrath, 1997), concept identification (Bower & Trabasso, 1964), and document summarization (Carbonell & Goldstein, 1998). This research uses the AlchemyAPI (AlchemyAPI, 2015) Web service to derive concepts from a set of given keywords. If the number of users is scaled up, the potential cost of AlchemyAPI, which is software-as-aservice, could limit the amount of information processed because of constraints on requests. For the purposes of this study, such limitations did not hinder the computation or analyses. The natural language processing REST Web service was used because it is specifically geared for semantic textual analysis. Given a set of keywords, it provides a set of concepts identified in the keywords along with their relevance score. The relevance score describes the importance of each concept in ranges from 0 to 1, where a higher relevance score indicates more significant concept. This step results in two sets of concepts from two types of activities (i.e., publications and proposals) for each faculty. Community-of-Practice Matching This step focused on evaluating how key concepts extracted from publications and proposal submissions could be used to identify potential membership in communities of practice. In the study, the concepts associated with a CoP s ontology were used to identify the number of matches between the terms belonging to each of the concept-based footprints. The ontologies were translated into a flat list of concepts, which gave equal weight to each term. The list of matched concepts was ordered by the frequency of concepts appearing in different resources in descending order. For this study, a higher match of concepts between the CoP ontology and a researcher s footprint denotes a closer affinity of the researcher to that CoP. Case Study We conducted a case study on three of the over 90 researchers examined using the methodology described in this section. The case study addressed the following research question: How effective are a researcher s publications and proposal submissions in identifying a researcher s alignment with a community of practice? The study also examined the differences in keywords or concepts between those stored in EC, which is populated by researchers, and those stored in Influuent, which is populated by Scopus. The study examined researchers from three disciplinary areas: Civil Engineering (Faculty A), Electrical and Computer Engineering (Faculty B), and Anthropology (Faculty C). The analysis was anonymized to preserve privacy. Results Each case includes two figures. The first figure shows the top concepts identified in DM publications and proposals submissions through ORSP. The concepts are presented along the X-axis and frequency of the concepts is presented along the Y-axis. The second figure shows the percentage of matches for concepts and keywords of EC, Influuent records, publications, and proposals for the three CoPs. The keywords marked EC are the researcher-defined keywords from EC. The remaining concepts are labeled as Influuent (originating from Scopus), DM Pubs (originating from DM publications), and ORSP Props (originating from ORSP). Case 1: Faculty A Faculty A is member of the Department of Civil Engineering. The self-described keywords of this faculty from the EC included cross-border transportation, freight and transportation logistics, intelligent transportation systems, traffic engineering, transportation engineering, and transportation planning. Influuent identified the following set of top concepts based on his/her publication data: highway systems, travel time, neural networks, genetic algorithms, traffic signals, global positioning system, commercial vehicles, trucks, rapid transit, and costs. Among the concepts identified in the faculty s publications from DM are Neural network, Interstate highway system, and Public transport (Figure 1a). Analysis of the proposals linked to Faculty A revealed a different set of concepts (Figure 1b). Similar to the publication footprint, Transportation planning and Freeway are dominant concepts, linked to at least three proposals. The proposal footprint also identified contextual information such as location (e.g., Texas), as well as different research interests (e.g., Higher education). In particular, the keyword Texas is relevant because of the extensive work that the researcher conducts in the state regarding transportation. (a) Publications (b) Proposals Figure 1: Concepts extracted for Faculty A. 686

4 Figure 3: Matching research footprints of Faculty A across communities of practices. Observations: Faculty A is a member of the Smart Cities CoP. As shown in Figure 3, the expertise systems (i.e., Expertise and Influuent) were able to match Faculty A with this CoP. More than 30% of the EC keywords (selfdescribed) were a match with the Smart Cities CoP ontology. In terms of publications, DM identified more matches than Influuent (more than 20% matches related to Smart Cities compared to less than 15% matches). Proposals submission through ORSP revealed diverse concepts related to Smart Cities. This suggests that the publications and proposals of this faculty cover areas that contain concepts related to Smart Cities. As presented in this Figure 2, although none of the EC keywords of Faculty A matched with the Cyber-Security CoP, the publications (i.e., Influuent, DM Pubs) indicated there might be some areas of shared interest. DM Pubs in particular identified concepts that align with this CoP (e.g., Computer simulation, Intelligent agent, and Control system) making Faculty A a potential candidate for collaboration with the Cyber-Security community. In terms of the Undergraduate Research CoP, the DM Pubs and ORSP Props were able to identify a possible alignment. Note that, while EC keywords show a high percentage of matches with the Smart Cities CoP (more than 30%), it fails to deliver any match with the two other CoPs. This could be due to the fact that the keywords are selfdescribed by the faculty, hence, more likely to match with the CoPs where the faculty chose to become a member. Case 2: Faculty B Faculty B is a member of the Department of Electrical and Computer Engineering. In EC, Faculty B identified his/her research interests in the following areas: Control systems, Cyber-physical systems, Electric power and energy systems, Hyperspectral remote sensing, Remote sensing, Signal processing, and Machine learning. Influuent identified the top concepts appearing in the publications of this faculty as Imagery, Remote sensing, Factorization, Parameter estimation, Image analysis, Pixels, Data reduction, Bathymetry, Imaging techniques, and Optical engineering. (a) Publications (b) Proposals Figure 2: Concepts extracted for Faculty B. DM publications of Faculty B (Figure 2a) identified concepts related to imaging (e.g., Hyperspectral imaging, Image Processing, Multi-spectral image), Numerical analysis, and Machine learning. Figure 2b shows results of similar analysis on proposal data. When compared to publications data, proposals identified similar concepts. Because the faculty is new to the university, he/she does not have a deep proposal record. Observations: Faculty B is a member of the Cyber- Security CoP. Indeed, in Figure 4, we observe that Cyber- Security concepts match with both the publication and proposal footprint of this faculty. This area was also identified by the EC and Influuent profile. Figure 4: Matching research footprints of Faculty B across Although this faculty is not a member of the Smart Cities CoP, the EC keywords and publications indicate a possible alignment of concepts between Faculty B and Smart Cities CoP. The exception in this case is the proposals, as there was no match between any concepts appearing in the proposals of Faculty B and Smart Cities CoP. A small set of concepts related to Undergraduate Research was identified in the publications of this faculty through Influuent and DM (less than 5%). This suggests that the faculty is covering some areas of Undergraduate Research in his/her research activities that appear in the publications. 687

5 Case 3: Faculty C Faculty C is from the Sociology & Anthropology program. Some of the research interests identified by Faculty C in EC are: Anthropology, Borders, Community engagement, Culture, Health, Human rights, Immigration, and Society. Top concepts identified in Influuent based on the publications of Faculty C includes United States of America, Mexico, Immigration, Anthropology, Illegality, Labor, Border region, Call center, and Linguistics. with that of Smart Cities CoP (more than 12%). Less concepts of this CoP matched when EC keywords and publications concepts were considered. EC keywords of this faculty had the most matches with the Cyber-Security CoP. Influuent concepts yielded the least number of matches with this CoP. Concepts and keywords related to Undergraduate Research were identified by both EC and publications. While the concepts from proposals of Faculty C resulted in more matches with Smart Cities CoP, the EC keywords indicated interest towards the Cyber-Security CoP. The analysis suggests that Faculty C can potentially contribute to all the three CoPs considered in this study. Related Work (a) Publications (b) Proposals Figure 5: Concepts extracted for Faculty C. DM publications of Faculty C revealed concepts related to Sociology (e.g., Culture, Human Rights), Anthropology, Psychology and concepts related to immigration and border (Figure 5a). The concepts are in alignment with the listed keywords in EC and Influuent-detected concepts. However, the ORSP Props detected a different set of concepts that are more related to local and regional issues (e.g., Rio Grande, Irrigation, Water) (Figure 5)). Although many of the concepts are related to water, neither the EC keywords nor the concepts appearing in the publications (i.e., Influuent and DM) was able to identify an area of interest that addresses regional water issues. Further investigation on the timeline of proposals identifying water issues reveal that it is indeed a new research interests of this faculty. Observations: Faculty C is not a member of any of the CoPs considered in this study. Figure 6 shows some of the concepts appearing in the proposal of this faculty matches Figure 6: Matching research footprints of Faculty C across communities of practices. Closely related work includes Cross-domain Topic Modeling (Tang, Wu, Sun, & Su, 2012), which uses research publications to address sparse connection, complementary expertise, and topic skewness challenges involved in recommending interdisciplinary collaborations. Similar work includes a recommendation algorithm for scientific articles based on both content and users rating (Wang & Blei, 2011). This work combines collaborative filtering based on latent factor models to recommend articles to a particular user from other users libraries and content analysis based on probabilistic topic modeling for recommending unrated articles. Probabilistic topic models such as Latent Dirichlet Allocation are designed to discover and annotate vast unstructured collections of documents to infer hidden thematic information (Blei, 2012). The Language Model Approach (Tomokiyo & Hurst, 2003) combines the extraction of candidate keyphrases and their ranking. Maui (Medelyan, 2015) automatically determines main topics in documents by extraction of keywords from text with or without use of a reference to a controlled vocabulary. One of the closest work addressing keyword extraction is CiteTextRank (Gollapalli & Caragea, 2014), a graph-based algorithm for keyword extraction using document s content and context within a citation network. Summary Identifying researchers who can support IDR opportunities should be extended beyond identifying expertise. One should consider peripheral knowledge, or affinity to a topic. This paper presents a preliminary study to examine the effects of relating knowledge from disparate data sources to identify potential membership in communities of practice. The study showed that Influuent and DM Pubs yielded different concepts (e.g., Figure 3, Undergraduate Research). This may be partially due to data sources of our preliminaries observation. Scopus (source of Influuent data) does not capture all the publications of a researcher. 688

6 The algorithms used for concept extraction by Influuent may also contribute to the differences. Self-described keywords given by EC show the variability in how researchers described themselves. Using concept abstraction provides a richer set of researcher footprints. The study also raises questions. Additional investigations are needed to understand the differences in Influuent concepts and the concepts identified through DM Pubs and ORSP Props. Another important aspect is how to classify the keywords based on the needs of the user. For example, a user looking for researchers in cancer research would want the concepts at a high level of abstraction than those who want to know who does research on a particular type of precursor cell, e.g., B-lymphoid. Another potential classification is those who are on the periphery of a discipline, as shown in the work for those who have interests in undergraduate research. The latter supports the ability to extend CoPs by identifying researchers who have the potential to contribute to a community e.g., Faculty B who is not part of the Smart City CoP, but has publications that align well with this community. The approach also supports the efforts of UTEP s ORSP to convene researchers in particular areas to build collaborations. Future work includes comparing different approaches for concept identification (e.g., keywords vs. sentence level parsing), investigating a statistical approach for assigning weights to the concepts extracted from different sources, and extending the ontologies to support emerging communities and IDR opportunities. Our long-term goal is to extend the researcher s footprint. As more researchers are associated with a CoP, they will be able to use its ontology to refine and directly associate with the community s keywords, enriching their footprint and the community s ontology. This will facilitate the university s ability to identify collaborators and expertise. EC can be extended to identify funding opportunities associated with the communities. Acknowledgements This work is supported in part by the National Science Foundation (NSF) grants HRD and DUE # Any opinions, findings, and conclusions or recommendations expressed in this paper are those of the author(s) and do not necessarily reflect the views of the NSF. References AlchemyAPI. (2015). Alchemy API. Retrieved October 23, 2015, from Berry, M. W., & Kogan, J. (2010). Text mining: Applications and theory John Wiley & Sons. Blei, D. M. (2012). Probabilistic topic models. Communications of the ACM, 55(4), Bower, G. H., & Trabasso, T. R. (1964). Concept identification. Studies in Mathematical Psychology, Carbonell, J., & Goldstein, J. (1998). The use of MMR, diversitybased reranking for reordering documents and producing summaries. Paper presented at the Proceedings of the 21st Annual International ACM SIGIR, pp Committee on Key Challenge Areas for Convergence and Health, Board on Life Sciences, Division on Earth and Life Studies, & National Research Council. (2014). Convergence: Facilitating transdisciplinary integration of life sciences, physical sciences, engineering, and beyond The National Academies Press. Cooke, N. J., & Hilton, M. L. (2015). Enhancing the effectiveness of team science National Academies Press. Digital Measures. (2015). Digital measures. Retrieved October 30, 2015, from Expertise Connector. (2015). Expertise connector. Retrieved October 30, 2015, from Gollapalli, S. D., & Caragea, C. (2014). Extracting keyphrases from research papers using citation networks. Paper presented at the Proceedings of the 28th AAAI, pp Jain, A. K., Murty, M. N., & Flynn, P. J. (1999). Data clustering: A review. ACM Computing Surveys (CSUR), 31(3), Jiang, J. J., & Conrath, D. W. (1997). Semantic similarity based on corpus statistics and lexical taxonomy. arxiv Preprint Cmp- Lg/ Krafft, D. B., Cappadona, N. A., Caruso, B., Corson-Rikert, J., Devare, M., Lowe, B. J., et al. (2010). Vivo: Enabling national networking of scientists. Medelyan, A. (2015). Maui - multi-purpose automatic topic indexing. Retrieved December 04, 2015, from Stokols, D., Hall, K. L., Taylor, B. K., & Moser, R. P. (2008). The science of team science: Overview of the field and introduction to the supplement. American Journal of Preventive Medicine, 35(2), S77-S89. Tang, J., Wu, S., Sun, J., & Su, H. (2012). Cross-domain collaboration recommendation. Paper presented at the Proceedings of the 18th ACM SIGKDD, pp The University of Texas System Office of Public Affairs. (2015). UT system launches free online database to connect industry with thousands of world-class researchers. Retrieved October 29, 2015, from Tomokiyo, T., & Hurst, M. (2003). A language model approach to keyphrase extraction. Paper presented at the Proceedings of the ACL 2003 Workshop on Multiword Expressions: Analysis, Acquisition and Treatment-Volume 18, pp Vogel, A. L., Hall, K. L., Fiore, S. M., Klein, J. T., Bennett, L. M., Gadlin, H., et al. (2013). The team science toolkit: Enhancing research collaboration through online knowledge sharing. American Journal of Preventive Medicine, 45(6), Wang, C., & Blei, D. M. (2011). Collaborative topic modeling for recommending scientific articles. Paper presented at the Proceedings of the 17th ACM SIGKDD, pp Wenger, E. (1998). Communities of practice: Learning as a social system. Systems Thinker, 9(5),

Term extraction for user profiling: evaluation by the user

Term extraction for user profiling: evaluation by the user Term extraction for user profiling: evaluation by the user Suzan Verberne 1, Maya Sappelli 1,2, Wessel Kraaij 1,2 1 Institute for Computing and Information Sciences, Radboud University Nijmegen 2 TNO,

More information

An Introduction to Data Mining

An Introduction to Data Mining An Introduction to Intel Beijing wei.heng@intel.com January 17, 2014 Outline 1 DW Overview What is Notable Application of Conference, Software and Applications Major Process in 2 Major Tasks in Detail

More information

Research of Postal Data mining system based on big data

Research of Postal Data mining system based on big data 3rd International Conference on Mechatronics, Robotics and Automation (ICMRA 2015) Research of Postal Data mining system based on big data Xia Hu 1, Yanfeng Jin 1, Fan Wang 1 1 Shi Jiazhuang Post & Telecommunication

More information

Using Artificial Intelligence to Manage Big Data for Litigation

Using Artificial Intelligence to Manage Big Data for Litigation FEBRUARY 3 5, 2015 / THE HILTON NEW YORK Using Artificial Intelligence to Manage Big Data for Litigation Understanding Artificial Intelligence to Make better decisions Improve the process Allay the fear

More information

Keyphrase Extraction for Scholarly Big Data

Keyphrase Extraction for Scholarly Big Data Keyphrase Extraction for Scholarly Big Data Cornelia Caragea Computer Science and Engineering University of North Texas July 10, 2015 Scholarly Big Data Large number of scholarly documents on the Web PubMed

More information

Identifying Focus, Techniques and Domain of Scientific Papers

Identifying Focus, Techniques and Domain of Scientific Papers Identifying Focus, Techniques and Domain of Scientific Papers Sonal Gupta Department of Computer Science Stanford University Stanford, CA 94305 sonal@cs.stanford.edu Christopher D. Manning Department of

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association

More information

MLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS 2014. November 7, 2014. Machine Learning Group

MLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS 2014. November 7, 2014. Machine Learning Group Big Data and Its Implication to Research Methodologies and Funding Cornelia Caragea TARDIS 2014 November 7, 2014 UNT Computer Science and Engineering Data Everywhere Lots of data is being collected and

More information

IT services for analyses of various data samples

IT services for analyses of various data samples IT services for analyses of various data samples Ján Paralič, František Babič, Martin Sarnovský, Peter Butka, Cecília Havrilová, Miroslava Muchová, Michal Puheim, Martin Mikula, Gabriel Tutoky Technical

More information

The CS Principles Project 1

The CS Principles Project 1 The CS Principles Project 1 Owen Astrachan, Duke University Amy Briggs, Middlebury College Abstract The Computer Science Principles project is part of a national effort to reach a wide and diverse audience

More information

Overcoming the Technical and Policy Constraints That Limit Large-Scale Data Integration

Overcoming the Technical and Policy Constraints That Limit Large-Scale Data Integration Overcoming the Technical and Policy Constraints That Limit Large-Scale Data Integration Revised Proposal from The National Academies Summary An NRC-appointed committee will plan and organize a cross-disciplinary

More information

SUBJECT TABLES METHODOLOGY

SUBJECT TABLES METHODOLOGY SUBJECT TABLES METHODOLOGY Version 0.3 Last updated: 28 March 2011 Copyright 2010 QS Intelligence Unit (a division of QS Quacquarelli Symonds Ltd) Contents Background... 3 Subject Disciplines Considered...

More information

Learning outcomes. Knowledge and understanding. Competence and skills

Learning outcomes. Knowledge and understanding. Competence and skills Syllabus Master s Programme in Statistics and Data Mining 120 ECTS Credits Aim The rapid growth of databases provides scientists and business people with vast new resources. This programme meets the challenges

More information

ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY

ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY Yu. A. Zagorulko, O. I. Borovikova, S. V. Bulgakov, E. A. Sidorova 1 A.P.Ershov s Institute

More information

Delivering Smart Answers!

Delivering Smart Answers! Companion for SharePoint Topic Analyst Companion for SharePoint All Your Information Enterprise-ready Enrich SharePoint, your central place for document and workflow management, not only with an improved

More information

Introduction to Pattern Recognition

Introduction to Pattern Recognition Introduction to Pattern Recognition Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Spring 2009 CS 551, Spring 2009 c 2009, Selim Aksoy (Bilkent University)

More information

72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD

72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD 72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD Paulo Gottgtroy Auckland University of Technology Paulo.gottgtroy@aut.ac.nz Abstract This paper is

More information

Chapter Managing Knowledge in the Digital Firm

Chapter Managing Knowledge in the Digital Firm Chapter Managing Knowledge in the Digital Firm Essay Questions: 1. What is knowledge management? Briefly outline the knowledge management chain. 2. Identify the three major types of knowledge management

More information

A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks

A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks Text Analytics World, Boston, 2013 Lars Hard, CTO Agenda Difficult text analytics tasks Feature extraction Bio-inspired

More information

Statistics for BIG data

Statistics for BIG data Statistics for BIG data Statistics for Big Data: Are Statisticians Ready? Dennis Lin Department of Statistics The Pennsylvania State University John Jordan and Dennis K.J. Lin (ICSA-Bulletine 2014) Before

More information

Data Isn't Everything

Data Isn't Everything June 17, 2015 Innovate Forward Data Isn't Everything The Challenges of Big Data, Advanced Analytics, and Advance Computation Devices for Transportation Agencies. Using Data to Support Mission, Administration,

More information

Connecting library content using data mining and text analytics on structured and unstructured data

Connecting library content using data mining and text analytics on structured and unstructured data Submitted on: May 5, 2013 Connecting library content using data mining and text analytics on structured and unstructured data Chee Kiam Lim Technology and Innovation, National Library Board, Singapore.

More information

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper CAST-2015 provides an opportunity for researchers, academicians, scientists and

More information

A Statistical Text Mining Method for Patent Analysis

A Statistical Text Mining Method for Patent Analysis A Statistical Text Mining Method for Patent Analysis Department of Statistics Cheongju University, shjun@cju.ac.kr Abstract Most text data from diverse document databases are unsuitable for analytical

More information

Data Mining Solutions for the Business Environment

Data Mining Solutions for the Business Environment Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania ruxandra_stefania.petre@yahoo.com Over

More information

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop ISSN: 2454-2377, October 2015 Big Data and Hadoop Simmi Bagga 1 Satinder Kaur 2 1 Assistant Professor, Sant Hira Dass Kanya MahaVidyalaya, Kala Sanghian, Distt Kpt. INDIA E-mail: simmibagga12@gmail.com

More information

COURSE RECOMMENDER SYSTEM IN E-LEARNING

COURSE RECOMMENDER SYSTEM IN E-LEARNING International Journal of Computer Science and Communication Vol. 3, No. 1, January-June 2012, pp. 159-164 COURSE RECOMMENDER SYSTEM IN E-LEARNING Sunita B Aher 1, Lobo L.M.R.J. 2 1 M.E. (CSE)-II, Walchand

More information

Semantic Search in Portals using Ontologies

Semantic Search in Portals using Ontologies Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br

More information

How To Create A Data Science System

How To Create A Data Science System Enhance Collaboration and Data Sharing for Faster Decisions and Improved Mission Outcome Richard Breakiron Senior Director, Cyber Solutions Rbreakiron@vion.com Office: 571-353-6127 / Cell: 803-443-8002

More information

SWIFT: A Text-mining Workbench for Systematic Review

SWIFT: A Text-mining Workbench for Systematic Review SWIFT: A Text-mining Workbench for Systematic Review Ruchir Shah, PhD Sciome LLC NTP Board of Scientific Counselors Meeting June 16, 2015 Large Literature Corpus: An Ever Increasing Challenge Systematic

More information

SPATIAL DATA CLASSIFICATION AND DATA MINING

SPATIAL DATA CLASSIFICATION AND DATA MINING , pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal

More information

Latent Dirichlet Markov Allocation for Sentiment Analysis

Latent Dirichlet Markov Allocation for Sentiment Analysis Latent Dirichlet Markov Allocation for Sentiment Analysis Ayoub Bagheri Isfahan University of Technology, Isfahan, Iran Intelligent Database, Data Mining and Bioinformatics Lab, Electrical and Computer

More information

How To Make Sense Of Data With Altilia

How To Make Sense Of Data With Altilia HOW TO MAKE SENSE OF BIG DATA TO BETTER DRIVE BUSINESS PROCESSES, IMPROVE DECISION-MAKING, AND SUCCESSFULLY COMPETE IN TODAY S MARKETS. ALTILIA turns Big Data into Smart Data and enables businesses to

More information

2015 Workshops for Professors

2015 Workshops for Professors SAS Education Grow with us Offered by the SAS Global Academic Program Supporting teaching, learning and research in higher education 2015 Workshops for Professors 1 Workshops for Professors As the market

More information

Semantic Concept Based Retrieval of Software Bug Report with Feedback

Semantic Concept Based Retrieval of Software Bug Report with Feedback Semantic Concept Based Retrieval of Software Bug Report with Feedback Tao Zhang, Byungjeong Lee, Hanjoon Kim, Jaeho Lee, Sooyong Kang, and Ilhoon Shin Abstract Mining software bugs provides a way to develop

More information

Sustaining Privacy Protection in Personalized Web Search with Temporal Behavior

Sustaining Privacy Protection in Personalized Web Search with Temporal Behavior Sustaining Privacy Protection in Personalized Web Search with Temporal Behavior N.Jagatheshwaran 1 R.Menaka 2 1 Final B.Tech (IT), jagatheshwaran.n@gmail.com, Velalar College of Engineering and Technology,

More information

Qualitative Corporate Dashboards for Corporate Monitoring Peng Jia and Miklos A. Vasarhelyi 1

Qualitative Corporate Dashboards for Corporate Monitoring Peng Jia and Miklos A. Vasarhelyi 1 Qualitative Corporate Dashboards for Corporate Monitoring Peng Jia and Miklos A. Vasarhelyi 1 Introduction Electronic Commerce 2 is accelerating dramatically changes in the business process. Electronic

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014 RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer

More information

Teaching Computational Thinking using Cloud Computing: By A/P Tan Tin Wee

Teaching Computational Thinking using Cloud Computing: By A/P Tan Tin Wee Teaching Computational Thinking using Cloud Computing: By A/P Tan Tin Wee Technology in Pedagogy, No. 8, April 2012 Written by Kiruthika Ragupathi (kiruthika@nus.edu.sg) Computational thinking is an emerging

More information

Animation. Intelligence. Business. Computer. Areas of Focus. Master of Science Degree Program

Animation. Intelligence. Business. Computer. Areas of Focus. Master of Science Degree Program Business Intelligence Computer Animation Master of Science Degree Program The Bachelor explosive of growth Science of Degree from the Program Internet, social networks, business networks, as well as the

More information

Using LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset.

Using LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset. White Paper Using LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset. Using LSI for Implementing Document Management Systems By Mike Harrison, Director,

More information

A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH

A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH 205 A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH ABSTRACT MR. HEMANT KUMAR*; DR. SARMISTHA SARMA** *Assistant Professor, Department of Information Technology (IT), Institute of Innovation in Technology

More information

CiteSeer x in the Cloud

CiteSeer x in the Cloud Published in the 2nd USENIX Workshop on Hot Topics in Cloud Computing 2010 CiteSeer x in the Cloud Pradeep B. Teregowda Pennsylvania State University C. Lee Giles Pennsylvania State University Bhuvan Urgaonkar

More information

Kate Gleason College of Engineering Concept Paper: Proposal for a Ph.D. in Engineering

Kate Gleason College of Engineering Concept Paper: Proposal for a Ph.D. in Engineering Program Goals Kate Gleason College of Engineering Concept Paper: Proposal for a Ph.D. in Engineering The primary goal of the proposed Ph.D. in Engineering program is to expand the research enterprise at

More information

RFI Summary: Executive Summary

RFI Summary: Executive Summary RFI Summary: Executive Summary On February 20, 2013, the NIH issued a Request for Information titled Training Needs In Response to Big Data to Knowledge (BD2K) Initiative. The response was large, with

More information

Machine Learning: Overview

Machine Learning: Overview Machine Learning: Overview Why Learning? Learning is a core of property of being intelligent. Hence Machine learning is a core subarea of Artificial Intelligence. There is a need for programs to behave

More information

Collaborations between Official Statistics and Academia in the Era of Big Data

Collaborations between Official Statistics and Academia in the Era of Big Data Collaborations between Official Statistics and Academia in the Era of Big Data World Statistics Day October 20-21, 2015 Budapest Vijay Nair University of Michigan Past-President of ISI vnn@umich.edu What

More information

Comparison of K-means and Backpropagation Data Mining Algorithms

Comparison of K-means and Backpropagation Data Mining Algorithms Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and

More information

A Survey on Product Aspect Ranking

A Survey on Product Aspect Ranking A Survey on Product Aspect Ranking Charushila Patil 1, Prof. P. M. Chawan 2, Priyamvada Chauhan 3, Sonali Wankhede 4 M. Tech Student, Department of Computer Engineering and IT, VJTI College, Mumbai, Maharashtra,

More information

UNDERSTAND YOUR CLIENTS BETTER WITH DATA How Data-Driven Decision Making Improves the Way Advisors Do Business

UNDERSTAND YOUR CLIENTS BETTER WITH DATA How Data-Driven Decision Making Improves the Way Advisors Do Business UNDERSTAND YOUR CLIENTS BETTER WITH DATA How Data-Driven Decision Making Improves the Way Advisors Do Business Executive Summary Financial advisors have long been charged with knowing the investors they

More information

Jobsket ATS. Empowering your recruitment process

Jobsket ATS. Empowering your recruitment process Jobsket ATS Empowering your recruitment process WELCOME TO JOBSKET ATS Jobsket ATS is a recruitment and talent acquisition software package built on top of innovation. Our software improves recruitment

More information

USABILITY OF A FILIPINO LANGUAGE TOOLS WEBSITE

USABILITY OF A FILIPINO LANGUAGE TOOLS WEBSITE USABILITY OF A FILIPINO LANGUAGE TOOLS WEBSITE Ria A. Sagum, MCS Department of Computer Science, College of Computer and Information Sciences Polytechnic University of the Philippines, Manila, Philippines

More information

DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support

DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support Rok Rupnik, Matjaž Kukar, Marko Bajec, Marjan Krisper University of Ljubljana, Faculty of Computer and Information

More information

What Is Keio University Research?

What Is Keio University Research? Case Study: Expanding collaboration and seeking new horizons in research Keio University Executive Summary Keio University aimed to build on its core research strengths to meet the changing needs of society.

More information

Important dimensions of knowledge Knowledge is a firm asset: Knowledge has different forms Knowledge has a location Knowledge is situational Wisdom:

Important dimensions of knowledge Knowledge is a firm asset: Knowledge has different forms Knowledge has a location Knowledge is situational Wisdom: Southern Company Electricity Generators uses Content Management System (CMS). Important dimensions of knowledge: Knowledge is a firm asset: Intangible. Creation of knowledge from data, information, requires

More information

SECURE AND TRUSTWORTHY CYBERSPACE (SaTC)

SECURE AND TRUSTWORTHY CYBERSPACE (SaTC) SECURE AND TRUSTWORTHY CYBERSPACE (SaTC) Overview The Secure and Trustworthy Cyberspace (SaTC) investment is aimed at building a cybersecure society and providing a strong competitive edge in the Nation

More information

Spatio-Temporal Patterns of Passengers Interests at London Tube Stations

Spatio-Temporal Patterns of Passengers Interests at London Tube Stations Spatio-Temporal Patterns of Passengers Interests at London Tube Stations Juntao Lai *1, Tao Cheng 1, Guy Lansley 2 1 SpaceTimeLab for Big Data Analytics, Department of Civil, Environmental &Geomatic Engineering,

More information

Effective Data Retrieval Mechanism Using AML within the Web Based Join Framework

Effective Data Retrieval Mechanism Using AML within the Web Based Join Framework Effective Data Retrieval Mechanism Using AML within the Web Based Join Framework Usha Nandini D 1, Anish Gracias J 2 1 ushaduraisamy@yahoo.co.in 2 anishgracias@gmail.com Abstract A vast amount of assorted

More information

From Stored Knowledge to Smart Knowledge

From Stored Knowledge to Smart Knowledge From Stored Knowledge to Smart Knowledge The British Library s Content Strategy 2013 2015 From Stored Knowledge to Smart Knowledge: The British Library s Content Strategy 2013 2015 Introduction The British

More information

Patent Big Data Analysis by R Data Language for Technology Management

Patent Big Data Analysis by R Data Language for Technology Management , pp. 69-78 http://dx.doi.org/10.14257/ijseia.2016.10.1.08 Patent Big Data Analysis by R Data Language for Technology Management Sunghae Jun * Department of Statistics, Cheongju University, 360-764, Korea

More information

Automatic Annotation Wrapper Generation and Mining Web Database Search Result

Automatic Annotation Wrapper Generation and Mining Web Database Search Result Automatic Annotation Wrapper Generation and Mining Web Database Search Result V.Yogam 1, K.Umamaheswari 2 1 PG student, ME Software Engineering, Anna University (BIT campus), Trichy, Tamil nadu, India

More information

Master of Science in Health Information Technology Degree Curriculum

Master of Science in Health Information Technology Degree Curriculum Master of Science in Health Information Technology Degree Curriculum Core courses: 8 courses Total Credit from Core Courses = 24 Core Courses Course Name HRS Pre-Req Choose MIS 525 or CIS 564: 1 MIS 525

More information

Patient Centered Healthcare Informatics

Patient Centered Healthcare Informatics 1 Patient Centered Healthcare Informatics Christopher C. Yang Abstract The healthcare system is undergoing a transformation from reactive care to proactive and preventive care. Patients or health consumers

More information

Web Database Integration

Web Database Integration Web Database Integration Wei Liu School of Information Renmin University of China Beijing, 100872, China gue2@ruc.edu.cn Xiaofeng Meng School of Information Renmin University of China Beijing, 100872,

More information

Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines

Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines , 22-24 October, 2014, San Francisco, USA Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines Baosheng Yin, Wei Wang, Ruixue Lu, Yang Yang Abstract With the increasing

More information

Big Data with Rough Set Using Map- Reduce

Big Data with Rough Set Using Map- Reduce Big Data with Rough Set Using Map- Reduce Mr.G.Lenin 1, Mr. A. Raj Ganesh 2, Mr. S. Vanarasan 3 Assistant Professor, Department of CSE, Podhigai College of Engineering & Technology, Tirupattur, Tamilnadu,

More information

Enhancing On-Line Conferencing Ba with Human-Machine Interaction CorMap Analysis

Enhancing On-Line Conferencing Ba with Human-Machine Interaction CorMap Analysis 62 International Journal of Knowledge and Systems Science, 1(2), 62-70, April-June 2010 Enhancing On-Line Conferencing Ba with Human-Machine Interaction CorMap Analysis Bin Luo, Chinese Academy of Sciences,

More information

Collecting Polish German Parallel Corpora in the Internet

Collecting Polish German Parallel Corpora in the Internet Proceedings of the International Multiconference on ISSN 1896 7094 Computer Science and Information Technology, pp. 285 292 2007 PIPS Collecting Polish German Parallel Corpora in the Internet Monika Rosińska

More information

Web-Based Educational Resources for Learning and Online Teaching 1

Web-Based Educational Resources for Learning and Online Teaching 1 Web-Based Educational Resources for Learning and Online Teaching 1 Web-Based Educational Resources for Learning and Online Teaching in Higher Education: The MERLOT Project Emrah Orhun Professor Computer

More information

Insight for Informed Decisions

Insight for Informed Decisions Insight for Informed Decisions NORC at the University of Chicago is an independent research institution that delivers reliable data and rigorous analysis to guide critical programmatic, business, and policy

More information

Text Mining - Scope and Applications

Text Mining - Scope and Applications Journal of Computer Science and Applications. ISSN 2231-1270 Volume 5, Number 2 (2013), pp. 51-55 International Research Publication House http://www.irphouse.com Text Mining - Scope and Applications Miss

More information

Integrated Information Services (IIS) Strategic Plan

Integrated Information Services (IIS) Strategic Plan Integrated Information Services (IIS) Strategic Plan Preamble Integrated Information Services (IIS) supports UCAR/NCAR/UCP efforts to both manage, preserve, and provide access to its scholarship for the

More information

Query term suggestion in academic search

Query term suggestion in academic search Query term suggestion in academic search Suzan Verberne 1, Maya Sappelli 1,2, and Wessel Kraaij 2,1 1. Institute for Computing and Information Sciences, Radboud University Nijmegen 2. TNO, Delft Abstract.

More information

Web 3.0 image search: a World First

Web 3.0 image search: a World First Web 3.0 image search: a World First The digital age has provided a virtually free worldwide digital distribution infrastructure through the internet. Many areas of commerce, government and academia have

More information

An interdisciplinary model for analytics education

An interdisciplinary model for analytics education An interdisciplinary model for analytics education Raffaella Settimi, PhD School of Computing, DePaul University Drew Conway s Data Science Venn Diagram http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram

More information

Find the signal in the noise

Find the signal in the noise Find the signal in the noise Electronic Health Records: The challenge The adoption of Electronic Health Records (EHRs) in the USA is rapidly increasing, due to the Health Information Technology and Clinical

More information

This software agent helps industry professionals review compliance case investigations, find resolutions, and improve decision making.

This software agent helps industry professionals review compliance case investigations, find resolutions, and improve decision making. Lost in a sea of data? Facing an external audit? Or just wondering how you re going meet the challenges of the next regulatory law? When you need fast, dependable support and company-specific solutions

More information

Healthcare Measurement Analysis Using Data mining Techniques

Healthcare Measurement Analysis Using Data mining Techniques www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 03 Issue 07 July, 2014 Page No. 7058-7064 Healthcare Measurement Analysis Using Data mining Techniques 1 Dr.A.Shaik

More information

Text Mining: The state of the art and the challenges

Text Mining: The state of the art and the challenges Text Mining: The state of the art and the challenges Ah-Hwee Tan Kent Ridge Digital Labs 21 Heng Mui Keng Terrace Singapore 119613 Email: ahhwee@krdl.org.sg Abstract Text mining, also known as text data

More information

Chapter 11 MANAGING KNOWLEDGE

Chapter 11 MANAGING KNOWLEDGE MANAGING THE DIGITAL FIRM, 12 TH EDITION Learning Objectives Chapter 11 MANAGING KNOWLEDGE VIDEO CASES Case 1: L'Oréal: Knowledge Management Using Microsoft SharePoint Case 2: IdeaScale Crowdsourcing:

More information

A Knowledge Management Framework Using Business Intelligence Solutions

A Knowledge Management Framework Using Business Intelligence Solutions www.ijcsi.org 102 A Knowledge Management Framework Using Business Intelligence Solutions Marwa Gadu 1 and Prof. Dr. Nashaat El-Khameesy 2 1 Computer and Information Systems Department, Sadat Academy For

More information

The First Online 3D Epigraphic Library: The University of Florida Digital Epigraphy and Archaeology Project

The First Online 3D Epigraphic Library: The University of Florida Digital Epigraphy and Archaeology Project Seminar on Dec 19 th Abstracts & speaker information The First Online 3D Epigraphic Library: The University of Florida Digital Epigraphy and Archaeology Project Eleni Bozia (USA) Angelos Barmpoutis (USA)

More information

Which universities lead and lag? Toward university rankings based on scholarly output

Which universities lead and lag? Toward university rankings based on scholarly output Which universities lead and lag? Toward university rankings based on scholarly output Daniel Ramage and Christopher D. Manning Computer Science Department Stanford University Stanford, California 94305

More information

ONLINE RESUME PARSING SYSTEM USING TEXT ANALYTICS

ONLINE RESUME PARSING SYSTEM USING TEXT ANALYTICS ONLINE RESUME PARSING SYSTEM USING TEXT ANALYTICS Divyanshu Chandola 1, Aditya Garg 2, Ankit Maurya 3, Amit Kushwaha 4 1 Student, Department of Information Technology, ABES Engineering College, Uttar Pradesh,

More information

Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus

Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus 1. Introduction Facebook is a social networking website with an open platform that enables developers to extract and utilize user information

More information

The Scientific Data Mining Process

The Scientific Data Mining Process Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In

More information

INNOVATION WITH IMPACT. Creating a Culture for Scholarly and Systematic Innovation in Engineering Education

INNOVATION WITH IMPACT. Creating a Culture for Scholarly and Systematic Innovation in Engineering Education INNOVATION WITH IMPACT Creating a Culture for Scholarly and Systematic Innovation in Engineering Education june 1, 2012 All Rights Reserved Copyright 2012 by American Society for Engineering Education

More information

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON Overview * Introduction * Multiple faces of Big Data * Challenges of Big Data * Cloud Computing

More information

An Engagement Model for Learning: Providing a Framework to Identify Technology Services

An Engagement Model for Learning: Providing a Framework to Identify Technology Services Interdisciplinary Journal of Knowledge and Learning Objects Volume 3, 2007 An Engagement Model for Learning: Providing a Framework to Identify Technology Services I.T. Hawryszkiewycz Department of Information

More information

IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH

IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH Kalinka Mihaylova Kaloyanova St. Kliment Ohridski University of Sofia, Faculty of Mathematics and Informatics Sofia 1164, Bulgaria

More information

FIVE YEAR REVIEWS OF HEALTH SCIENCES ORGANIZED RESEARCH UNITS UNIVERSITY OF CALIFORNIA, SAN DIEGO Supplement to UCSD ORU Policy & Procedures, May 2010

FIVE YEAR REVIEWS OF HEALTH SCIENCES ORGANIZED RESEARCH UNITS UNIVERSITY OF CALIFORNIA, SAN DIEGO Supplement to UCSD ORU Policy & Procedures, May 2010 FIVE YEAR REVIEWS OF HEALTH SCIENCES ORGANIZED RESEARCH UNITS UNIVERSITY OF CALIFORNIA, SAN DIEGO Supplement to UCSD ORU Policy & Procedures, May 2010 An Organized Research Unit (ORU) is a non-permanent

More information

Search and Data Mining: Techniques. Introduction Anna Yarygina Boris Novikov

Search and Data Mining: Techniques. Introduction Anna Yarygina Boris Novikov Search and Data Mining: Techniques Introduction Anna Yarygina Boris Novikov Data Analytics: Conference Sections Fundamentals for data analytics Mechanisms and features Big Data Huge data Target analytics

More information

Machine Learning and Data Mining. Fundamentals, robotics, recognition

Machine Learning and Data Mining. Fundamentals, robotics, recognition Machine Learning and Data Mining Fundamentals, robotics, recognition Machine Learning, Data Mining, Knowledge Discovery in Data Bases Their mutual relations Data Mining, Knowledge Discovery in Databases,

More information

Improving Decision Making and Managing Knowledge

Improving Decision Making and Managing Knowledge Improving Decision Making and Managing Knowledge Decision Making and Information Systems Information Requirements of Key Decision-Making Groups in a Firm Senior managers, middle managers, operational managers,

More information

How To Write A Summary Of A Review

How To Write A Summary Of A Review PRODUCT REVIEW RANKING SUMMARIZATION N.P.Vadivukkarasi, Research Scholar, Department of Computer Science, Kongu Arts and Science College, Erode. Dr. B. Jayanthi M.C.A., M.Phil., Ph.D., Associate Professor,

More information

Doctor of Philosophy in Informatics

Doctor of Philosophy in Informatics Doctor of Philosophy in Informatics 2014 Handbook Indiana University established the School of Informatics and Computing as a place where innovative multidisciplinary programs could thrive, a program where

More information

THE ROLE OF KNOWLEDGE MANAGEMENT SYSTEM IN SCHOOL: PERCEPTION OF APPLICATIONS AND BENEFITS

THE ROLE OF KNOWLEDGE MANAGEMENT SYSTEM IN SCHOOL: PERCEPTION OF APPLICATIONS AND BENEFITS THE ROLE OF KNOWLEDGE MANAGEMENT SYSTEM IN SCHOOL: PERCEPTION OF APPLICATIONS AND BENEFITS YOHANNES KURNIAWAN Bina Nusantara University, Department of Information Systems, Jakarta 11480, Indonesia E-mail:

More information

A Proposal for the use of Artificial Intelligence in Spend-Analytics

A Proposal for the use of Artificial Intelligence in Spend-Analytics A Proposal for the use of Artificial Intelligence in Spend-Analytics Mark Bishop, Sebastian Danicic, John Howroyd and Andrew Martin Our core team Mark Bishop PhD studied Cybernetics and Computer Science

More information

Value of. Clinical and Business Data Analytics for. Healthcare Payers NOUS INFOSYSTEMS LEVERAGING INTELLECT

Value of. Clinical and Business Data Analytics for. Healthcare Payers NOUS INFOSYSTEMS LEVERAGING INTELLECT Value of Clinical and Business Data Analytics for Healthcare Payers NOUS INFOSYSTEMS LEVERAGING INTELLECT Abstract As there is a growing need for analysis, be it for meeting complex of regulatory requirements,

More information

SURVEY REPORT DATA SCIENCE SOCIETY 2014

SURVEY REPORT DATA SCIENCE SOCIETY 2014 SURVEY REPORT DATA SCIENCE SOCIETY 2014 TABLE OF CONTENTS Contents About the Initiative 1 Report Summary 2 Participants Info 3 Participants Expertise 6 Suggested Discussion Topics 7 Selected Responses

More information