BS COMPUTER SCIENCE BEST THESIS



Similar documents
PRACTICAL DATA MINING IN A LARGE UTILITY COMPANY

MS1b Statistical Data Mining

M The Nucleus M The Cytoskeleton M Cell Structure and Dynamics

How To Use Neural Networks In Data Mining

A Review of Data Mining Techniques

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA

Data Mining and Neural Networks in Stata

Biochemistry. Entrance Requirements. Requirements for Honours Programs. 148 Bishop s University 2015/2016

Masters in Information Technology

Data Mining Applications in Higher Education

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November ISSN

INTERNATIONAL JOURNAL FOR ENGINEERING APPLICATIONS AND TECHNOLOGY DATA MINING IN HEALTHCARE SECTOR.

IT and CRM A basic CRM model Data source & gathering system Database system Data warehouse Information delivery system Information users

National Cancer Institute

Doctor of Philosophy in Computer Science

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

Call 2014: High throughput screening of therapeutic molecules and rare diseases

Data mining on the EPIRARE survey data

Integrated Data Mining Strategy for Effective Metabolomic Data Analysis

Pharmacology skills for drug discovery. Why is pharmacology important?

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology

Machine Learning/Data Mining for Cancer Genomics

How To Get A Computer Science Degree At Appalachian State

Masters in Human Computer Interaction

Masters in Advanced Computer Science

Methods of Grading S/N Style of grading Percentage Score 1 Attendance, class work and assignment 10 2 Test 20 3 Examination 70 Total 100

Masters in Artificial Intelligence

BIOL 3030 Biochemistry Spring 2015

Masters in Computing and Information Technology

Masters in Networks and Distributed Systems

Interactive Visual Data Analysis in the Times of Big Data

Data Mining and Machine Learning in Bioinformatics

Graduate and Postdoctoral Affairs School of Biomedical Sciences College of Medicine. Graduate Certificate. Metabolic & Nutritional Medicine

USING SELF-ORGANIZING MAPS FOR INFORMATION VISUALIZATION AND KNOWLEDGE DISCOVERY IN COMPLEX GEOSPATIAL DATASETS

CITY UNIVERSITY OF HONG KONG 香 港 城 市 大 學. Self-Organizing Map: Visualization and Data Handling 自 組 織 神 經 網 絡 : 可 視 化 和 數 據 處 理

Pushdown automata. Informatics 2A: Lecture 9. Alex Simpson. 3 October, School of Informatics University of Edinburgh als@inf.ed.ac.

CHEMISTRY. Real. Amazing. Program Goals and Learning Outcomes. Preparation for Graduate School. Requirements for the Chemistry Major (71-72 credits)

A leader in the development and application of information technology to prevent and treat disease.

Data Mining Algorithms Part 1. Dejan Sarka

Master Specialization in Knowledge Engineering

DATA MINING TECHNIQUES AND APPLICATIONS

Foundations of Business Intelligence: Databases and Information Management

CHURN PREDICTION IN MOBILE TELECOM SYSTEM USING DATA MINING TECHNIQUES

OTHER KNOWLEDGE CAPTURE TECHNIQUES

IC05 Introduction on Networks &Visualization Nov

Department of Food and Nutrition

Abdullah Mohammed Abdullah Khamis

A Framework of User-Driven Data Analytics in the Cloud for Course Management

CHEM 451 BIOCHEMISTRY I. SUNY Cortland Fall 2010

A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks

Working with telecommunications

Ph.D. in Bioinformatics and Computational Biology Degree Requirements

In this presentation, you will be introduced to data mining and the relationship with meaningful use.

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

CS Standards Crosswalk: CSTA K-12 Computer Science Standards and Oracle Java Programming (2014)

Likelihood of Cancer

Department of Health Sciences

. Learn the number of classes and the structure of each class using similarity between unlabeled training patterns

Obtaining Knowledge. Lecture 7 Methods of Scientific Observation and Analysis in Behavioral Psychology and Neuropsychology.

SYLLABUS. Path-217. Haneline MT, Meeker WC. Introduction to Public Health for Chiropractors. Sudbury, MA: Jones and Bartlett Publishers, 2011.

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

Introduction to Data Mining

Knowledge Discovery and Data Mining

REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf])

Using MATLAB: Bioinformatics Toolbox for Life Sciences

DEGREE PLAN INSTRUCTIONS FOR COMPUTER ENGINEERING

Principles of Data Mining by Hand&Mannila&Smyth

Prediction of Cancer Count through Artificial Neural Networks Using Incidence and Mortality Cancer Statistics Dataset for Cancer Control Organizations

Software Development Training Camp 1 (0-3) Prerequisite : Program development skill enhancement camp, at least 48 person-hours.

Mass Frontier Version 7.0

CTC Technology Readiness Levels

DATA MINING - SELECTED TOPICS

University of Glasgow - Programme Structure Summary C1G MSc Bioinformatics, Polyomics and Systems Biology

DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7

Course Curriculum for Master Degree in Medical Laboratory Sciences/Hematology and Blood Banking

BIOSCIENCES COURSE TITLE AWARD

life science data mining

Data Mining On Diabetics

Methodology for Emulating Self Organizing Maps for Visualization of Large Datasets

Visualization of large data sets using MDS combined with LVQ.

Gerard Mc Nulty Systems Optimisation Ltd BA.,B.A.I.,C.Eng.,F.I.E.I

MultiQuant Software 2.0 for Targeted Protein / Peptide Quantification

is a knowledge based expert decision support tool for predicting the metabolic fate of chemicals in mammals.

Chapter 6 - Enhancing Business Intelligence Using Information Systems

Nursing 113. Pharmacology Principles

BIOLOGICAL SCIENCES REQUIREMENTS [63 75 UNITS]

Master of Science in Biochemistry (Molecular Medicine Option).

Course Curriculum for Master Degree in Medical Laboratory Sciences/Clinical Biochemistry

Pipeline Pilot Enterprise Server. Flexible Integration of Disparate Data and Applications. Capture and Deployment of Best Practices

Biomedical Informatics: Its Scientific Evolution and Future Promise. What is Biomedical Informatics?

School of Computer Science

Transcription:

2014 Meren, Gil Troy P. Adviser: Vincent Peter C. Magboo, M.D., M.Sc. BOSOM Calculator: A Breast Cancer Outcome Survival Online Measurement Calculator using Data Mining and Predictive Modeling on SEER data Illnesses of high mortality rate such as breast cancer elicit questions related to the patient s time left to live. The common methods used to arrive at an estimate include comparing the patient s health condition to previous medical records and treatments, referral to statistically-computed survival rates based from historical records, or consulting another breast cancer expert. The application of data mining on medical records to create predictive models for cancer survivability has been proven to hold significant accuracy by numerous scientific and applied researches throughout the years. Agrawal et al. s Lung Cancer Outcome Calculator provides a framework for developing a predicted survival calculator for different cancers based on a patient s health condition. This research aims to develop the Breast Cancer Outcome - Survival Online Measurement Calculator (BOSOM Calculator), an online application that takes a patient s clinical cancer data to give a predicted cancer survival based on a dataset from the Surveillance, Epidemiology, and End Results Program (SEER).

2013 Mendoza, John Althom Adviser: Ma. Sheila A. Magboo, M.Sc. Prediction of Polyketide product from Module Organization of Enzymes Using Cumulative Tanimoto Fragment Scores Polyketide is a major class of natural products possessing several pharmacological properties. Perfroming wet laboratory experiments to discover a functional polyketide is costly and difficult because of its trial-and-error nature. However, the analogous biosynthesis of theses metabolites to fatty acides makes the resulting compound predictable. Through the use of information technology, a stand-alone computational tool -Predyketide - is crated to observe the resulting structure per elongation, and to allow prediction and visualization of the most possible natural product compuound. The list of all known building blocks (starter and extender) used in the system is gathered from ASMPKS, another polyketiderelated system. With these functionalities, this application can help in the discovery of new drugs requiring lesser time and effort.

2012 Ghany, Mark Lester Adviser: Geoffrey A. Solano, M.Sc. Visualization of Multivariate Health Data Using Self-Organizing Maps Data that are multivariate in nature is a type of data that may contain subtle patterns. However, it is considered to be an obstacle in research most of the time since classical statistics may find it encumbering to analyze. However, computational statistics, a collab oration between computer science and statistics, offers a suite of algorithms that may be used to surpass obstacles such as this. The Self- Organizing Map and Data Visualization are examples of these. The Self-Organizing Map is an artificial neural network that employs a process to reduce multidimensional data into a low-dimensional representation while Data Visualization is a process that aims to give the human brain a visual representation of knowledge about certain data. SOM Visualize is a software that makes use of both processes. The tool enables users to input data and visualize several patterns such as clusters, associations, as well as a geographical representation that exist in the data. It may give several hypotheses that may be confirmed through other statistical tests and SOM Visualize has therefore enabled the possibility of analysis of multivariate data.

2011 Quinto, Elmannson Adviser: Vincent Peter C. Magboo, M.D., M.Sc. CMAN: Context-free Grammar Manipulator Many topics in Automata and Languages Theory including context-free languages involve tedious tasks that are prone to error when done manually. Conventional instructional methods have many drawbacks when used to teach the numerous dynamic processes found in computer science. Interactive software provides one possible solution to these limitations of traditional teaching and learning resources. One of these is CMAN: Context-free Grammar Manipulator, an interactive pedagogical tool to construct grammars, convert it to normal forms and view its corresponding Pushdown Automaton. It can also test a string for membership using Cocke-Younger-Kasami algorithm then get its leftmost and rightmost derivation. It also provides an interactive graphical environment for parse tree construction. Lectures and assessment examination is also provided for users quick reference.

2010 Mantaring, Mara Tricia Adviser: Ma. Sheila A. Magboo, M.Sc. Generic Pathway Generator (GPG) A metabolic pathway within an organism is a series of enzymatic reactions consuming certain metabolites and producing others. These metabolites do not only play a part in one metabolic pathway but is integrated in others as well thus forming a complex network of reactions [2]. A vast knowledge in both biochemistry and cell physiology is needed in the projection and analysis of this complex network of reactions. Due to the difficulty of understanding and analyzing these pathways, illustrations and graphical representations of the composition of the pathways are needed. The development of the Generic Pathway Generator (GPG) which generates different metabolic pathways helps the users to visualize and understand the relationship of the different enzymes and factors that affect the end products of the different pathways. The results generated by the system can be used to analyze the characteristics of the different organisms that resulted because of the factors in the pathways and the alteration of the different enzymes, genes and other inputs. The development of this pathway website can help the different researchers compile their studies (experiments and projects) and generate pathways based on them. The automation of compilation of their studies can enable users to easily update their experiments and projects. GPG also enables users to easily connect the different pathway studies and see the resulting pathway.