Ensemble Learning of Colorectal Cancer Survival Rates
|
|
|
- Jodie Anderson
- 10 years ago
- Views:
Transcription
1 Ensemble Learning of Colorectal Cancer Survival Rates Chris Roadknight School of Computing Science University of Nottingham Malaysia Campus Malaysia Uwe Aickelin School of Computing Science University of Nottingham John Scholefield Faculty of Medicine & Health Sciences The University of Nottingham Lindy Durrant Faculty of Medicine & Health Sciences The University of Nottingham Abstract In this paper, we describe a dataset relating to cellular and physical conditions of patients who are operated upon to remove colorectal tumours. This data provides a unique insight into immunological status at the point of tumour removal, tumour classification and post-operative survival. We build on existing research on clustering and machine learning facets of this data to demonstrate a role for an ensemble approach to highlighting patients with clearer prognosis parameters. Results for survival prediction using 3 different approaches are shown for a subset of the data which is most difficult to model. The performance of each model individually is compared with subsets of the data where some agreement is reached for multiple models. Significant improvements in model accuracy on an unseen test set can be achieved for patients where agreement between models is achieved. Keywords ensemble learning; anti-learning; colorectal cancer. I. INTRODUCTION Colorectal cancer is the third most commonly diagnosed cancer in the world. Colorectal cancers start in the lining of the bowel and grow into the muscle layers underneath then through the bowel wall [1]. TNM staging involves the Classification of Malignant Tumours Tumour (T). Size of the tumor and whether it has invaded nearby tissue Nodes (N). The extent to which regional lymph nodes involved Metastasis (M). This is the spread of a disease from one organ or part to another non-adjacent organ. 4 TNM stages (I,II,III,IV) are generated by combining these three indicator levels and are allied with increasing severity and decreasing survival rates. Treatment options include minor/major surgery, chemotherapy, radiotherapy but the correct treatment is heavily dependent on the unique features of the tumour which are summarised by the TNM staging. Choosing the correct treatment at this stage is crucial to both the patient s survival and quality of life. A major goal of this research is to automatically optimize the treatment plan based on the existing data. The data for this research was gathered by scientists and clinicians at City Hospital, Nottingham. The dataset we use here is made up of over 200 possible attributes for 462 patients. The attributes are generated by recording metrics at the time of tumour removal, these include: Physical data (age, sex etc) Immunological data (levels of various T Cell subsets) Biochemical data (levels of certain proteins) Retrospective data (post-operative survival statistics) Clinical data (Tumour location, size etc). In the research into the relationship between immune response and tumour staging there has been some support of the hypothesis that the adaptive immune response influences the behavior of human tumors. In situ analysis of tumorinfiltrating immune cells may therefore be a valuable prognostic tool in the treatment of colorectal cancer [2]. The immune and inflammation responses appear to have a role to play in the responses of patients to cancer [3] but the precise nature of this is still unclear. This research brings together earlier findings on the nature of the tumour biomarker dataset and builds an ensemble learning solution that offers improved predictive performance for a subset of patients.
2 II. BACKGROUND The dataset supplied is a biological dataset and as such has a rich complement of pre-processing issues. For instance, over 10% of the values are missing, with some attributes having over 40% missing values and some patients having over 30% missing values. Missing data poses a problem for most modelling techniques. One approach would be to remove every patient or every attribute with any missing data. This would remove a large number of entries, some of which only have a few missing values that are possibly insignificant. Another approach is to average the existing values for each attribute and to insert an average into the missing value space. The appropriate average may be the mean, median or mode depending on the profile of the data, because of this the handling of missing values attributes with a non-linear effect is particularly difficult. Much of the data takes the form of human analysis of biopsy samples stained for various markers. Rather than raw cell counts or measurements of protein levels we are presented with thresholded values. For instance, CD16 [4] is found on the surface of different types of cells such as natural killer, neutrophils, monocytes and macrophages. The data contains a simple 0 or 1 representing the existence or otherwise of a significant number of these cells rather than a count of the number of cells. This kind of manual inspection and simplification is true for most of the data and any modeling solution must work with this limitation. By using a combination of correlation coefficients and expert knowledge the data was reduced down to a set of ~50 attributes. This included removing several measurements that were hindsight dependent (ie. chemo or radio treatment) and correlated with TNM stage. (ie. Dukes stage). The survival of patients with TNM class 2 and 3 tumours, the middle grade categories, is the most difficult to predict, Single attribute relationships exist within the dataset but are not strong. Analysis of single attributes in relation to survival can yield predictions with accuracy of only 59%. III. PREPROCESSING A range of preprocessing methods were applied to the data beyond interpolating. Some patients had extensive missing values, while some attributes were very poorly sampled. There were also some patients who s operation was too recent or who had died of an unrelated cause before the success of the operation could be evaluated. An initial pass followed the following protocol: Removed patients with data for fewer than 50% attributes Removed attributes with data for fewer than 50% patients Removed Patients still alive but for less than 60 months post operation Removed patients who died before the 60 month survival threshold of unrelated reasons Removed TNM related attributes Removed post-operative attributes (eg. Radiotherapy) Removed derivable or correlated attributes Removed compound attributes Next an effort was made to convert non-linear, single attribute relationships into linear ones. In an effort to better represent attributes with non-linear effect we also converted attributes with non-linear effects into linear effects. FLIPL is a good example (figure 2), values of 0 and 3 have poorer prognosis for survival than 1 and 2, thereby giving a non-linear relationship between grade and survival. Merging the values for FLIPL of 0 and 3 into a new group 0 and 1 and 2 into another group 1 means that using the mean value for missing values now has a meaningful representation for the resulting modeling approach. Figure 2. Relationship between survival and FLIPL category A range of supervised [10] and unsupervised [11] method have been applied to this dataset both in predicting TNM classes from immune-histochemical data and predicting survival. The fact that an anti-learning method [10] appears to perform as well as any learning methods for most tasks suggests that there are complex behaviours at work within this dataset, coupled with its high dimensionality, leading to relatively poor prediction results. IV. SUPERVISED ENSEMBLE LEARNING OF SURVIVAL RATES The data used in this paper has detailed information about post-operative survival, so we can model how attribute values effect survival rates. For the purposes of this work a patient is deemed to have survived if they are still alive 5 years after their tumour removal. We initially looked at how linear transformation and attribute selection affected the ability to predict 5 year survival. Details of the neural network modeling approach are given later in this section but at this stage if we just look at the impact of this reduction in non-linearity. We see a much improved performance with survival prediction levels better than the 65% achieved by TNM alone (figure 3). We can also see that using the top 8 attributes selected by SVM attribute selection [18] provides the best performance.
3 age, these 8 also included T Cell levels for CD24 and Tstroma and chemicals such as Interleukin 17 (IL17) signalling the expression of certain genes such as P27. The SVM based attribute selection method uses recursive feature elimination where the SVM is trained multiple times and the attributes subsequently evaluated post-training. We have already shown that this dataset shows anti-learning properties [8], whereby predicting TNM stage from immunohistochemical markers is best tackled using an anti-learning approach. An anti-learning approach [9, 10], using the bottom 6 attributes from the SVM attribute selection approach (table 1) achieves a 61.3% prediction rate. This included indicators such as the patients sex and levels of antibodies like clxcl10 and cxcr4. Figure 3. Prediction accuracy with an increasing number of attributes with and without non-linearity removal Several of the attributes presented in the dataset pertain to the survival of the patients after their operation to remove the tumour. The number of months the patient has survived, whether they are still alive or not and how they died (if dead) are all available. Figure 4 shows survival curves for patients up to 6 months. The strong difference between survival rates in TNM stage 1 and 4 patients is apparent (ie. at 30 month the survival rate is approximately 95% and 5% for these 2 groups). The difference between patients with TNM stage 2 and 3 cancers is less apparent. After 30 months deaths from colorectal cancer for TNM stage 1 patients increase quite quickly, in percentage terms steeper than any other TNM class. These three approaches use different attributes to perform their prediction and there are many inconsistencies between the predictions. This can be used to our advantage by measuring performance, in an ensemble manner, on patients where 2 or all 3 of the models agree. This agreement will be with a subset of patients so will not apply to every patient, but the ability to give SOME patients a BETTER prediction of survival is extremely valuable. Conversely, where there isn t agreement the possibility of being less confident about a prediction could also be useful. Table 2 shows the individual performance of each model and also the performance when models agree. For example, when the simple TNM model and the neural network model for the top 8 attributes agree, we achieve a 79.2% prediction for survival. This can be viewed logically as: If Learned Neural model predicts a patient will survive 5 years AND Patient is TNM type 2 OR If Learned Neural model predicts a patient will not survive five years AND Patient is TNM type 3 THEN We can be ~80% confident that our prediction is correct ELSE Patient survival cannot be predicted with any reliable accuracy Figure 4. Survival Curves for patients at all 4 TNM stages The trained neural network used for this predictive task is shown in figure 5. Again focusing on just TNM stage 2 and 3 patients we attempted to predict survival using both AI techniques and the TNM stage itself. The term survival is somewhat subjective but for the purposes of this work we used 5 years as the threshold for survival. Using the TNM stage alone to predict survival gives an accuracy of 64.6% (155 correct from 240). This is achieved by stating that all type 2 tumour patients will survive and all type 3 will not. If we use the cell marker statistics with patient physical details (such as age and sex) we can achieve a slightly higher predictive performance of 65.4% on the unseen test by using the top 8 attributes (table 1) selected by SVM attribute selection [12] followed by a trained Neural network [4]. As well as the physical characteristic of
4 Table 2. Accuracy of models and subsets of patients by SVM attribute selection [12] The dataset can now be divided into 4 distinct subgroup, each with a corresponding survival rate: Figure 5. Structure and weights of a survival predicting artificial neural network. While the non-linearity of a neural network decision making is notoriously difficult to glean information from, the relative simplicity of this neural approach means equation synthesis [13] yields formulas with some qualitative value, achieving predictive values of ~60% where models agree 1. TNM 2 Patients predicted to survive by the 2. TNM 2 Patients predicted not to survive by the 3. TNM 3 Patients predicted to survive by the 4. TNM 3 Patients predicted not to survive by the One interesting observation about this is that survival of TNM 3 patients with a positive prognosis from the model have a better survival rate that TNM 2 patients with a negative prognosis. This is shown more clearly by plotting the 4 corresponding survival curves (fig. 6) Table 1. Top 8 and bottom 6 attributes as produced Figure 6. Survival curves for 4 distinct patient sets grouped on TNM stage and model prognosis
5 V. CONCLUSIONS We have presented results for a unique dataset based on the biochemical and factors associated with colorectal tumour patients. This dataset is limited in many ways, but extremely important nonetheless and modeling any relationships or features based on the dataset to hand is an urgent priority. Generally, whether attempting to predict TNM stages or survival, patients at TNM stage 1 and 4 have more clear indicators in the attribute set. TNM stage 2 and 3 provides a much more challenging prediction task, so much so that the TNM stage appears much less important when predicting survival for these 2 stages than other indicators. The dataset used here has several drawbacks when it comes to building robust cause-effect models. The 2 main issues are the missing values and the high dimensionality. By preprocessing the dataset to convert attributes that may have a non-linear effect into linear attributes missing values can be represented as means of all values with a more stable meaning. This pre-processing step also makes resulting machine learning approach more capable of reaching global minima. The second issue of high dimensionality is tackled by using an attribute selection approach first demonstrated on another cancer dataset. We not only use the most highly ranked attributes here but by using anti-learning on the lowest ranked attributes we can produce a another, distinct model of the data. When looking specifically at survival, using 3 unique predictive approaches allows us to compare predictions from all three approaches. The amount of differences between the 3 approaches could be inferred from the lack of relationships between TNM staging and the immunohistochemistry [8] and the lack of a relationship between unsupervised clusters and TNM stages [11]. So for the 240 patients, there was only agreement between 2 of the approaches for between 103 and 140 patients but when agreement occurred, predictive performance increased up to a maximum of 82.5%. This level of accuracy for predicting survival of patients with TNM stage 2 and 3 tumours is unprecedented, albeit on a small subset of patients. This approach offer an important opportunity for clinicians to improve prognosis for patients but also, any information gained by analysing the agreeing models can be fed back to researcher in their endeavours to improve treatment options at a cellular level. [6] I. Guyon, J. Weston, S. Barnhill and V. Vapnik (2002). Gene selection for cancer classification using support vector machines. Machine Learning. 46: [7] C.M. Roadknight, G.R. Balls, G.E. Mills and D Palmer-Brown, Modeling complex environmental data. IEEE Transactions on Neural Networks 8 (4), [8] C. Roadknight, U. Aickelin, G. Qiu, J. Scholefield and L. Durrant. Supervised Learning and Anti-learning of Colorectal Cancer Classes and Survival Rates from Cellular Biology Parameters. Systems, Man and Cybernetics 2012 [9] A. Kowalczyk, Classification of anti-learnable biological and synthetic data, Proc. PKDD, pp [10] Kowalczyk, Adam, et al. "Validation of Anti-learnable Signature in Classifcation of Response to Chemoradiotherapy in Esophageal Adenocarcinoma Patients." Proc. Intern. Symp. on Optimization and Systems Biology, OSB [11] C. Roadknight, U. Aickelin, A, Ladas, D. Soria J. Scholefield and L. Durrant, Biomarker Clustering of Colorectal Cancer Data to Complement Clinical Classification. Federated Conference on Computer Science and Information Systems. Wrocaw, Poland, 9-12 September, 2012 [12] I. Guyon, J. Weston, S. Barnhill and V. Vapnik (2002). Gene selection for cancer classification using support vector machines. Machine Learning. 46: [13] C.M. Roadknight, G.R. Balls, G.E. Mills and D Palmer-Brown, Modeling complex environmental data. IEEE Transactions on Neural Networks 8 (4), REFERENCES [1] S. Hamilton and L. Aaltonen. Pathology and Genetics of Tumors of the Digestive System. (New York: WHO Publications, 2000). [2] J. Galon et al. Type, density, and location of immune cells within human colorectal tumors predict clinical outcome. (2006) Science 313, [3] S. Grivennikov, F. Greten and M. Karin, Immunity, Inflammation, and Cancer, Cell, Volume 140, Issue 6, 2010, Pages [4] C. M. Bishop, Neural Networks for Pattern Recognition. (Clarendon Press, Oxford, 1995) [5] M. Hall et al (2009); The WEKA Data Mining Software: An Update; SIGKDD Explorations, Volume 11, Issue 1.
Report series: General cancer information
Fighting cancer with information Report series: General cancer information Eastern Cancer Registration and Information Centre ECRIC report series: General cancer information Cancer is a general term for
BIOINF 585 Fall 2015 Machine Learning for Systems Biology & Clinical Informatics http://www.ccmb.med.umich.edu/node/1376
Course Director: Dr. Kayvan Najarian (DCM&B, [email protected]) Lectures: Labs: Mondays and Wednesdays 9:00 AM -10:30 AM Rm. 2065 Palmer Commons Bldg. Wednesdays 10:30 AM 11:30 AM (alternate weeks) Rm.
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING USING NEURAL NETWORKS JothiKumar.R 1, Sivabalan.R.V 2 1 Research scholar, Noorul Islam University, Nagercoil, India Assistant Professor, Adhiparasakthi College
Robert Bristow MD PhD FRCPC
Robert Bristow MD PhD FRCPC Clinician-Scientist and Professor, Radiation Oncology and Medical Biophysics, University of Toronto and Ontario Cancer Institute/ (UHN) Head, PMH-CFCRI Prostate Cancer Research
Data Mining Techniques for Prognosis in Pancreatic Cancer
Data Mining Techniques for Prognosis in Pancreatic Cancer by Stuart Floyd A Thesis Submitted to the Faculty of the WORCESTER POLYTECHNIC INSTITUE In partial fulfillment of the requirements for the Degree
Treating Melanoma S kin Cancer A Quick Guide
Treating Melanoma Skin Cancer A Quick Guide Contents This is a brief summary of the information on Treating melanoma skin cancer from our website. You will find more detailed information on the website.
The recommendations made throughout this book are by the National Health and Medical Research Council (NHMRC).
INTRODUCTION This book has been prepared for people with bowel cancer, their families and friends. The first section is for people with bowel cancer, and is intended to help you understand what bowel cancer
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
One out of every two men and one out of every three women will have some type of cancer at some point during their lifetime. 3
1. What is cancer? 2. What causes cancer?. What causes cancer? 3. Can cancer be prevented? The Facts One out of every two men and one out of every three women will have some type of cancer at some point
Lymph Nodes and Cancer What is the lymph system?
Lymph Nodes and Cancer What is the lymph system? Our bodies have a network of lymph vessels and lymph nodes. (Lymph is pronounced limf.) This network is a part of the body s immune system. It collects
Measures of Prognosis. Sukon Kanchanaraksa, PhD Johns Hopkins University
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
Medical Informatics II
Medical Informatics II Zlatko Trajanoski Institute for Genomics and Bioinformatics Graz University of Technology http://genome.tugraz.at [email protected] Medical Informatics II Introduction
Knowledge Discovery from patents using KMX Text Analytics
Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs [email protected] Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers
A Regression Approach for Forecasting Vendor Revenue in Telecommunication Industries
A Regression Approach for Forecasting Vendor Revenue in Telecommunication Industries Aida Mustapha *1, Farhana M. Fadzil #2 * Faculty of Computer Science and Information Technology, Universiti Tun Hussein
Targeting Specific Cell Signaling Pathways for the Treatment of Malignant Peritoneal Mesothelioma
The Use of Kinase Inhibitors: Translational Lab Results Targeting Specific Cell Signaling Pathways for the Treatment of Malignant Peritoneal Mesothelioma Sheelu Varghese, Ph.D. H. Richard Alexander, M.D.
Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data
CMPE 59H Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data Term Project Report Fatma Güney, Kübra Kalkan 1/15/2013 Keywords: Non-linear
Smoking and misuse of certain pain medicines can affect the risk of developing renal cell cancer.
Renal cell cancer Renal cell cancer is a disease in which malignant (cancer) cells form in tubules of the kidney. Renal cell cancer (also called kidney cancer or renal adenocarcinoma) is a disease in which
Appendix One. HER2-positive early breast cancer, its treatment and prognosis
Appendix One. HER2-positive early breast cancer, its treatment and prognosis Breast cancer and HER2/neu over-expression Health need is one of PHARMAC s nine decision criteria (http://www.pharmac.govt.nz/pdf/231205.pdf
Frequently Asked Questions About Ovarian Cancer
Media Contact: Gerri Gomez Howard Cell: 303-748-3933 [email protected] Frequently Asked Questions About Ovarian Cancer What is ovarian cancer? Ovarian cancer is a cancer that forms in tissues
Analysis of the colorectal tumor microenvironment using integrative bioinformatic tools
MLECNIK Bernhard & BINDEA Gabriela Analysis of the colorectal tumor microenvironment using integrative bioinformatic tools INSERM U872, Jérôme Galon Team15: Integrative Cancer Immunology Cordeliers Research
Malignant Lymphomas and Plasma Cell Myeloma
Malignant Lymphomas and Plasma Cell Myeloma Dr. Bruce F. Burns Dept. of Pathology and Lab Medicine Overview definitions - lymphoma lymphoproliferative disorder plasma cell myeloma pathogenesis - translocations
Data quality in Accounting Information Systems
Data quality in Accounting Information Systems Comparing Several Data Mining Techniques Erjon Zoto Department of Statistics and Applied Informatics Faculty of Economy, University of Tirana Tirana, Albania
Chapter 6. The stacking ensemble approach
82 This chapter proposes the stacking ensemble approach for combining different data mining classifiers to get better performance. Other combination techniques like voting, bagging etc are also described
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree S.Florence 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, K.Malathi 4 PG Scholar, Indian Institute of Information Technology, Srirangam,
The TV Series. www.healthybodyhealthymind.com INFORMATION TELEVISION NETWORK
The TV Series www.healthybodyhealthymind.com Produced By: INFORMATION TELEVISION NETWORK ONE PARK PLACE 621 NW 53RD ST BOCA RATON, FL 33428 1-800-INFO-ITV www.itvisus.com 2005 Information Television Network.
OBJECTIVES By the end of this segment, the community participant will be able to:
Cancer 101: Cancer Diagnosis and Staging Linda U. Krebs, RN, PhD, AOCN, FAAN OCEAN Native Navigators and the Cancer Continuum (NNACC) (NCMHD R24MD002811) Cancer 101: Diagnosis & Staging (Watanabe-Galloway
DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.
DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,
co-sponsored by the Health & Physical Education Department, the Health Services Office, and the Student Development Center
Cancer is a group of more than 100 related diseases. Normally, cells grow and divide to produce more cells to keep the body healthy. Sometimes, this process goes wrong. New cells form when the body doesn
Comparison of K-means and Backpropagation Data Mining Algorithms
Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and
MS1b Statistical Data Mining
MS1b Statistical Data Mining Yee Whye Teh Department of Statistics Oxford http://www.stats.ox.ac.uk/~teh/datamining.html Outline Administrivia and Introduction Course Structure Syllabus Introduction to
How To Use Neural Networks In Data Mining
International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and
Analysis of Prostate Cancer at Easter Connecticut Health Network Using Cancer Registry Data
The 2014 Cancer Program Annual Public Reporting of Outcomes/Annual Site Analysis Statistical Data from 2013 More than 70 percent of all newly diagnosed cancer patients are treated in the more than 1,500
Operations Research and Knowledge Modeling in Data Mining
Operations Research and Knowledge Modeling in Data Mining Masato KODA Graduate School of Systems and Information Engineering University of Tsukuba, Tsukuba Science City, Japan 305-8573 [email protected]
Mathematical Models of Supervised Learning and their Application to Medical Diagnosis
Genomic, Proteomic and Transcriptomic Lab High Performance Computing and Networking Institute National Research Council, Italy Mathematical Models of Supervised Learning and their Application to Medical
International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015
RESEARCH ARTICLE OPEN ACCESS Data Mining Technology for Efficient Network Security Management Ankit Naik [1], S.W. Ahmad [2] Student [1], Assistant Professor [2] Department of Computer Science and Engineering
Title: Immunohistochemical staining of radixin and moesin in prostatic adenocarcinoma
Author's response to reviews Title: Immunohistochemical staining of radixin and moesin in prostatic adenocarcinoma Authors: Tanner L Bartholow ([email protected]) Uma R Chandran ([email protected])
Manjeet Kaur Bhullar, Kiranbir Kaur Department of CSE, GNDU, Amritsar, Punjab, India
Volume 5, Issue 6, June 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Multiple Pheromone
International Journal of Software and Web Sciences (IJSWS) www.iasir.net
International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) ISSN (Print): 2279-0063 ISSN (Online): 2279-0071 International
The following information is only meant for people who have been diagnosed with advanced non-small cell
Important information for people with advanced non-small cell lung cancer The following information is only meant for people who have been diagnosed with advanced non-small cell lung cancer (NSCLC). NSCLC
Feature Selection using Integer and Binary coded Genetic Algorithm to improve the performance of SVM Classifier
Feature Selection using Integer and Binary coded Genetic Algorithm to improve the performance of SVM Classifier D.Nithya a, *, V.Suganya b,1, R.Saranya Irudaya Mary c,1 Abstract - This paper presents,
CS 2750 Machine Learning. Lecture 1. Machine Learning. http://www.cs.pitt.edu/~milos/courses/cs2750/ CS 2750 Machine Learning.
Lecture Machine Learning Milos Hauskrecht [email protected] 539 Sennott Square, x5 http://www.cs.pitt.edu/~milos/courses/cs75/ Administration Instructor: Milos Hauskrecht [email protected] 539 Sennott
Early Prostate Cancer: Questions and Answers. Key Points
CANCER FACTS N a t i o n a l C a n c e r I n s t i t u t e N a t i o n a l I n s t i t u t e s o f H e a l t h D e p a r t m e n t o f H e a l t h a n d H u m a n S e r v i c e s Early Prostate Cancer:
Fact sheet 10. Borderline ovarian tumours. The difficult cases. What is borderline ovarian cancer (BOC)?
For this reason, some doctors prefer the term borderline ovarian tumour rather than borderline ovarian cancer. Fact sheet 10 Borderline ovarian tumours We, Ovacome, are a support network for people affected
Lung cancer is not just one disease. There are two main types of lung cancer:
1. What is lung cancer? 2. How common is lung cancer? 3. What are the risk factors for lung cancer? 4. What are the signs and symptoms of lung cancer? 5. How is lung cancer diagnosed? 6. What are the available
SURVIVABILITY ANALYSIS OF PEDIATRIC LEUKAEMIC PATIENTS USING NEURAL NETWORK APPROACH
330 SURVIVABILITY ANALYSIS OF PEDIATRIC LEUKAEMIC PATIENTS USING NEURAL NETWORK APPROACH T. M. D.Saumya 1, T. Rupasinghe 2 and P. Abeysinghe 3 1 Department of Industrial Management, University of Kelaniya,
ENHANCED CONFIDENCE INTERPRETATIONS OF GP BASED ENSEMBLE MODELING RESULTS
ENHANCED CONFIDENCE INTERPRETATIONS OF GP BASED ENSEMBLE MODELING RESULTS Michael Affenzeller (a), Stephan M. Winkler (b), Stefan Forstenlechner (c), Gabriel Kronberger (d), Michael Kommenda (e), Stefan
How common is bowel cancer?
information Primary Care Society for Gastroenterology Bowel Cancer (1 of 6) How common is bowel cancer? Each year 35,000 people in Britain are diagnosed with cancer of the bowel, that is to say cancer
Breast Cancer. Presentation by Dr Mafunga
Breast Cancer Presentation by Dr Mafunga Breast cancer in the UK Breast cancer is the second most common cancer in women. Around 1 in 9 women will develop breast cancer It most commonly affects women over
Statistics for BIG data
Statistics for BIG data Statistics for Big Data: Are Statisticians Ready? Dennis Lin Department of Statistics The Pennsylvania State University John Jordan and Dennis K.J. Lin (ICSA-Bulletine 2014) Before
Visualization of Breast Cancer Data by SOM Component Planes
International Journal of Science and Technology Volume 3 No. 2, February, 2014 Visualization of Breast Cancer Data by SOM Component Planes P.Venkatesan. 1, M.Mullai 2 1 Department of Statistics,NIRT(Indian
An Introduction To Immunotherapy And The Promise Of Tissue Phenomics
An Introduction To Immunotherapy And The Promise Of Tissue Phenomics INSIDE: n The Potential of Immunotherapy n Towards an Understanding of Immunotherapy n Current Approaches to Immunotherapy n The Immunotherapy
A Local Stability of Mathematical Models for Cancer Treatment by Using Gene Therapy
International Journal of Modeling Optimization, Vol 5, No 3, June 2015 A Local Stability of Mathematical Models for Cancer Treatment by Using Gene Therapy Dwi Lestari Ratnasari Dwi Ambarwati Abstract This
Introduction to Pattern Recognition
Introduction to Pattern Recognition Selim Aksoy Department of Computer Engineering Bilkent University [email protected] CS 551, Spring 2009 CS 551, Spring 2009 c 2009, Selim Aksoy (Bilkent University)
specific B cells Humoral immunity lymphocytes antibodies B cells bone marrow Cell-mediated immunity: T cells antibodies proteins
Adaptive Immunity Chapter 17: Adaptive (specific) Immunity Bio 139 Dr. Amy Rogers Host defenses that are specific to a particular infectious agent Can be innate or genetic for humans as a group: most microbes
Lung Cancer & Mesothelioma 2011-2015
Lung Cancer & Mesothelioma 2011-2015 Annex G Mesothelioma 1. The vision for mesothelioma services is set out in the Mesothelioma Framework issued by DH on 26 February 2007 (supported by the British Thoracic
COMPARING NEURAL NETWORK ALGORITHM PERFORMANCE USING SPSS AND NEUROSOLUTIONS
COMPARING NEURAL NETWORK ALGORITHM PERFORMANCE USING SPSS AND NEUROSOLUTIONS AMJAD HARB and RASHID JAYOUSI Faculty of Computer Science, Al-Quds University, Jerusalem, Palestine Abstract This study exploits
ADVANCED MACHINE LEARNING. Introduction
1 1 Introduction Lecturer: Prof. Aude Billard ([email protected]) Teaching Assistants: Guillaume de Chambrier, Nadia Figueroa, Denys Lamotte, Nicola Sommer 2 2 Course Format Alternate between: Lectures
Support Program for Improving Graduate School Education Advanced Education Program for Integrated Clinical, Basic and Social Medicine
Support Program for Improving Graduate School Education Advanced Education Program for Integrated Clinical, Basic and Social Medicine January 27, 2009 Dear Professors (representative) of departments, Subject:
Scalable Developments for Big Data Analytics in Remote Sensing
Scalable Developments for Big Data Analytics in Remote Sensing Federated Systems and Data Division Research Group High Productivity Data Processing Dr.-Ing. Morris Riedel et al. Research Group Leader,
Management of Non-Small Cell Lung Cancer Guide for General Practitioners
Management of n-small Cell Lung Cancer Guide for General Practitioners Clinical Stage I Cancer only in one lobe of lung and
Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms
Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms Scott Pion and Lutz Hamel Abstract This paper presents the results of a series of analyses performed on direct mail
Mesothelioma: Questions and Answers
CANCER FACTS N a t i o n a l C a n c e r I n s t i t u t e N a t i o n a l I n s t i t u t e s o f H e a l t h D e p a r t m e n t o f H e a l t h a n d H u m a n S e r v i c e s Mesothelioma: Questions
Chapter 12 Discovering New Knowledge Data Mining
Chapter 12 Discovering New Knowledge Data Mining Becerra-Fernandez, et al. -- Knowledge Management 1/e -- 2004 Prentice Hall Additional material 2007 Dekai Wu Chapter Objectives Introduce the student to
The Ontario Cancer Registry moves to the 21 st Century
The Ontario Cancer Registry moves to the 21 st Century Rebuilding the OCR Public Health Ontario Grand Rounds Oct. 14, 2014 Diane Nishri, MSc Mary Jane King, MPH, CTR Outline 1. What is the Ontario Cancer
Learning Example. Machine learning and our focus. Another Example. An example: data (loan application) The data and the goal
Learning Example Chapter 18: Learning from Examples 22c:145 An emergency room in a hospital measures 17 variables (e.g., blood pressure, age, etc) of newly admitted patients. A decision is needed: whether
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
Introduction Breast cancer is cancer that starts in the cells of the breast. Breast cancer happens mainly in women. But men can get it too.
Male Breast Cancer Introduction Breast cancer is cancer that starts in the cells of the breast. Breast cancer happens mainly in women. But men can get it too. Many people do not know that men can get breast
Applying Multiple Neural Networks on Large Scale Data
0 International Conference on Inforation and Electronics Engineering IPCSIT vol6 (0) (0) IACSIT Press, Singapore Applying Multiple Neural Networks on Large Scale Data Kritsanatt Boonkiatpong and Sukree
Azure Machine Learning, SQL Data Mining and R
Azure Machine Learning, SQL Data Mining and R Day-by-day Agenda Prerequisites No formal prerequisites. Basic knowledge of SQL Server Data Tools, Excel and any analytical experience helps. Best of all:
CyTOF2. Mass cytometry system. Unveil new cell types and function with high-parameter protein detection
CyTOF2 Mass cytometry system Unveil new cell types and function with high-parameter protein detection DISCOVER MORE. IMAGINE MORE. MASS CYTOMETRY. THE FUTURE OF CYTOMETRY TODAY. Mass cytometry resolves
BREAST CANCER PATHOLOGY
BREAST CANCER PATHOLOGY FACT SHEET Version 4, Aug 2013 This fact sheet was produced by Breast Cancer Network Australia with input from The Royal College of Pathologists of Australasia I m a nurse and know
Prostate Cancer. Treatments as unique as you are
Prostate Cancer Treatments as unique as you are UCLA Prostate Cancer Program Prostate cancer is the second most common cancer among men. The UCLA Prostate Cancer Program brings together the elements essential
Credit Card Fraud Detection Using Self Organised Map
International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 13 (2014), pp. 1343-1348 International Research Publications House http://www. irphouse.com Credit Card Fraud
ductal carcinoma in situ (DCIS)
Understanding ductal carcinoma in situ (DCIS) and deciding about treatment Understanding ductal carcinoma in situ (DCIS) and deciding about treatment Developed by National Breast and Ovarian Cancer Centre
Understanding ductal carcinoma in situ (DCIS) and deciding about treatment
Understanding ductal carcinoma in situ (DCIS) and deciding about treatment Developed by National Breast and Ovarian Cancer Centre Funded by the Australian Government Department of Health and Ageing Understanding
Effects of Herceptin on circulating tumor cells in HER2 positive early breast cancer
Effects of Herceptin on circulating tumor cells in HER2 positive early breast cancer J.-L. Zhang, Q. Yao, J.-H. Chen,Y. Wang, H. Wang, Q. Fan, R. Ling, J. Yi and L. Wang Xijing Hospital Vascular Endocrine
Web Document Clustering
Web Document Clustering Lab Project based on the MDL clustering suite http://www.cs.ccsu.edu/~markov/mdlclustering/ Zdravko Markov Computer Science Department Central Connecticut State University New Britain,
Machine Learning. Mausam (based on slides by Tom Mitchell, Oren Etzioni and Pedro Domingos)
Machine Learning Mausam (based on slides by Tom Mitchell, Oren Etzioni and Pedro Domingos) What Is Machine Learning? A computer program is said to learn from experience E with respect to some class of
2. IMPLEMENTATION. International Journal of Computer Applications (0975 8887) Volume 70 No.18, May 2013
Prediction of Market Capital for Trading Firms through Data Mining Techniques Aditya Nawani Department of Computer Science, Bharati Vidyapeeth s College of Engineering, New Delhi, India Himanshu Gupta
Colorectal Cancer Treatment
Scan for mobile link. Colorectal Cancer Treatment Colorectal cancer overview Colorectal cancer, also called large bowel cancer, is the term used to describe malignant tumors found in the colon and rectum.
ANALYSIS OF FEATURE SELECTION WITH CLASSFICATION: BREAST CANCER DATASETS
ANALYSIS OF FEATURE SELECTION WITH CLASSFICATION: BREAST CANCER DATASETS Abstract D.Lavanya * Department of Computer Science, Sri Padmavathi Mahila University Tirupati, Andhra Pradesh, 517501, India [email protected]
Chapter 13. The hospital-based cancer registry
Chapter 13. The hospital-based cancer registry J.L. Young California Tumor Registry, 1812 14th Street, Suite 200, Sacramento, CA 95814, USA Introduction The purposes of a hospital-based cancer registry
Oncos Therapeutics: ONCOS THERAPEUTICS Personalized Cancer Immunotherapy. March 2015. Antti Vuolanto, COO and co-founder
Oncos Therapeutics: Personalized Cancer Immunotherapy ONCOS THERAPEUTICS Personalized Cancer Immunotherapy March 2015 Antti Vuolanto, COO and co-founder 1 History of Oncos Therapeutics 2002 2007 2009 Research
Learning to Process Natural Language in Big Data Environment
CCF ADL 2015 Nanchang Oct 11, 2015 Learning to Process Natural Language in Big Data Environment Hang Li Noah s Ark Lab Huawei Technologies Part 1: Deep Learning - Present and Future Talk Outline Overview
Mesothelioma. 1995-2013, The Patient Education Institute, Inc. www.x-plain.com ocft0101 Last reviewed: 03/21/2013 1
Mesothelioma Introduction Mesothelioma is a type of cancer. It starts in the tissue that lines your lungs, stomach, heart, and other organs. This tissue is called mesothelium. Most people who get this
Forecasting Trade Direction and Size of Future Contracts Using Deep Belief Network
Forecasting Trade Direction and Size of Future Contracts Using Deep Belief Network Anthony Lai (aslai), MK Li (lilemon), Foon Wang Pong (ppong) Abstract Algorithmic trading, high frequency trading (HFT)
Data Mining On Diabetics
Data Mining On Diabetics Janani Sankari.M 1,Saravana priya.m 2 Assistant Professor 1,2 Department of Information Technology 1,Computer Engineering 2 Jeppiaar Engineering College,Chennai 1, D.Y.Patil College
Uses of Flow Cytometry
Uses of Flow Cytometry 1. Multicolour analysis... 2 2. Cell Cycle and Proliferation... 3 a. Analysis of Cellular DNA Content... 4 b. Cell Proliferation Assays... 5 3. Immunology... 6 4. Apoptosis... 7
MHI3000 Big Data Analytics for Health Care Final Project Report
MHI3000 Big Data Analytics for Health Care Final Project Report Zhongtian Fred Qiu (1002274530) http://gallery.azureml.net/details/81ddb2ab137046d4925584b5095ec7aa 1. Data pre-processing The data given
THREE DIMENSIONAL REPRESENTATION OF AMINO ACID CHARAC- TERISTICS
THREE DIMENSIONAL REPRESENTATION OF AMINO ACID CHARAC- TERISTICS O.U. Sezerman 1, R. Islamaj 2, E. Alpaydin 2 1 Laborotory of Computational Biology, Sabancı University, Istanbul, Turkey. 2 Computer Engineering
Radiotherapy in locally advanced & metastatic NSC lung cancer
Radiotherapy in locally advanced & metastatic NSC lung cancer Dr Raj Hegde. MD. FRANZCR Consultant Radiation Oncologist. William Buckland Radiotherapy Centre. Latrobe Regional Hospital. Locally advanced
New strategies in anticancer therapy
癌 症 診 療 指 引 簡 介 及 臨 床 應 用 New strategies in anticancer therapy 中 山 醫 學 大 學 附 設 醫 院 腫 瘤 內 科 蔡 明 宏 醫 師 2014/3/29 Anti-Cancer Therapy Surgical Treatment Radiotherapy Chemotherapy Target Therapy Supportive
Data Mining and Machine Learning in Bioinformatics
Data Mining and Machine Learning in Bioinformatics PRINCIPAL METHODS AND SUCCESSFUL APPLICATIONS Ruben Armañanzas http://mason.gmu.edu/~rarmanan Adapted from Iñaki Inza slides http://www.sc.ehu.es/isg
Classification algorithm in Data mining: An Overview
Classification algorithm in Data mining: An Overview S.Neelamegam #1, Dr.E.Ramaraj *2 #1 M.phil Scholar, Department of Computer Science and Engineering, Alagappa University, Karaikudi. *2 Professor, Department
