Automatic Text Analysis Using Drupal
|
|
|
- Robert Mosley
- 9 years ago
- Views:
Transcription
1 Automatic Text Analysis Using Drupal By Herman Chai Computer Engineering California Polytechnic State University, San Luis Obispo Advised by Dr. Foaad Khosmood June 14, 2013 Abstract Natural language processing (NLP) is a field of computer science that is concerned with the interpretation of human language by computers. NLP has a multitude of different applications in the fields of computer science, artificial intelligence, and linguistics. The Automatic Text Analysis Using Drupal project is intended to increase the availability of NLP tools that can be easily used by the general public. By integrating a front-end content management system like Drupal with different back-end NLP applications, users can receive automatic analysis of their text without knowledge of the system. 1
2 Contents 1 Introduction Automatic Text Analysis Using Drupal Background Drupal Natural Language Processing Features Selection and Machine Learning Project Description and Architecture Results Data Sample Run Conclusion
3 1 Introduction 1.1 Automatic Text Analysis Using Drupal The Automatic Text Analysis Using Drupal project is intended to increase the availability of NLP tools that can be easily used by the general public. By integrating a front-end content management system like Drupal with different back-end NLP applications, users can receive automatic analysis of their text without knowledge of the system. Using Drupal s module system, the Python Natural Language Toolkit (NLTK) and the Stanford Parser can be integrated and used in order to provide statistical analysis to user-supplied text. 2 Background 2.1 Drupal Drupal is an open source content management platform used in websites and applications. A standard installation of Drupal, known as Drupal core, contains basic features that include user account registration, menu management, system administration, and other features common to a content management platform. Using add-ons, known as modules, a user can modify the standard Drupal installation and add different features or themes to their website. The Drupal system is built upon the open source Web development platform known as a LAMP stack (Figure 1). LAMP is short for Linux, Apache, MySQL, and PHP. In this case, Drupal uses PHP to interface with a MySQL database that is hosted on an Apache server running on Linux. By combining the open source software a developer is able to create dynamic Web sites or other Web applications. Internet Browser Apache Module Drupal/ PHP SQL Linux Figure 1: Basic Drupal Architecture 3
4 Drupal is built and supported by an active community that contributes thousands of different themes and modules. These modules are developed using the PHP scripting language that is suited for web development. Drupal s module system is based on the concept of hooks which allow modules to interact with the Drupal core. A hook is a PHP function that has a defined set of parameters, a specified result type, and once implemented performs a specific action. To extend on the Drupal core, a module simply needs to implement any of the available hooks that are described in the Drupal developer documentation. 2.2 Natural Language Processing Natural language processing (NLP) is a field of computer science that is concerned with the interpretation of human language by computers. NLP has a multitude of different applications in the fields of computer science, artificial intelligence, and linguistics. Many of these applications require the use of statistics in characterizing text. These statistics can be obtained using NLP applications like the Python Natural Language Toolkit (NLTK) and the Stanford Parser. The NTLK is a package that can be downloaded and used in Python programs in order to work with human language data. It contains libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning. By using the NLTK, developers can characterize text in terms of statistics and from there use those statistics in other NLP applications. The Stanford Parser is a package that contains Java implementations of different natural language parsers. Parsers are NLP tools that generate parse trees of user-supplied text. A parse tree is an ordered tree that represents the grammatical structure of a sentence. The tree describes which groups of words go together and the subjects and objects of the sentence. 2.3 Features Selection and Machine Learning Machine learning is the construction of systems that can learn from or be trained with data. In relation to NLP, a machine learning system could be trained for a multitude of different applications. These could include authorship attribution which is the process of determining an author of a work based upon characteristics found in other works, speech recognition which is the adaptation of spoken word into text, or any number of other applications. Machine learning uses a process known as feature selection in order to select relevant features for the construction of the machine learning model and system. In this particular project, one of the parsers that can be found in the Stanford Parser package was trained with data known as the Penn Treebank. Training a parser with the Penn Treebank allows it to learn a collection of syntactically annotated data and construct a machine learning model. The parser would then be able to create parse trees of sentences based on the data it was trained with. The statistics obtained from this project could potentially be used as part of the feature selection process in an NLP application. The statistics could be used as characteristics for different works and used to train a machine learning system. 4
5 3 Project Description and Architecture Drupal Drupal Node Content + Text Analysis Drupal Node Content Temporary Node Content Output Module Temporary Text Analysis Output Shell Exec Node Content Python NLPStats Stanford Parser Parsed/ Tokenized Content Parse Tree Node Content NLTK Figure 2: Module Architecture The module developed for this project initially did all of its text analysis directly, however as development progressed the text analysis was outsourced to the NLPStats Python program in order to make use of the NLTK. The general architecture for the module (Figure 2) is relatively simple with the majority of the functionality being completed by the NLPStats program. The module implements hooks to pull the node content from the database then strips the HTML tags from the content and outputs it to a temporary text file. The module then calls the PHP shell_exec command to execute Python and run the NLPStats program. The NLPStats program takes in the temporary text file and other arguments and performs statistical analysis on the text using the NLTK. During this process it also passes the content of the text to the Stanford Parser which generates a parse tree of each sentence in the text. The results of the Stanford Parser are then returned to NLPStats and compiled into the rest of the results of the analysis. The results of the analysis are then output to a temporary text file that is read by the module. The module then returns the node content along with the results of the analysis back to Drupal to be displayed on the Web site. 4 Results 4.1 Data In its current state the project is able to analyze and output approximately 17 different features of the text, however there are several other features that could still be analyzed, many of which could be obtained by analyzed the parse trees generated by the Stanford Parser. Currently the 5
6 parse trees are only used for one of the features that are output to the user. The features can be grouped into three different types (Figure 3) that include the lengths of the text in terms of number of characters, words, syllables, etc., data obtained from analyzing the parse trees, and calculated readability measures using the lengths of the text. 4.2 Sample Run Type Parse Feature Characters (Tot, para, sent, word) Capitalized Words (Tot, para, sent, word) Type/Token Ratio Word (Tot, para, sent) Sentences (Tot, para) Paragraphs Syllables (Tot, para, sent, word) Numerics (Tot, para, sent, word) Vowels (Tot, para, sent, word) Punctuation (Tot, para, sent, word) Nodes (sent) Automated Index Coleman Liau Index Flesch Reading Ease Flesch-Kinkaid Grade Gunning Fog Index Lix Formula Figure 3: Feature List by Type To understand the usefulness and potential application of this project, we will compare two similar texts, John. F. Kennedy s inaugural address and Martin Luther King Jr. s I Have a Dream speech. These works serve as good examples for comparison since they are both speeches and were delivered in the same time period of the 1960 s. They are also of similar length with JFK s speech having a length of approximately 1,300 words and the I Have a Dream speech having a length of approximately 1,600 words. Also the content of the speeches although not quite the same are similar in some ways. Since both of these texts were meant to be spoken and not read, it is not surprising that there is little variation in many of the features like characters per word, syllables per word, and vowels per word. There is however a clear difference when looking at words per paragraph (Figure 4) and syllables per sentence (Figure 5) as well as certain readability measures. JFK s speech has fewer words per paragraph with an average of (49.25) and a max of (96) while MLK s has an average of and a max of 206. JFK s speech however has more syllables per sentence with an average of and a max of 112 while MLK s has an average of and a max of 97. JFK s speech has a Flesch-Kincaid grade of while MLK s has a grade of This shows a clear difference in the structure of the speeches. JFK has longer sentences containing more words however MLK has longer paragraphs containing more sentences and this can be seen just by looking at the two speeches. This also explains the difference in readability grade as 6
7 Number of Syllables Number of Words JFK s speech has more syllables per sentence those sentences will tend to be more complex. By using this data we can characterize different works and use those characterizations for different applications like authorship attribution. 250 Words Per Paragraph Comparison JFK MLK 50 0 Avg Words/Para Max Words/Para Figure 4: Words per Paragraph Comparison between the two speeches 120 Syllables Per Sentence Comparison JFK MLK 20 0 Avg Syll/Sent Max Syll/Sent Figure 5: Syllables per Sentence Comparison between the two speeches 5 Conclusion The Automatic Text Analysis Using Drupal system is an easy to use NLP tool that can quickly gather the statistical features of a document. This type of tool could be used in multiple applications such as authorship attribution and other NLP related machine learning systems. 7
8 With the development of this system I have learned to be able to adapt quickly in terms of using other programming languages and contributing to other programs. As a part of that is also learning to adapt to user expectations and requirements that can change as development progresses. I did not realize how important the field of NLP was to our current technologies as well as future technologies and I hope to be able to contribute to its growth with this project and continue to see it grow in the future. 8
USABILITY OF A FILIPINO LANGUAGE TOOLS WEBSITE
USABILITY OF A FILIPINO LANGUAGE TOOLS WEBSITE Ria A. Sagum, MCS Department of Computer Science, College of Computer and Information Sciences Polytechnic University of the Philippines, Manila, Philippines
UQC103S1 UFCE47-20-1. Systems Development. uqc103s/ufce47-20-1 PHP-mySQL 1
UQC103S1 UFCE47-20-1 Systems Development uqc103s/ufce47-20-1 PHP-mySQL 1 Who? Email: [email protected] Web Site www.cems.uwe.ac.uk/~jedawson www.cems.uwe.ac.uk/~jtwebb/uqc103s1/ uqc103s/ufce47-20-1 PHP-mySQL
A Framework-based Online Question Answering System. Oliver Scheuer, Dan Shen, Dietrich Klakow
A Framework-based Online Question Answering System Oliver Scheuer, Dan Shen, Dietrich Klakow Outline General Structure for Online QA System Problems in General Structure Framework-based Online QA system
31 Case Studies: Java Natural Language Tools Available on the Web
31 Case Studies: Java Natural Language Tools Available on the Web Chapter Objectives Chapter Contents This chapter provides a number of sources for open source and free atural language understanding software
Drupal CMS for marketing sites
Drupal CMS for marketing sites Intro Sample sites: End to End flow Folder Structure Project setup Content Folder Data Store (Drupal CMS) Importing/Exporting Content Database Migrations Backend Config Unit
Drupal Performance Tuning
Drupal Performance Tuning By Jeremy Zerr Website: http://www.jeremyzerr.com @jrzerr http://www.linkedin.com/in/jrzerr Overview Basics of Web App Systems Architecture General Web
Workshop on Using Open Source Content Management System Drupal to build Library Websites Hasina Afroz Auninda Rumy Saleque
Workshop on Using Open Source Content Management System Drupal to build Library Websites Hasina Afroz Auninda Rumy Saleque Funded by: INASP, UK October 7, 2012 Ayesha Abed Library http://library.bracu.ac.bd
E-Commerce: Designing And Creating An Online Store
E-Commerce: Designing And Creating An Online Store Introduction About Steve Green Ministries Solo Performance Artist for 19 Years. Released over 26 Records, Several Kids Movies, and Books. My History With
Accelerating and Evaluation of Syntactic Parsing in Natural Language Question Answering Systems
Accelerating and Evaluation of Syntactic Parsing in Natural Language Question Answering Systems cation systems. For example, NLP could be used in Question Answering (QA) systems to understand users natural
Natural Language to Relational Query by Using Parsing Compiler
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,
Open Source Content Management System for content development: a comparative study
Open Source Content Management System for content development: a comparative study D. P. Tripathi Assistant Librarian Biju Patnaik Central Library NIT Rourkela [email protected] Designing dynamic and
Trollhättan, Sweden. http://keryx.se/ http://twitter.com/itpastorn/ http://itpastorn.blogspot.com/
Trollhättan, Sweden Lars Gunther is a web developer, computer science teacher and a pastor, who lives in Trollhättan, Sweden. He is the lead editor of several courses for WaSP Interact and invited expert
How to Install and Setting Up Drupal
Drupal 101 Introduction to Drupal September 12, 2014 nerdsummit.org Rick Hood [email protected] [email protected] [email protected] www.drupal.org/user/54879 2011 - present
Special Topics in Computer Science
Special Topics in Computer Science NLP in a Nutshell CS492B Spring Semester 2009 Jong C. Park Computer Science Department Korea Advanced Institute of Science and Technology INTRODUCTION Jong C. Park, CS
NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR
NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR Arati K. Deshpande 1 and Prakash. R. Devale 2 1 Student and 2 Professor & Head, Department of Information Technology, Bharati
LAMP Server A Brief Overview
2012 LAMP Server A Brief Overview Daniel Eakins Santa Fe College CTS 2356 Advanced Administration 3/21/2012 Abstract LAMP is short for Linux, Apache, MySQL, and PHP. LAMP servers are typically not Windows
Document management and exchange system supporting education process
Document management and exchange system supporting education process Emil Egredzija, Bozidar Kovacic Information system development department, Information Technology Institute City of Rijeka Korzo 16,
Seamless Web Data Entry for SAS Applications D.J. Penix, Pinnacle Solutions, Indianapolis, IN
Seamless Web Data Entry for SAS Applications D.J. Penix, Pinnacle Solutions, Indianapolis, IN ABSTRACT For organizations that need to implement a robust data entry solution, options are somewhat limited
owncloud Architecture Overview
owncloud Architecture Overview Time to get control back Employees are using cloud-based services to share sensitive company data with vendors, customers, partners and each other. They are syncing data
Video Transcription in MediaMosa
Video Transcription in MediaMosa Proof of Concept Version 1.1 December 28, 2011 SURFnet/Kennisnet Innovatieprogramma Het SURFnet/ Kennisnet Innovatieprogramma wordt financieel mogelijk gemaakt door het
An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines)
An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines) James Clarke, Vivek Srikumar, Mark Sammons, Dan Roth Department of Computer Science, University of Illinois, Urbana-Champaign.
uilding a Branch Website using Wordpress
Building a branch website using WordPress Building uilding a Branch Website using Wordpress WordPress is a Free and Open Source (FOSS) Content Management System (CMS). It allows you to build websites without
CA4003 - Compiler Construction
CA4003 - Compiler Construction David Sinclair Overview This module will cover the compilation process, reading and parsing a structured language, storing it in an appropriate data structure, analysing
Open Source CRM. Michelle Murrain, Nonprofit Open Source Initiative March 27, 2008
Open Source CRM Michelle Murrain, Nonprofit Open Source Initiative March 27, 2008 What I ll cover today What is a CRM? Kinds of CRM Why Open Source CRM? Examples of Open Source CRMs How to choose a CRM
Abdullah Radwan. Target Job. Work Experience (9 Years)
Abdullah Radwan LAMP / Linux / PHP / Apache / Ruby / MySQL / ASP.NET / Web Developer Wordpress / Magento / Drupal / C# / Sql Server / HTML / HTML5 / CSS CSS3 / Javascript / jquery / Prototype / SEO Target
Deposit Identification Utility and Visualization Tool
Deposit Identification Utility and Visualization Tool Colorado School of Mines Field Session Summer 2014 David Alexander Jeremy Kerr Luke McPherson Introduction Newmont Mining Corporation was founded in
LAMP Quickstart for Red Hat Enterprise Linux 4
LAMP Quickstart for Red Hat Enterprise Linux 4 Dave Jaffe Dell Enterprise Marketing December 2005 Introduction A very common way to build web applications with a database backend is called a LAMP Stack,
Installing Drupal on Your Local Computer
Installing Drupal on Your Local Computer This tutorial will help you install Drupal on your own home computer and allow you to test and experiment building a Web site using this open source software. This
Firewall Builder Architecture Overview
Firewall Builder Architecture Overview Vadim Zaliva Vadim Kurland Abstract This document gives brief, high level overview of existing Firewall Builder architecture.
Introduction to Python for Text Analysis
Introduction to Python for Text Analysis Jennifer Pan Institute for Quantitative Social Science Harvard University (Political Science Methods Workshop, February 21 2014) *Much credit to Andy Hall and Learning
Short notes on webpage programming languages
Short notes on webpage programming languages What is HTML? HTML is a language for describing web pages. HTML stands for Hyper Text Markup Language HTML is a markup language A markup language is a set of
Syllabus INFO-GB-3322. Design and Development of Web and Mobile Applications (Especially for Start Ups)
Syllabus INFO-GB-3322 Design and Development of Web and Mobile Applications (Especially for Start Ups) Spring 2015 Stern School of Business Norman White, KMEC 8-88 Email: [email protected] Phone: 212-998
1992-2010 by Pearson Education, Inc. All Rights Reserved.
Key benefit of object-oriented programming is that the software is more understandable better organized and easier to maintain, modify and debug Significant because perhaps as much as 80 percent of software
Content Management System - Drupal. Vikrant Sawant ([email protected]) Legislative Data Center, California
Content Management System - Drupal Vikrant Sawant ([email protected]) Legislative Data Center, California National Association of Legislative Information Technology Raleigh, NC October 2013 What
Shallow Parsing with Apache UIMA
Shallow Parsing with Apache UIMA Graham Wilcock University of Helsinki Finland [email protected] Abstract Apache UIMA (Unstructured Information Management Architecture) is a framework for linguistic
Building a Question Classifier for a TREC-Style Question Answering System
Building a Question Classifier for a TREC-Style Question Answering System Richard May & Ari Steinberg Topic: Question Classification We define Question Classification (QC) here to be the task that, given
Content Management Systems: Drupal Vs Jahia
Content Management Systems: Drupal Vs Jahia Mrudula Talloju Department of Computing and Information Sciences Kansas State University Manhattan, KS 66502. [email protected] Abstract Content Management Systems
Survey Results: Requirements and Use Cases for Linguistic Linked Data
Survey Results: Requirements and Use Cases for Linguistic Linked Data 1 Introduction This survey was conducted by the FP7 Project LIDER (http://www.lider-project.eu/) as input into the W3C Community Group
Systems Analysis Input and Output 1. Input and Output
Systems Analysis Input and Output 1 Input and Output A course in information architecture or web design, complemented with work in Human-Computer Interaction, will help the analyst understand how to improve
Web Based Maltese Language Text to Speech Synthesiser
Web Based Maltese Language Text to Speech Synthesiser Buhagiar Ian & Micallef Paul Faculty of ICT, Department of Computer & Communications Engineering [email protected], [email protected] Abstract An important
IGW+ Certificate. I d e a l G r o u p i n W e b. International professional web design,
IGW+ Certificate I d e a l G r o u p i n W e b International professional web design, Programming, CRM, online office automation, complete security, Secured Ecommerce and web site maintenance educational
How To Write A Database In Java With A New Data Type In Itunes.Com
Dynamic generation of web service APIs and interfaces through compilation of JSON Marisa Yeung, Nayan Deep Vemula and Sandy Siththanandan Team 4 COEN 259 Compilers Professor Ming Hwa Wang Santa Clara University
Master Degree Project Ideas (Fall 2014) Proposed By Faculty Department of Information Systems College of Computer Sciences and Information Technology
Master Degree Project Ideas (Fall 2014) Proposed By Faculty Department of Information Systems College of Computer Sciences and Information Technology 1 P age Dr. Maruf Hasan MS CIS Program Potential Project
MAGENTO HOSTING Progressive Server Performance Improvements
MAGENTO HOSTING Progressive Server Performance Improvements Simple Helix, LLC 4092 Memorial Parkway Ste 202 Huntsville, AL 35802 [email protected] 1.866.963.0424 www.simplehelix.com 2 Table of Contents
How does Drupal 7 Work? Tess Flynn, KDØPQK www.deninet.com
How does Drupal 7 Work? Tess Flynn, KDØPQK www.deninet.com About the Author Bachelor of Computer Science Used Drupal since 4.7 Switched from self-built PHP CMS Current Job: Not in Drupal! But she d like
Pattern Insight Clone Detection
Pattern Insight Clone Detection TM The fastest, most effective way to discover all similar code segments What is Clone Detection? Pattern Insight Clone Detection is a powerful pattern discovery technology
Application note: SQL@CHIP Connecting the IPC@CHIP to a Database
Application note: SQL@CHIP Connecting the IPC@CHIP to a Database 1. Introduction This application note describes how to connect an IPC@CHIP to a database and exchange data between those. As there are no
Interactive Dynamic Information Extraction
Interactive Dynamic Information Extraction Kathrin Eichler, Holmer Hemsen, Markus Löckelt, Günter Neumann, and Norbert Reithinger Deutsches Forschungszentrum für Künstliche Intelligenz - DFKI, 66123 Saarbrücken
Open Source Content Management Software : A Comparative Analysis
7 th International CALIBER 2009 Open Source Content Management Software... Open Source Content Management Software : A Comparative Analysis Kaushal K Giri Kirti R Nirgude Abstract There are many web-authoring
High Level Design Distributed Network Traffic Controller
High Level Design Distributed Network Traffic Controller Revision Number: 1.0 Last date of revision: 2/2/05 22c:198 Johnson, Chadwick Hugh Change Record Revision Date Author Changes 1 Contents 1. Introduction
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 INTELLIGENT MULTIDIMENSIONAL DATABASE INTERFACE Mona Gharib Mohamed Reda Zahraa E. Mohamed Faculty of Science,
LEARNING DRUPAL. Instructor : Joshua Owusu-Ansah Company : e4solutions Com. Ltd.
LEARNING DRUPAL Instructor : Joshua Owusu-Ansah Company : e4solutions Com. Ltd. Background The Drupal project was started in 2000 by a student in Belgium named Dries Buytaert. The code was originally designed
IT3504: Web Development Techniques (Optional)
INTRODUCTION : Web Development Techniques (Optional) This is one of the three optional courses designed for Semester 3 of the Bachelor of Information Technology Degree program. This course on web development
Efficient Automated Build and Deployment Framework with Parallel Process
Efficient Automated Build and Deployment Framework with Parallel Process Prachee Kamboj 1, Lincy Mathews 2 Information Science and engineering Department, M. S. Ramaiah Institute of Technology, Bangalore,
IT3503 Web Development Techniques (Optional)
INTRODUCTION Web Development Techniques (Optional) This is one of the three optional courses designed for Semester 3 of the Bachelor of Information Technology Degree program. This course on web development
Compiler I: Syntax Analysis Human Thought
Course map Compiler I: Syntax Analysis Human Thought Abstract design Chapters 9, 12 H.L. Language & Operating Sys. Compiler Chapters 10-11 Virtual Machine Software hierarchy Translator Chapters 7-8 Assembly
Obtain Professional Master of Computing From EIC in London
Obtain Professional Master of Computing From EIC in London Professional Maser in E-Commerce In this program, you will learn to design, create, document and implement computer programs in three languages.
Syllabus INFO-UB-3322. Design and Development of Web and Mobile Applications (Especially for Start Ups)
Syllabus INFO-UB-3322 Design and Development of Web and Mobile Applications (Especially for Start Ups) Fall 2014 Stern School of Business Norman White, KMEC 8-88 Email: [email protected] Phone: 212-998
Installation Instructions
Installation Instructions 25 February 2014 SIAM AST Installation Instructions 2 Table of Contents Server Software Requirements... 3 Summary of the Installation Steps... 3 Application Access Levels... 3
Visualization of Semantic Windows with SciDB Integration
Visualization of Semantic Windows with SciDB Integration Hasan Tuna Icingir Department of Computer Science Brown University Providence, RI 02912 [email protected] February 6, 2013 Abstract Interactive Data
Construction of a Web Server with a Database Facility Using Open Source Software
Construction of a Web Server with a Database Facility Using Open Source Software Munkhjargal Purevdorj* and Tadashi OKADA** This report is about using an open source operating system for constructing a
ASSOCIATE IN ARTS DEGREE-60 UNITS
+ A Course of Study for a Major in Computer Science A.A. Degree & Certificate Programs The field of computer science leads to a variety of careers that all require core computer science skills. These skills
Simplifying Your IT Helpdesk with Request Tracker
Simplifying Your IT Helpdesk with Request Tracker Abstract Jay Lee Network / Systems Administrator Sali Kaceli Manager of Academic Computing Philadelphia Biblical University Information Technology Department
Bubble Code Review for Magento
User Guide Author: Version: Website: Support: Johann Reinke 1.1 https://www.bubbleshop.net [email protected] Table of Contents 1 Introducing Bubble Code Review... 3 1.1 Features... 3 1.2 Compatibility...
Drupal 6 to Drupal 7 Migration Worksheet
Drupal 6 to Drupal 7 Migration Worksheet Rationale for This Document An upgrade of a Drupal 6 website is a complex proposition. As a general rule of thumb, many professional Drupal development teams approach
CentOS Linux 5.2 and Apache 2.2 vs. Microsoft Windows Web Server 2008 and IIS 7.0 when Serving Static and PHP Content
Advances in Networks, Computing and Communications 6 92 CentOS Linux 5.2 and Apache 2.2 vs. Microsoft Windows Web Server 2008 and IIS 7.0 when Serving Static and PHP Content Abstract D.J.Moore and P.S.Dowland
TAACO Quick Start Guide: Windows 7 Kristopher Kyle and Scott Crossley 9-9- 2014
I. Required Programs TAACO Quick Start Guide: Windows 7 Kristopher Kyle and Scott Crossley 9-9- 2014 Before TAACO will work correctly, you must have Python 2.7, Natural Language Toolkit (NTLK), and NumPy
AUTOMATED CONFERENCE CD-ROM BUILDER AN OPEN SOURCE APPROACH Stefan Karastanev
International Journal "Information Technologies & Knowledge" Vol.5 / 2011 319 AUTOMATED CONFERENCE CD-ROM BUILDER AN OPEN SOURCE APPROACH Stefan Karastanev Abstract: This paper presents a new approach
owncloud Architecture Overview
owncloud Architecture Overview owncloud, Inc. 57 Bedford Street, Suite 102 Lexington, MA 02420 United States phone: +1 (877) 394-2030 www.owncloud.com/contact owncloud GmbH Schloßäckerstraße 26a 90443
Auditing Drupal sites for performance, content and optimal configuration
Auditing Drupal sites for performance, content and optimal configuration! drupal.org/project/site_audit 2014.10.18 - Pacific NW Drupal Summit Jon Peck Senior Engineer at Four Kitchens @FluxSauce - github.com/fluxsauce
Building Library Website using Drupal
Building Library Website using Drupal Building the Library Web Site "The Web is quickly becoming the world's fastest growing repository of data." [Tim Berners-Lee, W3C director and creator of the World
Vincent Gabriel. Summary. Experience. Senior Software Developer at Landmark Network [email protected]
Vincent Gabriel Senior Software Developer at Landmark Network [email protected] Summary Open Source Contributions: https://github.com/vinceg Results-oriented lead architect with a focus on delivering
Database Management System Choices. Introduction To Database Systems CSE 373 Spring 2013
Database Management System Choices Introduction To Database Systems CSE 373 Spring 2013 Outline Introduction PostgreSQL MySQL Microsoft SQL Server Choosing A DBMS NoSQL Introduction There a lot of options
Is Drupal secure? A high-level perspective on web vulnerabilities, Drupal s solutions, and how to maintain site security
Is Drupal secure? A high-level perspective on web vulnerabilities, Drupal s solutions, and how to maintain site security Presented 2009-05-29 by David Strauss Thinking Securely Security is a process, not
Software Architecture Document
Software Architecture Document Natural Language Processing Cell Version 1.0 Natural Language Processing Cell Software Architecture Document Version 1.0 1 1. Table of Contents 1. Table of Contents... 2
Markus Dickinson. Dept. of Linguistics, Indiana University Catapult Workshop Series; February 1, 2013
Markus Dickinson Dept. of Linguistics, Indiana University Catapult Workshop Series; February 1, 2013 1 / 34 Basic text analysis Before any sophisticated analysis, we want ways to get a sense of text data
IT services for analyses of various data samples
IT services for analyses of various data samples Ján Paralič, František Babič, Martin Sarnovský, Peter Butka, Cecília Havrilová, Miroslava Muchová, Michal Puheim, Martin Mikula, Gabriel Tutoky Technical
Programming Languages
Programming Languages Programming languages bridge the gap between people and machines; for that matter, they also bridge the gap among people who would like to share algorithms in a way that immediately
AngularJS, Javascript, Prototype-based OO concept, RESTful Design Pattern, GWT, HTML5, Database.
Position Code Role Skillsets Years of Experience TPM01 Senior Back End Application Developer Enterprise Architecture, Java EE, Java Spring Framework, Design Pattern, SOA, JPA, OSGI, Linux, Maven, Tomcat,
Open Source Technologies on Microsoft Azure
Open Source Technologies on Microsoft Azure A Survey @DChappellAssoc Copyright 2014 Chappell & Associates The Main Idea i Open source technologies are a fundamental part of Microsoft Azure The Big Questions
Semantic annotation of requirements for automatic UML class diagram generation
www.ijcsi.org 259 Semantic annotation of requirements for automatic UML class diagram generation Soumaya Amdouni 1, Wahiba Ben Abdessalem Karaa 2 and Sondes Bouabid 3 1 University of tunis High Institute
Content Management Software Drupal : Open Source Software to create library website
Content Management Software Drupal : Open Source Software to create library website S.Satish, Asst Library & Information Officer National Institute of Epidemiology (ICMR) R-127, Third Avenue, Tamil Nadu
Integrating NLTK with the Hadoop Map Reduce Framework 433-460 Human Language Technology Project
Integrating NLTK with the Hadoop Map Reduce Framework 433-460 Human Language Technology Project Paul Bone [email protected] June 2008 Contents 1 Introduction 1 2 Method 2 2.1 Hadoop and Python.........................
SEITE 1 THUNDER OPEN SOURCE CMS FOR PROFESSIONAL PUBLISHING
SEITE 1 THUNDER OPEN SOURCE CMS FOR PROFESSIONAL PUBLISHING SEITE 2 PART I. PUBLISHING TECHNOLOGY IS CHANGING SEITE 3 In the Beginning, Technology was an essential enabling asset for MASS COMMUNICATION
Real-time Streaming Analysis for Hadoop and Flume. Aaron Kimball odiago, inc. OSCON Data 2011
Real-time Streaming Analysis for Hadoop and Flume Aaron Kimball odiago, inc. OSCON Data 2011 The plan Background: Flume introduction The need for online analytics Introducing FlumeBase Demo! FlumeBase
NATURAL LANGUAGE TO SQL CONVERSION SYSTEM
International Journal of Computer Science Engineering and Information Technology Research (IJCSEITR) ISSN 2249-6831 Vol. 3, Issue 2, Jun 2013, 161-166 TJPRC Pvt. Ltd. NATURAL LANGUAGE TO SQL CONVERSION
Natural Language Processing in the EHR Lifecycle
Insight Driven Health Natural Language Processing in the EHR Lifecycle Cecil O. Lynch, MD, MS [email protected] Health & Public Service Outline Medical Data Landscape Value Proposition of NLP
THE EIGHT ADVANTAGES OF BEST- OF-BREED APPLICATIONS
WHITE PAPER THE EIGHT ADVANTAGES OF BEST- OF-BREED APPLICATIONS INTRODUCTION Until recently, field service organizations seeking to take advantage of today s mobile environment often found they had to
Integrated Network Vulnerability Scanning & Penetration Testing SAINTcorporation.com
SAINT Integrated Network Vulnerability Scanning and Penetration Testing www.saintcorporation.com Introduction While network vulnerability scanning is an important tool in proactive network security, penetration
Drupal Training Modules 2015
Drupal Training Modules 2015 Webikon.com Phone: +40-722-369674 E-mail: [email protected] Web: http://webikon.com Drupal Training Modules 1 / 8 About us Webikon is a Romanian company focused in consulting,
K@ A collaborative platform for knowledge management
White Paper K@ A collaborative platform for knowledge management Quinary SpA www.quinary.com via Pietrasanta 14 20141 Milano Italia t +39 02 3090 1500 f +39 02 3090 1501 Copyright 2004 Quinary SpA Index
Automatic Knowledge Base Construction Systems. Dr. Daisy Zhe Wang CISE Department University of Florida September 3th 2014
Automatic Knowledge Base Construction Systems Dr. Daisy Zhe Wang CISE Department University of Florida September 3th 2014 1 Text Contains Knowledge 2 Text Contains Automatically Extractable Knowledge 3
