Acquisition of User Profile for Domain Specific Personalized Access 1
|
|
- Solomon Jacobs
- 8 years ago
- Views:
Transcription
1 Acquisition of User Profile for Domain Specific Personalized Access 1 Plaban Kumar Bhowmick, Samiran Sarkar, Sudeshna Sarkar, Anupam Basu Department of Computer Science & Engineering, Indian Institute of Technology Kharagpur, India {plaban, samiran, sudeshna, anupam}@cse.iitkgp.ernet.in Abstract. The Internet is a large and ever increasing database of structured and unstructured data ranging from audio, video and text. As a consequence, often a huge number of references to web pages are returned by a standard search engine in response to a user s query. Finding information from this huge pool is tedious and time taking. It is necessary to have a model of the user s interest in order to identify relevant information personalized to his needs. In this paper, we describe our representation of the user s profile so that it can be used to identify relevant information for the user. We describe our system for acquiring and maintain the user s profile so that the system can adapt itself to shift in the interest of the user. For each user, the system maintains a separate user profile that needs to gradually saturate to the actual interest of the user. 1 Introduction The Internet is an enormously growing source of information belonging to varying domain. Finding relevant information for the user provided query from the huge pool of the Internet is a challenging task to address as the personalized search interface is truly lacking. In a general purpose search engine, the results returned for a search query is same for all the users irrespective of the background and interest of the users. For example, in response to a query reflection, a standard search engine returns results from varying domains. For example, among first 9 results returned by Google, only three results are related to the domain of physics. This is because the keyword reflection has various meanings in different context and holds different meaning for different user category. We have developed a system that can be used by school students to query the Internet to retrieve personalized information. The domain of our interest is school level topics. In this domain, different users belonging to different grades and abilities will have different requirements. The requirement of a class seven student is somewhat different from that of a tenth standard student. So the same set of documents may not be understandable to students of every level. Traditional search engines do 1 This research work is funded in part by Media Lab Asia, under the auspices of the Communication Empowerment Laboratory, IIT Kharagpur.
2 not address this issue. There is need to develop a system which will personalize the view of the global database depending on the personal preferences of the users. To deliver the appropriate set of documents to the user, the system needs the knowledge of the underlying domain and also the concepts of interest to the user and her level of knowledge. To achieve this, there is need for proper representation of the domain knowledge about the different subjects and also knowledge about the user s requirement. In this paper, we will be describing the representation of the domain knowledge and the use of the domain knowledge in modeling and acquiring the user s profile. In section 3, we describe different aspects of domain knowledge and its requirement in the process of acquiring the user interest. In section 4, we will be discussing about the model of the user interest that helps in providing the users with relevant information. 2 Related Work Retrieving personalized information from the information space of Internet is a broad area of research. Several research works look at the various aspects of personalized system. The research works differs in the way they represent the user profile, the adaptivity of the system. We call a personalized system to be adaptive if the system is able to tune itself according to requirement of the user. Again among the adaptive systems, the algorithm for learning the user s interest varies from system to system. The presence of domain knowledge also distinguishes between the systems. FAB[1] is a content based recommendation system where the user profile is represented as a list of keywords and is maintained by relevance feedback. SHIFT[2] provides the users with personalized information by looking at the aspect of user interest modeling. The user profile is represented by weighted vector of keywords and specified during the subscription procedure. No idea of adaptation is apparent here. ifweb[3] is a user model based intelligent agent that provides support for navigation of WWW and also for document search according to the need of the user. The user profiles are represented as weighted semantic networks. The nodes and relations between nodes are derived from the co-occurrence criteria of related terms in some documents. The user profile is updated by relevance feedback. SmartPush[4] is a personalized news delivery system which depends on the special type of content authoring. Each document is augmented with a metadata in an ontological form that describes the content. User profile is represented by the hierarchy of concepts or ontology and is created explicitly by the user or by choosing from a set of default profiles. The user profile and the metadata in each document are matched to decide on the relevance of the document. WebWatcher[5] assists users like a tour guide while the user is browsing the World Wide Web providing important suggestions in choosing relevant hyperlinks by analyzing past experiences. The user profile is represented as a list of keywords and is provided in the beginning of the tour. Each link is annotated with the interest of the user if she selects the link. But no adaptation is supported in case of user profile. Syskill & Webert[6] provides the facility of both search and recommendation.
3 The user profile is a set of non-related classes each of which represents the content of an index page. Each class contains a separate profile which is a list of boolean features. Letizia[7] is an agent that automates a browsing strategy consisting of a best-first search augmented by heuristics inferring user interest from browsing behavior. The observation process is passive in the sense that it sits idle when the user browses. It provides recommendation on demand. In [8], the user profile is represented as hierarchy of concepts and is adopted from a reference ontology. Each concept is assigned with ten documents the concepts from which are extracted to generate a super document vector for each concept. The user profile is maintained automatically by looking at the surfing behavior of the user. The surfed pages are then classified to the appropriate concept node by measuring the cosine similarity[9] between the document vector and vector assigned to concept node. The time spent on a page plays an important role in calculating the weights. All the systems described above are addressing the issue of personalization based on the ontology of general domain. The domain of our interest i.e., the domain of school topics has a well defined structure. Our ontology has been tuned to that structure. To identify relevant documents for a school student, a system needs not only to identify the important concepts in the domain, but also needs to consider whether the concept is easily understandable to a student given her background. The above mentioned systems are inadequate to fulfill this need completely. Our work describes how these issues can be handled by representing the domain knowledge and the user interest in a special way. 3 Domain Knowledge The knowledge representation database, ontology[10], is organized into a three level hierarchical structure as shown in Figure 1. Fig. 1. Structure of Ontology Topic-Subtopic Level: On the top level, the topics share a parent child relationship. This provides a way of generalization from a specific to a more general topic. The hierarchy of the topics is stored as an n-ary tree with the exception that a node may have multiple parents. This is because a subtopic may be placed under two or more
4 topics. For example, in the domain of biology, animal nutrition and plant nutrition are two subtopics of the topic nutrition. Concept Level: A topic consists of several concepts, which form the next level of the ontology and a concept may belong to one or more topics. A set of empirical relations can be defined among the concepts in a domain. We notice that if a concept is of significance in a document, it is usually the case that the document contains a number of references to related concepts. The breadth and depth of the ontology is used by the ranking algorithm because concepts that are directly and remotely connected to the concepts in the query are used for the calculation of the document scores. In fact the occurrence of related concepts is taken as a very strong indication of the relevance of the document. Pages that do not contain related concepts are suspect and may be spurious. The relations that are stored in the ontology become very important for this reason. In order to keep the system simple the relations must be broad and general. The relation list chosen must also cover most important forms of relations that occur so that the ranking process has a sufficiently good ontological web. For example, if a document contains material relevant to reflection in optics, it will have references to some of the related concepts like light, ray, mirror, lens, angle of incidence, etc. To capture the strength of a relation, we introduce the notion of distance between two concepts. This distance between two concepts is not symmetric. These distances have been devised and tuned experimentally for each domain. The types of relations in the context of a domain are explained in Table 1. The concepts in the domain are organized into a di-graph. The existence of an edge between two concepts in the digraph indicates that the concepts are related. Each edge is assigned a weight depending upon the relation by which two concepts are related by this edge. The weight is an indication of the strength of the relationship. Table 1. Inter concept relationships Relations Has Part Inherited From Has Prerequisite Functionally Related Part Of Procedure Is Caused By The relations in Table 1 provide a way of storing the structure of a domain without storing any information about a particular concept. This structure may be used as a conceptual co-occurrence function and shows which concepts can logically co-occur. These relations make it possible to find the concepts that are close to a particular concept and this information may be used in many ways. Keyword Level: A set of keywords of each domain for the concepts that the keywords are associated to. This list also contains the specificity index of each keyword with respect to each of the concepts that it is associated to. This specificity index
5 stores the likelihood of the keyword representing a particular concept. These keywords are used to extract concepts from documents and queries. The association of the keywords to the concepts has several advantages. Firstly, the different keywords having the same meaning are mapped to a common concept removing the synonymous ambiguity of keywords. 4 Model of User Interest There is a need to model the interest of the user in order to filter the web documents with respect to the need of the user. The students belonging to the same class have common set of interest that is defined by the curriculum. So, we have defined a set of group profiles that are the representation of the syllabus. The model of the users interest is captured in the form of user profile that can be derived from group profiles by default. But individual interest of a user can vary from the predefined group profiles. We define two types of attributes to model two different aspect of the user interest. 1. Domain knowledge specific attributes: This type of attributes tries to capture the interest of the user in terms of the knowledge of the domain. The same ontological structure as the domain knowledge is adopted in the representation of the domain knowledge specific attributes of the user profile. Each concept in the user profile is further annotated with scores revealing the interest value of the concepts. 2. Information presentational attributes: The format and the view a document largely depends on the personal preferences of the user. For example, the user may like to have images in the presented document or the user may like to view the document with his personal color preferences. So there is a need to personalize the presentational view of the document. These attributes are used by the transcoding module of the system during presentation of the document. 4.1 Creation and Maintenance of the User Profile The user profile is acquired in two phases. At the first phase, the user is asked explicitly to provide her initial profile as a goal. The user can also update the profile manually. Static scores are assigned to each concept in these concepts. The user may not be able to enumerate all his interests initially. So the user s browsing history is used to update her profile. The next phase (user profile acquisition) monitors the browsing behavior of the user and with the help of the content analysis scheme the concepts of the user s interest are discovered gradually. 4.2 Profile Editing and Monitoring Architecture In Figure 2, we present an architecture for the creation and automatic updating of the user profile.
6 Fig. 2. Profile editing and monitoring architecture 4.3 Static User Profile Creation Profile Editor: In Figure 3, we provide an interface that helps the student to create her profile consulting several group profiles. Fig. 3. User profile creation and updating interface The interface provides the following facilities: Choosing a predefined group profile. Adding a new topic into the profile from a predefined group profile. Adding a subset of concepts under a topic. The student can also update his profile statically. The following operations are provided for updating a user profile: Deleting a topic and all the concepts under the topic. Deleting a subset of concepts under a topic.
7 Adding new concepts from a topic. The concepts chosen by the user are high indicators of his interest. So, these concepts should get higher interest scores. We adopt a fixed scoring scheme to score these concepts: Score(C) = S, where C Є concept from the static user profile S = constant representing the fixed score 4.4 User Profile Acquisition We have identified four possible data sources listed below that can be used to learn the user s preferences as the user starts using the system. 1. Query history. 2. Usage log of the user. 3. Previous state of the user profile. 4. Content of the document scanned by the user Learning from Query Pattern The pattern in which the user places her query reveals much of the user s interest. The system monitors all queries placed by a user and periodically updates the score of each concept by looking at the frequency the concepts present in the queries. Thus we get a weighted list of concepts. From this we choose the concepts which have higher frequency associated with. Now we scan each concept in this filtered list. If a concept from this list does exist in the user profile, the score for this concept is increased with the help of scoring scheme discussed later. If the concept is new to the user profile, then the concept is just annotated with calculated score. For each concept appearing in the query find the related concepts that occur in the current user profile Score = (concept frequency + related concept score) /period Learning from Browsing Pattern The monitoring agent monitors the browsing pattern to capture the concepts of user s interest. The browsing pattern of the user is maintained in the form of usage log. The usage log is analyzed to obtain the Web Access Graph that represents the browsing graph of the user for a particular result provided to the user in the response to a query. User Log Analysis: There are some important clues from usage log that we can exploit: 1. The file accessed by the user in her session. 2. time that a user spends on a particular document. Access Graph: We have represented the browsing session of the user in a directed graph called Access Graph (AG). The graph is the trace of the navigation pattern of the user. Each node of this graph is the representation of the browsed page having the following fields: Time of access defined as follows
8 t a = time of access for the page The idiosyncrasies of the user access behavior should be kept in mind in calculating t a. The user may explore a link and return back without spending a sufficient amount of time. Again the user may keep a page opened for a long time while she is busy with some other work. So, two threshold values have to be set so that effect of these idiosyncrasies can be tackled. Here we set two threshold values, which limit the acceptance of a reasonable access time. The set of links with three types of labeling: o explored_fruitful. o explored_unfruitful o unexplored Concepts derived from anchor text of links: The set of all links in an accessed page is divided into three sets. Φ explored-fruitful = set of all concepts appearing in the explored and fruitful links. ξ explored-unfruitful = set of all concepts appearing in the explored but unfruitful links. Ψ unexplored = set of all concepts appearing in the unexplored links. ω ef (c) = static score * ß * (t a /length), c Є Φ explored- fruitful (1) ω eu (c) = static score * ø * (t a /length), c Є ξ explored-unfruitful (2) ω un (c) = - static score * ø * (t a /length), c Є Ψ unexplored where ß and ø are score emphasizing factors that are tuned empirically and ß> ø Final vector Ω ln (c) = w 1 * ω ef (c) + w 2 * ω eu (c) + w 3 * ω un (c) (4) Concepts derived from the content: Here we derive the interesting scores of the concepts that are present in a document. We call those concepts as Direct Interesting Concept (DIC) those are relevant to the domain in concern. We not only consider the frequencies of DICs to be the score of the concept but also the related concepts to a DIC that are present in the previous state of the user profile contributes in the score of DIC. We call these related concepts as Indirect Interesting Concept (IIC). The score for each concept are derived by the following formulas: Ω DIC = frequency(dic) (5) Ω DIC = Ω DIC + Σ i=1,n (1/d DIC-IIC * ω IIC ), where n = no. of related concepts. d DIC-IIC = distance of the relation between DIC and IIC ω IIC = weight IIC in the previous state of the user profile. Score Accumulation: To get the final scores of the concepts explored during the browsing of the results, the access graph is transformed into an Access Tree (AT). Here depth at which a particular document is accessed plays an important role as browsing of a page at higher depth increases the interestingness value of the page. We preprocess the graph before generating AT. Certain types of links are removed (3) (6)
9 because they do not contribute much in score accumulation process. The types of links that can be pruned are: The self referential links. The link that form a cycle. In both the cases, the links are converted into simple text. From the remaining graph between root page and each individual page we find the path of maximum length and ignore other links that falls in the duplicate path. The scores of concepts during one result browsing are derived by the following formula: Final interest vector = interest vector at root + (7) Σ i=1,n (depth i * interest vector at child i ) User Feedback This process of acquisition of the user profile is user feedback dependent. When the user is presented with a set of results, the user is explicitly asked to rank each of the result he has gone through {interesting, not interesting, ok}. The final interest vector and the final dislike vector is annotated with the explicit user feedback by the following expression: Final feedback vector = feedback_score* final interest vector Where feedback_score Є { 1.5(interesting, 0.5(not interesting), 1(ok)} Concept Age Monitoring We assume that a concept that has been referred infrequently in the past will be referred in the near future with lower probability. For that we have introduced the concept of aging. The age of a concept in the user profile increases when the user logs on to the system but the concept is not referred. The concepts with higher ages represent the concepts with lower interest with respect to the userhere we define an interest decay factor that depends on the age of the concept and number of sessions he has logged in. µ = (age of the concept)/(number of sessions) 3 Conclusion and Future Work The modeling of users interest is challenging task. The idiosyncrasies in the user behavior make the problem an order of magnitude harder. Here we have adopted a hybrid model of content based as well as access based approach. The presence of domain knowledge makes the process of acquisition of user interest a little bit simpler and robust also. There is need to draw comparison between the actual user interest and the acquired user profile. As a future work, we have to device a criterion by which we can estimate the time needed in the convergence of the acquired user inter-
10 est to the actual user interest. Again there should be evaluation criteria to show how close the acquired profile is to the actual user profile. References 1. M. Balabanovic and Yoav Shoham. FAB: Content Based Collaborative Reccomendation. In Communication of the ACM, Vol. 40 No. 3, Page 66-72, March Tak W. Yan and H. Garcia-Molina. SIFT: ATool for Wide-Area Information Dissemination. In Proceedings of the 1995 USENIX Technical Confernce, Pages , Fabio A. Asnicar, Carlo Tasso. ifweb: a Prototype of User Model-Based Intelligent Agent for Document Filtering and Navigation in the World Wide Web. Proceedings of the workshop "Adaptive Systems and User Modeling on the World Wide Web", Sixth International Conference on User Modeling, Chia Laguna, Sardinia, 2-5 June T. Kuki, S. Jokela, R. Sulonen and M. Turpeinen. Agents in Delivering Personalized Content Based on Semantic Metadata. In Proc AAAI Spring Symposium Workshop on Intelligent Agents in Cyberspace, pages 84-93, Stanford, USA, T. Joachims, D. Freitag, and T. Mitchell. WebWatcher: A Tour Guide for the World Wide Web. In Proc. IJCAI 97, August M. Pazzani, J. Muramatsu, and D. Billsus. Syskill & Webert: Identifying Interesting Web Sites. In Proc. 19 th National Conference on Artificial Intelligence, Henry Lieberman. Letizia: An agent that Assists Web Browsing. In Proc. International Conference on Artificial Intelligence, Montreal, Canada, August Alexander Preschner and Susan Gauch. Ontology Based Personalized Search. In Proc. 11th Intl. Conf. on Tools with Artificial Intelligence, pages , November Ricardo Baeza-Yates and Berthier Ribeiro-Neto. Modern Information retrieval. Addison Wesley Longman Publishing Co. 10. Michael Gruninger and Jintae Lee. Ontology Applicatios and Design. On Communications of the ACM, pages 39-41, February 2003/Vol. 45, No. 2.
Semantic Search in Portals using Ontologies
Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br
More informationSo today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02)
Internet Technology Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture No #39 Search Engines and Web Crawler :: Part 2 So today we
More informationOptimization of Search Results with Duplicate Page Elimination using Usage Data A. K. Sharma 1, Neelam Duhan 2 1, 2
Optimization of Search Results with Duplicate Page Elimination using Usage Data A. K. Sharma 1, Neelam Duhan 2 1, 2 Department of Computer Engineering, YMCA University of Science & Technology, Faridabad,
More informationSemantic Concept Based Retrieval of Software Bug Report with Feedback
Semantic Concept Based Retrieval of Software Bug Report with Feedback Tao Zhang, Byungjeong Lee, Hanjoon Kim, Jaeho Lee, Sooyong Kang, and Ilhoon Shin Abstract Mining software bugs provides a way to develop
More informationUniversität Augsburg. Institut für Informatik D-86135 Augsburg. Learning Scrutable User Models: Inducing Conceptual Descriptions. Martin E.
Universität Augsburg Learning Scrutable User Models: Inducing Conceptual Descriptions Martin E. Müller Report 2002-07 März 2002 Institut für Informatik D-86135 Augsburg Copyright c Martin E. Müller Institut
More informationUnderstanding Web personalization with Web Usage Mining and its Application: Recommender System
Understanding Web personalization with Web Usage Mining and its Application: Recommender System Manoj Swami 1, Prof. Manasi Kulkarni 2 1 M.Tech (Computer-NIMS), VJTI, Mumbai. 2 Department of Computer Technology,
More informationQDquaderni. UP-DRES User Profiling for a Dynamic REcommendation System E. Messina, D. Toscani, F. Archetti. university of milano bicocca
A01 084/01 university of milano bicocca QDquaderni department of informatics, systems and communication UP-DRES User Profiling for a Dynamic REcommendation System E. Messina, D. Toscani, F. Archetti research
More informationA Framework for Ontology-Based Knowledge Management System
A Framework for Ontology-Based Knowledge Management System Jiangning WU Institute of Systems Engineering, Dalian University of Technology, Dalian, 116024, China E-mail: jnwu@dlut.edu.cn Abstract Knowledge
More information1 o Semestre 2007/2008
Departamento de Engenharia Informática Instituto Superior Técnico 1 o Semestre 2007/2008 Outline 1 2 3 4 5 Outline 1 2 3 4 5 Exploiting Text How is text exploited? Two main directions Extraction Extraction
More informationSustaining Privacy Protection in Personalized Web Search with Temporal Behavior
Sustaining Privacy Protection in Personalized Web Search with Temporal Behavior N.Jagatheshwaran 1 R.Menaka 2 1 Final B.Tech (IT), jagatheshwaran.n@gmail.com, Velalar College of Engineering and Technology,
More informationData Mining in Web Search Engine Optimization and User Assisted Rank Results
Data Mining in Web Search Engine Optimization and User Assisted Rank Results Minky Jindal Institute of Technology and Management Gurgaon 122017, Haryana, India Nisha kharb Institute of Technology and Management
More informationWebWatcher: A Tour Guide for the World Wide Web. Dayne Freitag. Carnegie Mellon University. in intelligent agents.
WebWatcher: A Tour Guide for the World Wide Web Thorsten Joachims Universitat Dortmund Informatik-LS8 Baroper Str. 301 44221 Dortmund, Germany Dayne Freitag Carnegie Mellon University School of Computer
More informationAdaptive Translation between User s Vocabulary and Internet Queries
Adaptive Translation between User s Vocabulary and Internet Queries Agnieszka Indyka Piasecka 1 and Maciej Piasecki 2 1 Wroc law University of Technology, Information Science Department 2 Wroc law University
More informationInformation Visualization of Attributed Relational Data
Information Visualization of Attributed Relational Data Mao Lin Huang Department of Computer Systems Faculty of Information Technology University of Technology, Sydney PO Box 123 Broadway, NSW 2007 Australia
More informationPersona: A Contextualized and Personalized Web Search
0-7695-435-9/02 $7.00 (c) 2002 IEEE Persona: A Contextualized and Personalized Web Search Francisco Tanudjaja and Lik Mui Laboratory of Computer Science at MIT fcstanud@alum.mit.edu - lmui@lcs.mit.edu
More informationTraining Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object
Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object Anne Monceaux 1, Joanna Guss 1 1 EADS-CCR, Centreda 1, 4 Avenue Didier Daurat 31700 Blagnac France
More informationInverted files and dynamic signature files for optimisation of Web directories
s and dynamic signature files for optimisation of Web directories Fidel Cacheda, Angel Viña Department of Information and Communication Technologies Facultad de Informática, University of A Coruña Campus
More informationWeb Document Clustering
Web Document Clustering Lab Project based on the MDL clustering suite http://www.cs.ccsu.edu/~markov/mdlclustering/ Zdravko Markov Computer Science Department Central Connecticut State University New Britain,
More information2 AIMS: an Agent-based Intelligent Tool for Informational Support
Aroyo, L. & Dicheva, D. (2000). Domain and user knowledge in a web-based courseware engineering course, knowledge-based software engineering. In T. Hruska, M. Hashimoto (Eds.) Joint Conference knowledge-based
More informationReverse Engineering of Relational Databases to Ontologies: An Approach Based on an Analysis of HTML Forms
Reverse Engineering of Relational Databases to Ontologies: An Approach Based on an Analysis of HTML Forms Irina Astrova 1, Bela Stantic 2 1 Tallinn University of Technology, Ehitajate tee 5, 19086 Tallinn,
More informationA MACHINE LEARNING APPROACH TO FILTER UNWANTED MESSAGES FROM ONLINE SOCIAL NETWORKS
A MACHINE LEARNING APPROACH TO FILTER UNWANTED MESSAGES FROM ONLINE SOCIAL NETWORKS Charanma.P 1, P. Ganesh Kumar 2, 1 PG Scholar, 2 Assistant Professor,Department of Information Technology, Anna University
More informationHow To Cluster On A Search Engine
Volume 2, Issue 2, February 2012 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: A REVIEW ON QUERY CLUSTERING
More informationSearch engine ranking
Proceedings of the 7 th International Conference on Applied Informatics Eger, Hungary, January 28 31, 2007. Vol. 2. pp. 417 422. Search engine ranking Mária Princz Faculty of Technical Engineering, University
More informationIndex Terms Domain name, Firewall, Packet, Phishing, URL.
BDD for Implementation of Packet Filter Firewall and Detecting Phishing Websites Naresh Shende Vidyalankar Institute of Technology Prof. S. K. Shinde Lokmanya Tilak College of Engineering Abstract Packet
More informationProfile Based Personalized Web Search and Download Blocker
Profile Based Personalized Web Search and Download Blocker 1 K.Sheeba, 2 G.Kalaiarasi Dhanalakshmi Srinivasan College of Engineering and Technology, Mamallapuram, Chennai, Tamil nadu, India Email: 1 sheebaoec@gmail.com,
More informationWeb-based Multimedia Content Management System for Effective News Personalization on Interactive Broadcasting
Web-based Multimedia Content Management System for Effective News Personalization on Interactive Broadcasting S.N.CHEONG AZHAR K.M. M. HANMANDLU Faculty Of Engineering, Multimedia University, Jalan Multimedia,
More informationSelf Organizing Maps for Visualization of Categories
Self Organizing Maps for Visualization of Categories Julian Szymański 1 and Włodzisław Duch 2,3 1 Department of Computer Systems Architecture, Gdańsk University of Technology, Poland, julian.szymanski@eti.pg.gda.pl
More informationAn Open Platform for Collecting Domain Specific Web Pages and Extracting Information from Them
An Open Platform for Collecting Domain Specific Web Pages and Extracting Information from Them Vangelis Karkaletsis and Constantine D. Spyropoulos NCSR Demokritos, Institute of Informatics & Telecommunications,
More informationIntinno: A Web Integrated Digital Library and Learning Content Management System
Intinno: A Web Integrated Digital Library and Learning Content Management System Synopsis of the Thesis to be submitted in Partial Fulfillment of the Requirements for the Award of the Degree of Master
More informationExperiments in Web Page Classification for Semantic Web
Experiments in Web Page Classification for Semantic Web Asad Satti, Nick Cercone, Vlado Kešelj Faculty of Computer Science, Dalhousie University E-mail: {rashid,nick,vlado}@cs.dal.ca Abstract We address
More informationSearch and Information Retrieval
Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search
More informationText Classification Using Symbolic Data Analysis
Text Classification Using Symbolic Data Analysis Sangeetha N 1 Lecturer, Dept. of Computer Science and Applications, St Aloysius College (Autonomous), Mangalore, Karnataka, India. 1 ABSTRACT: In the real
More informationAnnotea and Semantic Web Supported Collaboration
Annotea and Semantic Web Supported Collaboration Marja-Riitta Koivunen, Ph.D. Annotea project Abstract Like any other technology, the Semantic Web cannot succeed if the applications using it do not serve
More informationA Hybrid Approach for Ontology Integration
A Hybrid Approach for Ontology Integration Ahmed Alasoud Volker Haarslev Nematollaah Shiri Concordia University Concordia University Concordia University 1455 De Maisonneuve Blvd. West 1455 De Maisonneuve
More informationSearch Result Optimization using Annotators
Search Result Optimization using Annotators Vishal A. Kamble 1, Amit B. Chougule 2 1 Department of Computer Science and Engineering, D Y Patil College of engineering, Kolhapur, Maharashtra, India 2 Professor,
More informationBuilding A Smart Academic Advising System Using Association Rule Mining
Building A Smart Academic Advising System Using Association Rule Mining Raed Shatnawi +962795285056 raedamin@just.edu.jo Qutaibah Althebyan +962796536277 qaalthebyan@just.edu.jo Baraq Ghalib & Mohammed
More informationUtilising Ontology-based Modelling for Learning Content Management
Utilising -based Modelling for Learning Content Management Claus Pahl, Muhammad Javed, Yalemisew M. Abgaz Centre for Next Generation Localization (CNGL), School of Computing, Dublin City University, Dublin
More informationThe 2006 IEEE / WIC / ACM International Conference on Web Intelligence Hong Kong, China
WISE: Hierarchical Soft Clustering of Web Page Search based on Web Content Mining Techniques Ricardo Campos 1, 2 Gaël Dias 2 Célia Nunes 2 1 Instituto Politécnico de Tomar Tomar, Portugal 2 Centre of Human
More informationWeb Mining using Artificial Ant Colonies : A Survey
Web Mining using Artificial Ant Colonies : A Survey Richa Gupta Department of Computer Science University of Delhi ABSTRACT : Web mining has been very crucial to any organization as it provides useful
More informationData Discovery on the Information Highway
Data Discovery on the Information Highway Susan Gauch Introduction Information overload on the Web Many possible search engines Need intelligent help to select best information sources customize results
More informationWeb Mining. Margherita Berardi LACAM. Dipartimento di Informatica Università degli Studi di Bari berardi@di.uniba.it
Web Mining Margherita Berardi LACAM Dipartimento di Informatica Università degli Studi di Bari berardi@di.uniba.it Bari, 24 Aprile 2003 Overview Introduction Knowledge discovery from text (Web Content
More informationLDA Based Security in Personalized Web Search
LDA Based Security in Personalized Web Search R. Dhivya 1 / PG Scholar, B. Vinodhini 2 /Assistant Professor, S. Karthik 3 /Prof & Dean Department of Computer Science & Engineering SNS College of Technology
More informationAssociation rules for improving website effectiveness: case analysis
Association rules for improving website effectiveness: case analysis Maja Dimitrijević, The Higher Technical School of Professional Studies, Novi Sad, Serbia, dimitrijevic@vtsns.edu.rs Tanja Krunić, The
More informationUsing LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset.
White Paper Using LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset. Using LSI for Implementing Document Management Systems By Mike Harrison, Director,
More informationResearch of Postal Data mining system based on big data
3rd International Conference on Mechatronics, Robotics and Automation (ICMRA 2015) Research of Postal Data mining system based on big data Xia Hu 1, Yanfeng Jin 1, Fan Wang 1 1 Shi Jiazhuang Post & Telecommunication
More informationAdaptive Probing: A Monitoring-Based Probing Approach for Fault Localization in Networks
Adaptive Probing: A Monitoring-Based Probing Approach for Fault Localization in Networks Akshay Kumar *, R. K. Ghosh, Maitreya Natu *Student author Indian Institute of Technology, Kanpur, India Tata Research
More informationA Framework for the Delivery of Personalized Adaptive Content
A Framework for the Delivery of Personalized Adaptive Content Colm Howlin CCKF Limited Dublin, Ireland colm.howlin@cckf-it.com Danny Lynch CCKF Limited Dublin, Ireland colm.howlin@cckf-it.com Abstract
More informationQuery Recommendation employing Query Logs in Search Optimization
1917 Query Recommendation employing Query Logs in Search Optimization Neha Singh Department of Computer Science, Shri Siddhi Vinayak Group of Institutions, Bareilly Email: singh26.neha@gmail.com Dr Manish
More informationMining Text Data: An Introduction
Bölüm 10. Metin ve WEB Madenciliği http://ceng.gazi.edu.tr/~ozdemir Mining Text Data: An Introduction Data Mining / Knowledge Discovery Structured Data Multimedia Free Text Hypertext HomeLoan ( Frank Rizzo
More information131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10
1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom
More informationImportance of Domain Knowledge in Web Recommender Systems
Importance of Domain Knowledge in Web Recommender Systems Saloni Aggarwal Student UIET, Panjab University Chandigarh, India Veenu Mangat Assistant Professor UIET, Panjab University Chandigarh, India ABSTRACT
More informationExtending a Web Browser with Client-Side Mining
Extending a Web Browser with Client-Side Mining Hongjun Lu, Qiong Luo, Yeuk Kiu Shun Hong Kong University of Science and Technology Department of Computer Science Clear Water Bay, Kowloon Hong Kong, China
More informationPersonalization of Web Search With Protected Privacy
Personalization of Web Search With Protected Privacy S.S DIVYA, R.RUBINI,P.EZHIL Final year, Information Technology,KarpagaVinayaga College Engineering and Technology, Kanchipuram [D.t] Final year, Information
More informationONTOLOGY-BASED GENERIC TEMPLATE FOR RETAIL ANALYTICS
ONTOLOGY-BASED GENERIC TEMPLATE FOR RETAIL ANALYTICS Kenneth I. Davou 1 and Rosnah Idrus 2 1 Universiti Sains Malaysia (USM), Malaysia, kid11_ttm071@student.usm.my 2 Universiti Sains Malaysia (USM), Malaysia,
More informationAN EFFICIENT APPROACH TO PERFORM PRE-PROCESSING
AN EFFIIENT APPROAH TO PERFORM PRE-PROESSING S. Prince Mary Research Scholar, Sathyabama University, hennai- 119 princemary26@gmail.com E. Baburaj Department of omputer Science & Engineering, Sun Engineering
More informationAutomated Collaborative Filtering Applications for Online Recruitment Services
Automated Collaborative Filtering Applications for Online Recruitment Services Rachael Rafter, Keith Bradley, Barry Smyth Smart Media Institute, Department of Computer Science, University College Dublin,
More informationAnalysis of Social Media Streams
Fakultätsname 24 Fachrichtung 24 Institutsname 24, Professur 24 Analysis of Social Media Streams Florian Weidner Dresden, 21.01.2014 Outline 1.Introduction 2.Social Media Streams Clustering Summarization
More informationEmail Spam Detection Using Customized SimHash Function
International Journal of Research Studies in Computer Science and Engineering (IJRSCSE) Volume 1, Issue 8, December 2014, PP 35-40 ISSN 2349-4840 (Print) & ISSN 2349-4859 (Online) www.arcjournals.org Email
More informationBlog Post Extraction Using Title Finding
Blog Post Extraction Using Title Finding Linhai Song 1, 2, Xueqi Cheng 1, Yan Guo 1, Bo Wu 1, 2, Yu Wang 1, 2 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing 2 Graduate School
More informationA COMBINED TEXT MINING METHOD TO IMPROVE DOCUMENT MANAGEMENT IN CONSTRUCTION PROJECTS
A COMBINED TEXT MINING METHOD TO IMPROVE DOCUMENT MANAGEMENT IN CONSTRUCTION PROJECTS Caldas, Carlos H. 1 and Soibelman, L. 2 ABSTRACT Information is an important element of project delivery processes.
More informationVisCG: Creating an Eclipse Call Graph Visualization Plug-in. Kenta Hasui, Undergraduate Student at Vassar College Class of 2015
VisCG: Creating an Eclipse Call Graph Visualization Plug-in Kenta Hasui, Undergraduate Student at Vassar College Class of 2015 Abstract Call graphs are a useful tool for understanding software; however,
More informationA Semantically Enriched Competency Management System to Support the Analysis of a Web-based Research Network
A Semantically Enriched Competency Management System to Support the Analysis of a Web-based Research Network Paola Velardi University of Roma La Sapienza Italy velardi@di.uniroma1.it Alessandro Cucchiarelli
More informationA Framework of Personalized Intelligent Document and Information Management System
A Framework of Personalized Intelligent and Information Management System Xien Fan Department of Computer Science, College of Staten Island, City University of New York, Staten Island, NY 10314, USA Fang
More informationEffective User Navigation in Dynamic Website
Effective User Navigation in Dynamic Website Ms.S.Nithya Assistant Professor, Department of Information Technology Christ College of Engineering and Technology Puducherry, India Ms.K.Durga,Ms.A.Preeti,Ms.V.Saranya
More informationIntegrating User Data and Collaborative Filtering in a Web Recommendation System
Integrating User Data and Collaborative Filtering in a Web Recommendation System Paolo Buono, Maria Francesca Costabile, Stefano Guida, Antonio Piccinno, Giuseppe Tesoro Dipartimento di Informatica, Università
More informationAn Intelligent Matching System for the Products of Small Business/Manufactures with the Celebrities
An Intelligent Matching System for the Products of Small Business/Manufactures with the Celebrities Junho Jeong 1, Yunsik Son 2, Seokhoon Ko 1 and Seman Oh 1 1 Dept. of Computer Engineering, Dongguk University,
More informationKEYWORD SEARCH IN RELATIONAL DATABASES
KEYWORD SEARCH IN RELATIONAL DATABASES N.Divya Bharathi 1 1 PG Scholar, Department of Computer Science and Engineering, ABSTRACT Adhiyamaan College of Engineering, Hosur, (India). Data mining refers to
More informationNNMi120 Network Node Manager i Software 9.x Essentials
NNMi120 Network Node Manager i Software 9.x Essentials Instructor-Led Training For versions 9.0 9.2 OVERVIEW This course is designed for those Network and/or System administrators tasked with the installation,
More informationFlorida International University - University of Miami TRECVID 2014
Florida International University - University of Miami TRECVID 2014 Miguel Gavidia 3, Tarek Sayed 1, Yilin Yan 1, Quisha Zhu 1, Mei-Ling Shyu 1, Shu-Ching Chen 2, Hsin-Yu Ha 2, Ming Ma 1, Winnie Chen 4,
More informationA UPS Framework for Providing Privacy Protection in Personalized Web Search
A UPS Framework for Providing Privacy Protection in Personalized Web Search V. Sai kumar 1, P.N.V.S. Pavan Kumar 2 PG Scholar, Dept. of CSE, G Pulla Reddy Engineering College, Kurnool, Andhra Pradesh,
More informationPerformance evaluation of Web Information Retrieval Systems and its application to e-business
Performance evaluation of Web Information Retrieval Systems and its application to e-business Fidel Cacheda, Angel Viña Departament of Information and Comunications Technologies Facultad de Informática,
More informationWEB SITE OPTIMIZATION THROUGH MINING USER NAVIGATIONAL PATTERNS
WEB SITE OPTIMIZATION THROUGH MINING USER NAVIGATIONAL PATTERNS Biswajit Biswal Oracle Corporation biswajit.biswal@oracle.com ABSTRACT With the World Wide Web (www) s ubiquity increase and the rapid development
More informationFacilitating Knowledge Intelligence Using ANTOM with a Case Study of Learning Religion
Facilitating Knowledge Intelligence Using ANTOM with a Case Study of Learning Religion Herbert Y.C. Lee 1, Kim Man Lui 1 and Eric Tsui 2 1 Marvel Digital Ltd., Hong Kong {Herbert.lee,kimman.lui}@marvel.com.hk
More informationKOINOTITES: A Web Usage Mining Tool for Personalization
KOINOTITES: A Web Usage Mining Tool for Personalization Dimitrios Pierrakos Inst. of Informatics and Telecommunications, dpie@iit.demokritos.gr Georgios Paliouras Inst. of Informatics and Telecommunications,
More informationA Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks
A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks Text Analytics World, Boston, 2013 Lars Hard, CTO Agenda Difficult text analytics tasks Feature extraction Bio-inspired
More informationChapter-1 : Introduction 1 CHAPTER - 1. Introduction
Chapter-1 : Introduction 1 CHAPTER - 1 Introduction This thesis presents design of a new Model of the Meta-Search Engine for getting optimized search results. The focus is on new dimension of internet
More informationA FUZZY BASED APPROACH TO TEXT MINING AND DOCUMENT CLUSTERING
A FUZZY BASED APPROACH TO TEXT MINING AND DOCUMENT CLUSTERING Sumit Goswami 1 and Mayank Singh Shishodia 2 1 Indian Institute of Technology-Kharagpur, Kharagpur, India sumit_13@yahoo.com 2 School of Computer
More informationInternational Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 INTELLIGENT MULTIDIMENSIONAL DATABASE INTERFACE Mona Gharib Mohamed Reda Zahraa E. Mohamed Faculty of Science,
More informationDesign and Implementation of Domain based Semantic Hidden Web Crawler
Design and Implementation of Domain based Semantic Hidden Web Crawler Manvi Department of Computer Engineering YMCA University of Science & Technology Faridabad, India Ashutosh Dixit Department of Computer
More informationA Survey on Web Mining From Web Server Log
A Survey on Web Mining From Web Server Log Ripal Patel 1, Mr. Krunal Panchal 2, Mr. Dushyantsinh Rathod 3 1 M.E., 2,3 Assistant Professor, 1,2,3 computer Engineering Department, 1,2 L J Institute of Engineering
More informationI. INTRODUCTION NOESIS ONTOLOGIES SEMANTICS AND ANNOTATION
Noesis: A Semantic Search Engine and Resource Aggregator for Atmospheric Science Sunil Movva, Rahul Ramachandran, Xiang Li, Phani Cherukuri, Sara Graves Information Technology and Systems Center University
More informationDynamical Clustering of Personalized Web Search Results
Dynamical Clustering of Personalized Web Search Results Xuehua Shen CS Dept, UIUC xshen@cs.uiuc.edu Hong Cheng CS Dept, UIUC hcheng3@uiuc.edu Abstract Most current search engines present the user a ranked
More informationNatural Language Updates to Databases through Dialogue
Natural Language Updates to Databases through Dialogue Michael Minock Department of Computing Science Umeå University, Sweden Abstract. This paper reopens the long dormant topic of natural language updates
More informationNetwork Big Data: Facing and Tackling the Complexities Xiaolong Jin
Network Big Data: Facing and Tackling the Complexities Xiaolong Jin CAS Key Laboratory of Network Data Science & Technology Institute of Computing Technology Chinese Academy of Sciences (CAS) 2015-08-10
More informationChapter 6. Attracting Buyers with Search, Semantic, and Recommendation Technology
Attracting Buyers with Search, Semantic, and Recommendation Technology Learning Objectives Using Search Technology for Business Success Organic Search and Search Engine Optimization Recommendation Engines
More informationSemantically Enhanced Web Personalization Approaches and Techniques
Semantically Enhanced Web Personalization Approaches and Techniques Dario Vuljani, Lidia Rovan, Mirta Baranovi Faculty of Electrical Engineering and Computing, University of Zagreb Unska 3, HR-10000 Zagreb,
More informationRemote support for lab activities in educational institutions
Remote support for lab activities in educational institutions Marco Mari 1, Agostino Poggi 1, Michele Tomaiuolo 1 1 Università di Parma, Dipartimento di Ingegneria dell'informazione 43100 Parma Italy {poggi,mari,tomamic}@ce.unipr.it,
More informationHELP DESK SYSTEMS. Using CaseBased Reasoning
HELP DESK SYSTEMS Using CaseBased Reasoning Topics Covered Today What is Help-Desk? Components of HelpDesk Systems Types Of HelpDesk Systems Used Need for CBR in HelpDesk Systems GE Helpdesk using ReMind
More informationData Mining for Web Personalization
3 Data Mining for Web Personalization Bamshad Mobasher Center for Web Intelligence School of Computer Science, Telecommunication, and Information Systems DePaul University, Chicago, Illinois, USA mobasher@cs.depaul.edu
More informationA CLIENT-ORIENTATED DYNAMIC WEB SERVER. Cristina Hava Muntean, Jennifer McManis, John Murphy 1 and Liam Murphy 2. Abstract
A CLIENT-ORIENTATED DYNAMIC WEB SERVER Cristina Hava Muntean, Jennifer McManis, John Murphy 1 and Liam Murphy 2 Abstract The cost of computer systems has decreased continuously in recent years, leading
More informationOntology-Based Filtering Mechanisms for Web Usage Patterns Retrieval
Ontology-Based Filtering Mechanisms for Web Usage Patterns Retrieval Mariângela Vanzin, Karin Becker, and Duncan Dubugras Alcoba Ruiz Faculdade de Informática - Pontifícia Universidade Católica do Rio
More informationSEARCH ENGINE WITH PARALLEL PROCESSING AND INCREMENTAL K-MEANS FOR FAST SEARCH AND RETRIEVAL
SEARCH ENGINE WITH PARALLEL PROCESSING AND INCREMENTAL K-MEANS FOR FAST SEARCH AND RETRIEVAL Krishna Kiran Kattamuri 1 and Rupa Chiramdasu 2 Department of Computer Science Engineering, VVIT, Guntur, India
More informationGraph Mining and Social Network Analysis
Graph Mining and Social Network Analysis Data Mining and Text Mining (UIC 583 @ Politecnico di Milano) References Jiawei Han and Micheline Kamber, "Data Mining: Concepts and Techniques", The Morgan Kaufmann
More informationINTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET)
INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET) International Journal of Computer Engineering and Technology (IJCET), ISSN 0976 6367(Print), ISSN 0976 6367(Print) ISSN 0976 6375(Online)
More informationNOVEL APPROCH FOR OFT BASED WEB DOMAIN PREDICTION
Volume 3, No. 7, July 2012 Journal of Global Research in Computer Science RESEARCH ARTICAL Available Online at www.jgrcs.info NOVEL APPROCH FOR OFT BASED WEB DOMAIN PREDICTION A. Niky Singhai 1, B. Prof
More informationQualitative Corporate Dashboards for Corporate Monitoring Peng Jia and Miklos A. Vasarhelyi 1
Qualitative Corporate Dashboards for Corporate Monitoring Peng Jia and Miklos A. Vasarhelyi 1 Introduction Electronic Commerce 2 is accelerating dramatically changes in the business process. Electronic
More informationData Quality Mining: Employing Classifiers for Assuring consistent Datasets
Data Quality Mining: Employing Classifiers for Assuring consistent Datasets Fabian Grüning Carl von Ossietzky Universität Oldenburg, Germany, fabian.gruening@informatik.uni-oldenburg.de Abstract: Independent
More informationAutomatic Annotation Wrapper Generation and Mining Web Database Search Result
Automatic Annotation Wrapper Generation and Mining Web Database Search Result V.Yogam 1, K.Umamaheswari 2 1 PG student, ME Software Engineering, Anna University (BIT campus), Trichy, Tamil nadu, India
More informationAWERProcedia Information Technology & Computer Science
AWERProcedia Information Technology & Computer Science Vol 03 (2013) 1157-1162 3 rd World Conference on Information Technology (WCIT-2012) Webification of Software Development: General Outline and the
More informationA HYBRID RULE BASED FUZZY-NEURAL EXPERT SYSTEM FOR PASSIVE NETWORK MONITORING
A HYBRID RULE BASED FUZZY-NEURAL EXPERT SYSTEM FOR PASSIVE NETWORK MONITORING AZRUDDIN AHMAD, GOBITHASAN RUDRUSAMY, RAHMAT BUDIARTO, AZMAN SAMSUDIN, SURESRAWAN RAMADASS. Network Research Group School of
More information