COMPUTER SUPPORT OF SEMANTIC TEXT ANALYSIS OF A TECHNICAL SPECIFICATION ON DESIGNING SOFTWARE. Alla Zaboleeva-Zotova, Yulia Orlova



Similar documents
Project Networks With Mixed-Time Constraints

Frequency Selective IQ Phase and IQ Amplitude Imbalance Adjustments for OFDM Direct Conversion Transmitters

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis

VRT012 User s guide V0.1. Address: Žirmūnų g. 27, Vilnius LT-09105, Phone: (370-5) , Fax: (370-5) , info@teltonika.

Feature selection for intrusion detection. Slobodan Petrović NISlab, Gjøvik University College

1.1 The University may award Higher Doctorate degrees as specified from time-to-time in UPR AS11 1.

Gender Classification for Real-Time Audience Analysis System

Calculating the high frequency transmission line parameters of power cables

Design and Development of a Security Evaluation Platform Based on International Standards

A Performance Analysis of View Maintenance Techniques for Data Warehouses

Towards Specialization of the Contract-Aware Software Development Process

Study on Model of Risks Assessment of Standard Operation in Rural Power Network

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING

A Secure Password-Authenticated Key Agreement Using Smart Cards

Implementation of Deutsch's Algorithm Using Mathcad

VERIFICATION OF BUSINESS RULES USING LOGIC PROGRAMMING MEANS

A Multi-Camera System on PC-Cluster for Real-time 3-D Tracking

Ants Can Schedule Software Projects

Conversion between the vector and raster data structures using Fuzzy Geographical Entities

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña

BUSINESS PROCESS PERFORMANCE MANAGEMENT USING BAYESIAN BELIEF NETWORK. 0688,

Damage detection in composite laminates using coin-tap method

A DATA MINING APPLICATION IN A STUDENT DATABASE

On the Optimal Control of a Cascade of Hydro-Electric Power Stations

Forecasting the Direction and Strength of Stock Market Movement

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP)

Network traffic analysis optimization for signature-based intrusion detection systems

How To Know The Components Of Mean Squared Error Of Herarchcal Estmator S

Vision Mouse. Saurabh Sarkar a* University of Cincinnati, Cincinnati, USA ABSTRACT 1. INTRODUCTION

NEURO-FUZZY INFERENCE SYSTEM FOR E-COMMERCE WEBSITE EVALUATION

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network

An Event-Based Approach to Visualization

Overview of monitoring and evaluation

THE APPLICATION OF DATA MINING TECHNIQUES AND MULTIPLE CLASSIFIERS TO MARKETING DECISION

RESEARCH ON DUAL-SHAKER SINE VIBRATION CONTROL. Yaoqi FENG 1, Hanping QIU 1. China Academy of Space Technology (CAST)

The OC Curve of Attribute Acceptance Plans

Luby s Alg. for Maximal Independent Sets using Pairwise Independence

Calculation of Sampling Weights

RELIABILITY, RISK AND AVAILABILITY ANLYSIS OF A CONTAINER GANTRY CRANE ABSTRACT

Risk Model of Long-Term Production Scheduling in Open Pit Gold Mining

An Integrated Approach of AHP-GP and Visualization for Software Architecture Optimization: A case-study for selection of architecture style

A Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy S-curve Regression

Methodology to Determine Relationships between Performance Factors in Hadoop Cloud Computing Applications

METHODOLOGY TO DETERMINE RELATIONSHIPS BETWEEN PERFORMANCE FACTORS IN HADOOP CLOUD COMPUTING APPLICATIONS

Hollinger Canadian Publishing Holdings Co. ( HCPH ) proceeding under the Companies Creditors Arrangement Act ( CCAA )

IMPACT ANALYSIS OF A CELLULAR PHONE

Capacity-building and training

A hybrid global optimization algorithm based on parallel chaos optimization and outlook algorithm

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Development of an intelligent system for tool wear monitoring applying neural networks

Efficient Project Portfolio as a tool for Enterprise Risk Management

Traffic State Estimation in the Traffic Management Center of Berlin

Set. algorithms based. 1. Introduction. System Diagram. based. Exploration. 2. Index

An Enhanced Super-Resolution System with Improved Image Registration, Automatic Image Selection, and Image Enhancement

Inter-Ing INTERDISCIPLINARITY IN ENGINEERING SCIENTIFIC INTERNATIONAL CONFERENCE, TG. MUREŞ ROMÂNIA, November 2007.

Linear Circuits Analysis. Superposition, Thevenin /Norton Equivalent circuits

Performance Analysis of Energy Consumption of Smartphone Running Mobile Hotspot Application

IWFMS: An Internal Workflow Management System/Optimizer for Hadoop

To manage leave, meeting institutional requirements and treating individual staff members fairly and consistently.

POLYSA: A Polynomial Algorithm for Non-binary Constraint Satisfaction Problems with and

Course outline. Financial Time Series Analysis. Overview. Data analysis. Predictive signal. Trading strategy

Single and multiple stage classifiers implementing logistic discrimination

Conferencing protocols and Petri net analysis

INVESTIGATION OF VEHICULAR USERS FAIRNESS IN CDMA-HDR NETWORKS

Enterprise Master Patient Index

DEFINING %COMPLETE IN MICROSOFT PROJECT

An Interest-Oriented Network Evolution Mechanism for Online Communities

A practical approach to combine data mining and prognostics for improved predictive maintenance

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic

NPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6

PAS: A Packet Accounting System to Limit the Effects of DoS & DDoS. Debish Fesehaye & Klara Naherstedt University of Illinois-Urbana Champaign

Evaluation of the information servicing in a distributed learning environment by using monitoring and stochastic modeling

Approaches to Text Mining for Clinical Medical Records

Number of Levels Cumulative Annual operating Income per year construction costs costs ($) ($) ($) 1 600,000 35, , ,200,000 60, ,000

Selecting Best Employee of the Year Using Analytical Hierarchy Process

WISE-Integrator: An Automatic Integrator of Web Search Interfaces for E-Commerce

Improved SVM in Cloud Computing Information Mining

Minimal Coding Network With Combinatorial Structure For Instantaneous Recovery From Edge Failures

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol

FORMAL ANALYSIS FOR REAL-TIME SCHEDULING

A Crossplatform ECG Compression Library for Mobile HealthCare Services

Software for analysis of visual meteor data

iavenue iavenue i i i iavenue iavenue iavenue

A Data Mining-Based OLAP Aggregation of. Complex Data: Application on XML Documents

The program for the Bachelor degrees shall extend over three years of full-time study or the parttime equivalent.

Development of Automated Control System for University Research Projects

Lecture 3: Force of Interest, Real Interest Rate, Annuity

LAW ENFORCEMENT TRAINING TOOLS. Training tools for law enforcement officials and the judiciary

A Programming Model for the Cloud Platform

SCHEDULING OF CONSTRUCTION PROJECTS BY MEANS OF EVOLUTIONARY ALGORITHMS

Research on Transformation Engineering BOM into Manufacturing BOM Based on BOP

Using Association Rule Mining: Stock Market Events Prediction from Financial News

Statistical Approach for Offline Handwritten Signature Verification

Transcription:

Internatonal Book Seres "Informaton Scence and Computng" 29 COMPUTE SUPPOT O SEMANTIC TEXT ANALYSIS O A TECHNICAL SPECIICATION ON DESIGNING SOTWAE Alla Zaboleeva-Zotova, Yula Orlova Abstract: The gven work s devoted to development of the computer-aded system of semantc text analyss of a techncal specfcaton. The purpose of ths work s to ncrease effcency of software engneerng based on automaton of semantc text analyss of a techncal specfcaton. In work t s offered and nvestgated a technque of the text analyss of a techncal specfcaton s submtted, the expanded fuzzy attrbute grammar of a techncal specfcaton, ntended for formalzaton of lmted ussan language s constructed wth the purpose of analyss of offers of text of a techncal specfcaton, style features of the techncal specfcaton as class of documents are consdered, recommendatons on preparaton of text of a techncal specfcaton for the automated processng are formulated. The computer-aded system of semantc text analyss of a techncal specfcaton s consdered. Ths system consst of the followng subsystems: prelmnary text processng, the syntactc and semantc analyss and constructon of software models, storage of documents and nterface. Keywords: natural language, semantc text analyss, techncal specfcaton. ACM Classfcaton Keywords: I.2.7 Natural Language Processng Conference: The paper s selected from Second Internatonal Conference "Intellgent Informaton and Engneerng Systems" INOS 2009, Varna, Bulgara, June-July 2009 Introducton Now desgnng software ncludes development of requrements or techncal specfcaton (TS), system or techncal project (TP), programmng or desgn engneerng, tral operaton, support and mprovement. It s necessary to take nto account nterdependence of all basc parts of process of desgnng software from toolkt, technologes and the organzatons of works. The majorty of works to areas CAD systems are drected on creaton and perfecton of toolkt for automaton of desgnng process. The sgnfcant contrbuton to development САD systems have brought n V.I.Averchenkov, G.S.Altshuller, A.V.Andrejchkov, N.P.Buslenko, V.P.Bykov, B.S.Vonov, G.D.Volkova, V.Gasparsky, Дж. K.Dzhons, Дж. Dckson, M..Zarpov, V.A.Kamayev, K.V.Kumunzhev, V.M.Kurejchk, P.M.Mazurkn, I.Muller, I.P.Norenkov, I.J.Petrova, A.I.Polovnkn, A..Pohlko, J.M.Solomentsev,.Hanzen, P.Hll, A.Holl, etc. Most known of the commercal software products used at desgnng of the software, bascally are ntended for vsualzaton ntermedate and end results of process of desgnng. Some of them allow to fully automate last desgn stages: generaton of a code, creaton of the accountng and accompanyng documentaton, etc. Thus the problem of automaton of the ntal stage of desgnng - formatons and the analyss of the text of the techncal project remans open. It s connected to extraordnary complexty of a problem of synthess and the analyss of semantcs of the techncal text for whch decson t s necessary to use methods of an artfcal ntellect, appled lngustcs, psychology, etc. However, t s possble to come nearer to achevement of the gven purpose, havng allocated some small subtasks qute accessble to the decson by known methods of translaton. Proceedng from the aforesad, t s possble to draw a concluson, that the problem of creaton of means for automaton of process of desgnng s actual [1]. Ideas of a developed drecton realzaton of the unfed procedures of the desgnng equally answerng to requrements of the expert - desgner and requrements to technology to modellng of software products s man.

30 9 Intellgent Processng The purpose of ths work s to ncrease effcency of software engneerng based on automaton of semantc text analyss of a techncal specfcaton. To acheve ths purpose t s necessary to solve the followng tasks: 1. To carry out the analyss of software engneerng process and models of semantc text analyss; 2. To develop a technque of the text analyss of a techncal specfcaton; 3. To develop and nvestgate semantc model of the text of a techncal specfcaton; 4. To develop algorthmc mantenance of analyss of text of a techncal specfcaton and automatc constructon of the software models; 5. To realze developed formalsms, a technque and algorthms as system of automaton of the ntal stage of desgnng software. A Technque Of The Text Analyss Of A Techncal Specfcaton In work t s offered and nvestgated a technque of the analyss of the text of a techncal specfcaton s submtted, the fuzzy attrbute grammar of a techncal specfcaton, ntended for formalzaton of lmted ussan s constructed wth the purpose of analyss of offers of text of a techncal specfcaton, style features of the techncal specfcaton as class of documents are consdered, recommendatons on preparaton of text of a techncal specfcaton for the automated processng are formulated. A technque of the analyss of the text of a techncal specfcaton consst of three stages: semantc text processng, creaton of frame structure and creaton of data flow dagrams of system descrbed n the techncal specfcaton. (see gure 1). Level of formalzaton Level of realzaton Technque Natural language (NL) uzzy attrbute grammar Text of a techncal specfcaton n the lm ted natural language Processng of a techncal specfcaton Internal representaton rame structure rames =<,D> Creaton of frames ormal model (M ) Data low dagrams Dagrams: 1. General structure 2. Carred out functons Creaton of dagrams gure 1: Technque Of The Text Analyss Of A Techncal Specfcaton or realzaton of the frst stage of a technque the semantc model of the text of a techncal specfcaton, ncludng the requrements formulated as the document n the lmted natural language has been developed; the

Internatonal Book Seres "Informaton Scence and Computng" 31 second stage - the frame structure beng nternal representaton of requrements; the thrd stage - model of software as the descrpton of requrements n graphc language Data low Dagrams. The semantc model of the text of a techncal specfcaton contans the developed expanded fuzzy attrbute grammar above frame structure of the formal document "Techncal specfcaton" whch allows to dsplay contents TS most full. The expanded fuzzy attrbute grammar, necessary for the automated analyss of the text of a techncal specfcaton, s determned as: AG = <N, T, P, S, B,, A, D(A)>, where N - fnal set of non-termnal symbols; T - not crossed wth N set of termnal symbols; Р - fnal set of rules; S - the allocated symbol from N, named an ntal symbol; B - set of lngustc varables βk,, correspondng to termnal symbols T (a varable on k level); - set of functons of a belongng fk,, determnng a degree of belongng mk, lngustc varables βk,; A - set of attrbutes, A = AsnUAsem, where Asn - syntactc attrbutes, Asem - semantc attrbutes; D (A) - fnal set of semantc actons. The fragment of grammar s submtted n table 1. Table 1: ragment of the developed fuzzy attrbute grammar above frame structure of a techncal specfcaton β 1 <lst of ncomng data flows > <ncomng data flow descrpton> β 1,2 <ncomng data flow> β 2 <functon specfcaton > β 2,1 < functon type > β 2,2 <functon descrpton > <ncomng data flow name > :: 'Name' <ncomng data flow descrpton> :: 'Contents' < lst of ncomng data flows > ε The text contanng "entrance" or entrance data :: 'Clause' <ncomng data flow>:: rame Data low=creaton, Input=Gvng [<Number of data unts>]:: Slot AMOUNT O DATA = Gvng [<Type of data>]:: Slot TYPE O DATA = Gvng <the Name of ncomng data flow >:: Slot NAME O INCOMING DATA LOW = Gvng <functon type <name of the functons lss>:: 'Name'<functon descrpton>:: rame UNCTION = Creaton ; < Lst of functons> ε «man» «basc» «addtonal» <Name of functon>:: 'Name', Slot NAME O UNCTION = Gvng <Lst of ncomng data flow> <Lst of outcomng data flow> Lngustc varables from set B = {βk,}k, used for the analyss of the text of a techncal specfcaton s descrbed by the followng fve: βk, =<β, T(β), U, G, M>, β - name of lngustc varable (bass for development, purpose of development, techncal requrements to a program product, a stage and development cycles, etc.); T(β) - language expressons. or lngustc varables of the top level they are the lngustc varables correspondng to termnals of the rght part of a rule. or lngustc varables of the bottom level fuzzy varables, that s expressons of a natural language. U - Set of all probable values, T (β) U; G - rules of the morphologcal and syntactc descrpton of language expressons whch determne syntactc attrbutes Asn;

32 9 Intellgent Processng М - a semantc rule for lngustc varables whch s nduced by morphologcal and syntactc rules as the sense of a term n Т s n part determned by ts syntactc tree, and semantc attrbutes Asem. Methods of representaton connectons between rules are broadcast on language of fuzzy mathematcs. Thus connectons are represented by fuzzy relatons, predcates and rules, and sequence of transformatons of these relatons - as process of an fuzzy concluson. Lngustc varables of the top level are compound, that s nclude lngustc varables of the bottom level. Due to ths t s possble to constract a tree of lngustc varables and to establsh dependence between them. unctons of an a belongng from set = {fk,}k, lngustc varables {βk,}k,, are necessary for constructon of an fuzzy concluson. In partcular, to each rule of grammar from set P functon of a belongng fk, s put n conformty. Ths dual system of substtutons s used for calculaton of sense of a lngustc varable. Actually grammar of a techncal specfcaton s used for splttng the ntal text of the document nto sectons and processngs of most mportant of them for our problem. It needs precse observance of structure of the document. Techncal specfcaton represents the structured text consstng of sequence of preset sectons. The frame structure of the techncal specfcaton s submtted as: = N,, I, O where N s a name of system, s system functons vector, I s ncomng data flows vector, O s outgong data flows vector. 1 2 =,,..., k, then = N, I, D, G, H, O Where N - a name of functon, I - ncomng data flows vector of functon, D - the name of the acton whch are carred out by functon, G - subject of the functon acton, H - restrctons on functon, O - a outgong data flows vector of functon. Let s denote the data flow by D (Data low), then I, O, I, O are denoted by: D = N, D, T, C D Where N D - data flow name, D D - data flow drecton, T D - data type n flow, C D - data unts per frame. The model proposed s represented as a frame model wth a-knd-of lnks (see gure 2). D D D, gure 2: rame network

Internatonal Book Seres "Informaton Scence and Computng" 33 Computer-Aded System Of Semantc Text Analyss Of A Techncal Specfcaton The computer-aded system of semantc text analyss of a techncal specfcaton conssts of the followng subsystems: prelmnary text processng, the syntactc and semantc analyss and constructon of software models, storage of documents and nterface (see gure 3). gure 3: Archtecture of computer-aded system of semantc text analyss of a techncal specfcaton Prelmnary text processng s necessary to share of a techncal specfcaton on separate lexemes. The ncomng nformaton of a subsystem s the text of a techncal specfcaton n the lmted natural language, the target nformaton - tables of sectons, sentences and lexemes of a consdered techncal specfcaton. esults can be submtted both as correspondng tables, and as a tree of sectons. Already after the frst stage work not wth the text of a techncal specfcaton, but wth ts parts submtted on sectons s made. On a course of work of a techncal specfcaton shares all over agan on more and more fne sectons, then on separate sentences (wth preservaton of sectons structure) and lexemes wth the nstructon of an accessory to sentences. Prelmnary text processng s carred out wth use of fnal automatc devce. Durng the work of fnal automatc devce the symbols actng on ts entrance, collect n the buffer. In the certan condtons of fnal automatc devce record of the current contents of buffer n one of tables then the buffer s devastated s carred out. Work of automatc devce proceeds up to achevement of a fnal condton.

34 9 Intellgent Processng After that the receved tables act on an entrance of a subsystem of syntactc and semantc analyss. The semantc analyss of a text s made on the bass of the developed grammar of a text of techncal specfcaton. ules of top level serve for analyss of sectons of top level. ules for analyss of sectons consst of two parts: the frst part serves for analyss of a secton name; the second part serves for analyss of a text contents n secton. Symbols of the gven grammar possess syntactc attrbutes. In attrbutes of non-termnal symbols names of frames or names of slots n whch the nformaton receved durng the further analyss should be placed are specfed. Syntactc attrbutes of text can be n addton specfed n attrbutes of termnal symbols. Comparson of words at analyss s made n vew of ther morphology. Durng analyss the syntactc and morphologcal analyss are made only n the event that there s such necessty that tme of performance of semantc analyss s consderably reduced. Let's consder a fragment of the developed attrbute grammar submtted n a xml-format: <global-rule d="secton42" comment = "Secton 4.2. equrements to functonal characterstcs"> <rule><ruleref ur="#secton42name"/><ruleref ur="#secton42x"/></rule></global-rule> <global-rule d="secton42name" sectonpart="name" comment= "Headng of the unt 4.2."><rule><clause clausetype="uncetain"/><rule type="or"><words contans="unctons"/> <words contans= " functonal characterstcs "/> </rule></rule></global-rule> <global-rule d="secton42x" frame= "unctonrame" frameslot="uncton" comment="uncton"><rule> <ruleref ur="#secton42xname" /><ruleref ur="#secton42xcontent" /> </rule></global-rule> <global-rule d="secton42xcontent" sectonpart="content" comment="inputs and outputs of functon"><rule><ruleref ur= "#Secton42xInputs" mnoccurs="0"/><ruleref ur="#secton42xoutputs" mnoccurs="0"/></rule></global-rule> <global-rule d="secton42xinputs" comment="inputs of functon"> <rule><sentence/><clause/><rule type="or"><words contans="inputs"/> <words contans="entrance data"/></rule><ruleref ur="#input" maxoccurs="unbounded"/></rule></global-rule> The morphologcal and syntactc modules used n the program, are modules of the foregn developer. If n a rule of grammar there s a termnal havng syntactc attrbute the mechansm of syntactc analyss for current sentences s started [2]. After creaton of a tree of analyss constructon of frame descrpton of a techncal specfcaton begns. or ths purpose the nformaton on frames and names of slots whch contans n attrbutes of symbols of grammar s used. The receved frame structure contans the sgnfcant nformaton about system: data about nputs and outputs of system, functons and restrctons. or each functon nputs and outputs also are allocated. It allows to receve data flow dagrams of system whch s descrbed n a techncal specfcaton on the bass of frame structure. The subsystem Constructon of data flow dagrams carres out constructon and orderng the column of data flows, and also creaton the fgures of data flow dagrams n Mcrosoft Offce Vso. or constructon of data flows t s prospected of functons nputs contermnous to system nputs. Then functons on whch all nputs data act, are located on the one level of dagram. Ther nputs ncorporate to system nputs. urther t s prospected functons whch nputs concde wth outputs of functons receved on the prevous step. They are located on the followng level, ther nputs ncorporate to outputs of the prevous levels functons and wth system nputs. Work of algorthm proceeds untl all functons wll not be placed on the dagram. After that connecton of functon outputs wth necessary system outputs s made.

Internatonal Book Seres "Informaton Scence and Computng" 35 The computer-aded system of semantc text analyss of a techncal specfcaton s developed on Mcrosoft.NET ramework 2.0 platform (language of development C#) usng ntegrated development envronment Vsual Studo 2005. Scentfc Novelty Scentfc novelty conssts n the followng: a technque of text analyss of a techncal specfcaton at the ntal stages of software engneerng, ncludng semantc model of text of a techncal specfcaton, transformaton matter of text nto the frame structure and constructon of model of the software on ts bass are developed. Practcal Value Practcal value of work s that as a result of development and ntroducton of a suggested technque qualty of software engneerng rases due to automaton of routne work of the person on extracton of helpful nformaton from standard documents and to dsplayng t as software models. Conclusons and uture Work Software desgnng dffers from desgnng n other areas of a scence and techncs a lttle, therefore t s possble to expand results of the gven work for applcaton n other areas of human knowledge. Thus, openng prospects rase a urgency of the gven work. Bblography 1. Kamsay, A. Computer-aded syntactc descrpton of language systems/ A. Kamsay// Computatonal lngustcs. An nternatonal handbook on computer-orented language research and applcatons.- Boston: Walter de Guyter, 1989.- P.204-218 2. eyle, U. Natural language parsng and lngustc theores/ U. eyle.- Berln: ohrer Dordrecht, 1998.- 625 p. 3. Tools Development or Computer Aded Software Engneerng Based On Techncal Specfcaton`s Text Analyss / A.Zaboleeva-Zotova, Y.Orlova // Interactve Systems And Technologes: The Problems Of Human-Computer Interacton: Proc. of the Int. Conf., Ulyanovsk. Authors' Informaton Alla V. Zaboleeva-Zotova PhD, professor; CAD department, Volgograd State Techncal Unversty, Lenn av., 28, Volgograd, ussa; e-mal: zabzot@vstu.ru Yula A. Orlova PhD; CAD department, Volgograd State Techncal Unversty, Lenn av., 28, Volgograd, ussa; e-mal: yula.orlova@gmal.com