Technology White Paper Evolution of Media Servers for IMS/NGN Services Media Servers (MS) will play a key role in future IMS/NGN networks. More advanced services require more and more complete interaction with the user. Consequently, the Media Servers or Media Resource Platforms that are deployed in today s networks need to be enhanced. IMS, first designed for mobile networks, is now being promoted as a converged solution for both fixed and mobile networks. IMS-compliant Media Servers as defined by IMS standards, aim to provide a decoupled and flexible architecture, thanks to an optional split of the Media Resource Function Control (MRFC) and the Media Resource Function Processor (MRFP). The objective of IMS-compliant media resource functions is to allow media resources to be shared among various applications to facilitate the rapid creation of services. This paper focuses on the evolution of Media Servers. These have moved from stand-alone devices catering to dedicated applications towards flexible devices capable of voice/video/web services integrated with multimedia end devices. Furthermore, Media Servers are poised to play a significant role in triple-play services. They will serve requests from consumer s settop boxes and mobile multimedia terminals/browsers for additional video play-out and streaming. The paper also looks at some advanced aspects of managed customer interaction and the applications of Media Servers in mobile networks. The required protocols and extensions to existing protocols to achieve such advanced services are addressed, too. The paper concludes with a summary of the positioning of Alcatel s offering, including the Alcatel 8688 MRF and 8788 MRP.
Introduction.......................................................1 Applications and Media Servers....................................1 IP Multimedia Subsystems and Media Servers......................2 Charging............................................................4 Efficient utilization of Media Servers...............................4 Alcatel s Media Resource Processing Products......................5 Alcatel solutions......................................................5 Conclusion.........................................................5 Abbreviations.....................................................5 References........................................................6 Acknowledgement.................................................6 Biography.........................................................6 ALCATEL
Introduction Traditional service providers are moving away from Time Division Multiplex (TDM)-based circuit switched networks and transitioning towards packet-based networks for scalability, efficiency, and flexibility. Service providers have relied upon TDM technologies with SS7 signaling to provide Intelligent Network (IN) services such as prepaid calling, network announcements, and other services. All these services require some form of media such as announcements, updates to customer accounts, etc. that need to be served/played to the end users. Computer-Telephony Integration (CTI) has introduced a new set of services that can address a wide range of markets/applications. Next-generation services are rapidly evolving supporting voice, video, and data multi-media servicebearing capability with Quality of service (QoS) assurances. A Media Server (MS) is a network entity that may be shared across several applications and provides common media resources such as Text-To-Speech (TTS), Automatic Speech recognition, Video Conferencing, etc. A Next Generation Network (NGN) [1] is defined as a packet network in which transport elements such as switches and routers are logically/physically separated from the service/session control plane supporting multimedia services. The prime characteristics of NGN services are: a) ubiquitous, real-time, multi-media communications, b) more personal intelligence distributed throughout the network, c) more simplicity for users and personal service customization and management. As a result, the NGN enabling such advanced services will have more distributed intelligence among the network elements. So the NGNs, due to their flexibility and an opportunity to rapidly develop new applications/services, are the key enablers for the service providers to attract and/or retain the most lucrative customers for revenue generation. The NGNs are rapidly converging over packet-based IP networks, resulting in more efficiency and flexibility in sharing network resources. and H.323 are the emerging and dominant protocols for signaling and call control. The telecommunications industry is rapidly moving towards voice and video over IP technologies. As a result, the Media Servers that incorporate such flexible protocols can also be the core of broadcast for commercial insertion, program playback, and any combination of recent applications. So media server applications are continuously expanding beyond traditional just limited to IN applications such as Prompt and Collect, Interactive, Voice Response (IVR) or simple announcements. The IP Multimedia Sub-System (IMS) of the 3GPP provides a framework for enhanced and distributed service delivery over IP, independent of access technologies. The IMS acts as a platform for delivering any conceivable multimedia service and provides a generic architecture, which contains several common elements that can be shared among several applications. As a result, applications can be efficiently developed and rapidly delivered to the end users by sharing the common elements/features such as user-profiles, billing, security, etc. IMS-compliant Media Servers will comprise the Media Resource Function Control (MRFC) and Media Resource Function Processor (MRFP) that provide a decoupled and flexible architecture. They can also serve requests from consumer s IPTV set-top boxes and mobile multimedia terminals/browsers for additional video play-out and streaming. As a result, the Media Servers are poised to play a significant role in triple-play services architecture (TPSA). This paper gives details of a number of key aspects in the evolution of Media Servers and what they can do in the context of IMS/NGN services. Emerging protocols and standards that fuel that evolution are discussed briefly, along with their impact on media server architecture. Applications and Media Servers Today s service providers are striving for differentiators in a highly competitive market by providing new services Figure 1: Applications and their media needs DTMF Detection and Generation Annoucement Audio/Video Audio/Video one/playback Bridging Transcoding Play/ Record Automatic speech recognition Music on Hold Text to Speech Annoucement Server Contact Center Calling Card & Prepaid Enterprise collaboration application IP-Centrex Audio & Video Conferencing Interactive Voice Response apps Directory services Location based services Voice mail and Unified messaging Network hosted Games Intelligent call routing Voice Portal Voice activated calling Call Parking & Forward 1 ALCATEL
reducing operational costs. To help achieve those goals, traditional telecommunications service providers are moving away from Time Division Multiplex (TDM)-based circuit switched networks and transitioning towards packet-based networks for scalability, efficiency, and flexibility. This transition will help enable advanced services that merge voice, video, and data. With the advent of IN applications, it has become imperative to have dedicated media serving network elements for applications such as announcements or Interactive Voice Response systems. So voice can be considered as the primary medium for early Media Servers. Figure 1 depicts major current applications in the telecommunications services that utilize some form of media serving capability. In most of the applications shown in the table, announcements are played as deemed necessary at any point from session initiation to its closure. For example, in a contact center, the enduser may be transferred to another agent in mid-call with greetings or music playing during the transfer. In all such services, a Media Server is required, streaming media in the form of audio or video depending on the receiving terminal capabilities. Converting text to audio on-the-fly for playback to callers is known as Text-To-Speech (TTS). It is also a key feature of current Media Servers. Web technologies based on HTTP and XML have enabled the delivery of hypermedia. XML-based extensions such as VoiceXML (vxml) [2] and CCXML [3] have become industry standards for specifying telephony control, conference control, and voice user interfaces. Fundamentally, a voice browser fetches the required VXML page for interpretation from a Web Application Server (W) as depicted in Figure 3 and can prompt the Media Server to stream audio to the end user. Another important feature that a Media Server can provide is bridging. With bridging, several audio and video sessions can be mixed appropriately such that video/audio conferencing is enabled for the end-users. Figure 2: Voice browser for applications Telephone Carrier Voice XML Interpreter Figure 3: Application of Media Servers in video Setop box (RTP mixer) HTTP 555-1212 Consumer answers Corporation Triple-play services have now become a common offering from several service providers. As a result, streaming video clips for either self-service (or automated call-attendant) branding or interactive TV have become a reality. Figure 3 depicts one such application, where the end users can interact with TV; this serves as an illustration of beyond simple audio streaming applications. In other words, the scope of Media Servers is not limited to traditional applications such as Prompt and collect, Interactive Voice Response (IVR), or simple announcements. Click on remote: Call our operators now 1 st leg of the audio call Media Servers play a crucial role in developing such NGN services. Figure 4 depicts an NGN with generic Application Servers and their corresponding Media Servers. The firstgeneration IP Media Servers are directly incorporated into the Application Servers (), leading to a tightly coupled Media Server with the. This approach is effective for porting legacy applications to IP environments quickly. However, the tight coupling of MSs with s leads to reduced flexibility in sharing media resources across the service control plane. As a result, next generation architectures have predominantly separated the IP Media Server from the application. The initial trend has been to develop HTTPbased markup language control, such as Call Control XML (CCXML), and Voice- XML. This decoupling requires some signaling for interaction between s and Media Servers and is discussed later. Consumer or Corporate Web site Media Enter your phone number 555-1212 Includes information on page viewed Network Call routing and Find available agent Ring agent desktop, with CTI Thanks for calling. Can I answer any questions about the mobile phone you re interested in? IP Multimedia Subsystems and Media Servers The IMS architecture of 3GPP [4] provides a robust and flexible way of providing services over an IP (packet) network. The key value of the IMS ALCATEL 2
Figure 4: Transition towards flexible/decoupled IP Media Servers from Application Servers the MRFC and the MRFP. In addition, MRFC is also responsible for sending accounting information to charging/billing elements. Application Server Application Server The MRFP provides resources as instructed by the MRFC, and its major functions are Native VoIP Application IP Media Server Voice XML & CCXML IP Media Server Control Protocol IP Media Server a) serving media streams for announcements, etc., b) mixing of incoming media streams, c) media stream processing such as transcoding. PSTN Gateway Border Controller Packet Network In all cases of Application Server control, all session control requests passed between the Application Server and the MRFC are sent via the S-CSCF using the ISC interface and the interface of the Mr reference point. MRFC addresses are made known via peer-to-peer arrangements within the IM CN subsystem. IMS Release 6 does not explicitly specify the interface between and MRFC; however, most implementations have a (dotted line) interface between MRFC and. PSTN VoIP Figure 5: Layered IMS architecture Application Layer Session Control Layer Connectivity Layer HSS Diameter CSCF IP Network MGW MRF PSTN/PLMN Even though H.248 has been recommended as the protocol at the Mp interface between MRFC and MRFP, this has not yet been widely adopted by the industry. protocol extensions that give the applications fine-grained control of the media and call control services are currently under development. These help establish connections and interface specifications implemented by XML. But unlike VoiceXML and CCXML, there is as yet no widespread industry acceptance of a standard/universal media control protocol. has gained ground in the industry and has MGCF been recently catching up with its predecessor, H.323 and its variants. H.248 Basic network media services can be provided through the Netann scheme as outlined in : basic announcements, IVR, and limited bridging [5]. For more advanced features and mid-call trigger control, architecture stems from the fact that different applications/services can share a common set of elements such as user-profiles, authentication/authorization, billing, and media resources across elements, as shown in Figure 5. This is in contrast to the current (legacy) architectures, where the applications are built in a monolithic fashion and the abovementioned common elements are implemented separately for each application, leading to inefficient resource utilization. Figure 6: Simplified view of the MRF in the IMS network and the relevant interfaces ISC As per 3GPP IMS Release 6, Figure 6 below shows the interfaces of MRF, where the MRF can be split into a Media Resource Function Controller (MRFC) and a Media Resource Function Processor (MRFP). The MRFC supports services such as conferencing, announcements, or bearer transcoding. Ideally, as per IMS, the MRFC will process the messages received via S-CSCF and uses Media gateway Control Protocol (H.248 MEGACO) instructions to control the MRFP. However, efforts are under way to promote a /XML-based protocol between S-CSCF Mr MRFC MRFP Mp 3 ALCATEL
work is under progress on XML/-based protocols to provide a mechanism to control sessions in a Media Server and the associated objects (e.g., audio clips). Such a control description can be carried over INFO messages similar to VXML but with additional features. For Video, Real Time streaming Protocol (RTSP) has been the de-facto IP-based stream/media controller. For further information, the reader is referred to the working groups of the IETF. Charging The IMS architecture supports both offline and online charging systems. Offline charging invokes the collection of call data records (CDRs) at the Charging Collector Function (CCF) after the session is torn down. On the other hand, online charging stipulates that the Application Server contact the online charging function and, based on the policies established, the session/call is allowed to proceed to access the resources. The ECF in turn can select either a) immediate event charging or b) event charging with unit reservation, depending on the type of resource and policies in place. In the former case, the ECF deducts the appropriate credit from the user s account and grants the requesting ACRs from MRF/. In the latter case, the ECF sends back the allowed time or data volume that can be served to the requesting end user and updates the credit available to the user. This model is suited to a dynamic session where the amount of resources cannot be pre-determined. An IMScomplaint MS should be able to execute the above functionalities. Figure 8: Online charging Figure 7: Billing and off-line charging interaction in the IMS Billing System MRFC CDRs Billing System MRFC CCF RF CDRs S-CSCF CCF RF ACRs S-CSCF ACRs Figure 7 below depicts the interaction of various elements with the CCF. As a result, the MRF interacts with the CCF for any charging events that may have to be passed on to the CCF. As the IMS session traverses and utilizes resources such as media resources, the pertinent elements will generate appropriate Accounting Requests (ACRs) using Diameter. These ACRs will be appropriately processed by the CCF, thereby generating Call Data Records (CDRs) for the billing system. It should be noted that ACRs can be of two types depending on whether they are IMS-session related or IMS-event related. The former relates to the session depending on the point of ACR generation during the session (start, during, stop). Non-session related ACRs are called events, and they trigger the CCF to generate corresponding CDRs. On the other hand, session ACRs cause the CCF to update corresponding CDRs. The MRFC sends session ACRs. Online charging, on the other hand requires a different approach. Figure 8 shows the IMS online charging elements. Note that only the elements that are relevant to this paper are depicted. When a UE requests something (resources), the /MRFC contacts the Event Charging Function (ECF) through the Ro interface before delivering the service to the user. Efficient utilization of Media Servers provides the versatility to allow distributed application server components to deliver advanced intelligent services to remote clients. As a result, several key benefits such as increased scalability, fault tolerance, and load balancing capabilities can be obtained. However, control of distributed media resources remains an open issue. Most often media resource control is dictated by the inherent characteristics of a network deployment and/or the specific media needs of an application. To gain efficiency and enhance utilization of media resources, dynamic policies have to be deployed. These policies can be determined by: a) network load conditions, b) user subscription profiles including QoS, and c) service provider policies. A fundamental requirement for increasing efficiency and sharing media resources is that both the and the MRFC keep track of the usage of media resources, including capacity and capabilities. Current Media Servers lack support of such capabilities, even though and CSCF entities do implement some form of Service Capability Interaction Management (SCIM). The ability to trigger services is a key feature of an IMS network. There are three potential trigger handling areas: user preferences, network policy/preference, and service orchestration. Of these, user preferences are well defined in IMS as the ALCATEL 4
ability to base a service selection on preferences indicated in the Initial Filter Criteria (IFC). For example, a INVITE may be directed to an application server () that provides class features for that user. Network policies and service capabilities have not been well defined in IMS, and both have been referred to loosely as SCIM. Network policies allow the operator to express a routing preference to a server based on events or states in the network (e.g., server outage or the user s location in the network). This is different from service orchestration, which is a component specialized in invoking services and handling the events and data flow between them. Incorporation of SCIM in the and CSCF will allow efficient utilization of media resources. Alcatel s Media Resource Processing Products Alcatel solutions Alcatel s 8686 Specialized Resource Point (SRP), dedicated to TDM environments and extensively used in intelligent networks, has evolved into Alcatel 8688 Media Resource Function, which is positioned as a Media Server for NGN and IMS services. It has a large number of flexible features that make it a versatile Media Server providing voice and video processing. It supports call acceptance and processing via standard IP signaling () and uses RTP media processing. Media services include voice prompts, menus, and data (DTMF or speech) collection (prompt & collect), halfduplex speech recording, and the ability to bridge or unbridge calls during transfer. It is integrated in numerous Alcatel services. The Alcatel 8788 Media Resource Platform is particularly suitable for the applications involving contact centers and answering corporate market needs. The most important functions of the MRP are those that permit operation of the platform in a multi-tenant environment: a) policy management for port allocation and resource allocation on a per tenant basis, b) tenant-based real-time and historical reporting. This capability is a key benefit for service providers, allowing them to easily partition a customer s application and act as a hosted application service that generates multiple revenue streams. Enhanced call-routing that involves call qualification, call queuing, and call transfer provides the MRP 8788 (that subsumes the MRF 8688) with the capability to offer service providers for the corporate market a unique opportunity to create several applications that are variations of contact centers and a powerful mechanism for revenue generation Conclusion The convergence towards a packet-based IMS architecture enables service providers to share and efficiently utilize media resources. Common network elements handling various applications can leverage on the shareability and scalability properties of IMS architecture. Various functions and features that are needed for achieving such flexibility and IMS-compliance for Media Servers are described in this paper. The IMS-complaint Media Servers can be efficiently shared across the applications and therefore help increase revenue across NGNs, mainly stemming from the sharing of common elements and applications. With a full range of Media Servers, including the 8688 MRF and the 8788 MRP, Alcatel covers the needs of NGN and IMS networks for fixed and mobile operators. It also offers a comprehensive migration strategy from today s circuit-switched to tomorrow s packet-switched networks and complements yesterday s voice processing by video to provide state-of-theart voice and video processing. Glossary of terms and abbreviations: Session Initiation Protocol RTP Real-time Transport Protocol VXML Voice Extended Meta Language MRF Media Resource Function MRFC Media Resource Function Controller MRFP Media Resource Function Processor CSCF Call Session Control Function CDR Call Data Records MRP Media Resource Platform TDM Time Division Multiplexing Application Server IN Intelligent Networks CCXML Call Control XML MS Media Server SCIM Service Control Interaction Management References [1] http://www.itu.int/itu-t/studygroups/index.html, Study group 13. [2] http://www.voicexml.org [3] http://www.xml.org [4] IP Multimedia Subsystems, 3GPP TS 22.228, 3GPP [5] E. Burger et. al., Basic Network Media Services, draftburger-sipping-netann-11.txt, work in Progress, IETF. Acknowledgement The authors are thankful to Robert Hemmerich for his valuable contribution. 5 ALCATEL
Girish Chiruvolu is a product line manager for Alcatel, North America (NA) for the MRP 8788 Media Servers in Voice over IP solutions. In the recent past, he was a research scientist at Alcatel CTO, NA and has worked on several projects involving IP, Ethernet, traffic modeling, QoS, and Multimedia applications. He holds a Ph.D. in Computer Science. A product line manager for IMS-based products and solutions for Alcatel FSD, NA, John Kaippallimalil has extensive expertise in software engineering, network architecture, and standardization of next-generation communications networks. Jacek Jedruszek has been working for Alcatel since 1984. After participating in the development of a variety of software projects, he joined product management in 2000 with responsibility for the Media Resource Function and Specialized Resource Point (SRP). ALCATEL 6
Alcatel and the Alcatel logo are registered trademarks of Alcatel. All other trademarks are the property of their respective owners. Alcatel assumes no responsibility for the accuracy of the information presented, which is subject to change without notice. 12 2005 Alcatel. All rights reserved. 3GQ 10001 0030 TQZZA Ed.01