FPGA-based Router Virtualization: A Power Perspective
|
|
|
- Tamsin Barber
- 10 years ago
- Views:
Transcription
1 2012 IEEE th IEEE International 26th International Parallel Parallel and Distributed and Distributed Processing Processing Symposium Symposium Workshops Workshops & PhD Forum FPGA-based Router Virtualization: A Power Perspective Thilan Ganegedara, Viktor K. Prasanna Ming Hsieh Dept. of Electrical Engineering University of Southern California Los Angeles, CA {ganegeda, prasanna}@usc.edu Abstract Both Internet and semiconductor technology have advanced dramatically over the past decade. These advancements have made great impact on the conventional Internet infrastructure where networking equipment is dedicated on a per network basis. Router virtualization allows a single hardware router to serve packets from multiple networks while ensuring the same throughput and Quality of Service (QoS) guaranteed originally. In this paper, we study the effect of router virtualization, from a power consumption perspective, on the widely used Field Programmable Gate Array (FPGA) platform. An analytical model is proposed to estimate Layer-3 power consumption under different virtual router configurations. The analytical model is verified using post place-and-route results obtained using stateof-the-art FPGA and the models stand accurate with only a ±3% maximum error. Low power FPGA families are explored inthisworktohighlightthebenefitsofusingsuchplatformsin networking environments. Our experimental results show that by virtualizing, power savings proportional to the number of virtual networks can be achieved compared with non-virtualized routers. I. INTRODUCTION Over the past four decades, the Internet has grown from a network of few tens of nodesto a massiveweb of over 5 billion nodes [12]. This immense and continuous growth is constantly calling for high-bandwidth networking equipment in order to keep up with the increasing demands. With the advancements in semiconductor technology, satisfying these demands has not been a major challenge. However, the proliferation of networking devices(routers, switches, etc.) is posing a challenge in a different dimension- Power. The conventional approach has been to assign dedicated networking equipment onapernetworkbasis.thishasresultedinapoorlyutilized Internet infrastructure where the network equipment operates full time, however the duty-cycle is low (especially at the edge-network level). Recently, router virtualization [1] was introduced to mitigate the administration issues of maintaining multiple physical networking equipment, to ease the management tasks. However, there are many more additional benefits in virtualizing a router. In this work, we are highlighting the benefits from a power consumption perspective. Router virtualization can be described as the consolidation of multiple physical routers to a single shared hardware platform. This process of virtualization must be transparent to the user in the sense that before and after the process, the user should not experience any difference in the service received from the Internet Service Provider (ISP). From the equipment standpoint, this translates to ensuring the throughput and latency requirements guaranteed originally. This is a challenging task since maintaining multiple routing tables while forwarding packets correctly on a given amount of hardware is not straightforward. Field Programmable Gate Arrays(FPGA) become an ideal candidate for such applications, mainly due to its reconfigurability, high performance and the abundant resources and parallelism provided[19],[14]. Using the memory and logic resources available on FPGA, it is possible to build virtualized routers that can host several virtual networks on a single chip. In [13] it is shown that when router power taken into account, network layer (Layer-3) operation consumes nearly 62% of the total power. There have been many efforts on reducing Layer-3 power consumption of routers[20],[8],[10]. With virtualization, the power consumption of routers needs to be redefined. Sharing of the router platform allows the static powertobesharedamongthelookupenginesandgivemore throughput per unit energy spent. Further, modern FPGAs come with various features that help one build architectures for low power applications. These features can be exploited to build virtualized forwarding engines that are power efficient compared with the existing Internet infrastructure. In order to evaluate these power benefits quantitatively, we provide analytical models to estimate power for different virtual router scenarios and compare the benefits achieved by using each model. We give a comprehensive comparison of virtualized vs. non-virtualized routers from a power standpoint. In addition, we also compare the main two virtualization schemes to show the advantages and disadvantages of using each approach. The benefits of using low power features of FPGA is highlighted from both power and throughput view points.finallywerefinethepowermodelsthatweproposeby identifying the representative values for FPGA based routers. We summarize our contributions in this work as follows: An accurate analytical model to estimate power savings achieved using router virtualization(section IV) Exploration of low power FPGAs to achieve greater power benefits in networking applications(section VI) Detailed power comparison of non-virtualized vs. virtualized schemes on state-of-the-art FPGA(Section VI-A) Power efficiency comparison for different virtual router configurations(section VI-B) /12 $ IEEE DOI /IPDPSW
2 A. Background II. BACKGROUND AND RELATED WORK 1) Networking on FPGA: Various platforms are being used to build router architectures. FPGA, Ternary Content Addressable Memory (TCAM), Application Specific Integrated Circuit (ASIC) and Network Processing Unit (NPU) are the most prominent. As mentioned in Section I, FPGAs are extensively used in networking applications for IP (Internet Protocol) forwarding, firewalls, Network Intrusion Detection Systems (NIDS), etc. [19], [14]. Their reconfigurability and high performance capabilities are the most desirable features for networking applications. Further, the huge amount of logic and memory resources make them suitable for compute and memory intensive applications. Recently, there have been efforts towards low power FPGAs and device families targeted for such applications are available. These platforms have various architectural and algorithmic means by which, the power consumed by the device can be reduced. In networking applications, the throughput, resource(logic and/or memory) consumption, latency of operation and the power consumed are the critical driving parameters. Using FPGA, it is possible to build architectures that meet most, if not all, these demands. Pipelining improves the performance while reducing the latency. Various algorithmic techniques can be employed in order to build resource efficient forwarding solutions and they can be easily mapped on to the reconfigurable FPGA fabric. Low power network equipment has gained much attraction recently with the interest in green networks. FPGAs provide various architectural and algorithmic features with which, significant power benefits can be achieved(by slightly sacrificing the throughput). 2) Router Virtualization: Router virtualization is an emerging research area that has attracted interest in both industrial and academic community. The main advantage of router virtualization is that all the networking equipment can be brought into a single administrative domain which makes the management tasks much easier. In addition to this, several other benefits such as, reduction in equipment cost and space arealso prominent.in thecontextof thiswork,weconsider virtualization of the data plane of the router. Control plane virtualization can be achieved simply by adopting the existing Operating System (OS) virtualization techniques. However, data plane virtualization requires more careful consideration since multiple factors such as throughput, resource limitation, power, etc. come into picture. From an industry standpoint, deployment of router virtualization can be seen in the Juniper J series [9] routers and Cisco s Catalyst-6500 router[2],[3]. In the research community, two main categories of virtualization techniques can be found:1)separateand2)merged.asthenamessuggest,in the separate case[18], each virtual network gets its own lookup enginewhereasinthemergedcase[5],[6],[11],[4],[17],all the virtual networks share a single lookup engine using some merging technique. The merging process exploits the structural similarity of tries to reduce the addition of new nodes by Merged flow in Merged flow in Distributor Virtualized router: Merged A shared router serving K virtual networks Virtualized router: Separate Logical router 1 Logical router 2 Logical router K-1 Fig. 1. Merged(top) and separate(bottom) router virtualization approaches increasing node sharing. This leads to resource efficiency. The two router virtualization approaches are depicted in Figure 1. B. Related Work As mentioned in Section II-A, there are several techniques that implement router virtualization. However, for this work, we do not focus on a particular technique but evaluate two different classes of virtualization techniques, merged and separate. We generalize these two techniques in such a way that any given technique can be modeled by adjusting the parametersoftheproposedmodel.thefocusofmostofthe router virtualization techniques proposed in the literature are focused on reducing the memory requirement to store the routing tables[5],[6],[11],[4],[17]. The power benefits of virtualization is not studied nor quantified. Improving the power efficiency of networking equipment is critical. With the constantly growing size and demands of the Internet, cooling of equipment has become a major issue. Several solutions have been proposed to improve the power efficiency of networking equipment. Both TCAM and algorithmic solutions exist in literature. In[7],[8], various trie partitioning methodsareusedtoreducethepipelinedepthaswellasper stage memory requirement, to reduce the power consumed per lookup. Memory balancing is integrated with these solutions to further enhance the power efficiency. TCAMs are known to be power hungry due to its massively parallel search. However, by properly organizing the TCAMs (with the aid of some algorithmic techniques), the power consumption of TCAM can also be reduced. In[20], the authors propose a load balancing scheme for multi-chip TCAM based IP lookup. By controlling the TCAM entries triggered by a lookup, power efficiency is achieved. IPStash [10] is an alternative solution for TCAM and is based on a memory architecturethat is similar to set associative memory. By appropriately mapping the routing table to the set associative memory and using controlled prefix expansion the authors achieve 35% power savings compared to state-of-the-art TCAM solutions. Merged flow out Merged flow out
3 III. NOTATIONS AND ASSUMPTIONS Todenotethevariousschemes,weuseasetofabbreviations andtheyareasfollows: NV: Non-virtualized(conventional) VS: Virtualized separate approach VM: Virtualized merged approach TABLE I NOTATIONS AND SYMBOLS Description No. of virtual networks Virtualrouter i Lookuppipeline i No. of stages per pipeline Memorysizeofstage j ofpipeline i Logicslicesinstage j ofpipeline i Powerconsumed Device(FPGA/ASIC/etc.) Leakagepower Utilization ofvirtualrouter i Merging efficiency Symbol K VR i P i N M i,j L i,j P( ) D P L µ i α In this study, we make several assumptions to simplify the model we propose. However, it should be noted that these assumptions may be altered depending on the considered application. Assumption1: Network traffic is uniformly distributed amongthe K virtualrouters.in otherwords, µ i = 1/K for i = 0,1,...,K 1 Assumption2: Allroutingtablesareofsamesize.Anupper bound value is assumed considering real life edge-level routing table(10000 prefixes) to simulate a worst case scenario. This translatesto M i,j = M k,j for i,k = 0,1,...,K 1 and j = 0,1,...,N. Assumption 3: In the case of non-virtualized and virtualseparate(described in Section II-B), packets belonging to different virtual networks are assumed to be properly distributed among the virtual router instances and the packet distributor energy is considered negligible. Assumption 4: Merging of routing tables is generalized to make our analysis generic. Merging efficiency is defined as the amount of node overlap in a given level or equivalently: α = number of common nodes total number of nodes IV. ROUTER MODELS FOR POWER ESTIMATION ThepowermodelingdoneinthisworkisforLayer-3lookup operation of a router. We consider three main types of routers: 1) Non-virtualized, 2) Virtualized-Separate and 3) Virtualized- Merged. We provide comprehensive models to estimate the power consumption of a router in these three different scenarios. For this work, we consider linear pipelined lookup architectures only. Hence, we consider tree/trie structures for IPlookupwhicharemappedontothestagesofapipeline. We consider three main components that contribute to the power consumption of a router s data plane operation. Leakage power represents the static power dissipation while power consumed by logic and memory accounts for dynamic power. Thestaticpowerisproportionaltotheareaofthedeviceused, while dynamic power highly depends on the clock frequency, the type(logic, memory, etc.) and amount of resources used. Hence, we first show the resource consumption for each setup and translate that to the power consumption on a per resource type basis. When the router is not serving any packets, the logic or memoryresourcescanbesenttoanidlemode.hence,during the off period of the duty cycle, the dynamic power can be assumed to be zero, but the static power is dissipated constantly since the device has to be operating despite the duty cycle. Turning off the logic and memory resources can be effectively done using flags(boolean values indicating whether service is required or not) and clock gating, respectively. A. Non-virtualized A non-virtualized router is the conventional approach in networking. Network equipment is dedicated for individual networks and the utilization of each equipment is fairly low due to the behavior of the edge-network users. The resource consumptionisexpressedineq.1.thedevice D hererefers tothechiponwhichthelookupengineisimplemented.since we have multiple equipment, multiple devices are required, hence, the static power consumed increases proportional to the number of networks. For dynamic power, we introduce the utilization for a fair comparison. As stated in Assumption 1, we assume a uniform distribution of packets across the virtual networks. If required, more complex distributions can be modeled by appropriately changing the µ i values. Power consumed in the non-virtualized case is expressed in Eq. 2. R NV = (D + (L i,j +M i,j )) (1) P NV = (P L +µ i (P(L i,j )+P(M i,j ))) (2) B. Virtualized-separate The virtualized-separate is very similar to the nonvirtualizedcase,exceptforthefactthatnowwehaveasingle shared platform hosting all the virtual routers. Hence, the static power dissipation is nearly broughtdown by a factor of K. However, the dynamic power consumption remains the same with its correlation to the utilization. The resource utilization and the power models are expressed in Eq. 3 and Eq. 4 respectively. R V S = D+ (L i,j +M i,j ) (3) P V S = P L + µ i ( (P(L i,j )+P(M i,j ))) (4)
4 It should be noted that the separate approach has its disadvantages just as in the non-virtualized case. The number of separate lookup instances that can be implemented on a given device is limited by the available resources. Hence, the scalability of the separate virtualization approach dictated by the platform used. However, from a power perspective, we have fine grained control over the resources and temporarily turnoff theresourcesthatarenotbeingused,whileusinga single device. C. Virtualized-merged This approach is radically different from the previous two cases. Here, the multiple virtual routing tables are merged (using some table merging technique), to produced a single lookup tree. The incoming packet stream, consisting of packets from different virtual networks, is sent through the lookup engine and based on the virtual network identifier (VNID), the router loads the corresponding routing table data and forwards the packet. Hence, the router hardware is time-shared among the virtual networks (in the case of separate router virtualization, the hardware was space-shared). The resource utilization andpower modelis givenin equationseq.5and Eq. 6 respectively. Since there is only one lookup pipeline, we usetheindex 0forthepipelineinsteadofusing i. R VM = D + P V M = P L + (L 0,j +α (P(L 0,j )+P(α M i,j ) (5) M i,j )) (6) In the case of merged, the scalability limitation has two aspects. First one is the resource limitation. The purpose of merging is to reduce the overall memory requirement. However, as we merge multiple routing tables, the total size of memory required to store the merged lookup tree may exceed the memory available on the device. This is one aspect. Thesecondaspectisthatwhenwemergetworoutingtables, the lookup engine has to be able to sustain the required throughputs of the two virtual networks, even in the worst case. When multiple such routing tables are merged, the throughput is shared among the virtual networks, hence at some point, the lookup engine may fail to sustain the required throughput. These two are the major limitations in the merged approach. However, the merged approach is more scalable than the separate approach considering the resource consumption. V. VIRTUAL ROUTERS ON FPGA In the previous section, we discussed how to model the powerconsumptionofavirtualrouteronfpga.wenowfocus on implementing these different architectures on state-of-theart FPGA. For these experiments, we consider a Xilinx Virtex 6 platform (XC6VLX760) under two speed grade scenarios: 1) speed grade -2 for high performanceand 2) speed grade -1L for lower power. This device was chosen considering its onboard resources, listed in Table II. In order to support multiple virtual networks, having abundant on-chip resources, mainly Block RAM (Random Access Memory), distributed RAM and I/O(Input/Output) pins, is critical. TABLE II VIRTEX 6 XC6VLX760 DEVICE SPECS Resource Logic Cells Amount 758K Max. distributed RAM 8 Mb Block RAM 26 Mb Max. I/O pins 1200 In the proposed model, we consider three main contributors for power: static, logic and memory. We initially identify the representative values and/or functions for these two components(p L, P(L i,j )and P(M i,j ))ontheaforementionedtwo platforms. For all our power calculations, we use the Xilinx XPower Analyzer(XPA) and XPower Estimator(XPE) tools. Thesetoolsprovideameansbywhichagivendesigncanbe evaluated from a power standpoint at resource type level and at different operational frequencies. A. Static power The static power is the minimum power required to keep the device powered up with no switching. Even though static powerdoesnotdependonthefrequencyatwhichthedevice operates, it is proportional to the area of the device, process technology, and the operating temperature(which affects the leakage current). Various circuit optimization techniques can beadoptedtoreducethiscomponentandweseesuchdeployments in the low power FPGA devices. The main distinction in a high-performance and low power variants is the supply current, which is significantly lower(2000 ma difference) in the low power FPGAs. In our case, we examined the static power dissipation of thedeviceunderthetwospeedgradesandtheresultsareas follows: Speedgrade-2: 4.5±5%W Speedgrade-1L: 3.1±5%W Thevariationisbasedontheamountofresourcesused(or equivalently area covered by the used resources). We observed amaximumof ±5%deviationinourapplicationandthevalue may vary depending on the resource consumption. B. Power consumed by memory Two types of memories exist on FPGA. Distributed RAM andblockram(bram).eventhoughbothtypesofmemories maybe used in our applications, for simplicity, we assume onlybramisused.onthedeviceweareconsidering,26mb of BRAM is available. However, BRAM(on Xilinx devices) is organized into 36 Kb blocks (contains two independent 18 Kb blocks). Hence, despite how small the amount of memoryrequired,abramblockhastobeassignedtoserve the purpose. Therefore, BRAM power is determined by the numberofblocksusedratherthan thetotalsizeof memory
5 The other determining factors are 1) operating frequency, 2) dutycycle,3)writerate,and4)bitwidthofreadoutdata.we assumedawriterateof 1%(lowupdaterate)and 18bitwide data for the comparison. The effect of bit width was negligible compared with the effect of other parameters. We conducted experiments using the XPE tool to analyze thebehaviorofbrambasedonthesizeandthefrequency. The observation was that BRAM power monotonically increased with both size and frequency. However, it should be noted that the behavior of 18 Kb and 36 Kb modules were different. The increase with respect to size was predictable as each BRAM block is an independent component. Figure 2 illustrate the power variation for a single BRAM. Fig. 2. BRAM power variation with operating frequency(note: The number within parenthesis denotes the speed grade) Using these details, we generate a power model for BRAM under different scenarios. The model is summarized in Table III. The notations used in the table are M - memory requirement in bits, f - operating frequency in MHz. These resultscanbeusedtopredictthep(m i,j )valuesinthemodels proposed in Section IV. Setup C. Power consumed by logic TABLE III BRAM POWER MODEL Power(µW) 18Kb(-2) M/18K f 36Kb(-2) M/36K f 18Kb(-1L) M/18K f 36Kb(-1L) M/36K f In most studies related to networking, the power consumed by logic is considered negligible compared to that of memory. However, in our study, we identified that logic power (including signaling power) can become comparably significant. Logic power is distributed among Look-Up Tables (LUT), shift registers, distributed RAM and flip-flops. Signal power includes the power dissipated when communicating among the aforementioned logic resources as well as memory components. In order to avoid the clutter, we treat both logic andsignalpowerasawholeandpresenttheresults. Inordertoevaluatelogicpower,westayatthegranularity of a single processing element(pe) of a pipeline stage. This includes the stage registers and any type of logic resources that are required to perform the memory access and computations requiredateachstage.inthecaseofouruni-bittrie,thelogic resource consumption was as follows: Slice registers as flip-flops: 1689 SliceLUTsaslogic:336 SliceLUTsasmemory:126 SliceLUTsasrouting:376 The power consumed depends on the frequency of operation and the amount of resources used. The observation was that logic power linearly increases with the number of pipeline stages. The variation with frequency is illustrated in Figure 3. Further, for a trie based IP lookup implementation, per stage logic power dissipation as a function of operating frequency, inmhz,canbeexpressedas: Speedgrade-2: f µw Speedgrade-1L: f µw Fig. 3. Per stage logic and signal power consumption(note: The value inside parenthesis denotes the speed grade) D. Pipelined IP lookup Algorithmic (i.e. trie/tree based) IP lookup have become popular over TCAM based IP lookup due to their flexibility and scalability. Mapping such trie/tree based solutions to FPGA platforms can be done efficiently. Most router virtualizationsolutionsaretriebased[5],[6],[4],[17].henceweuse trie as the representative example. Each trie level is mapped ontoapipelinestageandeachstageisassociatedwithanindependently accessible memory[7],[11],[8]. When a lookup request is received, the packet traverses the pipeline similar to the trie traversal and at the end of the pipeline, outputs the appropriate next-hop port information (NHI). Generally, the NHIinformationisstoredattheleafnodesofthetrie(nodes that do not possess any children nodes) using techniques such as leaf pushing [16], in order to reduce the memory consumption for the trie storage. In the case of virtualization, a leaf node is simply a vector that has routing information corresponding to all the considered virtual networks. And the vector is indexed using the VNID to extract the forwarding information[5],[4]. The three cases considered here (non-virtualized, virtualized-separate and virtualized-merged) have the same architecture with the following distinctions:
6 Fig.4. PointerandNHImemoryrequirements formerged(α = 80%and α = 20%)andseparateapproaches Non-virtualized implements a single lookup engine on a single device and all the devices are dedicated on a pernetworkbasis.forak virtualnetworkscenario, K devices are required. Virtualized-separate implements multiple lookup engines on a single device and between two lookup engines, there is no resource sharing except for the FPGA fabric itself. Virtualized-merged implements a single shared lookup engine and all the virtual networks share the same memory and logic in the lookup engine. The amount of logic used, remains almost the same as in the two other cases, however, the amount of memory required may significantly increase depending on the merging efficiency, α. E. Routing tables Router virtualization is most effective at edge level of the network since the problem of underutilization is most prevalent at the edge network level. In order to demonstrate the results for a more realistic scenario, we use routing tables from real networks obtained from[15]. To simplify the implementation, we assume all the routing tables to be of same size and we use the largest routing table we obtained from[15] to report the results for the worst case scenario. This particular routing table consisted of 3725 prefixes and when the corresponding trie had 9726 nodes without leaf pushing and nodes with leaf pushing. Figure 4 demonstrates the effect of virtualization on memory under different scenarios and illustrates the amount of memory used for pointers(non-leaf nodes) and for NHI/forwarding information(leaf nodes). It can be seen that the memory saving achieved by the merged schemes is highly dependent on the node overlap percentage or merging efficiency α. Also it is clear that pointer saving becomes less and less effective as the number of virtual routers increase and α decreases. Since we cannot assume any particular structure for the considered routing tables, merging efficiency cannot be determined in advance and leads to indeterministic memory requirements whereas in separate (even non-virtualized) approach, the memory requirement is deterministic. Also, it should be noted that merging schemes are appropriate(from a memory standpoint) when the number of virtual routers is small. VI. VIRTUALIZED ROUTER: POWER PERFORMANCE In the previous section, we observed how each component behaves for the two scenarios and derived relationships in terms of operating frequency and amount of logic resources required. However, the standard metric used for power measurements in routers is Watts per Gbps(Giga bits per second). This describes how much energy is spent to provide a unit throughput. In order to evaluate the two virtualized routers against the non-virtualized router, we analyzed the post placeand-route behavior of these architectures on the device we considered earlier(virtex 6 XC6VLX760) under the two speed grades(-2 and-1l). Without loss of generality, for all pipelines weassumealengthof 28stages. A. Total power dissipation: Experimental vs. estimation Here, we validate the models proposed in Section V against the experimental results we obtain. To clearly illustrate the performance of difference schemes, we first compare the total power utilized by all the schemes(non-virtualized, virtualizedseparateandvirtualized-merged(α = 20%and α = 80%)and then we show the comparison of all the virtualized schemes. These results are shown in Figure 5 and Figure 6, respectively. It can be observed clearly that the non-virtualized router consumes power proportional to the number of(virtual) networks. In contrast, virtualized routers consume very small amount of power since the static power consumed by the lookup engine is shared among the considered virtual networks. Another interesting observation is that in Figure 6, the total power dissipation decreases with the increasing number of virtual networks. According to the model(eq. 4), the power consumption must remain the same since only one lookup engine is active at a given time (Assumption 1). However, the experimental value decreases due to various hardware optimizations applied when implementing multiple parallel architectures. We limited the maximum number of virtual networks to 15 since in the case of virtualized-separate, the I/O pin requirement exceeded when the number of virtual networks was increased. In a complete router implementation(parsing, lookup, editing, scheduling, etc.), this number may become even less when other inputs and outputs are considered. The goal of this work is to analyze the power behavior of the lookup portion of a router. Therefore, the above implementation stand as an accurate prototype for the considered purpose
7 Fig. 5. Comparison of total power consumption in virtualized and non-virtualized schemes for speed grades-2(left) and-1l(right) Fig. 6. Comparison of total power consumption in different virtualized schemes for speed grades-2(left) and-1l(right) Fig. 7. Percentage error of the model estimation compared with the experimental results for speed grades-2(left) and-1l(right) Figure 7 shows the percentage error of the models we proposed in Section V. The percentage error is calculated as follows: Percentage error = P Model P Experimental P Experimental 100% Itcanbeseenthatthemodelestimationis highlyreliable with a maximum error of ±3%. The cause for this error is the various hardware optimizations that are performed, by the synthesis tool, when the amount of resources used, increases. As shown in the figure, for non-virtualized and virtualizedseparate, the error is much less compared to that of virtualizedmerged. In the merged approach, we use more BRAM per pipeline stage, to accommodate the increasing number of virtual routers. The synthesis tool performs various routing and placement optimizations to improve the performance of the design which causes our predictions to deviate slightly from the exact measurement. Nevertheless, the model we proposed here provide an accurate means by which the power consumption of virtualized routers can be estimated. B. Power efficiency In the context of lookup engines, one important metric is the packet handling rate. In this work, we use Giga bits per second as the metric to measure packet handling rate and to compute this, we use minimum packet size as 40 bytes. A router may use more and more power to support higher throughput. In order to compare such architectures with power efficient architectures, we use the power dissipated per unit throughput as the metric for our comparisons. Figure 8 illustrates the comparison of the three approaches with respect to the considered metric. The lower the mw/gbps number is, the better the architecture. Therefore, by analyzing the results in Figure 8, the virtualized separate approach yields the best power efficiency. The conventional router is the second best while merged approach shows the worst performance. The main reason behind the poor performance of the merged approach is the reduction in operating frequency(hence, throughput) with the increasing number of virtual routers. Due to the higher resource consumption, the operating frequency decreases significantly. The
8 Fig. 8. Power dissipated per unit throughput for speed grades-2(left) and-1l(right) power consumed by the resources increase, but the throughput drops. As a result, the power per unit throughput increases. Also, the performance difference between the two cases, α = 20%and α = 80%isalso intuitive.whenthemerging efficiency is much less, the amount of resources consumed by the router increases, while the throughput decreases. In both total power consumed and power per unit throughput, we have presented the results for the two speed grades (-2 and -1L). We observed a 30% less power consumption when speed grade-1l was chosen compared to speed grade -2. However, the power saving comes at the expense of throughput. This fact becomes clear when comparing the two speed grades with respect to mw/gbps in Figure 8. The two speed grades perform almost the same way with same variation and performance numbers. Hence, low power FPGAs are suitable in environments where throughput is not the major concern. VII. CONCLUSION Due to the proliferation of networking devices, power dissipated in the network is drastically increasing. Router virtualization was proposed to overcome the complications and disadvantages of the conventional networks. In order to achieve these benefits, the underlying hardware must be capable of supporting virtualization. With its reconfigurability and abundant resources, Field Programmable Gate Arrays (FPGAs) become an attractive platform. In this work, we demonstrated how various virtualized router architectures perform on state-of-the-art FPGA platforms and highlighted the benefits of using low power FPGA families for networking applications. The experimental results revealed that for maximum power efficiency, the virtualized separate approach gives the best results while the results of the merged approach varied depending on the merging efficiency. However, it should be noted that the separate approach suffers from scalability issues with increasing number of virtual routers because of resource exhaustion. We demonstrated the benefits of using low power families of FPGA and discovered that they give same power efficiency as the high-speed platforms while consuming low power and yielding lower throughput. REFERENCES [1] J. Carapinha and J. Jiménez. Network virtualization: a view from the bottom. In Proceedings of the 1st ACM workshop on Virtualized infrastructure systems and architectures, VISA 09, pages 73 80, New York, NY, USA, ACM. [2] Cisco. Cisco catalyst 6500 virtual switching system cisco.com/en/us/products/ps9336/index.html. [3] Cisco. Hardware and software virtualized routers. com/en/us/solutions/collateral/ns341/ns524/ns562/ns573/white paper c ns573 Networking Solutions White Paper.html. [4] J. Fu and J. Rexford. Efficient ip-address lookup with a shared forwarding table for multiple virtual routers. In Proceedings of the 2008 ACM CoNEXT Conference, CoNEXT 08, pages 21:1 21:12, New York, NY, USA, ACM. [5] T. Ganegedara, W. Jiang, and V. Prasanna. Multiroot: Towards memoryefficient router virtualization. In Communications (ICC), 2011 International Conference on, [6] T. Ganegedara, H. Le, and V. Prasanna. Towards on-the-fly incremental updates for virtualized routers on fpga - unpublished article. In Field Programmable Logic and Applications(FPL), 2011 International Conference on, [7] W. Jiang and V. Prasanna. Multi-way pipelining for power-efficient ip lookup. In Global Telecommunications Conference, IEEE GLOBECOM2008.IEEE,pages1 5, dec [8] W. Jiang and V. Prasanna. Towards green routers: Depth-bounded multipipeline architecture for power-efficient ip lookup. In Performance, Computing and Communications Conference, IPCCC IEEE International, pages , dec [9] Juniper. Jcs1200 control system. whitepapers/ en.pdf. [10] S. Kaxiras and G. Keramidas. Ipstash: a power-efficient memory architecture for ip-lookup. In Microarchitecture, MICRO-36. Proceedings. 36th Annual IEEE/ACM International Symposium on, pages ,dec [11] H. Le, T. Ganegedara, and V. Prasanna. Memory-efficient and scalable virtual routers using fpga. In Field Programmable Gate Arrays(FPGA), 2011 International Symposium on, [12] A. M. Lyons, D. T. Neilson,, and T. R. Salamon. Energy efficient strategies for high density telecom applications. Connected Devices About to Pass the 5 Billion Milestone&from=all pr. [13] A. M. Lyons, D. T. Neilson,, and T. R. Salamon. Energy efficient strategies for high density telecom applications. Princeton University, Supelec, Ecole Centrale Paris and Alcatel-Lucent Bell Labs Workshop on Information, Energy and Environment, June [14] NetFPGA. Netfpga boards. [15] Potaroo. Bgp analysis reports. [16] M. Ruiz-Sanchez, E. Biersack, and W. Dabbous. Survey and taxonomy of ip address lookup algorithms. Network, IEEE, 15(2):8 23, [17] H. Song, M. Kodialam, F. Hao, and T. Lakshman. Building scalable virtual routers with trie braiding. In INFOCOM, 2010 Proceedings IEEE, pages1 9,2010. [18] D. Unnikrishnan, R. Vadlamani, Y. Liao, A. Dwaraki, J. Crenne, L. Gao, and R. Tessier. Scalable network virtualization using fpgas. In Proceedings of the 18th annual ACM/SIGDA international symposium on Field programmable gate arrays, FPGA 10, pages , New York, NY, USA, ACM. [19] Xilinx. Xilinx xcell journal. xcellonline/. [20] K.Zheng,C.Hu,H.Liu,andB.Liu.Anultrahighthroughputandpower efficient tcam-based ip lookup engine. In INFOCOM Twenty-third AnnualJoint Conference of the IEEE Computer and Communications Societies, volume 3, pages vol.3, march
Hybrid Data Structure for IP Lookup in Virtual Routers Using FPGAs
Hybrid Data Structure for IP Lookup in Virtual Routers Using FPGAs O guzhan Erdem Electrical and Electronics Engineering Middle East Technical University Ankara, TURKEY 68 Email: [email protected] Hoang
Networking Virtualization Using FPGAs
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical and Computer Engineering University of Massachusetts,
CHAPTER 5 FINITE STATE MACHINE FOR LOOKUP ENGINE
CHAPTER 5 71 FINITE STATE MACHINE FOR LOOKUP ENGINE 5.1 INTRODUCTION Finite State Machines (FSMs) are important components of digital systems. Therefore, techniques for area efficiency and fast implementation
How Router Technology Shapes Inter-Cloud Computing Service Architecture for The Future Internet
How Router Technology Shapes Inter-Cloud Computing Service Architecture for The Future Internet Professor Jiann-Liang Chen Friday, September 23, 2011 Wireless Networks and Evolutional Communications Laboratory
Reconfigurable Architecture Requirements for Co-Designed Virtual Machines
Reconfigurable Architecture Requirements for Co-Designed Virtual Machines Kenneth B. Kent University of New Brunswick Faculty of Computer Science Fredericton, New Brunswick, Canada [email protected] Micaela Serra
Open Flow Controller and Switch Datasheet
Open Flow Controller and Switch Datasheet California State University Chico Alan Braithwaite Spring 2013 Block Diagram Figure 1. High Level Block Diagram The project will consist of a network development
Scaling 10Gb/s Clustering at Wire-Speed
Scaling 10Gb/s Clustering at Wire-Speed InfiniBand offers cost-effective wire-speed scaling with deterministic performance Mellanox Technologies Inc. 2900 Stender Way, Santa Clara, CA 95054 Tel: 408-970-3400
7a. System-on-chip design and prototyping platforms
7a. System-on-chip design and prototyping platforms Labros Bisdounis, Ph.D. Department of Computer and Communication Engineering 1 What is System-on-Chip (SoC)? System-on-chip is an integrated circuit
Distributed Elastic Switch Architecture for efficient Networks-on-FPGAs
Distributed Elastic Switch Architecture for efficient Networks-on-FPGAs Antoni Roca, Jose Flich Parallel Architectures Group Universitat Politechnica de Valencia (UPV) Valencia, Spain Giorgos Dimitrakopoulos
Configurable String Matching Hardware for Speeding up Intrusion Detection. Monther Aldwairi*, Thomas Conte, Paul Franzon
Configurable String Matching Hardware for Speeding up Intrusion Detection Monther Aldwairi*, Thomas Conte, Paul Franzon Department of Electrical and Computer Engineering, North Carolina State University,
Exploiting Stateful Inspection of Network Security in Reconfigurable Hardware
Exploiting Stateful Inspection of Network Security in Reconfigurable Hardware Shaomeng Li, Jim Tørresen, Oddvar Søråsen Department of Informatics University of Oslo N-0316 Oslo, Norway {shaomenl, jimtoer,
The Software Defined Hybrid Packet Optical Datacenter Network SDN AT LIGHT SPEED TM. 2012-13 CALIENT Technologies www.calient.
The Software Defined Hybrid Packet Optical Datacenter Network SDN AT LIGHT SPEED TM 2012-13 CALIENT Technologies www.calient.net 1 INTRODUCTION In datacenter networks, video, mobile data, and big data
Best Practises for LabVIEW FPGA Design Flow. uk.ni.com ireland.ni.com
Best Practises for LabVIEW FPGA Design Flow 1 Agenda Overall Application Design Flow Host, Real-Time and FPGA LabVIEW FPGA Architecture Development FPGA Design Flow Common FPGA Architectures Testing and
Design and Implementation of an On-Chip timing based Permutation Network for Multiprocessor system on Chip
Design and Implementation of an On-Chip timing based Permutation Network for Multiprocessor system on Chip Ms Lavanya Thunuguntla 1, Saritha Sapa 2 1 Associate Professor, Department of ECE, HITAM, Telangana
DRAFT 18-09-2003. 2.1 Gigabit network intrusion detection systems
An Intrusion Detection System for Gigabit Networks (Working paper: describing ongoing work) Gerald Tripp Computing Laboratory, University of Kent. CT2 7NF. UK e-mail: [email protected] This draft
A Comparison of Ruleset Feature Independent Packet Classification Engines on FPGA
A Comparison of Ruleset Feature Independent Packet Classification Engines on FPGA Andrea Sanny, Thilan Ganegedara, Viktor K. Prasanna Ming Hsieh Dept. of Electrical Engineering University of Southern California
FPGA Implementation of IP Packet Segmentation and Reassembly in Internet Router*
SERBIAN JOURNAL OF ELECTRICAL ENGINEERING Vol. 6, No. 3, December 2009, 399-407 UDK: 004.738.5.057.4 FPGA Implementation of IP Packet Segmentation and Reassembly in Internet Router* Marko Carević 1,a,
DESIGN AND VERIFICATION OF LSR OF THE MPLS NETWORK USING VHDL
IJVD: 3(1), 2012, pp. 15-20 DESIGN AND VERIFICATION OF LSR OF THE MPLS NETWORK USING VHDL Suvarna A. Jadhav 1 and U.L. Bombale 2 1,2 Department of Technology Shivaji university, Kolhapur, 1 E-mail: [email protected]
基 於 SDN 與 可 程 式 化 硬 體 架 構 之 雲 端 網 路 系 統 交 換 器
基 於 SDN 與 可 程 式 化 硬 體 架 構 之 雲 端 網 路 系 統 交 換 器 楊 竹 星 教 授 國 立 成 功 大 學 電 機 工 程 學 系 Outline Introduction OpenFlow NetFPGA OpenFlow Switch on NetFPGA Development Cases Conclusion 2 Introduction With the proposal
IMPLEMENTATION OF FPGA CARD IN CONTENT FILTERING SOLUTIONS FOR SECURING COMPUTER NETWORKS. Received May 2010; accepted July 2010
ICIC Express Letters Part B: Applications ICIC International c 2010 ISSN 2185-2766 Volume 1, Number 1, September 2010 pp. 71 76 IMPLEMENTATION OF FPGA CARD IN CONTENT FILTERING SOLUTIONS FOR SECURING COMPUTER
Case Study: Improving FPGA Design Speed with Floorplanning
Case Study: Improving FPGA Design Speed with Floorplanning - An introduction to Xilinx PlanAhead 10.1 by Consultant Kent Salomonsen ([email protected]) Picture this: the RTL is simulating
Hardware Implementation of Improved Adaptive NoC Router with Flit Flow History based Load Balancing Selection Strategy
Hardware Implementation of Improved Adaptive NoC Rer with Flit Flow History based Load Balancing Selection Strategy Parag Parandkar 1, Sumant Katiyal 2, Geetesh Kwatra 3 1,3 Research Scholar, School of
Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging
Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging In some markets and scenarios where competitive advantage is all about speed, speed is measured in micro- and even nano-seconds.
Juniper Networks QFabric: Scaling for the Modern Data Center
Juniper Networks QFabric: Scaling for the Modern Data Center Executive Summary The modern data center has undergone a series of changes that have significantly impacted business operations. Applications
Binary search tree with SIMD bandwidth optimization using SSE
Binary search tree with SIMD bandwidth optimization using SSE Bowen Zhang, Xinwei Li 1.ABSTRACT In-memory tree structured index search is a fundamental database operation. Modern processors provide tremendous
Consolidating Multiple Network Appliances
October 2010 Consolidating Multiple s Space and power are major concerns for enterprises and carriers. There is therefore focus on consolidating the number of physical servers in data centers. Application
Index Terms Domain name, Firewall, Packet, Phishing, URL.
BDD for Implementation of Packet Filter Firewall and Detecting Phishing Websites Naresh Shende Vidyalankar Institute of Technology Prof. S. K. Shinde Lokmanya Tilak College of Engineering Abstract Packet
FlexPath Network Processor
FlexPath Network Processor Rainer Ohlendorf Thomas Wild Andreas Herkersdorf Prof. Dr. Andreas Herkersdorf Arcisstraße 21 80290 München http://www.lis.ei.tum.de Agenda FlexPath Introduction Work Packages
- Multiprotocol Label Switching -
1 - Multiprotocol Label Switching - Multiprotocol Label Switching Multiprotocol Label Switching (MPLS) is a Layer-2 switching technology. MPLS-enabled routers apply numerical labels to packets, and can
White Paper FPGA Performance Benchmarking Methodology
White Paper Introduction This paper presents a rigorous methodology for benchmarking the capabilities of an FPGA family. The goal of benchmarking is to compare the results for one FPGA family versus another
A Hybrid Electrical and Optical Networking Topology of Data Center for Big Data Network
ASEE 2014 Zone I Conference, April 3-5, 2014, University of Bridgeport, Bridgpeort, CT, USA A Hybrid Electrical and Optical Networking Topology of Data Center for Big Data Network Mohammad Naimur Rahman
CS6204 Advanced Topics in Networking
CS6204 Advanced Topics in Networking Assoc Prof. Chan Mun Choon School of Computing National University of Singapore Aug 14, 2015 CS6204 Lecturer Chan Mun Choon Office: COM2, #04-17 Email: [email protected]
9/14/2011 14.9.2011 8:38
Algorithms and Implementation Platforms for Wireless Communications TLT-9706/ TKT-9636 (Seminar Course) BASICS OF FIELD PROGRAMMABLE GATE ARRAYS Waqar Hussain [email protected] Department of Computer
FPGA area allocation for parallel C applications
1 FPGA area allocation for parallel C applications Vlad-Mihai Sima, Elena Moscu Panainte, Koen Bertels Computer Engineering Faculty of Electrical Engineering, Mathematics and Computer Science Delft University
Architecture of distributed network processors: specifics of application in information security systems
Architecture of distributed network processors: specifics of application in information security systems V.Zaborovsky, Politechnical University, Sait-Petersburg, Russia [email protected] 1. Introduction Modern
Latency in High Performance Trading Systems Feb 2010
Latency in High Performance Trading Systems Feb 2010 Stephen Gibbs Automated Trading Group Overview Review the architecture of a typical automated trading system Review the major sources of latency, many
Implementation and Design of AES S-Box on FPGA
International Journal of Research in Engineering and Science (IJRES) ISSN (Online): 232-9364, ISSN (Print): 232-9356 Volume 3 Issue ǁ Jan. 25 ǁ PP.9-4 Implementation and Design of AES S-Box on FPGA Chandrasekhar
Header Parsing Logic in Network Switches Using Fine and Coarse-Grained Dynamic Reconfiguration Strategies
Header Parsing Logic in Network Switches Using Fine and Coarse-Grained Dynamic Reconfiguration Strategies by Alexander Sonek A thesis presented to the University of Waterloo in fulfillment of the thesis
NetFlow Performance Analysis
NetFlow Performance Analysis Last Updated: May, 2007 The Cisco IOS NetFlow feature set allows for the tracking of individual IP flows as they are received at a Cisco router or switching device. Network
Performance Evaluation of Linux Bridge
Performance Evaluation of Linux Bridge James T. Yu School of Computer Science, Telecommunications, and Information System (CTI) DePaul University ABSTRACT This paper studies a unique network feature, Ethernet
IP address lookup for Internet routers using cache routing table
ISSN (Print): 1694 0814 35 IP address lookup for Internet routers using cache routing table Houassi Hichem 1 and Bilami Azeddine 2 1 Department of Computer Science, University Center of Khenchela, Algeria
Network Virtualization
Network Virtualization Jennifer Rexford Advanced Computer Networks http://www.cs.princeton.edu/courses/archive/fall08/cos561/ Tuesdays/Thursdays 1:30pm-2:50pm Introduction Motivation for network virtualization
PART III. OPS-based wide area networks
PART III OPS-based wide area networks Chapter 7 Introduction to the OPS-based wide area network 7.1 State-of-the-art In this thesis, we consider the general switch architecture with full connectivity
DDR subsystem: Enhancing System Reliability and Yield
DDR subsystem: Enhancing System Reliability and Yield Agenda Evolution of DDR SDRAM standards What is the variation problem? How DRAM standards tackle system variability What problems have been adequately
Network (Tree) Topology Inference Based on Prüfer Sequence
Network (Tree) Topology Inference Based on Prüfer Sequence C. Vanniarajan and Kamala Krithivasan Department of Computer Science and Engineering Indian Institute of Technology Madras Chennai 600036 [email protected],
Principle and Implementation of. Protocol Oblivious Forwarding
Principle and Implementation of Protocol Oblivious Forwarding PUBLICATION DATE DECEMBER 27, 2012 Contents Principle and Implementation of... 1 Protocol Oblivious Forwarding... 1 AUTHORS... 错 误! 未 定 义 书
Analysis of traffic engineering parameters while using multi-protocol label switching (MPLS) and traditional IP networks
Analysis of traffic engineering parameters while using multi-protocol label switching (MPLS) and traditional IP networks Faiz Ahmed Electronic Engineering Institute of Communication Technologies, PTCL
40G MACsec Encryption in an FPGA
40G MACsec Encryption in an FPGA Dr Tom Kean, Managing Director, Algotronix Ltd, 130-10 Calton Road, Edinburgh EH8 8JQ United Kingdom Tel: +44 131 556 9242 Email: [email protected] February 2012 1 MACsec
Reconfigurable Low Area Complexity Filter Bank Architecture for Software Defined Radio
Reconfigurable Low Area Complexity Filter Bank Architecture for Software Defined Radio 1 Anuradha S. Deshmukh, 2 Prof. M. N. Thakare, 3 Prof.G.D.Korde 1 M.Tech (VLSI) III rd sem Student, 2 Assistant Professor(Selection
Agenda. Michele Taliercio, Il circuito Integrato, Novembre 2001
Agenda Introduzione Il mercato Dal circuito integrato al System on a Chip (SoC) La progettazione di un SoC La tecnologia Una fabbrica di circuiti integrati 28 How to handle complexity G The engineering
Optical interconnection networks for data centers
Optical interconnection networks for data centers The 17th International Conference on Optical Network Design and Modeling Brest, France, April 2013 Christoforos Kachris and Ioannis Tomkos Athens Information
Power-efficient Virtual Machine Placement and Migration in Data Centers
2013 IEEE International Conference on Green Computing and Communications and IEEE Internet of Things and IEEE Cyber, Physical and Social Computing Power-efficient Virtual Machine Placement and Migration
Static-Noise-Margin Analysis of Conventional 6T SRAM Cell at 45nm Technology
Static-Noise-Margin Analysis of Conventional 6T SRAM Cell at 45nm Technology Nahid Rahman Department of electronics and communication FET-MITS (Deemed university), Lakshmangarh, India B. P. Singh Department
3D On-chip Data Center Networks Using Circuit Switches and Packet Switches
3D On-chip Data Center Networks Using Circuit Switches and Packet Switches Takahide Ikeda Yuichi Ohsita, and Masayuki Murata Graduate School of Information Science and Technology, Osaka University Osaka,
ON SUITABILITY OF FPGA BASED EVOLVABLE HARDWARE SYSTEMS TO INTEGRATE RECONFIGURABLE CIRCUITS WITH HOST PROCESSING UNIT
216 ON SUITABILITY OF FPGA BASED EVOLVABLE HARDWARE SYSTEMS TO INTEGRATE RECONFIGURABLE CIRCUITS WITH HOST PROCESSING UNIT *P.Nirmalkumar, **J.Raja Paul Perinbam, @S.Ravi and #B.Rajan *Research Scholar,
High-Performance IP Service Node with Layer 4 to 7 Packet Processing Features
UDC 621.395.31:681.3 High-Performance IP Service Node with Layer 4 to 7 Packet Processing Features VTsuneo Katsuyama VAkira Hakata VMasafumi Katoh VAkira Takeyama (Manuscript received February 27, 2001)
MIMO detector algorithms and their implementations for LTE/LTE-A
GIGA seminar 11.01.2010 MIMO detector algorithms and their implementations for LTE/LTE-A Markus Myllylä and Johanna Ketonen 11.01.2010 2 Outline Introduction System model Detection in a MIMO-OFDM system
OKTOBER 2010 CONSOLIDATING MULTIPLE NETWORK APPLIANCES
OKTOBER 2010 CONSOLIDATING MULTIPLE NETWORK APPLIANCES It is possible to consolidate multiple network appliances into a single server using intelligent flow distribution, data sharing and virtualization
Innovative improvement of fundamental metrics including power dissipation and efficiency of the ALU system
Innovative improvement of fundamental metrics including power dissipation and efficiency of the ALU system Joseph LaBauve Department of Electrical and Computer Engineering University of Central Florida
All Programmable Logic. Hans-Joachim Gelke Institute of Embedded Systems. Zürcher Fachhochschule
All Programmable Logic Hans-Joachim Gelke Institute of Embedded Systems Institute of Embedded Systems 31 Assistants 10 Professors 7 Technical Employees 2 Secretaries www.ines.zhaw.ch Research: Education:
D1.2 Network Load Balancing
D1. Network Load Balancing Ronald van der Pol, Freek Dijkstra, Igor Idziejczak, and Mark Meijerink SARA Computing and Networking Services, Science Park 11, 9 XG Amsterdam, The Netherlands June [email protected],[email protected],
SDN. WHITE PAPER Intel Ethernet Switch FM6000 Series - Software Defined Networking. Recep Ozdag Intel Corporation
WHITE PAPER Intel Ethernet Switch FM6000 Series - Software Defined Networking Intel Ethernet Switch FM6000 Series - Software Defined Networking Recep Ozdag Intel Corporation Software Defined Networking
Scalable Prefix Matching for Internet Packet Forwarding
Scalable Prefix Matching for Internet Packet Forwarding Marcel Waldvogel Computer Engineering and Networks Laboratory Institut für Technische Informatik und Kommunikationsnetze Background Internet growth
Architectures and Platforms
Hardware/Software Codesign Arch&Platf. - 1 Architectures and Platforms 1. Architecture Selection: The Basic Trade-Offs 2. General Purpose vs. Application-Specific Processors 3. Processor Specialisation
Increase Simplicity and Improve Reliability with VPLS on the MX Series Routers
SOLUTION BRIEF Enterprise Data Center Interconnectivity Increase Simplicity and Improve Reliability with VPLS on the Routers Challenge As enterprises improve business continuity by enabling resource allocation
International Workshop on Field Programmable Logic and Applications, FPL '99
International Workshop on Field Programmable Logic and Applications, FPL '99 DRIVE: An Interpretive Simulation and Visualization Environment for Dynamically Reconægurable Systems? Kiran Bondalapati and
LogiCORE IP AXI Performance Monitor v2.00.a
LogiCORE IP AXI Performance Monitor v2.00.a Product Guide Table of Contents IP Facts Chapter 1: Overview Target Technology................................................................. 9 Applications......................................................................
Software Defined Networking
Software Defined Networking Richard T. B. Ma School of Computing National University of Singapore Material from: Scott Shenker (UC Berkeley), Nick McKeown (Stanford), Jennifer Rexford (Princeton) CS 4226:
Improved Method for Parallel AES-GCM Cores Using FPGAs
Improved Method for Parallel -GCM Cores Using FPGAs Karim Moussa Ali Abdellatif, Roselyne Chotin-Avot, abib Mehrez To cite this version: Karim Moussa Ali Abdellatif, Roselyne Chotin-Avot, abib Mehrez.
Analysis of Basic Quality of Service Mechanism for Voice over IP In Hamdard University Network Campus
Analysis of Basic Quality of Service Mechanism for Voice over IP In Hamdard University Network Campus Shahbaz Akhatar Siddiqui Student MS (Telecom) Hamdard University Karachi Junior Lecturer in National
SummitStack in the Data Center
SummitStack in the Data Center Abstract: This white paper describes the challenges in the virtualized server environment and the solution Extreme Networks offers a highly virtualized, centrally manageable
High Performance Cluster Support for NLB on Window
High Performance Cluster Support for NLB on Window [1]Arvind Rathi, [2] Kirti, [3] Neelam [1]M.Tech Student, Department of CSE, GITM, Gurgaon Haryana (India) [email protected] [2]Asst. Professor,
Hardware Trojans Detection Methods Julien FRANCQ
DEFENDING WORLD SECURITY Hardware Trojans Detection Methods Julien FRANCQ 2013, December the 12th Outline c 2013 CASSIDIAN CYBERSECURITY - All rights reserved TRUDEVICE 2013, December the 12th Page 2 /
Application Performance Testing Basics
Application Performance Testing Basics ABSTRACT Todays the web is playing a critical role in all the business domains such as entertainment, finance, healthcare etc. It is much important to ensure hassle-free
4 Internet QoS Management
4 Internet QoS Management Rolf Stadler School of Electrical Engineering KTH Royal Institute of Technology [email protected] September 2008 Overview Network Management Performance Mgt QoS Mgt Resource Control
From Bus and Crossbar to Network-On-Chip. Arteris S.A.
From Bus and Crossbar to Network-On-Chip Arteris S.A. Copyright 2009 Arteris S.A. All rights reserved. Contact information Corporate Headquarters Arteris, Inc. 1741 Technology Drive, Suite 250 San Jose,
Cisco Integrated Services Routers Performance Overview
Integrated Services Routers Performance Overview What You Will Learn The Integrated Services Routers Generation 2 (ISR G2) provide a robust platform for delivering WAN services, unified communications,
UNITE: Uniform hardware-based Network Intrusion detection Engine
UNITE: Uniform hardware-based Network Intrusion detection Engine S. Yusuf and W. Luk and M. K. N. Szeto and W. Osborne Department of Computing, Imperial College London, 180 Queen s Gate, London SW7 2BZ
Network Traffic Monitoring an architecture using associative processing.
Network Traffic Monitoring an architecture using associative processing. Gerald Tripp Technical Report: 7-99 Computing Laboratory, University of Kent 1 st September 1999 Abstract This paper investigates
Seeking Opportunities for Hardware Acceleration in Big Data Analytics
Seeking Opportunities for Hardware Acceleration in Big Data Analytics Paul Chow High-Performance Reconfigurable Computing Group Department of Electrical and Computer Engineering University of Toronto Who
Dynamic Resource Allocation in Software Defined and Virtual Networks: A Comparative Analysis
Dynamic Resource Allocation in Software Defined and Virtual Networks: A Comparative Analysis Felipe Augusto Nunes de Oliveira - GRR20112021 João Victor Tozatti Risso - GRR20120726 Abstract. The increasing
DDS. 16-bit Direct Digital Synthesizer / Periodic waveform generator Rev. 1.4. Key Design Features. Block Diagram. Generic Parameters.
Key Design Features Block Diagram Synthesizable, technology independent VHDL IP Core 16-bit signed output samples 32-bit phase accumulator (tuning word) 32-bit phase shift feature Phase resolution of 2π/2
Horst Görtz Institute for IT-Security
Horst Görtz Institute for IT-Security On the Vulnerability of FPGA Bitstream Encryption against Power Analysis Attacks Extracting Keys from Xilinx Virtex-II FPGAs Amir Moradi, Alessandro Barenghi, Timo
OpenFlow Switching: Data Plane Performance
This full text paper was peer reviewed at the direction of IEEE Communications Society subject matter experts for publication in the IEEE ICC 21 proceedings OpenFlow : Data Plane Performance Andrea Bianco,
Implementation of Virtual Local Area Network using network simulator
1060 Implementation of Virtual Local Area Network using network simulator Sarah Yahia Ali Department of Computer Engineering Techniques, Dijlah University College, Iraq ABSTRACT Large corporate environments,
Architectural Level Power Consumption of Network on Chip. Presenter: YUAN Zheng
Architectural Level Power Consumption of Network Presenter: YUAN Zheng Why Architectural Low Power Design? High-speed and large volume communication among different parts on a chip Problem: Power consumption
On the effect of forwarding table size on SDN network utilization
IBM Haifa Research Lab On the effect of forwarding table size on SDN network utilization Rami Cohen IBM Haifa Research Lab Liane Lewin Eytan Yahoo Research, Haifa Seffi Naor CS Technion, Israel Danny Raz
Migrate from Cisco Catalyst 6500 Series Switches to Cisco Nexus 9000 Series Switches
Migration Guide Migrate from Cisco Catalyst 6500 Series Switches to Cisco Nexus 9000 Series Switches Migration Guide November 2013 2013 Cisco and/or its affiliates. All rights reserved. This document is
White Paper. Requirements of Network Virtualization
White Paper on Requirements of Network Virtualization INDEX 1. Introduction 2. Architecture of Network Virtualization 3. Requirements for Network virtualization 3.1. Isolation 3.2. Network abstraction
RSVP- A Fault Tolerant Mechanism in MPLS Networks
RSVP- A Fault Tolerant Mechanism in MPLS Networks S.Ravi Kumar, M.Tech(NN) Assistant Professor Gokul Institute of Technology And Sciences Piridi, Bobbili, Vizianagaram, Andhrapradesh. Abstract: The data
Software Defined Networking & Openflow
Software Defined Networking & Openflow Autonomic Computer Systems, HS 2015 Christopher Scherb, 01.10.2015 Overview What is Software Defined Networks? Brief summary on routing and forwarding Introduction
