Construction of a Video Transmission Scenario in Software-Defined Networks for QoE Estimation

Quizza-Hernandez, Vicko; Arango-Colorado, Juan-Camilo; Campo-Muñoz, Wilmar-Yesid; Quizza-Hernandez, Vicko; Arango-Colorado, Juan-Camilo; Campo-Muñoz, Wilmar-Yesid

doi:10.19053/01211129.v31.n61.2022.14807

Services on Demand

Journal

Article

Indicators

Cited by SciELO
Access statistics

Revista Facultad de Ingeniería

Print version ISSN 0121-1129On-line version ISSN 2357-5328

Rev. Fac. ing. vol.31 no.61 Tunja July/Sept. 2022 Epub Oct 27, 2022

https://doi.org/10.19053/01211129.v31.n61.2022.14807

Artículos

Construction of a Video Transmission Scenario in Software-Defined Networks for QoE Estimation

Construcción de un escenario de transmisión de video en redes definidas por software para la estimación de la QoE

Construção de um Cenário de Transmissão de Vídeo em Redes Definidas por Software para Estimativa de QoE

Vicko Quizza-Hernandez¹
http://orcid.org/0000-0003-4142-0733

Juan-Camilo Arango-Colorado²
http://orcid.org/0000-0003-2911-7053

Wilmar-Yesid Campo-Muñoz³
http://orcid.org/0000-0001-8585-706X

^¹ Universidad del Quindío (Armenia-Quindío, Colombia). vquizzah@uqvirtual.edu.co.

^² Universidad del Quindío (Armenia-Quindío, Colombia). jcarangoc@uqvirtual.edu.co.

^³ Ph. D, Universidad del Quindío (Armenia-Quindío, Colombia). wycampo@uniquindio.edu.co

Abstract

The services supported by data networks have become widespread, so the architectures of the new data networks are service-oriented. They are endowed with intelligence, flexibility, and programmability. The preceding is with the aim of providing acceptability by users. Thus, this paper presents the construction of a video transmission scenario over a software-defined network (SDN, Software-Defined Networking) using free software and modifying its behavior with background traffic, on which the Quality of Experience (QoE) is estimated. Subjective and objective metrics were used for the QoE estimation. For the first one, the Mean Opinion Score (MOS) was used, while the second one was studied with the Full Reference Image Quality Assessment (FR-IQA). Finally, a correlation between the two types of metrics was proposed.

Keywords: IQA-FR; MOS; QoE; SDN; Video streaming

Resumen

Los servicios soportados por las redes de datos se han masificado por lo que las arquitecturas de las nuevas redes de datos están orientadas a servicios, dotadas de inteligencia, flexibilidad y programabilidad. Lo anterior con el objetivo de brindar la aceptabilidad por parte de los usuarios de los servicios. Así, en este artículo se presenta la construcción de un escenario de transmisión de video sobre una red definida por software (SDN, Software-Defined Networking) utilizando software libre y modificando su comportamiento con tráfico de fondo, sobre el que se estima la calidad de experiencia (QoE, Quality of Experience). Para la estimación de la QoE se usaron métricas subjetivas y objetivas. Para la primera de ellas se usa la puntuación de opinión media (MOS, Mean Opinion Score), mientras que las segundas se estudian a partir de las mediciones de calidad de imagen con referencia completa (IQA-FR, Image Quality Assessment Full-Reference). Finalmente, se propone una correlación entre los dos tipos de métricas.

Palabras clave: IQA-FR, MOS, QoE; SDN; Videostreaming

Resumo

Os serviços suportados pelas redes de dados se difundiram, de modo que as arquiteturas das novas redes de dados são orientadas a serviços, dotadas de inteligência, flexibilidade e programabilidade. O anterior com o objetivo de fornecer aceitabilidade pelos usuários dos serviços. Assim, este artigo apresenta a construção de um cenário de transmissão de vídeo sobre uma rede definida por software (SDN, Software-Defined Networking) utilizando software livre e modificando -seu comportamento com tráfego de fundo, no qual se estima a qualidade da transmissão. Qualidade da Experiência). Métricas subjetivas e objetivas foram utilizadas para a estimativa de QoE. Para o primeiro deles, utiliza-se o Mean Opinion Score (MOS), enquanto os segundos são estudados a partir das medidas de referência completa da avaliação da qualidade da imagem (IQA-FR, Image Quality Assessment Full-Reference). Por fim, é proposta uma correlação entre os dois tipos de métricas.

Palavras-chave: IQA-FR; MOS; QoE; SDN; Transmissão de vídeo

I. INTRODUCTION

The applications and services supported by data networks are constantly growing, so the architectures of the new networks must be service-oriented and equipped with intelligence, flexibility, and programmability. The above with the aim of providing the global acceptability of an application or service. This is the quality of experience (QoE) [¹]. To achieve this vision, the industry and the academy have established that the softwarization of the network constitutes the fundamental basis for achieving a multi-client architecture capable of satisfying the diverse current and future requirements [²]. From this situation, a proposal that takes advantage of the technological advance of recent years arose; that is, the Software-Defined Network (SDN) and the Network Function Virtualization (NFV) [³], [⁴].

The QoE is based on subjective parameters; that is, it measures the interaction between the content presented and the user's perception (color, light intensity, or damaged pixels), and it is expressed through concepts such as: excellent, good, acceptable, regular, or bad [⁵]. Other factors that could impact the QoE are usability, technological performance, subjective evaluation, expectations, context, screen size, lighting, viewing distance, and user movement [⁶]. The SDN, on the other hand, separates forwarding and controlling functions to design a network that can be centrally scheduled and managed. In contrast, the NFV isolates network functions from hardware and provides the infrastructure on which the SDN software can run. Both are based on virtualization and use network isolation, but they differ in how they separate functions and extract resources [⁷].

Thus, this paper presents the construction of a video scenario since this service generates the most traffic in both mobile and wired data networks [⁸], [⁹]. Moreover, the consumption of this service has dramatically increased with the arrival of OTTs, social networks, and the capabilities of devices such as smart TVs and smartphones [¹⁰]. Therefore, the functional scenario that supports the video streaming service is built using SDN and NFV networks; background traffic is used to vary its behaviour. For each scenario, the QoE is estimated using the subjective metric of the Mean Opinion Score (MOS), and the objective metrics are also calculated using image quality measurements with complete reference (FR-IQA). The objective metrics used are the Peak Signal-to-Noise Ratio (PSNR), the Structural Similarity Index Measure (SSIM), and the Video Multimethod Assessment Fusion (VMAF). The PSNR is the relationship between the maximum possible power value of a signal and the power of the noise that affects or distorts the quality of its representation. The SSIM is used to measure the similarity between two images and predict the perceived quality of the images. The VMAF predicts subjective video quality by comparing the reference video stream and the one received by the client.

The paper is organized as follows: section II presents the methodology, section III presents the results and their discussion, and finally, section IV presents the conclusions and future works derived from this research.

II. METHODOLOGY

For the development of this research, an adaptation of the methodology considered in [¹¹] was carried out, where three methodological phases were defined. In the first phase, the tools identified were capable of supporting software networks under the SDN and NFV paradigms, as well as the server and the client for the video transmission service. In the second phase, we proceeded to implement and integrate the tools in a functional scenario. In phase three, the QoE estimation experiments were carried out using measurement instruments and the analysis of the results (Figure 1).

Fig.1. Construction phases of the video transmission scenario in SDN.

For the implementation of the scenario, open-source tools were chosen as follows: for the construction of the SDN, the Mininet software was used due to its ability to emulate the behavior of this type of network [¹²]. For virtualization, docker was chosen due to its low resource consumption and ease of creating containers. Then, the NGINX server was selected as the server for video transmission due to its ability to support a high number of simultaneous requests. Lastly, VLC was selected as the client due to its ability to play many video formats and codecs and its reliability in the graphical environments available for Linux systems. As a video encoding tool, the FFmpeg multimedia framework was used. As a tool for background traffic generation, D-ITG (Distributed Internet Traffic Generator) was used because it allows the handling of packets at a low level; that is, the ability to select the number, size, or speed. The video used was Big Buck Bunny, which is classified as category AE due to its movement [¹³].

Figure 2 shows the experimentation scenario. In addition, the OpenDayLight (ODL) remote controller was used, which allowed viewing the network topology and the registration of the devices. The video streaming server and the client were hosted in separate containers which are built using Docker. To make the connection between the SDN network, the server, and the client hosted in the containers, the Mininet-based NFV tool called Containernet [¹⁴] was used. This tool helps to virtualize Docker containers as hosts in a Mininet-emulated network topology.

Fig. 2 Experimentation scenario.

To deliver the video to the NGINX server, the OBS Studio software was used so that it could carry out the transmission. The topology used in the SDN is a linear network made up of two OVS switches, which run with the Linux kernel to run the OpenFlow 1.3 protocol necessary for its compatibility with the ODL driver. The switches are connected to each other through an Ethernet interface, and each of them has a host. In this case, the Docker containers with the server and the client are generated by the Containernet API. The Python code and the topological dictionary of the SDN network are shown in Figure 3.

Fig. 3 SDN network.

The docker container operating system for the server is Ubuntu Trusty, this being the official image, which is ready to add and install whatever application or service is needed, in this case, the NGINX server. For the client, the image used by the Docker container is dockerfile-ubuntu-gnome provided in the official Docker image repository [¹⁵].

Up to this point, we have the elements of the experimentation scenario. In addition, to build different scenarios, the D-ITG platform was used to congest the network with background traffic; thus, observe how this traffic affects the quality of the transmitted video. This platform is capable of producing traffic for IPv4 and IPv6 accurately replicating the workload of current internet applications [¹⁶], allowing the variation of different characteristics in the traffic to be introduced, such as the type of packet, the size in bytes of the payload, the packets sent per second, and the time in milliseconds in which the traffic will be sent.

As instruments for the estimation of the QoE, the MOS was used. For this, 15 users were selected through the Ishihara test [¹⁷]. Thus, users watched 40-second video clips that qualify for the following 15 seconds. The quality criterion considered the color of the image, the resolution, the blurriness, and the continuity of the movement if there is flickering. On the contrary, the content of the video was not considered. The following scale is used for the qualification: Excellent = 5, Good = 4 - 4.9, Acceptable = 3 - 3.9, Mediocre = 2 - 2.9, and Poor = 1 - 1.9 [¹⁸].

The test was designed to be carried out with three different scenarios: scenario 1, without congestion; scenario 2, with medium congestion of 90 Kbps; and scenario 3, with high congestion of 200 Kbps in the SDN network. The congestion was performed using the D-ITG tool using the ITGSend command from the server and ITGRecv on the client, using TCP (Transmission Control Protocol) packets with a payload size of 100 bytes. Each scenario was repeated three times, giving a total of 9 different visualizations.

III. RESULTS AND DISCUSSION

A. Subjective Estimation of the QoE

The surveys carried out were processed in such a way that, for each user, the average of their scores was obtained in each of the visualizations by experimentation environment with a relative error of 5%. Subsequently, the standard deviation and the mean for each experimental environment were found among all the calculated means.

In Figure 4 (a), the results obtained for Scenario 1 are observed; that is, without congestion. The quality of experience expressed by users was, on average, in the range between Excellent and Good. It is understood that this estimate is given since the video display was the best by not having any network congestion.

Fig. 4 MOS (a) no congestion, (b) medium congestion, (c) high congestion.

Scenario 2 employed medium congestion where users perceived that the quality of the video received by the client was in the range between Acceptable and Mediocre. The result is observed in Figure 4 (b). On the other hand, for Scenario 3, which had high congestion, the users rated the video quality as Poor. This can be seen in Figure 4 (c).

The results of the general mean for each of the experimental environments shown in Table 1 establish that, on average, the users had a perception of Good, Mediocre, and Bad quality for the scenarios without congestion, with medium congestion, and with high congestion, respectively.

Table 1 Statistics of the QoE for each scenario.

Scenario	Average QoE	Standard deviation
1	4.48	0.4723239853
2	2.55	0.5017219028
3	1.32	0.2996823715

However, to corroborate the QoE assertiveness of the scenarios, it is also necessary to analyze the standard deviation of the general mean. This measure defines the dispersion between each data obtained for each experimental scenario; so that, having a high standard deviation, there is a greater distance between the data and having a low standard deviation, the data is closer to each other. Therefore, it can be inferred that in the scenario with high congestion, the surveyed users gave a very close value to the video quality since it has a low standard deviation value, see Table 1; while in the scenarios without congestion and with average congestion, the standard deviation value is in a medium range, see Table 1, concluding that the users surveyed gave a more dispersed rating.

B. Objective Estimation

The measures for the objective Video Quality Assessment (VQA,) are studied from the measurements of the Image Quality Assessment (IQA). Since this research employs a controlled environment, the FR-IQA [¹⁹] is used. In FR-IQA algorithms, complete reference image information is needed to predict the quality of degraded or distorted images; these algorithms generally perform pixel disparity measurements between reference and distorted images. These FR-IQA objective metrics were obtained with the help of the FFmpeg framework [²⁰] and an additional library called ffmpeg-quality-metrics, which allowed to obtain comma-separated value files for the following metrics [²¹]: PSNR, this metric calculates the mean square value of the difference between the original video and the received frames from the pixel distortions [²²]; SSIM, which is based on perception to determine how much degradation an image/video has compared to a reference; it is expressed in a decimal from 0 to 1, being 1 the total similarity between the compared elements [²³]; and VMAF, which is a metric based on perception through machine learning, developed by Netflix for streaming systems, which uses different algorithms and measures through Vector support machines (SVM) to provide a quality measure for the video for the high correlation to the subjective opinions that a human being can have, allowing in some video streaming architectures the correction in real time of the video quality [²⁴], its measurement is a decimal from 0 to 100, with 100 being the maximum measurement fidelity.

Next, the curves of the frame-by-frame results of each objective metric for each of the three experimentation scenarios are presented.

1) PSNR. For the PSNR in Figure 5 (a), it is observed that 91.5% of the frames exceed the value of 20 dB and that, according to [²⁵], they are in the range of general PSNR. In Figure 5 (b), the PSNR is presented for Scenario 2, where 52% of the frames exceed the value of 20 dB. In Figure 5 (c), the PSNR is presented for Scenario 3, where only 40% of the frames exceed the value of 20 dB. According to these results, only Scenario 1 meets the general range of the PSNR.

Fig. 5 PSNR (a) No congestion, (b) Medium congestion, (c) High congestion.

2) SSIM.Figure 6 (a) shows the behavior of the SSIM metric of the video when it is transmitted in Scenario 1, where 92% of the frames exceed the value of 0.8; that is, they present a high similarity. In Figure 6 (b), the SSIM for Scenario 2 is presented, where only 58% of the frames exceed the value of 0.8. In Figure 6 (c), the SSIM for Scenario 3 is presented, where only 49% of the frames exceed the value of 0.8.

Fig. 6 SSIM (a) No congestion, (b) Medium congestion, (c) High congestion.

3) VMAF. In Figure 7 (a) - (c), the curves of the behavior of the frames-by-frames VMAF metric for scenarios 1, 2, and 3 are presented, respectively. Figure 7 (a) shows the VMAF for Scenario 1, where 48.7% of the frames exceed the value of 40, and 2% of the frames have a VMAF value of 0. Figure 7 (b) shows the VMAF for Scenario 2, where only 20% of the frames exceed the VMAF value of 40, and 15% of the frames have a VMAF value of 0. Figure 7 (c) shows Scenario 3, where only 16% of the frames exceed the VMAF value of 40, and 20% of the frames have a VMAF value of 0. Therefore, there is a greater loss of frames in scenarios with greater congestion; that is, VMAF equal to 0. In addition, note that there is a greater number of lost frames in the scenario with greater congestion; this is a lower VMAF frame by frame.

Fig. 7 VMAF (a) No congestion, (b) Medium congestion, (c) High congestion.

According to the above and relating the subjective metrics of the MOS with the objective metrics PSNR, SSIM, and VMAF, it can be said that the estimate of the QoE will be greater than or equal to 4; that is, a good MOS if the objective value of VMAF is above 40 in at least 50% of the frames, the objective value of the PSNR is greater than or equal to 20 dB in at least 90% of frames, and the SSIM target value is greater than or equal to 0.8 in at least 90% of frames.

IV. CONCLUSIONS

The chosen free tools showed their functionality and allowed their integration for constructing an emulated scenario of an SDN network to support the video service. Thus, an NGINX server and a VLC video client supported by NFV technology represented by Docker containers were integrated into a single scenario. The data network is based on SDN technology for which the Mininet emulation tool and the ODL external controller were used.

Subjective estimation using the MOS metric is a time-consuming process, which requires the prior selection of users; this is costly in terms of time and human resources. The QoE estimation was only good for Scenario 1, which is the one without congestion. It is worth mentioning that the influence of computational resources on the results obtained in each scenario cannot be ruled out. However, the contribution lies in the construction of the scenario and how it can be modified through background traffic.

Objective metrics for QoE estimation showed agreement with MOS and have the advantage of not being costly in terms of human resources or time. Furthermore, being complete reference metrics, they adapt well to controlled environments, and it is also possible to automate the process thanks to the capabilities offered by the FFmpeg framework.

In future work, the extrapolation of the scenario to the cloud and the study of other types of metrics that do not require a complete reference for the estimation of QoE is proposed.

ACKNOWLEDGMENTS

To Universidad del Quindío for the support on this research Project.

REFERENCES

[1] J. Nightingale, P. Salva-Garcia, J. M. A. Calero, Q. Wang, “5G-QoE: QoE modeling for ultra-HD video streaming in 5G networks,” IEEE Transactions on Broadcasting, vol. 64, no. 2, pp. 621-634, 2018. https://doi.org/10.1109/TBC.2018.2816786 [ Links ]

[2] O. Sami Oubbati, M. Atiquzzaman, T. Ahamed Ahanger, A. Ibrahim, “Softwarization of UAV networks: A survey of applications and future trends,” IEEE Access, vol. 8, pp. 98073-98125, 2020. https://doi.org/10.1109/ACCESS.2020.2994494 [ Links ]

[3] R. Souza, K. Dias, S. Fernandes, “NFV Data Centers: A Systematic Review,” IEEE Access, vol. 8, pp. 51713-51735, 2020. https://doi.org/10.1109/ACCESS.2020.2973568 [ Links ]

[4] L. Skorin-Kapov, M. Varela, T. Hoßfeld, K.-T. Chen, “A Survey of Emerging Concepts and Challenges for QoE Management of Multimedia Services,” ACM Transactions on Multimedia Computing, Communications, and Applications, vol. 14, no. 2s, pp. 1-29, Apr. 2018. https://doi.org/10.1145/3176648 [ Links ]

[5] ITU, “P.10: Vocabulary for performance, quality of service and quality of experience,” 2017. [ Links ]

[6] O. B. Maia, H. C. Yehia, L. De Errico, “A concise review of the quality of experience assessment for video streaming,” Computer Communications, vol. 57, pp. 1-12, Feb. 2015. https://doi.org/10.1016/j.comcom.2014.11.005 [ Links ]

[7] R. Mijumbi, J. Serrat, J. L. Gorricho, N. Bouten, F. De Turck, R. Boutaba, “Network function virtualization: State-of-the-art and research challenges,” IEEE Communications Surveys & Tutorials, vol. 18, no. 1, pp. 236-262, Jan. 2016. https://doi.org/10.1109/COMST.2015.2477041 [ Links ]

[8] K. Sun, H. Zhang, Y. Gao, D. Wu, “Delay-aware fountain codes for video streaming with optimal sampling strategy,” Journal of Communications and Networks, vol. 21, no. 4, pp. 339-352, Aug. 2019. https://doi.org/10.1109/JCN.2019.000024 [ Links ]

[9] L. M. Castaneda Herrera, A. Duque Torres, W. Y. Campo Munoz, “An Approach Based on Knowledge-Defined Networking for Identifying Video Streaming Flows in 5G Networks,” IEEE Latin America Transactions, vol. 19, no. 10, pp. 1737-1744, 2021. https://doi.org/10.1109/TLA.2021.9477274 [ Links ]

[10] A. Hammershoj, A. Nowak, J. K. B. Hansen, C. Stefanovic, “Next-Generation OTT Distribution Architecture Supporting Multicast-Assisted ABR (mABR) and HTTP/3 over QUIC,” in SMPTE 2020 Annual Technical Conference and Exhibition, pp. 31-39, 2022. https://doi.org/10.5594/M001928 [ Links ]

[11] N. C. Robinson, Research Methodology a step-by-step guide for beginners. Ranjit Kumar, 2021. [ Links ]

[12] M. Erel, E. Teoman, Y. Özçevik, G. Seçinti, B. Canberk, “Scalability analysis and flow admission control in mininet-based SDN environment,” in IEEE Conference on Network Function Virtualization and Software Defined Network, pp. 18-19, 2016. https://doi.org/10.1109/NFV-SDN.2015.7387396 [ Links ]

[13] Big Buck Bunny Movie, http://www.bigbuckbunny.org [ Links ]

[14] M. Peuster, H. Karl, S. V. Rossem, “MeDICINE: Rapid Prototyping of Production-Ready Network Services in Multi-PoP Environments,” in IEEE Conference on Network Function Virtualization and Software Defined Networks, pp. 148-153, 2016. https://doi.org/10.1109/NFV-SDN.2016.7919490 [ Links ]

[15] Garland, Docker Image | Docker Hub, 2022. https://hub.docker.com/r/garland/dockerfile-ubuntu-gnome/. [ Links ]

[16] S. Avallone, S. Guadagno, D. Emma, A. Pescape, G. Ventre, "D-ITG distributed Internet traffic generator," in First International Conference on the Quantitative Evaluation of Systems, 2004, pp. 316-317. https://doi.org/10.1109/QEST.2004.1348045 [ Links ]

[17] D. van Staden, F. N. Mahomed, S. Govender, L. Lengisi, B. Singh, O. Aboobaker, “Comparing the validity of an online Ishihara colour vision test to the traditional Ishihara handbook in a South African university population,” African Vision and Eye Health, vol. 77, no. 1, e370, Feb. 2018. https://doi.org/10.4102/aveh.v77i1.370 [ Links ]

[18] TU-T, P.910: Subjective video quality assessment methods for multimedia applications, 1999. https://www.itu.int/rec/T-REC-P.910-200804-I/en [ Links ]

[19] D. Narsaiah, R. S. Reddy, A. Kokkula, P. A. Kumar, A. Karthik, "A Novel Full Reference-Image Quality Assessment (FR-IQA) for Adaptive Visual Perception Improvement," in 6th International Conference on Inventive Computation Technologies, 2021, pp. 726-730, https://doi.org/10.1109/ICICT50816.2021.9358610 [ Links ]

[20] FFMPEG, Ffmpeg.org, 2022. https://ffmpeg.org/documentation.html [ Links ]

[21] W. Robitza, ffmpeg_quality_metrics., 2022. https://vqeg.github.io/software-tools/quality%20analysis/ffmpeg-quality-metrics/ [ Links ]

[22] Q. Huynh-Thu, M. Ghanbari, “The accuracy of PSNR in predicting video quality for different video scenes and frame rates,” Telecommunication Systems, vol. 49, pp. 35-48, 2012. https://doi.org/10.1007/s11235-010-9351-x [ Links ]

[23] Z. Wang, A. C. Bovik, H. R. Sheikh, E. P. Simoncelli, "Image quality assessment: from error visibility to structural similarity". IEEE Transactions on Image Processing, vol. 13, no. 4, pp. 600-612, Apr. 2004. https://doi.org/10.1109/TIP.2003.819861 [ Links ]

[24] F. Zhang, A. Katsenou, C. Bampis, L. Krasula, Z. Li, D. Bull, "Enhancing VMAF through New Feature Integration and Model Combination," in Picture Coding Symposium (PCS), 2021, pp. 1-5, https://doi.org/10.1109/PCS50896.2021.9477458 [ Links ]

[25] A. Leixi, Full Reference Video Quality Evaluation Method (PSNR, SSIM) and Conversion Model with MOS, 2013. https://blog.csdn.net/leixiaohua1020/article/details/11694369 [ Links ]

Citation: V. Quizza-Hernandez, J.-C. Arango-Colorado, W.-Y. Campo-Muñoz, “Construction of a Video Transmission Scenario in Software-Defined Networks for QoE Estimation”, Revista Facultad de Ingeniería, vol. 31 (61), e14807, 2022. https://doi.org/10.19053/01211129.v31.n61.2022.14807

AUTHORS’ CONTRIBUTION

Vicko Quizza-Hernandez: Conceptualization, formal analysis, writing-review and editing.

Juan-Camilo Arango-Colorado: Conceptualization, formal analysis, writing-review and editing.

Wilmar-Yesid Campo-Muñoz: Supervision, methodology, investigation, writing-review and editing.

Received: May 10, 2022; Accepted: September 22, 2022; Published: September 28, 2022

This is an open-access article distributed under the terms of the Creative Commons Attribution License