Format Archives - epiclabs.io

Assessing metrics for video quality verification in Livepeer’s ecosystem (I)

Livepeer is creating a scalable and decentralized PAAS for developers who want to add live or on-demand video to their project.

Epic Labs, a Software Innovation Center, specialized in the media industry, is contributing to the project, an effort funded by Livepeer’s grant program.

This is the first of a series of posts. The aim of these is to share with the community the progress of our research effort to gather feedback and ideas. Please share your comments and/or fork the repo to contribute. Let’s get to it!

As it is explained here:

Livepeer is a protocol for video transcoding. Transcoding is a compute-heavy process that has traditionally carried a high technical and monetary cost. Livepeer aims to slash the cost of transcoding with an open network that incentivizes transparent competition among suppliers of transcoding capacity. The rules of the Livepeer protocol are backed by smart contracts on the Ethereum blockchain, an immutable ledger.

The combination of an open, permissionless network and immutable non-reversibility attracts byzantine (adversarial) behavior: any participant can attempt to “break the protocol” without immediate consequences from within the Livepeer network. A verification mechanism is necessary to decide whether the transcoding work was done correctly.

Part of this contribution involves research and attempt to tackle the problem of verifying that the transcoded content itself is a reasonable match for the original source given a good-faith effort at transcoding.

Eventually, the mission consists on developing a verification classifier that will give a pass / fail output (and with what confidence score), for a given segment of a given asset.

Assessing the quality of a single asset

The literature on the topic of Video Quality Assessment (VQA) is quite vast and is out of the scope of this article to enter into deep details about it. Interested readers are referred to other Medium publications here (by Netflix) and here.

As a foundation to our discourse, let’s agree that when a transcoded copy of a video asset is made using lossy compression, there will be approximations and data omissions. The evaluation of the effect of those omissions by humans is termed Subjective Quality Assessment, and is based on the concept of Mean Opinion Score (MOS). However, these evaluations can be terribly tedious and time consuming, and isn’t progress about improving people’s lives?

Above: two different lossly compressed versions of an original picture (a frame from Big Buck Bunny). Definition of a criterion on what is an acceptable quality level is not an easy task, hence the plethora of objective quality methods attempting to measure quality.

Enter objective Objective Quality Methods. These methods, as their name implies, attempt to make a model of the distortions incurred during the encode /decode process and foresee the most likely MOS a piece of (lossy) compressed video will have. They can be either based on the Human Visual System (HVS) or on purely mathematical / engineering metrics, the latter being the most widespread kind of methods with a list of variations growing by the year: MSE, PSNR, SSIM, MS-SSIM, PEVQ, VMAF,…

Nevertheless, the aforementioned engineering / traditional metrics can still be categorized depending on whether they are based on a full comparison between the original and the transcoded version (FR metrics), on a comparison between a reduced representation of the original and the copy (RR metrics) or on a no-reference model that seeks artifacts and other degradations only in the copy (NR metrics).

For the purpose of this article and subsequents, we will abound on how to use these metrics to approve / reject a piece of encoded video.

A first experiment to build some intuition about the problem at hand is to depict the evolution in time of one of the metrics (say, SSIM) for a given movie. The charts below show the iteration frame by frame over the first 50 seconds (1240 frames) of Big Buck Bunny movie (720p@1.3Mbps), encoded to 500 kbps and 250 kbps, respectively.

As it would be expected, lower bitrate should introduce more artifacts and distortions, as can be seen from the fact that the 250 kbps curve goes almost parallel to the one for 500 kbps, just with slightly lower values, and in some specific scenes.

SSIM time evolution for the first 1240 frames (50 seconds) of Big Buck Bunny, encoded at 500 kbps (above) and 250 kbps (below)

Well, this looks promising. Let’s see how much the SSIM diverges by measuring the ratio between one another:

Time evolution of the ratio between SSIMs encoded at 500 kbps and 250 kbps. First 1240 frames (50 seconds) of Big Buck Bunny

Aha, a mean ratio of 3%. One would expect a fairly flat line, not ramps and spikes, though. In future articles we may dig further on these glitches in the frames 95, 305, 405, 605, 880 and 986. Omit them for now if you can. Otherwise, this ratio could report to a fairly decent extent on how much distortion the 250 kbps rendition is contributing with respect to the 500 kbps.

Let’s see now what happens if, in a display of mischief, one includes a watermark in the 500 kbps rendition:

SSIM time evolution for the first 1240 frames (50 seconds) of Big Buck Bunny, encoded to 500 kbps with a watermark

Ok, so the time series is now completely different. It would be very difficult to use SSIM to distinguish a lower bandwidth rendition from a watermarked rendition. The 250kbps rendition has a mean value of SSIM in 0.93 whereas the watermarked version shows a mean SSIM of 0.95.

Taking a look at the ratio between the 500 kbps and the watermarked rendition, we find something even more astonishing. Watermarks improve the SSIM!! (Well, in some segments at least).

Time evolution of the ratio between SSIMs encoded at 500 kbps and with a watermark. In some segments (ratios below 1), watermarks seem to give even better SSIM results than the ‘*bona fide*’ rendition at 500 kbps.

Blast. Encodings with watermarks behave better than those with lower resolutions.

In practice, this basically means that if one were to use SSIM to establish a prescriptive threshold, this would be indeed arbitrary and subject to debate. Even if using statistical values like average, histograms or the like. Where is the limit? 1% ratio with respect to the mean? Too strict. 10%? Too loose.

Bringing in more metrics

Let’s use yet another metric, just to evaluate if this issue only affects SSIM. We summon PSNR, another VQA champion!

PSNR’s values have a scale different from SSIM . To begin with, they operate in dB (logarithmic), not linear scores. Nonetheless, some experts have defined tables that map dB to MOS. It is more or less accepted that below 20dB is the worst quality and more than 37dB is the highest one can perceive with the naked eye.

For our illustrative purposes, we will compare metrics as a ratio between renditions. Figures below represent the same experiment we did before: the ratio between the instantaneous SSIM of the 500 kbps rendition and the 250 kbps rendition and the same between the 500 kbps with watermark. SSIM and PSNR are superimposed to see their respective contributions.

Time evolution of the ratio between SSIMs and PSNRs for encodings at 500 kbps and 250 kbps.

Time evolution of the ratio between SSIMs and PSNRs, encoded at 500 kbps and at 500 kbps with a watermark.

Ahm. Well. It seems that PSNR is not adding much more information. It is a bit more sensitive than SSIM to some scenes, amplifying the signal in both directions, but other than that, if we were to blindly use the average we would end up concluding again that watermarked renditions are better than those without (which is wrong).

Generalizing to several assets

But, let’s see to what extent we could still use them. Let’s leave watermarks aside. Perhaps there are upper and lower bounds to how much these metrics vary, and see if they can still be of any use to assess the validity of an encoding, if only to verify that an encoding is a decent copy of an original.

We have taken a number of assets from Youtube’s YT8M dataset and encoded a few renditions from there. Specifically, we have taken about 140 videos from this dataset, established the 1080p rendition as original, and encoded 10 seconds of each to 720p, 480p, 360p and 240p. For the sake of simplicity, we have reduced the respective bitrates to be equal to those used by YouTube for each rendition (you can find a more detailed article on how this can be done here).

We have also invited a few more full reference metrics to the party, namely cosine, euclidean and hamming distances, so we add more diversity to the analysis.

Once we have gathered our renditions, we have iterated video by video (4 renditions x 140 videos = 560 specimens) and extracted their mean PSNR, SSIM, MS-SSIM, VMAF, cosine, Hamming and euclidean hash distances with respect to the original 1080p rendition. Full details about this implementation can be explored in the github repo we have enabled here. Yet another caveat: the segments were chosen rather arbitrarily: 10 seconds (240 to 300 frames) from the middle of the video’s timeline. What if those segments happen to be non-representative of the whole video? They could depict a static scene with very low variation of colors, or otherwise fall in the middle of an epic battle with lots of motion. This is known to severely affect the output quality when bitrate is constant, hence the likelihood of an encoded job of being rejected even if properly done.

With this substantial amount of data, we can proceed to evaluate how different metrics relate to each other. A bit of EDA (Exploratory Data Analysis) might bring in some hindsight as to how metrics relate to each other. Matrix below depicts a pairs plot of our newly generated dataset. It builds on two elementary plots: scatter plots of one metric against each other and histograms of themselves in the diagonal. We can see that all distances (euclidean, cosine and Hamming) are linearly related, meaning basically that they could be used almost interchangeably. On the other hand, SSIM and PSNR are also somehow correlated in a logarithmic / exponential manner, but inversely with regards to the hash distance metrics. In a world apart, the more sophisticated MS-SSIM and VMAF present some degree of connection between them, and display a similar pattern as SSIM in their lower bound with regard to the hash distance metrics.

Pairs plot for our dataset of 140 Youtube clips. Distance metrics (cosine, euclidean and Hamming) are fairly linearly correlated, whereas SSIM and PSNR show some logarithmic / exponential relationship with each other. VMAF and MS-SSIM also have some degree of linear correlation to each other. MS-SSIM appears as the one with the most compact distribution.

The intensity of this correlation is given by the Spearman’s correlation coefficient displayed in the table below. Without entering into details, let’s explain that Spearman’s correlation coefficient gives the same information as that of Pearson’s, but calculated on ranks instead of actual data values. This allows for identification of both positive (blue) and negative (red) correlations, where +1 means total positive correlation (when one feature grows, so does the other) and -1 means total negative correlation (when one feature grows, the other decreases).

Spearman correlation table for all seven metrics used in our analysis

From Wikipedia:

The sign of the Spearman correlation indicates the direction of association between X (the independent variable) and Y (the dependent variable). If Ytends to increase when X increases, the Spearman correlation coefficient is positive. If Y tends to decrease when X increases, the Spearman correlation coefficient is negative. A Spearman correlation of zero indicates that there is no tendency for Y to either increase or decrease when X increases. The Spearman correlation increases in magnitude as X and Y become closer to being perfect monotone functions of each other.

Summa sumarum, the above is basically telling us that to some extent all seven metrics would provide a fairly similar information once those relationships would be substantiated and disentangled by means of some mathematical sorcery.

On the other hand, one might want to see which metric has the lowest variance, hence giving a better approximation to the definition of strict thresholds. The histograms displayed in the diagonal of the pairs plot point MS-SSIM as the best candidate, with a lower bound somewhere in the 0.925 MS-SSIM points. There are some outliers, though, worth checking in order to further understand why they have such “bad” behavior.

In order to find out who are these outliers and why some assets present values of MS-SSIM below 0.975, we can plot the different metrics values against their rendition identifier. Each of the following charts displays the mean value of the respective metric for 10 seconds for all the different encodings we made. So, basically, for each asset we have four points. One for each rendition. Note that not all of them have a rendition in the 426:240, 640:360, 854:480, 1280:720, but doesn’t make them outliers. We are interested in the metric value, not the dimensions.

This allows us to locate outliers and inspect them further. In the Jupyter notebook provided in the repository, this can be easily done by means of Plotly and hovering the cursor over the points.

…and here we go. One frame of one of the samples that lie outside of the crowd is presented below. This video shows people jumping form the rocks into water and having their fun in what is seemingly a very complex asset, both in the dynamics through time as well as in the spatial relationships and pixel textures.

Frame from one of the outliers. The video presents a high amount of motion and high detail hard to deal with for the codec in a reasonable compression rate.

So, what does this mean? Only simple videos would pass a filter based on the average of a metric? We need to establish a somewhat more sophisticated metric that takes into account both the relative reduction in quality as well as the complexity in time and space.

Conclusions and further work

Ok, so. Objective Full Reference metrics alone cannot be used in isolation or grouped to define thresholds for acceptance / rejection. There are several problems associated with them.

In the first place, there seems to be provable that all of them supply the same or almost the same information (see the correlation table above) in one or another way.

Second, once we have agreed that we could establish a threshold under which measured distortion is not acceptable (for example, 0.97 MS-SSIM points), still other forms of attack are possible (i.e. watermarks).

Third, for complex videos the threshold might result too restrictive for acceptably well encoded renditions.

How do we proceed then?

One potential option is to tackle the problem from another angle, using No Reference metrics applied over both the original and the copy, then comparing. Another line of research would involve creating adaptive thresholds that would take into account the spatial and temporal activity levels of an asset. Yet another thread might lead us to simply create a neural network with annotated assets and define by ourselves different kinds of attack (poor resolution, watermark, etc).

This is getting interesting! Stay tuned for follow-up posts on this exciting research project!

References

Livepeer

Livepeer has 30 repositories available. Follow their code on GitHub.

github.com

epiclabs-io/YT8M

(Machine) Learning from YouTube8M dataset . Contribute to epiclabs-io/YT8M development by creating an account on…

github.com

Toward A Practical Perceptual Video Quality Metric

measuring video quality accurately at scale

medium.com

Video Quality Assessment

In Eyevinn’s initiative to share our knowledge around quality we continue with addressing video quality assessment…

medium.com

Human visual system model — Wikipedia

needs additional citations for verification .improve this article by adding citations to reliable sources. Unsourced…

en.wikipedia.org

About the authors

Rabindranath is PhD in Computational Physics by the UPC and AI researcher. Dionisio is Computer Science Engineer by the UPM specialized in Media. Ignacio is Telecommunications Engineer by the UPM specialized in Data Science and Machine Learning. They are part of Epic Labs, a software innovation center for Media and Blockchain technologies.

Livepeer is sponsoring this project with Epic Labs to research ways to evaluate the quality of video transcoding happening throughout the Livepeer network. This article series helps summarize our research progress.

To learn more about Epic, visit www.epiclabs.io or @epiclabs_io in Twitter.

III Foro EL ESPAÑOL – Blockchain y Criptomonedas

Epic Labs participó en el III Foro EL ESPAÑOL Blockchain y Criptomonedas: Así cambiarán los modelos de negocio.

El pasado 18 de junio se celebró el III Foro El Español Blockchain y Criptomonedas: Así cambiarán los modelos de negocio. El evento contó con la presencia del CEO y Fundador de Epic Labs Alfonso Peletier que apuntó: “Podemos definir una comunidad como un grupo de personas y empresas que se agrupan por un bien común. Blockchain va a ayudar a definir la consecución de fines comunes de las comunidades del futuro”.

El evento, respaldado por eToro, Banco Sabadell, Ethereum Madrid, KPMG, Universidad CEU San Pablo e Innsomnia, contó como maestro de ceremonias con Arturo Criado, redactor jefe de la sección de Empresas y Medios de El Español.

La primera mesa abordó la visión general del mercado, así como las perspectivas futuras y en ella participaron Luis Olmedo de Cáceres, Senior Manager de Global Strategy de KPMG; Javier Molina, Presentor en España de eToro, Antonio Gonzalo, Cofundador del think-tank Ethereum Madrid; Javier Domínguez, del Nodo de programación de Blockchain España y Gabriel Montalto, Broker de Derivados Financieros.

Olmedo de Cáceres fue el encargado de romper el hielo de la mesa con una revisión del posicionamiento regulatorio actual. “En Europa, en 2017 decidimos que había llegado el momento de estudiar en serio Blockchain y sus aplicaciones. En España, para empezar, estamos presentes en todas las iniciativas de la Unión Europea. Además, la CNMV está mandado notas aclaratorias regularmente. Por el momento llevan dos: una advertencia y una guía para las ICOS. Luego estaría Suiza, que es el ejemplo a seguir. Su iniciativa Cripto Valley, en una zona cerca de Zurich, es una gran apuesta“.

Al ser preguntado por la iniciativa de Facebook, el manager de Global Strategy de KPMG explicó que, antes de salir, ya está teniendo efectos positivos el en resto de criptomonedas. “Venimos de un año muy bueno. Desde comienzo de año Bitcoin ha multiplicado su valor casi por tres. La salida de Libra va a ayudar a que esto aún mejore. La apuesta que han hecho es muy importante, han fichado a parte de su equipo de Paypal y Coinbase. Aunque lo de Facebook no sea una cripto como tal va a permitir que la gente use un nuevo medio de pago y eso va a ser muy beneficioso para el resto“.

Por su parte, Javier Molina comenzó su intervención incidiendo en que la iniciativa de Facebook no puede ser considerada una criptomoneda como tal. “Lo de Facebook no va a ser una criptomoneda porque no está descentralizada“.

El Presentor en España de eToro explicó que considera a este tipo de activos una oportunidad de inversión pero que tienen un nivel muy alto de riesgo. “Hay que estar en estos productos pero tienen que representar el 1% o el 2% en un portafolio. Las perspectivas son muy buenas. El problema no es el activo, es el uso que se da. Podríamos estar ante el oro digital“.

Por su parte, Antonio Gonzalo hizo hincapié en los importantes pasos que ha dado el sector recientemente. “En el pasado año vivimos un periodo muy bajista en términos de valor. En cambio, en lo que tiene que ver con la infraestructura ha sido un año muy bueno para el trabajo en escalabilidad“.

El cofundador de Ethereum Madrid también valoró el nacimiento de Libra muy positivamente. “Ahora, con la entrada de la red social en el sector, las perspectivas son buenísimas. El efecto red de Facebook va a venir muy bien para todo el ecosistema“.

Montalto resaltó que actualmente sólo una minoría de ciudadanos conocen y entienden el significado de Blockchain. “La gran mayoría de las personas siguen sin saber de qué hablamos. Cuando la gente entienda cripto al mismo nivel que las acciones de bolsa vamos a ver una revolución. En una década los chicos que hoy tienen 13 años van a estar pagando su Coca-Cola o su entrada de cine con criptomonedas en Madrid y en Pekín“.

Para concluir la mesa, este broker especializado en derivados financieros dio a conocer alguna actividad menos conocida en base a criptomonedas. “Ya hay empresas de compra de viviendas en Bitcoins. El dinero en efectivo como tal va a desaparecer. Se están retirando billetes de gran valor y se impide un pago en efectivo por encima de ciertas cantidades. No descarto que estaremos pagando muy pronto con visas basadas en cripto“.

Una visión del futuro

La segunda mesa se centró en intentar dar una visión de cómo se remodelarán los negocios gracias a estas nuevas iniciativas. En ella participaron Carlos Trevijano, Socio Responsable de Fintech y Pagos en KPMG España; Roger Olivet, Blockchain Director de Banco Sabadell; Francisco Estevan Vitores, CEO de Innsomnia; Ricardo J. Palomo, Catedrático de la Universidad CEU San Pablo; Jesús Pérez, Director de CryptoPlaza de Finnovating y Alfonso Peletier, CEO de Epic Labs.

Trevijano comenzó su intervención contestando a una pregunta del público que solicitaba una definición para Blockchain. “Es una base de datos descentralizada basada en una cadena de bloques. En Blockchain la propia red valída cada operación. Una de las innovaciones más interesantes que ha traído son los Smart-contracts, que permiten automatizar los eventos que rigen el contrato cuando se cumplen las condiciones“.

Respecto a las claves del futuro, el responsable de Fintech y Pagos en KPMG España, señaló que la principal dificultad que va a tener Blockchain son los incentivos de participación. “Los incentivos van a jugar un papel fundamental en la adopción de esta tecnología entre el gran público“.

En su intervención, Roger Olivet analizó los efectos futuros que va a tener la nueva iniciativa de Facebook. “La noticia de hoy es positiva en muchos aspectos. Va a dar acceso a este activo a mucha gente y va a permitir conocer el resto del ecosistema. Además, a las entidades financieras nos va a ayudar a ponernos las pilas. En este entorno, los bancos tenemos una ventaja: somos entidades confiables y tenemos que aprovecharlo. La colaboración va a ser muy importante. Esto no va de islas, va de archipiélagos y puentes“.

El responsable de Blockchain de Banco Sabadell tuvo la oportunidad de dar detalles sobre la iniciativa en la que participa su compañía. “Empresas de distintos sectores nos hemos puesto de acuerdo para identificar y acreditar a nuestros proveedores. Cuando una empresa colaboradora y quiere trabajar con una compañía como la nuestra pierde mucho tiempo. Ahora, con la iniciativa que hemos presentado hoy, si se valida una vez se valida para todos los componentes“.

Estevan Vitores aprovechó su intervención para aportar su propia definición de Blockchain. “Cuando se lo he intentado explicar a mi hijo, al final he llegado a la conclusión de que Blockchain es la tecnología de la verdad. Hay que poner a todo el mundo de acuerdo para que el sistema mienta. Es una gran oportunidad para hacer las cosas distintas. Blockchain es mucho más que Bitcoin. Yo, si fuera notario, estaría preocupado e intentando subirme a esta tecnología“.

El CEO de Innsomnia quiso poner en valor el papel de los emprendedores. “Estamos ante un momento apasionante. Queda aún mucha fluctuación y tenemos que seguir aprendiendo. Los emprendedores van a jugar un papel fundamental en el desarrollo de esta industria“.

Entre las distintas visiones profesionales, Ricardo J. Palomo aporto la experiencia que se está teniendo de las instituciones educativas del fenómeno Blockchain. “Esta revolución está causando gran fascinación entre los alumnos. Estas tecnologías y la inteligencia artificial están generando mucha ilusión por los cambios. Estamos en un cambio de paradigma en el que vamos a estar decidiendo si se usa o si no se usa esta tecnología. La iniciativa de Facebook va a popularizar aún más todo lo referente con esta nueva industria“.

En cuanto a las oportunidades futuras, el catedrático de Universidad CEU San Pablo apostó por que veremos cada vez más iniciativas que aporten valor usando el Blockchain o las criptomonedas. “Estamos pasando de una fase teórica a una en la que ya empezamos a ver casos de uso. Cada vez más empresas van a apoyarse en esta tecnología usándola para más soluciones. Esto requiere tiempo, formación, receptividad e ilusión. El mundo necesita cada cierto tiempo novedades y esta es una muy grande“.

Ante una mayoría de opiniones que ensalzaron más las oportunidades de Blockchain, Jesús Pérez quiso romper una lanza en favor de las criptomonedas. “Blockchain es una base de datos con una serie de atributos. En cambio, Bitcoin está generando mucho más valor ya. Cuando se abordan estrategias sólo con Blockchain se pierden oportunidades de negocio. Venezuela transacciona más Bitcoins que Dólares. Ya hay empresas con 600 millones de beneficio. Estamos hablando de un negocio real“.

Alfonso Peletier, puso el acento en el potencial colaborativo de esta tecnología. “Podemos definir una comunidad como un grupo de personas y empresas que se agrupan por un bien común. Blockchain va a ayudar a definir la consecución de fines comunes de las comunidades del futuro. Blockchain ha llegado para quedarse. Con el tiempo la tecnología va a evolucionar porque ahora mismo es muy inmadura. No tenemos el sistema operativo terminado totalmente y queremos poner apps encima“.

El CEO de Epic Labs concluyó destacando lo excepcional del momento que estamos viviendo. “Esta tecnología aún requiere tiempo e inversión. La inversión ha ido a un ritmo muy bueno pero estamos en un momento de inmadurez. Tenemos entre las manos algo especial. Con Internet vimos la revolución de la información, ahora veremos la revolución de la confianza“.

Una confianza que todos los participantes en el foro mostraron de cara a un futuro en el que Blockchain y las criptomonedas jugarán un papel central de la nueva economía.

El evento contó con la colaboración de BLOCKCHAIN España.

Fuente: El Español

Fotografías: Jesús Umbría – EL ESPAÑOL

LightFlow broadens user-context scope to further improve efficiencies

Cost, quality of experience and smooth operation benefit by factoring extended codecs support, specific device optimization and network QoS.

These are the new features added:

Content preparation supporting VP9 codec, in addition to LightFlow current solution for H.264 and H.265. Thanks to this you can optimize each unique playback in any device with the most efficient version of your video assets using an intelligent multi-codec approach.

Smart bitrate management based on device, screen size, and watching distance. LightFlow is able to learn from your users’ habits and context, and apply optimizations accordingly.

Now LightFlow is able to apply optimizations considering the network conditions of your users base. Ensure your videos are always prepared to maximize quality of experience of your users while being cost effective.

Epic Labs to exhibit at IBC 2018 introducing LightFlow by Epic Labs.

August 3, 2018
EpicLabs
#BeEpic!, Epic Labs, Media, Press Release, Promo

EPIC LABS TO EXHIBIT AT IBC 2018 INTRODUCING FIRST-IN-CLASS VIDEO STREAM OPTIMIZER FOR QUALITY OF EXPERIENCE (QoE): LIGHTFLOW by Epic Labs

Epic Labs will exhibit at IBC 2018 introducing LightFlow: a QoE enhancer leveraging advanced machine learning technologies, smart encoding algorithms and cognitive services for better user engagement. Visit us at Stand 14.A36 to learn about LightFlow Video Stream Optimizer for QoE

Madrid- 02/08/2018 – Epic Labs (www.epiclabs.io), proud member of the DASH Industry Forum and Digital Production Partnership (DPP), is pleased to announce its presence at IBC 2018 introducing a new QoE enhancer media platform known as LightFlow.

LightFlow is an easy-to-use platform that enables an enhanced user experience by leveraging advanced encoding techniques using machine learning techniques. LightFlow also introduces ultrafast (real-time) encoding and cognitive services to optimize video streams bringing the best quality of experience for customers. It works either for VoD (file-based) and live streams on DASH and HLS on any device player.

LightFlow improves media through proprietary Machine Learning algorithms that analyse content providing the best video quality at a lower bitrate generating an optimal ABR ladder and encoding profiles. Some of the benefits are: minimize rebuffering events, improve video start time (VST), exit before video starts (EBVS) and reduce video start failures (VSF) achieving a best-in-class stream that flows perfectly over any internet connection.

A more efficient use of video bandwidth means a lower cost for storage and improving CDN performance, while reducing transcoding costs.

LightFlow can be easily integrated on any OTT or live TV Channel workflow by consuming a light and intuitive API and an architecture that allows multiple deployment models (SaaS, on-prem, cloud, hybrid).

Customers will get an enhanced QoE, better user engagement and user retention by offering a TV-Like experience.

LightFlow main features:

– Per title smart encoding.
– Per scene smart encoding.
– Live smart encoding.
– Low-Latency smart encoding.
– Advanced media cognitive services: face recognition, auto-clipping, enriched archiving, contextual advertising and content moderation.

LightFlow can be deployed on any media workflow and any existing infrastructure either on-premises or on any cloud provider. Epic Labs can also provide the encoding, storage and delivery for any workflow. LightFlow enables a 100% out-of-the-box solution to fit on any workflow or requirement.

Alfonso Peletier, Epic Labs CEO and Founder: “We have identified a raising demand of OTT providers and broadcasters to improve viewers’ experience: a great image quality and a lighter stream that loads up fast, with no rebuffering events no matter the network condition or device brand. At Epic Labs, we have put an incredible Team of media software engineers, encoding and machine learning experts to create LightFlow with the mission to boost viewers QoE while keeping media workflows costs at bay. We are proud to be at IBC, the best place to be when launching a new product like LightFlow and stay in close touch with decision makers from all over the world.”.

In addition to offer attendees the chance for a hands-on demonstration and in-depth information about LightFlow, Alfonso Peletier, Epic Labs CEO & Founder, will be part of the panel “Artificial Intelligence and Machine Learning: Status update” that will be held on Saturday 15th at 15:30 in the Hall 14 Content Everywhere Hub.
Epic Labs will be at stand A36 in Hall 14 at Amsterdam RAI. To schedule a meeting with Epic Labs at IBC 2018, please send an email to: sales@epiclabs.io or visit www.epiclabs.io

Press and Media contact (Content and advertisement):
Jesús Carrillo
Marketing and Communications Manager
M.: +34 649 801 593
E.: jcarrillo@epiclabs.io
W.: www.epiclabs.io

NLP for Computer Vision – Having breakfast with Terminator

In this video, we are talking about another tool of Machine Learning- Natural Language Processor- which could be used to enhance the accuracy and power of computer vision. Natural Language Processing or NLP is used to understand the meaning of human language by training an algorithm that looks for the relationship between words. One of these relationships could be “belongs to” or “contains”, and so we can relate several concrete terms (fruit, cereals, milk) with an abstract concept that contains all (breakfast, morning). We can use this feature to increase our accuracy in detecting concepts in a scene. We also propose this method for the use case “Contextual Advertising” to avoid wrong detections derived from the context of a video:

0:00 – Intro
0:37 – Object Detection
1:07 – NLP definition
1:16 – Context example 1: BREAKFAST
2:00 – Context expample 2: BASEBALL
2:23 – Contextual advertising USE CASE
3:00 – Recap

Machine Learning applications in Media and Broadcast industry

Interview with Alfonso Peletier, Epic Labs CEO and Founder, at Bit Audiovisual 2018. Epic Labs was part of a discussion panel focused on Artificial Intelligence for Media.

Dash.js V 2.6.8 released with relevant features.

May 9, 2018
EpicLabs
#BeEpic!, Case Study, Epic Labs, Media, Press Release

Dash.js V 2.6.8 released with relevant features.

Epic Labs, a proud member of the DASH Industry Forum and Digital Production Partnership (DPP), announces the release of version 2.6.8 of dash.js that incorporates major improvements such as low latency Chunked CMAF support.

Several enhancements have been incorporated in this new version such as fixes that improve captioning and DRM handling. But above all, two implementations are extremely important, one of them is the support of persistent licenses (DRM), while the other is the introduction of Low latency chunked CMAF over HTTP, which enables 2-4 second latency for live streams keeping a smooth playback experience. Supporting this feature in a scalable manner is enabled by major CDNs such as Akamai.

The development aims to provide an alternative to older protocols like RTMP whose main advantage over HTTP streaming ones was its lower latency.

In the client side, part of the solution is based on the use of the fetch API offered by browsers, which allows reading incoming chunks as they arrive, instead of waiting for them to be completed. Dash.js analyzes the byte stream representing each chunk and pushes it to the video decoder as soon as it has enough information to start playback stably. Every player based on dash.js is able to benefit of this new feature by doing nothing but updating to the latest version.

Jesús Oliva, Epic Labs Media Engineer Lead & Founder: “This is an exciting achievement because it provides a low latency chunked solution based on open standards, protocols and source code. A real alternative to proprietary or custom solutions that require to package content in a strange manner or exceed the norm. This is a great deal for the industry.”

Dash Industry Forum Dash.js

The DASH Industry Forum (DASH-IF) establishes interoperability gui- delines on the usage of the MPEG-DASH streaming standard. MPEG- DASH simpli es and converges the delivery of IP video, to provide a rich and enjoyable user experience, to help drive down costs and ultimately to enable a better content catalogue to be offered to consumers.

Epic Labs, proud member of the DASH Industry Forum, is a software engineering center that helps companies to innovate in Media, offering advanced video solutions and several collaboration proposals in digital transformation.

Machine Learning for Business as Unusual. Álvaro Gonzalez and Ignacio Peletier.

Álvaro González. Epic Labs AI Lead Engineer.
Ignacio Peletier. Epic Labs AI Engineer.

In 2004, Chris Anderson presented his article “The Long Tail” in Wired magazine, explaining how the future of digital businesses would focus on a lot of niches in front of mass markets. This is one of the most important things that allows video in the internet era, where each person can search and find content that fits perfectly to their likes, finding it either within the user generated content. And that is exactly where the qualities of artificial intelligence can help.

“It is not about doing the same things by replacing people with machines, it is about doing business as unusual, doing something that is not usually done.” Álvaro González. Epic Labs AI Lead Engineer.

There is a lot of talk these days about how AI can simplify workflows and perform tasks that are repetitive and automatic to achieve more efficiency. But what if instead of focusing on resource optimization, what we do is think about those things that are not done because there is not enough people or budget for it? It is then when machine learning, video and long tail enable new and interesting use cases that can be undertaken through artificial intelligence tools, either in a 100% automated process or requiring some kind of supervision.

In sports, we have the case of those that have a local interest which are minority, or smaller leagues. Thanks to the commoditization of content generation and delivery technologies, it is possible to organize productions operated by artificial intelligence. Focusing on football, a machine learning system can enable a local football match to be made automatically and broadcast it over a streaming channel. You can even train to do replays and include graphics. It is not a question of whether the production is as good as a handful of technicians could have made, it is about having a product where there was nothing before, which among other things, allows monetizing that long tail through dynamic advertising or a unique Sponsorship.

This technology can also be applied to recognize and cross-reference material to create content. As we continue to talk about football, this can be applied in parallel to the cameras on the air to all those that are being ingested on some server, taking advantage of material that is usually discarded because no staff is available to review and classify it. In this way, new content can be created, looking for this niche once again, as can be a summary of everything a player has done in a football match, at the choice of the user.

Álvaro González, Epic Labs AI Lead Engineer: “This also applies to any other sport, personal hobbies and much other cases. We are living in an era where the end-user is able to generate their own content if they cannot find it, and the artificial intelligence can help us to get and retain that people. More than ever, unusual content requires unusual business.”

Category: Format

Assessing metrics for video quality verification in Livepeer’s ecosystem (I)

Assessing the quality of a single asset

Bringing in more metrics

Generalizing to several assets

Conclusions and further work

References

About the authors

III Foro EL ESPAÑOL – Blockchain y Criptomonedas

Epic Labs participó en el III Foro EL ESPAÑOL Blockchain y Criptomonedas: Así cambiarán los modelos de negocio.

Una visión del futuro

LightFlow broadens user-context scope to further improve efficiencies

Cost, quality of experience and smooth operation benefit by factoring extended codecs support, specific device optimization and network QoS.

NLP for Computer Vision – Having breakfast with Terminator

Machine Learning applications in Media and Broadcast industry

Dash.js V 2.6.8 released with relevant features.

Machine Learning for Business as Unusual. Álvaro Gonzalez and Ignacio Peletier.

TYPE

TOPICS

Assessing the quality of a single asset

Bringing in more metrics

Generalizing to several assets

Conclusions and further work

References

About the authors

Epic Labs participó en el III Foro EL ESPAÑOL Blockchain y Criptomonedas: Así cambiarán los modelos de negocio.

Una visión del futuro

Cost, quality of experience and smooth operation benefit by factoring extended codecs support, specific device optimization and network QoS.

EPIC LABS TO EXHIBIT AT IBC 2018 INTRODUCING FIRST-IN-CLASS VIDEO STREAM OPTIMIZER FOR QUALITY OF EXPERIENCE (QoE): LIGHTFLOW by Epic Labs

Epic Labs will exhibit at IBC 2018 introducing LightFlow: a QoE enhancer leveraging advanced machine learning technologies, smart encoding algorithms and cognitive services for better user engagement. Visit us at Stand 14.A36 to learn about LightFlow Video Stream Optimizer for QoE

La inteligencia artificial aplicada a los archivos de televisión

TYPE

TOPICS

Tags