IMPACT CASE STUDY
Audio quality plays a crucial role in our daily lives, significantly impacting the human experience. High-quality audio enhances our enjoyment of music, movies, and online content, making entertainment more immersive and enjoyable. Audio quality is also the cornerstone of communication, ensuring that video calls and voice messages are clear and intelligible. Bad audio can cause issues such as miscommunication of important information and mental listening fatigue. For a brand to remain trusted, it must be able to deliver high-quality audio on a large scale. This requires new technologies to continuously monitor audio quality.
Dr Hines has developed a suite of software tools for testing audio quality at scale: ViSQOL (2015), AMBIQUAL (2018) and Go Listen (2020). These tools have been widely adopted by some of the largest companies in the world including Google, Meta and many others. ViSQOL now processes the top 30k YouTube videos every day to ensure audio quality meets the required standards before 1 billion YouTube viewers consume the content. In 2018, ViSQOL won the US-Ireland Research Innovation Award for this collaboration with Google. Dr Hines's subjective listening test platform, Go Listen, has also seen wide adoption from researchers in industry and academia. Dr Hines’s internal metrics show that over 1000 experiments using this technology have now been conducted by more than 300 researchers involving greater than 6000 subjective listeners.
In 2012, Dr Hines began a collaboration with Google that led to the development of ViSQOL. ViSQOL is a software algorithm that evaluates the quality of speech or music and estimates how a human listener would perceive the audio quality. As a result, millions of hours of audio content can be assessed without the need for human evaluations. Until 2018, no technology existed to evaluate immersive spatial audio, so the team developed AMBIQUAL , a first-of-its-kind model to predict audio quality for surround sound spatial audio.
To validate models like ViSQOL or any audio enhancement technologies, it is necessary to conduct subjective listening tests with human listeners. In 2020, the team developed Go Listen, a web-based subjective listening test platform which allows audio researchers anywhere in the world to create and share subjective listening tests in minutes. Dr Hines’s future work around audio quality will focus on audio enhancement for hearing-impaired listeners and augmented reality systems.
Dr Hines’s audio quality testing tools have now been widely adopted by some of the largest companies in the world. Some recognisable names actively using the technology in research and product development include: Google, Meta, Apple, Microsoft, Sony, Dolby (Spatial Audio), Nvidia (Consumer Electronics), Starkey (Hearing Aids), Mathworks (MATLAB R&D Software) and many others from startups to multinationals.
Google is the most prolific user of the technology and ViSQOL has been adopted as a quality metric across multiple product groups within the company. It is used for a variety of purposes from developers creating new ways to compress and transmit audio files so they sound clear and take up less space for Google Meet, to quality testing on YouTube and designing codecs for Pixel phones. More than half of YouTube’s billion daily views come from mobile devices and uploaded content needs to be converted into lots of different formats to accommodate the various devices and network conditions. This process can introduce reduced quality and content must be monitored daily to ensure quality standards.
ViSQOL is deployed within YouTube as an automated regression testing tool that can provide quality warning alerts. Essentially, ViSQOL ‘listens’ to the top 30k YouTube videos each day, in all formats, to ensure audio quality meets the required standards before 1 billion + YouTube viewers consume the content.
ViSQOL has become a crucial tool for both development and production. We are utilising the metric extensively in the development of speech and audio enhancement... Every day, YouTube probes the quality of the 30K most popular videos worldwide using ViSQOL as the measure... We have heard from [...] Microsoft and Apple that it is frequently used in [... ] their organisations.
— Adrian Grange, Senior Technical Program Manager, Google
Another example of industry adoption of the tools is MATLAB, a computing platform that is used for engineering and scientific applications like data analysis, audio and image processing, machine learning and robotics. It is used by over 4 million researchers worldwide. In 2024, ViSQOL was included in Mathworks' MATLAB 2024a release as their standard speech and audio quality metric.
ViSQOL has received several awards for its novelty and impact including: the SFI Connect 2018 Impact Award, QoMEX Best Paper Award 2013 and the 2018 RIA/Amcham US-Ireland Research Innovation Award. ViSQOL and AMBIQUAL have also been granted 3 US patents. The project’s quality metrics and tools have had a significant impact in terms of academic contribution to knowledge, with over 600 citations across the published papers on the work.
Dr Hines’s subjective listening test platform, Go Listen, has also seen wide adoption from researchers in industry and academia. Internal metrics show that more than 300 researchers and 6000 subjective listeners have now conducted over 1000 tests using the platform. The platform continues to grow with new users and organisations joining daily.
Dr Hines continues to push the boundaries in audio quality research. The team’s latest project, Hear AI, is developing music enhancement algorithms for people with hearing impairments. This research can have a significant impact in addressing this global health issue affecting over 1.5 billion people globally.
"GoListen makes listening tests easy. User-friendly, simple setup, and free to use. They also store your data, making it a fantastic platform! Carrying out tests on GoListen saves us time and money.”
— Cyran Aouameur, Research Engineer, Sony
“MATLAB is used by more than 5 million engineers and scientists [...] The addition of ViSQOL to Audio Toolbox allowed us to respond to growing requests for easy access to objective audio quality estimation. ViSQOL can match and outperform other well-established audio quality algorithms”.
— Jimmy Lapierre, Principal Engineer, Mathworks