Werner Bailer | Joanneum Research, Graz | Friday, June 25, 2021 | 10:00 (CET, 08:00 UTC) | online
Abstract:
Artificial neural networks have been adopted for a broad range of tasks in multimedia analysis and processing, such as visual and acoustic classification, extraction of multimedia descriptors or image and video coding. The trained neural networks for these applications contain a large number of parameters (weights), resulting in a considerable size. Thus, transferring them to a number of clients using them in applications (e.g., mobile phones, smart cameras) benefits from a compressed representation of neural networks.
MPEG Neural Network Coding and Representation is the first international standard for efficient compression of neural networks (NNs). The standard is designed as a toolbox of compression methods, which can be used to create coding pipelines. It can be either used as an independent coding framework (with its own bitstream format) or together with external neural network formats and frameworks. For providing the highest degree of flexibility, the network compression methods operate per parameter tensor in order to always ensure proper decoding, even if no structure information is provided. The standard contains compression-efficient quantization and an arithmetic coding scheme (DeepCABAC) as core encoding and decoding technologies, as well as neural network parameter pre-processing methods like sparsification, pruning, low-rank decomposition, unification, local scaling and batch norm folding. NNR achieves a compression efficiency of more than 97% for transparent coding cases, i.e. without degrading classification quality, such as top-1 or top-5 accuracies.
This talk presents an overview of the context, technical features and characteristics of NN coding standard, and discusses ongoing topics such as incremental neural network representation.
Bio:
Werner Bailer is a Key Researcher at DIGITAL – Institute for Information and Communication Technologies at JOANNEUM RESEARCH in Graz, Austria. He received a degree in Media Technology and Design in 2002 for his diploma thesis on motion estimation and segmentation for film/video standards conversion. His research interests include audiovisual content analysis, multimedia retrieval and machine learning. He regularly contributes to standardization, among others in MPEG, where he co-chairs the ad-hoc group on neural network compression.




Bio: Laura Toni received the M.S. and Ph.D. degrees, both in electrical engineering, from the University of Bologna, Bologna, Italy, in 2005 and 2009, respectively. In 2007, she was a Visiting Scholar at the University of California at San Diego (UCSD), San Diego, CA, USA, and since 2009, she has been a frequent visitor to the UCSD, working on media coding and streaming technologies. Between 2009 and 2011, she was with the Tele-Robotics and Application Department, Italian Institute of Technology, investigating wireless sensor networks for robotics applications. In 2012, she was a Postdoctoral Fellow at UCSD, and between 2013 and 2016, she was a Postdoctoral Fellow in the Signal Processing Laboratory (LTS4) at École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland. Since July 2016, she has been a Lecturer in the Electronic and Electrical Engineering Department, University College London (UCL), U.K. Her research mainly involves interactive multimedia systems, decision-making strategies under uncertainty, large-scale signal processing, and communications. She received the UCL Future Leadership Award in 2016, the ACM Best 10% Paper Award in 2013, and the IEEE/IFIP Best Paper Award in 2012.
Bio: Lucia D’Acunto received her PhD in 2012 from Delft University of Technology, the Netherlands, with a thesis on video streaming over peer-to-peer networks. She now works as a senior research scientist at TNO, focusing on video distribution and on the impact of future internet architectures (e.g. ICN, SDN and 5G) on it. She has led and is leading various European research projects on these topics, most notably the open call projects from the European Projects TRIANGLE, 5GINFIRE and FLAME. Since 2016, Lucia is an active participant and contributor to the 3GPP SA4 group, which focusses on mobile and 5G standardization for media applications. Lucia also serves in the organizing committees of several international conferences, usually in the roles of program chair or demo chair, and in the program committees. Lucia also regularly advises European operators on network and TV technologies and contributes to 5GPPP and NEM visions on the 5G Media Vertical and pilots. Lucia has published her research in several papers and journals and holds more than 15 patent applications.