Smart neural network optimization

Running a computer vision model is an expensive operation. You have invested a lot of resources and time building your model. Now it turns out keeping the lights on might be even more costly. And then you realise this will cost a lot of energy as well.

During this talk Steven van Blijderveen will explain Convolutional Neural Networks (CNN) and different types of compression methods available for CNNs, such as quantization and pruning. Quantization refers to the process of reducing the number of bits that represent a number. In the context of neural networks, this means using lower-precision formats to represent weights and activations, which can lead to significant reductions in model size. Pruning is a compression technique that involves eliminating unnecessary connections or weights in a neural network. For example, if we imagine a neural network as a vast web of interconnected neurons, pruning can be likened to trimming off the less important connections, allowing the network to focus on the more significant ones.

Besides explaining about these types of compression, Steven will also talk about knowledge distillation, which is a technique where a compact neural network, known as the student, is trained to imitate a larger, more complex network or ensemble of networks, known as the teacher. The student network learns from the output of the teacher network rather than the raw data, enabling it to achieve comparable performance with a fraction of the resources.

Matthijs Plat will explain about the solution that AIminify has built for the high energy consumption and how this can be used in the world of AI.

Date

27 March 2024, 11:00-12:00 CET

Venue

ITC Langezijds Building, Room LA 2211
Hallenweg 8, 7522 NH Enschede

Speaker

Steven van Blijderveen

Steven van Blijderveen is Artificial Intelligence Engineer at AIminify. After getting his Artificial Intelligence master's in Utrecht he started working for Tinify, the world leader in image compression. In this role he developed an upscaler, which makes it possible to resize images into bigger formats with minimal loss of quality. He did this by improving upon an existing GAN, a type of generative neural network. After that he investigated heavily in different techniques of neural network compression and the fully automated pipeline to get this up and running. This results in AIminify which offers an on-premise optimization solution with zero configuration. Aiminify is able to compress PyTorch as well as Tensorflow models, without any settings. This will increase the speed of use and deployability with minimal loss of quality.

stevenvanblijderveen@aiminify.com

Connect with me on LinkedIn

Matthijs Plat

Matthijs Plat is the founder of AIminify. After his BIT study at the University of Twente Matthijs has been working in manufacturing for 15 years. In 2018 he started his own business and as of 2019 he is running Tinify, the world leader in image compression. In 2023 the decision was made to start investing in neural network compression, which resulted in AIminify.

matthijsplat@aiminify.com

Connect with me on LinkedIn

Smart neural network optimization

Date

Venue

Speaker

Video

Presentation

Questions and Answers