Neural Network Quantization Methods

AI Model Compression for $1,000: Ora Computing Uses Quantum Physics to Beat Hardware Lock-In

Vienna startup Ora Computing raised €3.5M and proved a 70-billion-parameter large language model can be compressed for under ...

New framework renders AI more trustworthy for cancer subtyping

Medical artificial intelligence (AI) faces a fundamental challenge: uncertainty quantification. Artificial neural networks ...

23d

OpenCV 5.0 brings LLMs to the Computer Vision Library

Version 5.0 Modernizes DNN Engine, Adds LLM/VLM Support, and Enhances Core, Hardware Acceleration, and 3D Stack.

Nature

Quantization Techniques in Neural Network Inference

Quantization in neural network inference refers to the process of mapping high-precision parameters and activations to lower-precision representations, typically using integer or even binary values.

EurekAlert!

A new method for training optical neural networks based on Pavlov’s experiment

A research team led by Professor Han Zhang at Shenzhen University has pioneered a novel optical neural network that learns like a living organism—without relying on traditional computing algorithms.

IEEE

Single-Step Hardware-Aware Neural Network Quantization With Mixed Precision

Abstract: Quantization is a neural network compression technique that effectively improves the deployment performance on inference hardware. Fixed-point quantization methods use the same bit-width for ...

Phys.org

Adaptive method helps light-based quantum processors act more like neural networks

Machine learning models called convolutional neural networks (CNNs) power technologies like image recognition and language translation. A quantum counterpart—known as a quantum convolutional neural ...

Scientific Research Publishing

Zhou, J., Cui, G., Hu, S., Zhang, Z., Yang, C., Liu, Z., et al. (2020) Graph Neural Networks: A Review of Methods and Applications. AI Open, 1, 57-81.

ABSTRACT: The accurate prediction of backbreak, a crucial parameter in mining operations, has a significant influence on safety and operational efficiency. The occurrence of this phenomenon is ...

VentureBeat

Huawei's new open source technique shrinks LLMs to make them run on less powerful, less expensive hardware

Huawei’s Computing Systems Lab in Zurich has introduced a new open-source quantization method for large language models (LLMs) aimed at reducing memory demands without sacrificing output quality.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results