About 378,000 results
Open links in new tab
  1. Quantization (signal processing) - Wikipedia

    In mathematics and digital signal processing, quantization is the process of mapping input values from a large set (often a continuous set) to output values in a (countable) smaller set, often …

  2. What is Quantization - GeeksforGeeks

    Nov 6, 2025 · Quantization is a model optimization technique that reduces the precision of numerical values such as weights and activations in models to make them faster and more …

  3. What Is Quantization? | How It Works & Applications

    Quantization is the process of mapping continuous infinite values to a smaller set of discrete finite values. In the context of simulation and embedded computing, it is about approximating real …

  4. What is quantization? - IBM

    Quantization is the process of reducing the precision of a digital signal, typically from a higher-precision format to a lower-precision format. This technique is widely used in various fields, …

  5. A Visual Guide to Quantization - by Maarten Grootendorst

    Jul 22, 2024 · Explore the quantization of Large Language Models (LLMs) with 60+ illustrations.

  6. What is quantization in machine learning? - Cloudflare

    What is quantization in machine learning? Quantization is a technique for lightening the load of executing machine learning and artificial intelligence (AI) models. It aims to reduce the …

  7. What is Quantization and Why It Matters for AI Inference?

    Jul 20, 2025 · Among many optimization techniques to improve AI inference performance, quantization has become an essential method when deploying modern AI models into real …

  8. Digital Communication - Quantization - Online Tutorials Library

    Quantization is representing the sampled values of the amplitude by a finite set of levels, which means converting a continuous-amplitude sample into a discrete-time signal.

  9. Quantization - Hugging Face

    The optimum.fx package provides wrappers around the PyTorch quantization functions to allow graph-mode quantization of 🤗 Transformers models in PyTorch. This is a lower-level API …

  10. Quantization and performance optimization | How-to guides

    What is quantization? Quantization is a technique used in machine learning to reduce the computational and memory requirements of models, making them more efficient for …