Scaling laws for Haralick texture features of linear gradients

Sorinel A. Oprisan; Ana Oprisan

doi:10.7717/peerj-cs.2856

Scaling laws for Haralick texture features of linear gradients

Sorinel A. Oprisan , Ana Oprisan

Physics and Astronomy, College of Charleston, Charleston, SC, United States

DOI: 10.7717/peerj-cs.2856

Published: 2025-04-30
Accepted: 2025-04-03
Received: 2025-01-10

Academic Editor: Xiangjie Kong

Subject Areas: Computer Vision, Data Mining and Machine Learning, Data Science, Visual Analytics
Keywords: Texture classification, Image analysis, Gray Level Co-occurrence Matrix, Haralick features, Scaling laws

Copyright: © 2025 Oprisan and Oprisan
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.

Cite this article: Oprisan SA, Oprisan A. 2025. Scaling laws for Haralick texture features of linear gradients. PeerJ Computer Science 11:e2856 https://doi.org/10.7717/peerj-cs.2856

The authors have chosen to make the review history of this article public.

Abstract

This study presents a novel analytical framework for understanding the relationship between the image gradients and the symmetries of the Gray Level Co-occurrence Matrix (GLCM). Analytical expression for four key features–sum average (SA), sum variance (SV), difference variance (DV), and entropy–were derived to capture their dependence on image’s gray-level quantization (N_g), the gradient magnitude (∇), and the displacement vector (d) through the corresponding GLCM. Scaling laws obtained from the exact analytical dependencies of Haralick features on N_g, ∇ and |d| show that SA and DV scale linearly with N_g, SV scales quadratically, and entropy follows a logarithmic trend. The scaling laws allow a consistent derivation of normalization factors that make Haralick features independent of the quantization scheme N_g. Numerical simulations using synthetic one-dimensional gradients validated our theoretical predictions. This theoretical framework establishes a foundation for consistent derivation of analytic expressions and scaling laws for Haralick features. Such an approach would streamline texture analysis across datasets and imaging modalities, enhancing the portability and interpretability of Haralick features in machine learning and medical imaging applications.

Introduction

Researching image texture presents a fundamental challenge: it requires a universally accepted definition. Texture can be perceived through tactile means Manjunath & Ma (1996) and optical methods (Tuceryan & Jain, 1999). Humans recognize texture in images (Papathomas, Kashi & Gorea, 1997; Aviram & Rotman, 2000; Jagadeesh & Gardner, 2022), distinguishing it by attributes such as coarseness and roughness. The human visual system relies on local contrast ratios and intensity differences, rather than absolute pixel intensity values, to interpret image patterns, such as intensity gradients (Werner, 1935; Land & McCann, 1971; Attneave, 1954; Barten, 1999). In non-human primates, neurons selectively respond to surface luminance gradients and utilize linear shading gradients to infer three-dimensional (3D) structure (Hanazawa & Komatsu, 2001). While previous experimental findings established that the primate visual cortex prioritizes luminance gradients over absolute luminosity as a key visual feature for pattern classification (Correani, Scott-Samuel & Leonards, 2006; Keil, 2007), more recent research has demonstrated that image gradients also facilitate the neural encoding of 3D representations of textured objects (Gomez & Neumann, 2016).

Furthermore, MRI studies in humans have shown that luminance gradients along the vertical axis of an image elicit stronger neural responses in scene-selective brain regions compared to horizontal gradients (Cheng, Chen & Dilks, 2023). This directional selectivity suggests that the human brain assigns different levels of importance to intensity gradients depending on their orientation within natural scenes. Experimental evidence also suggests that vertical intensity gradients are processed by distinct neural pathways in the early visual cortex than those used for gradients in other orientations (Vaziri et al., 2014).

Computer applications have leveraged human visual perception by incorporating gradients as fundamental visual features to enhance the informational content of images. For instance, geographic information system (GIS) tools utilize color gradients to represent variations in elevation and population density (DeMers, 2008). In image processing, gradients serve as essential components for various tasks, including edge detection (Canny, 1986), to correct different lighting or camera properties (Marchand, 2007), and distinguishing between digital camera images and scanned images (Mettripun & Amornraksa, 2014). Additionally, reducing gradient magnitudes at transitions within mosaic images helps create visually cohesive scenes, which human observers perceive as single, unified images (Perez, Gangnet & Blake, 2003).

Natural-scene images depict nature-made objects, such as landscapes, animals, and plants. At the initial stage of an image processing pipeline, basic image enhancement tasks must make assumptions about the image through interpolation methods like smoothing and filtering or model fitting techniques such as Bayesian inference. Although prior knowledge is essential for image processing, it can also introduces bias by favoring expected outcomes. Spectral priors do not directly encode information about an image’s specific properties but instead influence its histogram (the spectrum). Many image features, including color and texture, can be derived from image gradients or spectral priors, as they exhibit remarkable invariance across images (Long & Purves, 2003; Tward, 2021; Dresp-Langley & Reeves, 2024). Each pixel in a gradient image contains two values corresponding to the gradient components at that location. The gradient distribution represents these values’ histogram or probability distribution across all pixels or multiple images. This study focused on one-dimensional gradients in two-dimensional images to explore how Haralick statistical features relate to image gradients. Significant discrepancies exist between human and machine vision in classifying the same textures (Tamura, Mori & Yamawaki, 1978). Efforts to enhance machine-based texture recognition have included detailed models of human visual perception of luminance differences (Chan, Golub & Mulet, 1970; Miao & Shaohui, 2017) and techniques that focus on grouping similar image regions (Rosenfeld & Kak, 1982) or analyzing semi-repetitive pixel arrangements in natural scenes (Pratt, 1978, 2006).

Computer vision and “big data” efficient algorithms driven by machine learning (ML) and Artificial Intelligence (AI) rapidly expanded into the medical imaging field in healthcare. Despite its significance, over 97% of recorded medical images remain unused due to inadequate feature extraction and classification methods (Murphy, 2019). With the emergence of ML and AIs, several automated systems for medical image analysis have been developed. These include tools for bone age estimation (Kim et al., 2017), detection of pulmonary tuberculosis and lung nodules (Hwang et al., 2018; Singh et al., 2018), and AI-based lobe segmentation in CT images (Fischer et al., 2020). Texture analysis is crucial in such applications, including diagnosing microcalcifications in breast tissue (Karahaliou et al., 2007) and detecting cancer from ultrasound images of various organs (Faust et al., 2018).

Texture analysis has been applied to improve the quality of life for individuals with visual impairments. For example, it has enhanced handwriting digit identification accuracy (Sanchez Sanchez et al., 2024) and improved the performance of classification algorithm (Alshehri et al., 2024). In nondestructive material testing, texture analysis helps characterize changes in microstructure caused by mechanical, thermal, and operational stresses. By analyzing microstructural features, researchers gain a deeper understanding of bulk material properties and their macroscopic mechanical behavior. Microstructure texture classification has been widely used in metallurgical studies, based on second-order statistical features such as Haralick features (Haralick, Shanmugam & Dinstein, 1973; Haralick, 1979). Applications include identifying constituent metallurgical phases in steel microstructures (Naik, Sajid & Kiran, 2019), assessing surface hardening during cooling (Fuchs, 2005), detecting phase transitions in two-phase steel systems (Liu, 2014), and analyzing the effects of tempering parameters on steel microstructure (Dutta et al., 2014). Additionally, texture analysis has been utilized to quantify corrosion in steam piping systems (Fajardo et al., 2022). In soft condensed matter, texture classification has been used for identifying phase transitions in polymers and liquid crystals (Pieprzyk et al., 2022; Sastry et al., 2012) and measuring shear modulus, failure temperature, and zero shear viscosity, in polymeric colloids (Xu et al., 2024).

Texture-based image analysis often utilizes advanced statistical methods, such as discriminative binary and ternary pattern features (Midya et al., 2017), wavelet-based techniques (Wan & Zhou, 2010; Karahaliou et al., 2007), and matrix-based approaches such as gray-level run length (Raghesh Krishnan & Sudhakar, 2013), autocovariance (Huang, Lin & Chen, 2005), and spatial gray-level dependence matrices (Kyriacou et al., 1997; Pavlopoulos et al., 2000).

One widely used approach to texture analysis is the Gray Level Co-occurrence Matrix (GLCM), a statistical method that captures spatial relationships between pixel intensities (Oprisan & Oprisan, 2023). GLCM, which belongs to second-order statistical methods (Humeau-Heurtier, 2019), quantifies occurrences of pixel pairs that exhibit specific spatial relationships. Haralick, Shanmugam & Dinstein (1973), Haralick (1979) identified 14 texture features derived from GLCM; however, many have been critiqued for redundancy (Conners & Harlow, 1980) and computational complexity. Advanced methods, including higher-order statistics and fractal dimensions (Pavlopoulos et al., 2000; Kyriacou et al., 1997), have further enriched the field but remain limited in practical application due to high computational demands.

The primary objective of this study is to derive analytical expressions for the GLCM and its related features, in order to better understand how they depend on gray-level quantization ( $N_{g}$ ), image gradient magnitude ( $\nabla$ ), and displacement vector ( $d$ ). The secondary objective is to use these newly derived expressions, particularly those from the GLCM of linear gradients, to establish scaling laws that govern the dependence of Haralick features on $N_{g}$ , $\nabla$ , and $d$ . These scaling laws will help determine the asymptotic behavior of Haralick features and identify data-driven normalization factors, ensuring that results remain independent of the image quantization scheme. Previous studies primarily relied on empirical methods to estimate normalization factors that could make Haralick features invariant to the number of gray levels ( $N_{g}$ ). For instance, Clausi (2002) proposed normalizing gray-level intensities by the total number of gray levels in the GLCM, but applied this only to two features—inverse difference and inverse difference moment. Similarly, Shafiq-ul Hassan et al. (2017, 2018) aimed to enhance the reproducibility of MRI-based Haralick features across different voxel volumes and scanner models (Philips, Siemens, and GE models). However, their empirical approach identified only two reproducible GLCM-based features, and they noted that “for some features, their relationship with gray levels appeared to be random, therefore, no normalizing factor could be identified” (Shafiq-ul Hassan et al., 2017). Lofstedt et al. (2019) also investigated methods to reduce the sensitivity of Haralick features to image size, noise levels, and different quantization schemes. Their approach involved normalizing each gray level by $N_{g}$ and additional empirical normalization factors, effectively transforming the GLCM into an equivalent normalized Riemann sum. While this normalization improved consistency for many texture features, it did not work universally, although “most of the modified texture features quickly approach a limit.” This study introduces a systematic methodology for deriving scaling laws that explain how Haralick features evolve with changes in the number of gray levels ( $N_{g}$ ). By establishing these scaling laws analytically, we aim to provide a more rigorous foundation for normalization strategies, reducing the reliance on empirical estimations.

This study demonstrates the derivation methodology for feature dependencies on $N_{g}$ , $d$ , and $\nabla$ for four Haralick features: sum average (SA), sum variance (SV), difference average (DA), and entropy. We chose these four Haralick features because they have received significantly less attention than those based directly on calculating various moments of the GLCM. Examples include Second Angular Moment or Energy $f_{1}$ (over 19,300 publications in Google Scholar), Contrast $f_{2}$ (22,500 publications), Correlation $f_{3}$ (21,000 publications), Sum of Squares Variance $f_{4}$ (20,100 publications), Inverse Difference Moment or Local Homogeneity $f_{5}$ (16,000 publications), and Entropy $f_{9}$ (19,100 publications) (Haralick, Shanmugam & Dinstein, 1973). The remaining Haralick features are used significantly less often because they depend on marginal probabilities derived from the GLCM and require extra computational steps. For instance, SA $f_{6}$ (3,080 publications), the SV $f_{7}$ (2,720 publications), and the difference variance $f_{10}$ (2,600 publications) are cited at about one order of magnitude lower than the previous category. Consequently, their meanings are more complex to grasp. We have included entropy in this study for two reasons: to demonstrate how a logarithmic moment of GLCM is estimated and, more importantly, to illustrate that the derived marginal probabilities used for evaluating SA and sum difference can immediately apply to calculating sum entropy and difference entropy features. By advancing the theoretical understanding of these features, this work aims to enhance the applicability of Haralick features in machine learning and AI-driven texture analysis.

The manuscript is structured as follows. The Methods “The Gray Level Co-occurrence Matrix (GLCM)” defines the meaning and notation for the GLCM. Figure 1 shows a reference frame attached to the upper left corner of the image and the offset vector $d = (Δ x, Δ y)$ between the reference (shaded) pixel and its set of neighbours. Descriptions of the x-direction $p_{x} (i)$ , y-direction $p_{y} (j)$ , sum $p_{x + y} (k)$ , and difference $p_{x - y} (k)$ marginal distributions are provided in “Marginal Distributions Associated with the GLCM”. A visual aid is included to help elucidate the meaning of the marginal distributions in Fig. 2. The numerical procedure used for generating synthetic images is detailed in “Synthetic Gradient Images”. The Results section begins with a two-dimensional $N_{x} \times N_{y}$ gradient map for a periodic vertical gradient of length $N_{y}$ in “Two-dimensional (2D) Gradient Maps”, supporting the transition to the $N_{g} \times N_{g}$ GLCM matrix by wrapping around the 2D map in “Wrap Around the 2D Gradient Map to get the GLCM”. Utilizing GLCM symmetry for periodic linear gradients enables us to estimate the number of nonzero GLCM entries for a given gradient $\nabla$ in “On the Number of Nonzero GLCM Entries for a Linear Gradient”, which is necessary for calculating the marginal distribution of gray level differences $p_{x - y} (k)$ (Marginal distribution of gray level differences $p_{x - y}$ for linear gradients), the marginal distribution of gray level sums $p_{x + y} (k)$ (Marginal distribution of gray level sums $p_{x + y}$ for linear gradients). The numerical procedure used for comparing analytic predictions against numerically computed Haralick features for synthetic one-dimensional gradients is detailed in “Analytic Scaling Laws for Haralick Features of Linear Gradients. Comparison with Numerical Results”. The subsequent subsections of the Results section apply the findings to derive analytic expressions and scaling laws for sum average, sum variance, difference average, and entropy dependence on $N_{g}, \nabla$ and $| d |$ . Side-by-side comparison of analytical and numerical findings are summarized in the Discussion and Conclusions section.

Figure 1: Gray Level Co-occurrence Matrix (GLCM) displacement vectors.
(A) By convention, the $x$ -direction runs horizontally to the right and $y$ -direction vertically downward with the image’s origin at the upper left corner. Pixel offsets are given by the displacement vector $d = (Δ x, Δ y)$ . (B) In a non-periodic $N_{x} (= 5) \times N_{y} (= 4)$ 2-bit image, there are $R_{x} = (N_{x} - 1) N_{y} = 16$ horizontal pairs of pixels at a displacement $d = (Δ x = 0, Δ y = 1)$ and $R_{y} = N_{x} (N_{y} - 1) = 15$ vertical pairs of pixels at a displacement $d = (Δ x = 1, Δ y = 0)$ . (C) The GLCM for unit horizontal displacement has $N_{g} \times N_{g} = 16$ non-zero entries for a 2-bit depth image. For example, the two horizontal pairs 0-1 highlighted with elliptic shades in panel B give the GLCM entry $P (0, 1) = 2$ . (D) The GLCM for unit vertical displacement has 15 non-zero entries. For example, the vertical pairs 1-2 indicated with rectangular shades in panel B yield the GLCM entry $P (1, 2) = 1$ .

Download full-size image

DOI: 10.7717/peerj-cs.2856/fig-1

Figure 2: Marginal probability distributions from the GLCM.
(A) The probability of finding a gray level intensity $i$ along the horizontal $x$ -direction in the image is $p_{x} (i)$ and along vertical direction is $p_{y} (i)$ . (B) The probability of finding a gray level difference of $k = | i - j |$ units is $p_{x - y} (k)$ . It is determined by summing elements parallel to the primary diagonal at a distance of $k$ units above and below the GLCM along the corresponding dashed lines. By summing GLCM elements parallel to its secondary diagonal, one obtains $p_{x + y} (s)$ .

Download full-size image

DOI: 10.7717/peerj-cs.2856/fig-2

Methods

The Gray Level Co-occurrence Matrix

A grayscale image is a two-dimensional matrix $I (x, y)$ that stores gray-level intensities (see Fig. 1A). The bit depth of an image determines the number $N_{g}$ of gray levels. For instance, an 8-bit image has $N_{g} = 2^{8} = 256$ gray levels. By convention, a gray level of zero, $I (x, y) = 0$ , represents black, while $I (x, y) = N_{g} - 1$ corresponds to white. Intermediate intensities represent various shades of gray. Figure 1A shows the upper left corner reference frame attached to an image with x-direction pointing horizontally to the right and the y-direction vertically downward. Each square in Fig. 1A represents an image pixel. Arrows from the central highlighted pixel indicate the offset vectors $d = (Δ x, Δ y)$ to its neighbors. The increment $Δ x$ represents the image row offset and $Δ y$ represents the image column offset.

Figure 1B illustrates a rectangular $N_{x} (= 5) \times N_{y} (= 4)$ image with a 2-bit depth ( $N_{g} \in {0, 1, 2, 3}$ ). For the same image, as shown in Fig. 1B, each displacement vector $d$ defines a corresponding GLCM. For instance, a unit displacement along horizontal direction $d = (Δ x = 1, Δ y = 0)$ produces Fig. 1C. Indeed, there are two pairs of pixels with the starting point gray level $i = 0$ and endpoint intensity level $j = 1$ separated by one pixel displacement along the horizontal direction. The array coordinates (1,4)-(1,5) and (2,2)-(2,3) are marked with elliptical shaded area and connected by the two horizontal lines extending from panel B image to the corresponding GLCM entry $P (0, 1) = 2$ in panel C. Similarly, there is only one pair of pixels in the Fig. 1B image with the starting point gray level $i = 1$ and endpoint intensity level $j = 2$ separated by one pixel displacement along the vertical direction. The array coordinates (3,4)-(4,4) are marked with rectangular shaded area and connected horizontally by a line extending from panel B image to the corresponding GLCM entry $P (1, 2) = 1$ in panel D. The unnormalized GLCM counts the number of occurrences of the (reference) gray level $i$ at a distance specified by the displacement vector $d = (Δ x, Δ y)$ from the (target) gray level $j$ (Haralick, Shanmugam & Dinstein, 1973):

(1) $P_{d} (i, j) = # {((x_{i}, y_{i}), (x_{j}, y_{j})) : I (x_{i}, y_{i}) = i & I (x_{j}, y_{j}) = j},$ where # denotes the number of elements in the set, the coordinates of the reference gray level $i$ are $(x_{i}, y_{i})$ , and the coordinates of the neighbor (target) pixel with gray level $j$ are $(x_{j} = x_{i} + Δ x, y_{j} = y_{i} + Δ y)$ .

In the GLCM Eq. (1), the first index $i$ represents the intensity of the reference point, or the starting point of the displacement vector $d$ , while the second index $j$ corresponds to the intensity of the endpoint of the displacement vector. For instance, an offset $d = (Δ x = 1, Δ y = 0)$ indicates that the row index (in the $y$ -direction) remains unchanged since $Δ y = 0$ , and the column index (in the $x$ - or horizontal direction across the image) increases by one unit $(Δ x = 1)$ .

For simplicity, Fig. 1 only counts the pairs for gray levels one pixel apart along the horizontal (Fig. 1C) and vertical (Fig. 1D) directions, respectively. For example, only one pair of gray level intensities 1-0 is counted between the spatial coordinates (2,3) and (2,4) in Fig. 1B, which is shown as $P (1, 0) = 1$ in Fig. 1C GLCM. As a result, the Fig. 1B GLCM is not symmetric. In the original definition of the GLCM provided by Haralick (Haralick, Shanmugam & Dinstein, 1973), symmetry allows both $P (1, 2)$ and $P (2, 1)$ pairings to be counted as instances where the pixel value 1 is separated by the distance vector $d$ from the pixel value 2. Mathematically this is achieved by adding to the GLCMs in Figs. 1C and 1D their corresponding transposed arrays. In line with Haralick’s definition, our implementation and all the results presented in this study used a symmetric GLCM matrix definition.

The number of possible pairs in the image typically normalizes the GLCM. For instance, in an $N_{x} \times N_{y}$ image, there are $R_{x} = (N_{x} - 1) N_{y}$ horizontal pairs and $R_{y} = (N_{y} - 1) N_{x}$ vertical pairs. In the example depicted in Fig. 1, since the image has $4 \times 5$ pixels, the GLCM normalization factors are $R_{x} = 16$ and $R_{y} = 15$ . The corresponding normalized GLCM values in Fig. 1C are, for example, $p_{d} (0, 2) = P (0, 2) / R_{x} = 2 / 16$ , and for Fig. 1D, they are $p_{d} (1, 2) = P_{d} (1, 2) / R_{y} = 1 / 15$ . The unnormalized GLCM is indicated with capital letters such as $P_{d} (i, j)$ , while its normalized version is denoted as $p_{d} (i, j)$ :

(2) $p_{d} (i, j) = \frac{P_{d} (i, j)}{\sum_{i = 0}^{N_{g} - 1} \sum_{j = 0}^{N_{g} - 1} P_{d} (i, j)} .$

The normalized GLCM indicates the likelihood of finding gray level $j$ at a displacement $d = (Δ x, Δ y)$ from the current location of the reference pixel with gray level $i$ in an image. It adheres to the normalization condition $\sum_{i = 0}^{N_{g} - 1} \sum_{j = 0}^{N_{g} - 1} p_{d} (i, j) = 1$ . More than half of the original 14 Haralick features rely on an additional step that involves computing marginal probability distributions from $p_{d} (i, j)$ .

The GLCM is a natural measure of image gradients, quantifying the change in light intensity from the reference intensity $i$ to the target intensity $j$ along the displacement vector $d = (Δ x, Δ y)$ . Since Haralick features are scalar measures defined by the two-point histogram represented by the GLCM, they also inherently measure light intensity gradients present in images.

Marginal distributions associated with the GLCM

Only three of the original Haralick features (Haralick, Shanmugam & Dinstein, 1973; Haralick, 1979) use the normalized GLCM $p_{d} (i, j)$ as defined in Eq. (2). All the other use one of the four marginal probability distributions derived from $p_{d} (i, j)$ . To simplify the notation, one dropped the subscript $d$ from the normalized GLCM $p_{d} (i, j)$ . The $x$ -direction marginal probability distribution can be obtained by summing along the rows of the GLCM $p (i, j)$ :

$p_{x} (i) = \sum_{j = 0}^{N_{g} - 1} p (i, j),$ as shown in Fig. 2A. For example, $p_{x} (0)$ is the sum of all row elements with an intensity $i = 0$ at the reference point (see Fig. 1A), regardless of the intensity of its endpoint determined by the displacement vector. Therefore, $p_{x} (i)$ gives the probability of finding gray level $i$ in the image. The mean and variance of the GLCM along the marginal distribution $p_{x} (i)$ are $μ_{x} = \sum_{i = 0}^{N_{g} - 1} i p_{x} (i),$ and $σ_{x}^{2} = \sum_{i = 0}^{N_{g} - 1} {(i - μ_{x})}^{2} p_{x} (i) .$

The $y$ -direction marginal probability distribution $p_{y} (i)$ can be obtained by summing the columns of the GLCM $p (i, j)$ :

$p_{y} (j) = \sum_{i = 0}^{N_{g} - 1} p (i, j) .$

For example, $p_{y} (0)$ is the sum of all column elements with an endpoint intensity $j = 0$ , regardless of the intensity of the reference (starting) point. These marginal probabilities are illustrated in Fig. 2, along the horizontal dashed lines representing the GLCM line summation for $p_{x}$ and along the vertical dashed lines representing the GLCM column summation of $p (i, j)$ to obtain $p_{y}$ , respectively.

The marginal distribution of gray level differences $k = i - j$ between the reference pixel intensity $i$ and the endpoint intensity $j$ determined by the displacement vector $d$ is:

(3) $p_{x - y} (k) = \sum_{i = 0}^{N_{g} - 1} \sum_{j = 0}^{N_{g} - 1} δ_{| i - j |, k} p (i, j),$ where $δ_{m, n}$ is Kronecker’s symbol. For example, $p_{x - y} (0)$ represents the sum of all primary diagonal elements of the GLCM, as these elements exhibit no gray level differences between the reference point and the endpoint of the vector $d$ , as illustrated in Fig. 2B. Similarly, the sum of the elements along the first line parallel to and above the primary diagonal reflects a gray level difference of $k = + 1$ units between the reference gray level $i$ and the endpoint gray level $j$ of the GLCM, which defines $p_{x - y} (1)$ . The sum $p (0, 1) + p (1, 2) + p (2, 3)$ of GLCM entries along the first line parallel and above the primary diagonal in Fig. 2B correspond to the fraction of $p_{x - y} (1)$ with $j - i = + 1$ . The sum $p (1, 0) + p (2, 1) + p (3, 2)$ of GLCM entries along the first line parallel and below the primary diagonal in Fig. 2B correspond to the fraction of $p_{x - y} (1)$ with $j - i = - 1$ . Since the definition of gray level differences marginal distribution $p_{x - y} (1)$ in Eq. (3) counts absolute differences $k = | i - j |$ , the two partial sums must also be added (see the $\oplus$ symbol) to produce $p_{x - y} (1)$ .

The marginal distribution of gray level sums $k = i + j$ between the reference pixel intensity $i$ and the endpoint neighbor intensity $j$ is:

(4) $p_{x + y} (k) = \sum_{i = 0}^{N_{g} - 1} \sum_{j = 0}^{N_{g} - 1} δ_{i + j, k} p (i, j) .$

To prevent overcrowding in Fig. 2B, we only showed the $p_{x + y} (3)$ , which signifies the sum of the GLCM elements along its secondary diagonal with $i + j = 3$ , i.e., $p (3, 0) + (2, 1) + p (1, 2) + p (0, 3)$ . Other values for $p_{x + y} (s)$ correspond to summation along lines parallel to the secondary diagonal in Fig. 2B.

Synthetic gradient images

While the GLCM method described in “Methods” applies to any image, this study specifically focuses on computer-generated (synthetic) images with one-dimensional vertical gradients. This focus is motivated by the fact that image gradients are highly invariant across images (Long & Purves, 2003; Tward, 2021; Dresp-Langley & Reeves, 2024).

Image gradients have long been used as statistical (or spectral) priors for estimating image features (Gong & Sbalzarini, 2014, 2016). A gradient image $G (x, y)$ , is derived from the first-order spatial differences of the original image, $I (x, y)$ , such that $G (x, y) = (I (x - 1, y) - I (x, y), I (x, y - 1) - I (x, y))$ (McCann & Pollard, 2008; Sevcenco & Agathoklis, 2021). The gradient image retains the same dimensions as the original but stores the x- and y-direction gradient values at each pixel.

Gradient spectral priors have been extensively applied in various image processing tasks, including denoising and deblurring (Chen, Yang & Wu, 2010), image restoration (Cho et al., 2012), range compression (Fattal, Lischinski & Werman, 2002), shadow removal (Finlayson, Hordley & Drew, 2002), and image compositing (Levin et al., 2004; Perez, Gangnet & Blake, 2003). Notably, deblurring in the gradient domain is often more computationally efficient than operating on raw pixel values (Cho & Lee, 2009; Shan, Jia & Agarwala, 2008; Wang & Cheng, 2016).

Traditionally, images are decomposed into 2D orthogonal gradient maps assuming that x- and y-direction gradients are statistically independent. One of the first studies to explore potential correlations between these gradient distributions in natural scene images found “weakly negatively correlated in the training dataset (from edges in the images)” (Gong & Sbalzarini, 2016). Consistent with these findings, recent algorithms for image denoising and deblurring (Zheng et al., 2022; Zhangying et al., 2024), range compression (Yan, Sun & Davis, 2024), or pattern classification (Wang et al., 2025) continue to treat orthogonal gradients as independent and their spectral priors as uncorrelated. Based on this well-supported assumption, our study focuses exclusively on a vertical gradient for calculating Haralick texture features.

Figure 3A shows a $b = 3$ -bit depth grayscale image with dimensions $N_{x} \times N_{y}$ , featuring a vertical, linearly increasing, periodic intensity gradient of $\nabla = 1$ gray level per pixel. The array $I (x, y)$ that represents the image is given by $I (x_{i}, y_{i}) = y_{i} \nabla$ where $y_{i} = {0, 2, . . ., N_{y} - 1}$ . Since image intensities do not depend on the $x_{i} = {0, 2, . . ., N_{x} - 1}$ matrix index, the image appears as horizontal stripes with linearly increasing intensity (Fig. 3A). Furthermore, the vertical gradients are periodic, i.e., the intensity pattern repeats after reaching the maximum number of gray levels $N_{g} = 2^{b}$ . In other words, the vertical coordinate $y_{i}$ and pixel intensity are connected through $I (x_{i}, y_{i}) = m o d (y_{i}, N_{g}) \nabla$ . The modulo (“mod”) operation along the vertical spatial indices $y_{i}$ ensures the gradient repeats periodically after $N_{g}$ pixels. In Fig. 3A example, the gray levels increase linearly from zero to $N_{g} - 1$ with a step of $\nabla = 1$ gray level per pixel. The arrow next to the gradient in Fig. 3A indicates the gradient’s direction. Similarly, Fig. 3E shows a synthetically generated image with a vertical, linearly increasing, and periodic gradient $\nabla = 2$ gray levels per pixel. The grayscale images from Fig. 3A and Fig. 3E are numerically represented in Fig. 3B and Fig. 3F, respectively. The horizontal arrows between panels A and B indicate that the constant intensity line of pixels is represented numerically by the corresponding integer values with black mapped to 0. Following the procedure described above, we generated square synthetic images of $1024 \times 1024$ pixels containing periodic linear gradient patterns, as illustrated in Fig. 3. Our analysis focuses on three key variables:

(1)

The number of gray levels in the image ( $N_{g}$ ),
(2)

The intensity of the image gradients ( $\nabla$ ) in gray levels per pixel, and
(3)

The displacement vector ( $d = (Δ x, Δ y)$ ) in pixels, which determines the GLCM matrix used to compute the Haralick features.

Figure 3: Periodic and linear vertical gradients and their GLCM.
(A and E) Horizontal stripes of constant intensity with a periodic vertical gradient of $\nabla = 1$ (panel A) and $\nabla = 2$ (panel 2) gray levels per pixel in a $b = 3$ -bit depth grayscale image. Each horizontal line is one pixel wide. (B and F) Numerical representation of the grayscale image with values ranging from zero to $N_{g} - 1.$ (C and G) The two-dimensional (2D) gradient map of the periodic gray level gradient displays nonzero entries at the coordinates $(y_{i}, y_{j}) = (y_{i}, y_{i + | d |})$ , which are spaced by the distance $d$ and maintain the absolute coordinates of pixels along the gradient. The first non-zero entry occurs at $(i = 0, j = d) \nabla)$ , with all nonzero entries separated by distances of $\nabla$ both vertically and horizontally. (D and H) The shaded gray levels $i = 0$ and $j = 1$ at a vertical distance of one pixel $d = (0, 1)$ in panel B determine the GLCM entry $P (0, 1) = 1$ . The GLCM can be obtained by wrapping around the 2D gradient map by modulo $N_{g} + 1$ in both array dimensions.

Download full-size image

DOI: 10.7717/peerj-cs.2856/fig-3

We created images with a bit depth (b) ranging from 4 to 8, corresponding to $N_{g} = 2^{b} \in {16, 32, 64, 128, 256}$ . These values represent a broad and realistic range for evaluating how Haralick features depend on $N_{g}$ (see Figs. 4 and 5). For each bit depth we generated synthetic images with gradient intensities ( $\nabla$ ) ranging from 1 to 8. However, to reduce visual clutter, only odd $\nabla$ values are displayed in Figs. 4 and 5. Finally, for each combination of bit depth ( $b$ ) and gradient intensity $\nabla$ , we computed GLCMs for vertical displacement vectors $| d | = 1, \dots, 8$ .

Figure 4: Analytical vs numerically calculated features scaling with image bit depth.
Synthetic linear gradient images were used with $N_{g} \in {16, 32, 64, 128, 256}$ gray levels. The GLCMs were numerically evaluated for a fixed integer vertical displacement $| d | = 1$ pixels and variable linear gradients of $\nabla = 1$ gray level per pixel (symbol “ $*$ ”), $\nabla = 3$ gray levels per pixel (symbol “o”), $\nabla = 5$ gray levels per pixel (symbol “+”), and $\nabla = 7$ gray levels per pixel (symbol “.”). All Haralick features were computed numerically using Matlab’s $g r a y c o p r o p s ()$ function. The continuous lines represent the analytically predicted scaling laws for the corresponding features. (A) The numerically computed sum average (SA) feature $f_{6}$ increases linearly with $N_{g}$ and is independent of the magnitude of the displacement vector and the gradient. (B) The numerically computed sum variance (SV) feature $f_{7}$ exhibits a quadratic dependence on the magnitude of the displacement vector and is independent of the magnitude of the displacement vector and the gradient, as predicted by Eq. (13). (C) The numerically computed difference variance (DV) $f_{10}$ scales linearly with the image bit depth and the slope increases linearly with the image gradient intensity $\nabla$ , as predicted by Eq. (15). (D) The experimental values of entropy $f_{9}$ show the predicted logarithmic trend, but they are consistently and slightly shifted in comparison to the theoretical prediction from Eq. (20). The reason is that the numerically computed Entropy feature uses $\log (p (i, i) + ε)$ with a small $ε$ constant to prevent logarithm divergence for sparse GLCM with many $p (i, i) = 0$ .

Download full-size image

DOI: 10.7717/peerj-cs.2856/fig-4

Figure 5: Analytical vs numerically calculated features scaling with displacement vector magnitude.
All synthetic gradient images were 8-bit depth. The GLCMs were numerically evaluated for vertical displacements $| d | = 1, \dots, 8$ pixels and linear gradients $\nabla = 1$ gray level per pixel (symbol “ $*$ ”), $\nabla = 3$ gray levels per pixel (symbol “o”), $\nabla = 5$ gray levels per pixel (symbol “+”), and $\nabla = 7$ gray levels per pixel (symbol “.”). All features were numerically computed using Matlab’s function $g r a y c o p r o p s ()$ . The continuous lines illustrate the analytically predicted scaling laws for the corresponding features. (A) The numerically computed sum average (SA) feature $f_{6}$ remains independent of the magnitude of the displacement vector and exhibits negligible gradient dependence due to the integer part function, as elaborated in the text. (B) The numerically computed sum variance (SV) feature $f_{7}$ scales linearly with the magnitude of the displacement vector, with a slope proportional to the gradient, as predicted by Eq. (13). (C) The numerically computed difference variance (DV) $f_{10}$ scales linearly with the magnitude of the displacement vector and the slope is proportional to the gradient, as predicted by Eq. (16). (D) The experimental values of entropy $f_{9}$ are independent of the magnitude of the displacement vector and increase with the gradient, as expected from Eq. (20). The slight systematic difference between the computed and predicted values is due to the actual entropy feature calculation using $\log (p (i, i) + ε)$ with a small $ε$ constant to prevent logarithm divergence for sparse GLCM with many $p (i, i) = 0.$

Download full-size image

DOI: 10.7717/peerj-cs.2856/fig-5

Results

Interpreting GLCM and Haralick features is difficult because they contain second-order statistical information about image pixels. To calculate Haralick features, one employs images with a single periodic and linear gradient to understand the relationship between image gradients and GLCM symmetries.

Two-dimensional gradient maps

To count the pairs of pixels with a starting gray level $i$ and an endpoint gray level $j$ separated by a distance $d = (Δ x, Δ y)$ pixels, one can create a two-dimensional (2D) $N_{y} \times N_{y}$ gradient map such that its $(y_{i}, y_{j}) = (y_{i}, y_{j = i + | d |})$ entry is 1 if $I (x_{i}, y_{j = i + | d |}) - I (x_{i}, y_{i}) = d \cdot \nabla$ and zero otherwise as shown in Fig. 3C. Here, $\cdot$ is the dot product and ensures that one considers the relative orientation of the gradient $\nabla$ to the displacement vector $d$ . From Figs. 3B and 3F, one notices that the gray level intensity at spatial coordinate $y_{i}$ is always $y_{i} = i \nabla$ with $i = 0, \dots, N_{y} - 1$ . The pixel intensity at a vertical coordinate $y_{j}$ , which is a distance $| d |$ from $y_{i}$ , is $y_{j} = y_{i} + d \cdot \nabla = (i + | d |) \nabla$ . As a result, the 2D maps in Figs. 3C and 3G are one-to-one correspondences between pixel location $y_{i}$ and its corresponding gray level intensity $i \nabla$ . One notices in Fig. 3C with $\nabla = 1$ gray level per pixel and Fig. 3G with $\nabla = 2$ gray level per pixel that the vertical and horizontal distance between any non-zero entries of the 2D gradient map is $\nabla$ . These $\nabla$ displacements are marked in Fig. 3C and Fig. 3G, respectively. Additionally, one can observe from the 2D gradient maps in Figs. 3C and 3G that all nonzero entries are aligned with the primary diagonal of the 2D gradient map at a distance of $d \cdot \nabla$ from it. The distance of the gradient pattern from the primary diagonal of the 2D gradient maps is determined by the first gray level intensity, i.e., $i = 0$ , which is always paired with the gray label $j = d \cdot \nabla$ for any displacement vector $d$ and gradient intensity $\nabla$ . Finally, all nonzero entries $(y_{i}, y_{j})$ in the 2D gradient maps shown in Figs. 3C and 3G obey the condition $k = | i - j | = d \cdot \nabla$ shown with dashed line parallel to the principal axis diagonal. The principal diagonal elements are always zero because they correspond to a uniform image with no intensity changes from pixel to pixel.

Wrap around the 2D gradient map to get the GLCM

While illuminating, representing a periodic linear gradient of length $N_{y}$ using a sparse $N_{y} \times N_{y}$ 2D gradient maps, as shown in Figs. 3C and 3G, is not efficient. As a result, the GLCM removes the extra spatial information about pixel coordinates $(y_{i}, y_{j})$ retained by the 2D gradient map and only counts the co-occurrence of gray level intensities $i$ and $j$ at a relative distance $d = (Δ x, Δ y)$ , as illustrated in Figs. 3D and 3H. Consequently, for a specific displacement vector $d$ , the GLCM is an $N_{g} \times N_{g}$ matrix that solely counts the co-occurrence of gray levels $i$ and $j$ at a relative distance $d$ from each other, irrespective of their absolute spatial coordinates $y_{i}$ and $y_{j}$ . Because the absolute coordinates $(y_{i}, y_{j})$ of the pixel intensity pair $i$ and $j$ are no longer recorded, the GLCM is not a one-to-one mapping of the original gradient (unlike the 2D gradient map). For instance, in Fig. 3C, the pixel intensities $i = 7$ and $j = 0$ are located at a distance $d = (0, 1)$ , and they are represented in the 2D gradient map by a value of “1” at spatial coordinates $(y_{i} = 7, y_{j} = 8)$ , as shown in Figs. 3A and 3B. However, the GLCM represents the same pair as an entry at $(i = 7, j = 0)$ as it remaps all 2D gradient map entries from Figs. 3C and 3G using a modulo $N_{g}$ operation. For example, the spatial coordinates $(y_{i} = 7, y_{j} = 0)$ from Fig. 3C are mapped modulo $N_{g} + 1 = 9$ to GLCM coordinates $(y_{i} = 7, y_{j} = 0)$ , which correspond to gray levels $(i = 7, j = 0)$ in GLCM. Although the $N_{g} \times N_{g}$ GLCM array can no longer be mapped back to the original image, it retains essential second-order spatial correlations of gray level intensities.

On the number of nonzero GLCM entries for a linear gradient

For any linear gradient $\nabla$ , the starting point of the GLCM has an index $i$ from the set ${0, \nabla, 2 \nabla, \dots, ({\tilde{N}}_{g} - 1) \nabla}$ , where ${\tilde{N}}_{g}$ is the number of non-zero GLCM entries shown in Figs. 3D and 3H, i.e.:

(5) ${\tilde{N}}_{g} = 1 + [\frac{N_{g} - 1}{| \nabla |}] .$

In the above formula, $[\dots]$ denotes the integer part. Each endpoint index $j$ of the GLCM is also expressed as $j = i + \nabla$ . This relationship indicates that the increment of endpoint indices in the GLCM, represented by $Δ j$ , is equivalent to that of the starting point indices, denoted as $Δ i$ , meaning that $Δ j = Δ i = \nabla$ , as illustrated by the horizontal and vertical double arrows in Fig. 3C for $\nabla = 1$ and in Fig. 3G for $\nabla = 2$ . For example, in Fig. 3D and $\nabla = 1$ gray level per pixel along with Eq. (5) determines how many non-zero GLMC entries ${\tilde{N}}_{g}$ result from sampling the $N_{g} = 8$ gray levels of the image, i.e., ${\tilde{N}}_{g} = 1 + [(8 - 1) / 1] = 8$ . Similarly, for Fig. 3H with $\nabla = 2$ gray levels per pixel in conjunction with Eq. (5), it yields ${\tilde{N}}_{g} = 1 + [(8 - 1) / 2] = 4$ .

One can notice that Fig. 3 displays the GLCMs for positive gradients $\nabla > 0$ and positive displacement vectors, such as $d = (Δ x, Δ y) = (0, 1)$ . Reversing the direction of the gradient would merely shift all non-zero entries in the two-dimensional representation shown in Figs. 3C and 3G below the primary diagonal at a distance $i - j = d \cdot \nabla < 0$ .

Marginal distribution of gray level differences $p_{x - y}$ for linear gradients

The marginal probability distribution $p_{x - y} (k)$ , defined by Eq. (3) and visually represented in Fig. 2B, accounts for the sum of GLCM entries with specified gray level differences $k = j - i$ . As observed in Figs. 3D and 3H, the lines parallel to the primary diagonal of the GLCM convey information about image gradients and represent the lines of constant gray level differences $p_{x - y} (k)$ . For example, the GLCM primary diagonal entries have zero gray level differences, i.e., $k = j - i = 0.$ Consequently, the sum of the primary diagonal elements, i.e., $p_{x - y} (0) = \sum_{i = 0}^{N_{g} - 1} \sum_{j = 0, | i - j | = 0}^{N_{g} - 1} p (i, j)$ , is a zero gradient line because the gray level differences, i.e., the difference between the gray level value $i$ of the start (reference) point $(x_{i}, y_{i})$ and the endpoint gray level intensity $j$ at $(y_{j}, x_{j})$ along the displacement vector $d = (Δ x, Δ y)$ , is $k = | i - j | = 0$ . Figure 3D illustrates that the GLCM of a gradient $\nabla = 1$ gray level per pixel along the vertical unit displacement vector $d = (0, 1)$ contains all entries (except one) aligned along a parallel line with the primary diagonal at gray level differences $k = j - i = d \cdot \nabla = 1$ . The sole exception is the GLCM entry at the discontinuity between the first period and the subsequent gradient repeats (see Fig. 3A and Fig. 3E). For example, the first period of the gradient in Fig. 3E and Fig. 3F ends with a gray level of $i = 6$ in an image with $N_{g} = 8$ gray levels and a gradient intensity $\nabla = 2.$ Therefore, its pair must have an intensity $j = i + \nabla = 8$ , which is mapped modulo $N_{g}$ to $j = 0$ . It corresponds to $P (6, 0) = 1$ (remember that the wrapping around of GLCM in gray level spaces is done modulo $N_{g}$ because the gray level indices start at zero while the spatial coordinates wrap around with modulo $N_{g} + 1$ operation because they begin at index 1). Since accounting for another period of the same gradient increases all nonzero entries of GLCM by one unit, from this point forward, one only calculates the GLCM for a single period of the gradient. To compute Haralick’s features, one uses the symmetry of the GLCM induced by periodic linear gradients such as those shown in Fig. 3.

One can observe from Fig. 3 that the nonzero GLCM entries parallel to the primary diagonal for a given gray level difference $k = j - i = d \cdot \nabla$ begin at a distance of $d \cdot \nabla$ from the first GLCM entry $p (0, 0)$ . The line of constant gray level differences $k = j - i = d \cdot \nabla$ (dotted line parallel to the primary diagonal of the GLCM in Figs. 3D and 3H) starts at $p (0, d \cdot \nabla)$ and ends at $p (i = (m_{1} - 1) \nabla, j = i + d \cdot \nabla)$ , where $m_{1}$ is the number of GLCM entries along the gray level differences line with $k = j - i = d \cdot \nabla$ , which is:

(6) $m_{1} = {\tilde{N}}_{g} - | d | .$

In the example depicted in Figs. 3A–3D, the GLCM for a unit vertical displacement $d = (0, 1)$ in an image exhibiting a linear gradient of $\nabla = 1$ gray level per pixel and $N_{g} = 8$ gray levels has a total of ${\tilde{N}}_{g} = 8$ nonzero entries (from Eq. (5)), of which $m_{1} = 7$ (see Eq. (6)) along the line of constant gray level differences $k = j - i = d \cdot \nabla = 1$ . This line starts at $p (0, d \cdot \nabla) = p (0, 1)$ and ends at $p (i = (m_{1} - 1) \nabla, j = i + d \cdot \nabla) = p (6, 7)$ . Similarly, for the example shown in Figs. 3E–3H, $\nabla = 2$ gray levels per pixel and $N_{g} = 8$ , one gets a total number of GLMC entries of ${\tilde{N}}_{g} = 4$ (from Eq. (5)), of which $m_{1} = 3$ (see Eq. (6)) along the line of constant gray level differences $k = j - i = d \cdot \nabla = 2$ that starts at $p (0, d \cdot \nabla) = p (0, 2)$ and end at $p (i = (m_{1} - 1) \nabla, j = i + d \cdot \nabla) = p (4, 6)$ .

The GLCM always has exactly ${\tilde{N}}_{g}$ nonzero entries according to Eq. (5), of which, according to Eq. (6), $m_{1}$ are on the constant gray level differences line $k = j - i = d \cdot \nabla$ . The remaining $m_{2}$ nonzero GLCM entries have the endpoint coordinate $j$ always beginning at zero due to the wrapping around modulo $N_{g}$ in the gray level intensity space:

(7) $m_{2} = {\tilde{N}}_{g} - m_{1} = | d | .$

Such GLCM entries are $p (i = m_{1} \nabla, j = 0)$ , $p (i = (m_{1} + 1) \nabla, j = \nabla)$ , and so on. One notices that all these new $m_{2} = | d |$ GLCM entries align along the line of constant gray level differences $k = j - i = - m_{1} \nabla$ , as shown in Figs. 3D and 3H. To summarize, the (unnormalized) marginal distribution of gray level differences $p_{x - y}$ for linear gradients represents the frequency of various combinations of pixel intensities that yield a specific absolute difference value $k = | j - i |$ :

(8) $p_{x - y} (k) = {\begin{matrix} 1, & f o r k = j - i = d \cdot \nabla w i t h i = {0, 1, \dots, m_{1} - 1} \nabla, \\ 1, & f o r k = j - i = - m_{1} \nabla w i t h i = {m_{1}, m_{1} + 1, \dots {\tilde{N}}_{g} - m_{1}} \nabla, \\ 0, & o t h e r w i s e . \end{matrix}$

Marginal distribution of gray level sums $p_{x + y}$ for linear gradients

The previous section demonstrated that linear gradients are naturally represented by non-zero entries parallel to the primary diagonal of the GLCM. Thus, the marginal distribution of gray level difference $p_{x - y}$ arises naturally from GLCM symmetry. Other Haralick features require calculating the marginal distribution $p_{x + y} (s)$ for a given sum of gray level intensity $s = i + j$ , where $s = {0, 1, \dots, 2 (N_{g} - 1)} .$ One can utilize the GLCM symmetries caused by linear gradients and the corresponding marginal distribution $p_{x - y} (k)$ where $k = j - i = d \cdot \nabla$ to streamline the calculation of the other marginal distribution $p_{x + y}$ . Indeed, from $p_{x - y} (k)$ , the $m_{1}$ nonzero endpoint gray level intensity are $j = i + k = i + d \cdot \nabla$ where $i = {0, \nabla, \dots, (m_{1} - 1) \nabla}$ . Therefore, the elements of the marginal distribution $p_{x + y} (s)$ are $s = i + j = 2 i + d \cdot \nabla$ with $i = {0, \nabla, \dots, (m_{1} - 1) \nabla}$ . Similarly, the second line of constant gray level differences is $k = j - i = - m_{1} \nabla$ where $j = i - m_{1} \nabla$ and $i = {m_{1} \nabla, (m_{1} + 1) \nabla, \dots}$ , which determines the marginal distribution $p_{x + y} (s)$ with $s = i + j = 2 i - m_{1} \nabla$ . In summary, the (un-normalized) marginal distribution of gray level sums $p_{x + y}$ for linear gradients indicates the frequency of various combinations of pixel intensities that total a specific value $s = j + i$ :

(9) $p_{x + y} (s) = {\begin{matrix} 1, & f o r s = 2 i + d \cdot \nabla w i t h i = {0, 1, \dots, m_{1} - 1} \nabla, \\ 1, & f o r s = 2 i - m_{1} \nabla w i t h i = {m_{1}, m_{1} + 1, \dots {\tilde{N}}_{g} - m_{1}} \nabla, \\ 0, & o t h e r w i s e . \end{matrix}$

Analytic scaling laws for Haralick features of linear gradients Comparison with numerical results

The previous subsection includes all the elements needed to estimate analytically any Haralick feature. In the following subsections, we derive analytical formulas for SA, SV, difference variance (DV), and Entropy based on the GLCMs symmetries derived in the previous subsections. Anticipating the results from the following subsections, the analytic scaling laws for Haralick features take the general form

$f \propto N_{g}^{α} | d |^{β} \nabla^{γ},$ where the scaling exponents $α, β$ and $γ$ are derived from the GLCM symmetries as we will prove below.

To validate our theoretically predicted scaling laws for Haralick features, we performed numerical calculations using synthetic (computer-generated) gradient images. The predictions are represented by continuous lines in Figs. 4 and 5. At the same time, the corresponding numerical simulation results—based on the synthetic images described in “Synthetic Gradient Images”—are shown as discrete points with different symbols, as indicated in the figure legends.

To reduce plot clutter in Figs. 4 and 5, we present results only for odd intensity gradient values of $\nabla \in {1, 3, 5, 7}$ gray levels per pixels. In Fig. 4 the displacement vector magnitude was fixed at $| d | = 1$ , while the number of gray levels varied as $N_{g} \in {16, 32, 64, 128, 256}$ . Conversely, in Fig. 5 the bit depth was set to $b = 8$ bits ( $N_{g} = 256$ ), while the vertical displacement vector magnitude varied as $| d | = 1, \dots, 8.$ For each synthetic image with a given bit depth $b$ and gradient intensity $\nabla$ , we computed the GLCMs for each vertical displacement vector $d$ using Matlab’s $g r a y c o m a t r i x ()$ function. For instance, the GLCM shown in Fig. 1C was obtained using $g r a y c o m a t r i x$ (img,‘Offset’,[0 1], ‘NumLevels’, 4, ‘GrayLimits’, [], ‘Symmetric’,false). Additionally, when calculating all Haralick features, we consistently set the ‘Symmetric’ flag in graycomatrix() to true. Subsequently, we computed Haralick features from GLCMs using Matlab function $g r a y c o p r o p s ()$ .

For a single period of a linear gradient $\nabla$ (see Fig. 3) all ${\tilde{N}}_{g}$ nonzero entries of the GLCM given by Eq. (5) have equal weight and are only aligned to two parallel lines to the primary diagonal as in Figs. 3D and 3H.

Sum average $f_{6}$

The SA indicates the uniformity of intensity values across the image texture. A higher SA value represents an even distribution of intensity sums between neighboring pixels. SA is defined as:

(10) $f_{6} = \sum_{k = 0}^{2 (N_{g} - 1)} k p_{x + y} (k) .$

A high SA implies that most pixel pairs have similar intensity sums, indicating a relatively uniform texture. A low SA suggests more significant variation in intensity sums between neighboring pixels, signifying a more textured appearance. From Eq. (10) with Eq. (9)

(11) $\begin{aligned} f_{6} = \frac{1}{{\tilde{N}}_{g}} (\overset{k = j - i = d \nabla}{\overset{⏞}{d \nabla + (d \nabla + 2 \nabla) + \dots (d \nabla + 2 (m_{1} - 1) \nabla) +}} \\ \underset{k = j - i = - m_{1} \nabla}{\underset{⏟}{m_{1} \nabla + (m_{1} \nabla + 2 \nabla) + \dots (m_{1} \nabla + 2 ({\tilde{N}}_{g} - m_{1}) \nabla)}} = ({\tilde{N}}_{g} - 1) \nabla = \nabla [\frac{N_{g} - 1}{\nabla}] . \end{aligned}$

To simplify the calculation of $f_{6}$ above, we separated the contributions of the GLCM entries that are parallel to its primary diagonal at a distance of $k = j - i = d \nabla$ from those on the line where $k = j - i = - m_{1} \nabla$ . Each of the two terms in Eq. (11) is an arithmetic series with the sum $\sum_{q = 0}^{Q} a + 2 \nabla q = a (Q + 1) + Q (Q + 1) \nabla$ . For $k = j - i = d \nabla$ in Eq. (11) one uses $a = d \nabla$ and $Q = m_{1} - 1$ for $k = j - i = - m_{1} \nabla$ one substitute $a = m_{1} \nabla$ and $Q = {\tilde{N}}_{g} - m_{1}$ .

The first observation is that the theoretically predicted SA value given by Eq. (11) is independent of the gradient intensity $\nabla$ (see the continuous lines in Fig. 4A) and the displacement vector $d$ (see the continuous lines in Fig. 5A) as summarized also in Table 1. Numerically computed Haralick feature SA confirms that its values are independent of gray level intensity gradients $\nabla$ and increases linearly with the number of gray levels $N_{g}$ as shown in Fig. 4A. The exact formula in Eq. (11), which involves the discontinuous integer part function $[\dots]$ , is challenging to work with; however, by dropping the integer part operation, one finds a continuous approximate value ${\tilde{f}}_{6} \approx N_{g} - 1$ . This approximation demonstrates that $f_{6}$ scales linearly with $N_{g}$ (see the continuous lines Fig. 4A, which is also confirmed numerically by the linear increase of Haralick features with the number $N_{g}$ of gray levels shown in Fig. 4A. The second observation is that, numerical simulations shown in Fig. 5A confirm our theoretical prediction based on Eq. (11) that SA feature is independent of the displacement vector magnitude $| d |$ . One notices, a slight error in approximating $f_{6}$ with ${\tilde{f}}_{6}$ . For example, a $N_{g} = 256$ gray level image and a gradient $\nabla = 2$ gray levels per pixel gives $f_{6} = 2 [(256 - 1) / 2] = 254$ , which is slightly less than the simplified approximation ${\tilde{f}}_{6} = 255$ , but the error is under 0.4%. Even for gradients as large as $\nabla = 10$ gray levels per pixel, the error of approximating $f_{6}$ with ${\tilde{f}}_{6} \approx N_{g} - 1$ is below 1%. This slight disagreement between the theoretical predicted SA value from Eq. (11) and the numerically computed values is emphasized in Fig. 5A. One can conclude that the gradient $\nabla$ slightly decreases the SA value $f_{6}$ , but the correction is negligible for small gradients $\nabla < 10$ gray levels per pixel. This fact is marked by the general attribute “independent” with an asterisk in Table 1.

Table 1:

Summary of feature scaling laws

f \propto N_{g}^{α} | d |^{β} \nabla^{γ}

	$N_{g}$	$\| d \|$	$\nabla$
Sum average	Linear	Independent	Independent*
Sum variance	Quadratic	Linear	Linear
Difference variance	Linear	Linear	Linear
Entropy	Logarithmic	Independent	Independent*

DOI: 10.7717/peerj-cs.2856/table-1

Note:

The asterisk mark next to “independent” attribute means the respective feature very slightly decreases with $\nabla$ , and this effect can be neglected for $\nabla < 10$ gray levels per pixel.

Sum variance

The sum variance feature is defined as follows:

(12) $f_{7} = \sum_{k = 0}^{2 (N_{g} - 1)} {(k - f_{6})}^{2} p_{x + y} (k),$ and can be analytically estimated for GLCM of linear gradients using the same strategy described above when deriving explicit analytical expression for SA in Eq. (11).

(13) $\begin{aligned} f_{7} = \frac{1}{{\tilde{N}}_{g}} (\overset{k = j - i = d \nabla}{\overset{⏞}{{(d \nabla - f_{6})}^{2} + {(d \nabla + 2 \nabla - f_{6})}^{2} + \dots {(d \nabla + 2 (m_{1} - 1) \nabla - f_{6})}^{2} +}} \\ \underset{k = j - i = - m_{1} \nabla}{\underset{⏟}{{(m_{1} \nabla - f_{6})}^{2} + {(m_{1} \nabla + 2 \nabla - f_{6})}^{2} + \dots {(m_{1} \nabla + 2 ({\tilde{N}}_{g} - m_{1}) \nabla - f_{6})}^{2}}}) = \\ \nabla^{2} ({\tilde{N}}_{g}^{2} / 3 - {\tilde{N}}_{g} d + d^{2} - 1 / 3) . \end{aligned}$

To accurately predict the scaling law of SV features from the exact formula given by Eq. (13), one could eliminate the integer part function from the definition of ${\tilde{N}}_{g}$ and utilize an approximate estimate:

(14) ${\tilde{f}}_{7} = ((N_{g} - 1)^{2} / 3 - (N_{g} - 1) d \nabla + d^{2} \nabla^{2} - 1 / 3 \nabla^{2}) .$

The discrepancy between the true $f_{7}$ (Eq. (13)) and the approximate estimate ${\tilde{f}}_{7}$ is minor but can reach several percentage points. For example, the largerst error occurs for $N_{g} = 256$ , $\nabla = 7$ , and $| d | = 8$ , which is approximately 3.33%.

Based on Eq. (14), one notice that SV scales quadratically with $N_{g}$ . Indeed, the second term in Eq. (14), which is linear in $N_{g}$ , is always smaller than the first term, quadratic in $N_{g}$ if $d \nabla < N_{g}$ . This condition is fulfilled because the product $d$ pixels times $\nabla$ gray levels per pixel is the number of gray levels variation across an image, which cannot be larger than $N_{g}$ . Numerical simulations confirmed our analytical prediction of a quadratic scaling law for $f_{7}$ with $N_{g}$ , as shown in Fig. 4B. One also notices from Fig. 4B that for fixed displacement vector magnitude $| d |$ , numerical values of SV are independent of gradient intensity $\nabla$ as predicted analytically by Eq. (14).

For an image with a fixed number of gray levels $N_{g}$ and gradient $\nabla$ gray levels per pixel, the second term in Eq. (14) dominated SV’s dependence on $| d |$ . This is because $(N_{g} - 1) d \nabla > d^{2} \nabla^{2}$ , which reduces to $N_{g} - 1 > d \nabla$ . This was shown above to be true for all images. Furthermore, the second term in Eq. (14) $(N_{g} - 1) d \nabla$ is also larger than the fourth term $1 / 3 \nabla^{2}$ because $(N_{g} - 1) d > 1 / 3 \nabla$ even for the smallest possible displacement vector with $| d | = 1$ . As a result, the linear term $| d |$ is the primary influence in the scaling law of $f_{7}$ , which aligns with our numerical simulations shown in Fig. 5B. As noticed from Fig. 5B, for a fixed displacement vector magnitude $| d |$ , SV linearly changes with the gradient intensity $\nabla$ as predicted analytically (see also Table 1).

Difference variance

The definition of difference variance is:

(15) $f_{10} = \sum_{k = 0}^{2 (N_{g} - 1)} {(k - D A)}^{2} p_{x - y} (k),$ where the DA is given by $D A = \sum_{k = 0}^{2 (N_{g} - 1)} k p_{x - y} (k)$ . The evaluation of DA is straightforward and follows from Eq. (8) since all GLCM entries are equal weight:

$D A = \frac{1}{{\tilde{N}}_{g}} (m_{1} d \nabla + m_{2} (- m_{1} \nabla)) = 0.$

As a result, the DA reduces to

(16) $f_{10} = \sum_{k = 0}^{2 (N_{g} - 1)} k^{2} p_{x - y} (k) = \frac{1}{{\tilde{N}}_{g}} (m_{1} (d \nabla)^{2} + m_{2} (- m_{1} \nabla)^{2}) = \nabla^{2} d ({\tilde{N}}_{g} + 1 - d) .$

To infer the asymptotic scaling law exponents from the exact formula of DS given by Eq. (16), one drops the integer part function from ${\tilde{N}}_{g}$ and uses an approximate formula ${\tilde{f}}_{10} \approx \nabla d (N_{g} - 1) + (1 - d) \nabla^{2} \approx \nabla d N_{g}$ , which suggests the scaling law

$f_{10} \propto N_{g} | d | \nabla .$

The theoretically predicted linear scaling with $N_{g}$ is confirmed by numerical simulations shown in Fig. 4C, for a fixed $| d | = 1$ pixel and slopes that increase linearly with the gradient intensity $\nabla$ .

The scaling of experimental $f_{10}$ with the displacement vector $d$ exhibits a linear dependence with a slope proportional to the gradient $\nabla$ . Additionally, the plot of the theoretical prediction from Eq. (16) shows some deviation from linearity for large gradients. This is expected because ${\tilde{f}}_{10}$ neglects the contribution of the term $(1 - d) \nabla^{2}$ compared to $\nabla d (N_{g} - 1)$ . However, the contribution of the neglected term increases quadratically with the gradient $\nabla$ and could become significant for images with large gradients (see Table 1).

Entropy

The Haralick features discussed thus far are derived from different moments of the marginal distribution of either the difference intensity (see Eq. (8)) or the sum intensity (see Eq. (9)). In contrast, entropy employs a logarithmic scale to compute features from the GLCM. The definition of the entropy feature is:

(17) $f_{9} = - \sum_{i = 0}^{N_{g} - 1} \sum_{j = 0}^{N_{g} - 1} p (i, j) \log (p (i, j)) .$

Entropy reaches its maximum value when a probability distribution is uniform (entirely random texture) and its minimum value of 0 when all grayscale values in the image are the same. If the entropy $f_{9}$ is defined using the base-2 logarithm $\log_{2} ()$ , then $f_{9}$ is measured in bits. While we only examined the entropy feature $f_{9}$ , employing the marginal distributions of pixel intensity sums from Eq. (9) along with the detailed calculation examples for the SA and SV features, one can easily deduce the scaling law of SE defined by:

(18) $f_{8} = - \sum_{k = 0}^{2 (N_{g} - 1)} p_{x + y} (k) \log (p_{x + y} (k)) .$

Similarly, by using the marginal distribution of pixel intensity difference from Eq. (8) and the detailed calculation examples provided for the DA feature, one can easily derive the scaling law of DE defined by:

(19) $f_{11} = - \sum_{k = 0}^{N_{g} - 1} p_{x - y} (k) \log (p_{x - y} (k)) .$

Calculating entropy is straightforward because all GLCM entries carry equal weight, leading to:

(20) $f_{9} = - \sum_{i = 0}^{N_{g} - 1} \sum_{j = 0}^{N_{g} - 1} p (i, j) \log (p (i, j)) = \log ({\tilde{N}}_{g}) .$

As seen from the numerical simulation results presented in Fig. 4D, the theoretical scaling law derived from Eq. (20) captures the general logarithmic trend of the entropy. However, it slightly underestimates it (see Table 1). Numerical simulations illustrated in Fig. 5D confirm that Entropy feature $f_{9}$ is independent of the magnitude of the displacement vector, as predicted by Eq. (20), and also slightly underestimates the actual values. The discrepancy arises from an offset constant $ε$ used in estimating the entropy from images where $\log (p (i, j) + ε)$ was employed instead of $\log (p (i, j))$ to prevent the entropy singularity for sparse GLCM.

Discussions and conclusion

Haralick’s features are widely used in data dimensionality reduction and ML algorithms for image processing in a wide range of practical applications such as MRI (Brynolfsson et al., 2017) and CT scan image processing (Cao et al., 2022; Chen et al., 2021; Park et al., 2020; Shafiq-ul Hassan et al., 2017, 2018; Tharmaseelan et al., 2022), cancer detection (Faust et al., 2018; Cook et al., 2013; Permuth et al., 2016; Soufi, Arimura & Nagami, 2018), liver disease (Acharya et al., 2012, 2016; Raghesh Krishnan & Sudhakar, 2013) and mammographic masses classification (Midya et al., 2017), colon lesions (Song et al., 2014), prostatic devices for disable people (Alshehri et al., 2024), detection of violent crowd (Lloyd et al., 2017), image forensic (Kumar, Pandey & Mishra, 2024), malware detection (Ahmed, Hammad & Jamil, 2024; Karanja, Masupe & Jeffrey, 2020), human face detection (Jun, Choi & Kim, 2013), computer network intrusion detection (Baldini, Hernandez Ramos & Amerini, 2021). However, their interpretation poses challenges since they are second-order statistics that depend in a complicated and nonlinear manner on image characteristics such as the number of gray level quantization $N_{g}$ and and the intensity of image gradients $\nabla$ and the selected displacement vector $d = (Δ x, Δ y)$ between adjacent pixels through the image. This study focused on extracting meaningful analytic expressions and deriving asymptotic scaling laws from Haralick’s features for synthetic images containing only linear gradients. We focused on linear gradients for several reasons: (a) The human visual system efficiently decomposes and analyzes natural scenes using orthogonal gradients (Jagadeesh & Gardner, 2022; Barten, 1999; Bracci & Op de Beeck, 2023; Cheng, Chen & Dilks, 2023; Henderson, Tarr & Wehbe, 2023), (b) Efficient computer vision algorithms leverage gradient spectral priors to extract image features (Gong & Sbalzarini, 2016; Zheng et al., 2022), (c) In 2D natural scene images, orthogonal gradients are uncorrelated (Gong & Sbalzarini, 2016), and (d) The entries of the GLCM serve as natural measures of image gradients. For instance, $p_{d} (i, j)$ is the gradient intensity $(j - i) / | d |$ in a given image along the displacement vector $d = (Δ x, Δ y)$ . We demonstrated that the GLCM for any linear gradient has nonzero entries solely along the two lines parallel to its principal axis diagonal shown in Fig. 3. We found that for any GLCM associated with an image gradient, the total number of entries is ${\tilde{N}}_{g}$ given by Eq. (5). The two lines parallel to the primary diagonal in Fig. 3 represent the gray level differences: (1) $k = j - i = d \cdot \nabla$ , with $m_{1} = {\tilde{N}}_{g} - | d |$ entries (see Eq. (6)) and (2) $k = j - i = - (1 + m_{1}) \nabla$ , with $| d |$ entries (see Eq. (7)). Due to the GLCM symmetry for linear gradients, we derived explicit analytical expressions for the marginal probabilities $p_{x - y} (i)$ and $p_{x + y} (i)$ that are used to compute some of Haralick’s features. To our knowledge, this is the only study that derived explicit mathematical expressions of Haralick’s features in terms of the number of gray level quantization $N_{g}$ , the magnitude of the linear gradient $\nabla$ present in the image, and the displacement vector $d$ used for calculating the GLCM of the image.

We found that the analytic formula for the SA $f_{6}$ in Eq. (10) scales linearly with the number of gray levels $N_{g}$ in the image and is independent of both the image gradient $\nabla$ and displacement vector $d$ . The numerically estimated dependence of $f_{6}$ on $N_{g}$ shown in Fig. 4A confirms the theoretical predictions. Similarly, numerical simulations confirm that $f_{6}$ is independent of the magnitude image gradient $\nabla$ and the vertical displacement vector $d$ as shown in Fig. 5A.

The theoretical formula for the SV in Eq. (12) shows the asymptotic scaling law as $f_{7} \propto N_{g}^{2} | d | \nabla$ . As predicted theoretically, SV increases quadratically with $N_{g}$ , which was confirmed numerically (see Fig. 4B). The analytically predicted SV increases linearly with $d \nabla$ , which was numerically confirmed in Fig. 5B, which shows that the slope of the SV vs $d$ increases proportional to the gradient intensity $\nabla$ .

We also predicted analytically that the DV features given by Eq. (16) has a scaling law $f_{10} \propto N_{g} \nabla | d |$ . Our numerical simulations confirmed that SD increases linearly with $N_{g}$ , with a slope that itself increases linearly with the image gradient $\nabla$ , as shown in Fig. 4C. For a fixed $N_{g} = 256$ , the SV increases linearly with the magnitude of the displacement vector ( $| d |$ ), with a slope proportional to $\nabla$ (see Fig. 5C).

As we predicted theoretically, the entropy scales logarithmically with $N_{g}$ and $\nabla$ and is independent of $| d |$ , i.e., $f_{9} \propto {\tilde{N}}_{g}$ .

We provided a detailed derivation of exact analytic formulas and asymptotic scaling laws for the four Haralick features associated with vertical image gradients.

Since natural scenes can be decomposed into orthogonal and uncorrelated gradients (Gong & Sbalzarini, 2016), our derivations can be extended to a multidimensional gradient-based Haralick feature space. In our synthetic images, we introduced a single gradient along the vertical direction ( $\nabla_{y} = \nabla$ ) while setting the horizontal gradient to zero ( $\nabla_{x} = 0$ ) as shown in Fig. 3. This design simplified the identification of general GLCM symmetries induced by the gradient, as described in “Methods”. However, our derived formulas remain valid because, even in natural scenes, orthogonal image gradients are uncorrelated.

To generalize our findings, the scalar gradient $\nabla$ must be replaced with the gradient vector $(\nabla_{x}, \nabla_{y})$ for 2D images. The analytical formulas we derived for Haralick’s features can be used to estimate image gradients from measured features. Another application involves deriving consistent normalization factors for Haralick features. Comparing the values of Haralick features across datasets from different scanners with varying resolutions is challenging and different empirical normalizations algorithms achieved only limited success (Clausi, 2002; Lofstedt et al., 2019; Shafiq-ul Hassan et al., 2017, 2018). Thus, identifying suitable normalization factors that render Haralick features invariant to the number of gray levels or the quantization scheme is crucial among other fields in radionics.

We demonstrated that the SA feature in Eq. (10) should be normalized by $N_{g}$ to ensure asymptotic independence from the quantization scheme. This normalization allows for the consistent comparison of the Haralick SA feature across images obtained at different resolutions and with various imaging devices. Similarly, we analytically proved that the SV feature in Eq. (12) should be normalized by $N_{g}^{2}$ to achieve invariance to the image quantization scheme. Unlike empirical trial-and-error approaches, our normalization factors are rigorously derived based on the symmetries of the GLCM, ensuring mathematical consistency and robustness.

Supplemental Information

The main Matlab file used for computing the Haralick features.

This generates the figures from the paper. All for loops run through the data set values and produce the discrete points shown on the paper.

DOI: 10.7717/peerj-cs.2856/supp-1

Download

The Haralick function f6 required by the main Matalb file.

The file generates the values for Haralick features that will generate the figures from the paper. All for loops run through the data set values and produce the discrete points shown on the paper.

DOI: 10.7717/peerj-cs.2856/supp-2

Download

The Haralick function f10 required by the main MATLAB file.

The file generates the values for Haralick features that will create the figures from the paper. All for loops run through the data set values and produce the discrete points shown on the paper.

DOI: 10.7717/peerj-cs.2856/supp-3

Download

The Haralick function f9 required by the main MATLAB file.

The file generates the values for Haralick features that will create the figures from the paper. All for loops run through the data set values and produce the discrete points shown on the paper.

DOI: 10.7717/peerj-cs.2856/supp-4

Download

The Haralick function f7 required by the main MATLAB file.

The file generates the values for Haralick features that will create the figures from the paper. All for loops run through the data set values and produce the discrete points shown on the paper.

DOI: 10.7717/peerj-cs.2856/supp-5

Download

[1] Acharya UR, Raghavendra U, Fujita H, Hagiwara Y, Koh JEW, Jen Hong T, Sudarshan VK, Vijayananthan A, Yeong CH, Gudigar A, Ng KH. 2016. Automated characterization of fatty liver disease and cirrhosis using curvelet transform and entropy features extracted from ultrasound images. Computers in Biology and Medicine 79(1):250-258

[2] Acharya UR, Subbhuraam VS, Ribeiro R, Krishnamurthi G, Marinho R, Sanches J, Suri J. 2012. Data mining framework for fatty liver disease classification in ultrasound: a hybrid feature extraction paradigm. Medical Physics 39(7Part1):4255-4264

[3] Ahmed IT, Hammad BT, Jamil N. 2024. A comparative performance analysis of malware detection algorithms based on various texture features and classifiers. IEEE Access 12(19):11500-11519

[4] Alshehri M, Sharma SK, Gupta P, Shah SR. 2024. Empowering the visually impaired: translating handwritten digits into spoken language with HRNN-GOA and Haralick features. Journal of Disability Research 3(1):14557

[5] Attneave F. 1954. Some informational aspects of visual perception. Psychological Review 61(3):183-193

[6] Aviram G, Rotman SR. 2000. Evaluating human detection performance of targets and false alarms, using a statistical texture image metric. Optical Engineering 39(8):2285-2295

[7] Baldini G, Hernandez Ramos JL, Amerini I. 2021. Intrusion detection based on gray-level co-occurrence matrix and 2D dispersion entropy. Applied Sciences 11(12):5567

[8] Barten P. 1999. Contrast sensitivity of the human eye and its effects on image quality. Bellingham: Press Monographs. SPIE Optical Engineering Press.

[9] Bracci S, Op de Beeck HP. 2023. Understanding human object vision: a picture is worth a thousand representations. Annual Review of Psychology 74(1):113-135

[10] Brynolfsson P, Nilsson D, Torheim T, Asklund T, Thellenberg-Karlsson C, Trygg J, Nyholm T, Garpebring A. 2017. Haralick texture features from apparent diffusion coefficient (ADC) MRI images depend on imaging and pre-processing parameters. Scientific Reports 7(1):4041

[11] Canny J. 1986. A computational approach to edge detection. IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI-8(6):679-698

[12] Cao W, Pomeroy MJ, Zhang S, Tan J, Liang Z, Gao Y, Abbasi AF, Pickhardt PJ. 2022. An adaptive learning model for multiscale texture features in polyp classification via computed tomographic colonography. Sensors (Basel) 22(3):907

[13] Chan TF, Golub GH, Mulet P. 1970. A nonlinear primal-dual method for total variation-based image restoration. SIAM Journal on Scientific Computing 20(6):1964-1977

[14] Chen K, Deng L, Li Q, Luo L. 2021. Are computed-tomography-based hematoma radiomics features reproducible and predictive of intracerebral hemorrhage expansion? an in vitro experiment and clinical study. The British Journal of Radiology 94(1121):20200724

[15] Chen X, Yang J, Wu Q. 2010. Image deblur in gradient domain. Optical Engineering 49(11):117003

[16] Cheng A, Chen Z, Dilks DD. 2023. A stimulus-driven approach reveals vertical luminance gradient as a stimulus feature that drives human cortical scene selectivity. NeuroImage 269(2):119935

[17] Cho S, Lee S. 2009. Fast motion deblurring. ACM Transactions on Graphics 28(5):1-8

[18] Cho TS, Zitnick CL, Joshi N, Kang SB, Szeliski R, Freeman WT. 2012. Image restoration by matching gradient distributions. IEEE Transactions on Pattern Analysis and Machine Intelligence 34(4):683-694

[19] Clausi DA. 2002. An analysis of co-occurrence texture statistics as a function of grey level quantization. Canadian Journal of Remote Sensing 28(1):45-62

[20] Conners RW, Harlow CA. 1980. A theoretical comparison of texture algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI-2(3):204-222

[21] Cook GJ, Yip C, Siddique M, Goh V, Chicklore S, Roy A, Marsden P, Ahmad S, Landau D. 2013. Are pretreatment 18F-FDG pet tumor textural features in non-small cell lung cancer associated with response and survival after chemoradiotherapy? Journal of Nuclear Medicine 54(1):19-26

[22] Correani A, Scott-Samuel NE, Leonards U. 2006. Luminosity—a perceptual “feature” of light-emitting objects? Vision Research 46(22):3915-3925

[23] DeMers M. 2008. Fundamentals of geographic information systems. Hoboken: Wiley.

[24] Dresp-Langley B, Reeves AJ. 2024. Environmental lighting conditions, phenomenal contrast, and the conscious perception of near and far. Brain Science 14(10):966

[25] Dutta S, Barat K, Das A, Das SK, Shukla A, Roy H. 2014. Characterization of micrographs and fractographs of Cu-strengthened HSLA steel using image texture analysis. Measurement 47(2):130-144

[26] Fajardo JI, Paltan CA, López LM, Carrasquero EJ. 2022. Textural analysis by means of a gray level co-occurrence matrix method. Case: corrosion in steam piping systems. Materials Today: Proceedings 49(2):149-154 Advances in Mechanical Engineering Trends

[27] Fattal R, Lischinski D, Werman M. 2002. Gradient domain high dynamic range compression. ACM Transactions on Graphics 21(3):249-256

[28] Faust O, Acharya UR, Meiburger K, Molinari F, Koh JEW, Yeong CH, Kongmebhol P, Kh N. 2018. Comparative assessment of texture features for the identification of cancer in ultrasound images: a review. Biocybernetics and Biomedical Engineering 38(2):275-296

[29] Finlayson GD, Hordley SD, Drew MS. 2002. Removing shadows from images. In: Heyden A, Sparr G, Nielsen M, Johansen P, eds. Computer Vision—ECCV 2002. Berlin, Heidelberg: Springer. 823-836

[30] Fischer AM, Varga-Szemes A, Martin SS, Sperl JI, Sahbaee P, Neumann D, Gawlitza J, Henzler T, Johnson CM, Nance JW, Schoenberg SO, Schoepf UJ. 2020. Artificial intelligence-based fully automated per lobe segmentation and emphysema-quantification based on chest computed tomography compared with global initiative for chronic obstructive lung disease severity of smokers. Journal of Thoracic Imaging 35(Supplement 1):S28-S34

[31] Fuchs A. 2005. Application of microstructural texture parameters to diffusional and displacive transformation products. PhD Thesis, University of Birmingham

[32] Gomez O, Neumann H. 2016. Biologically inspired model for inference of 3D shape from texture. PLOS ONE 11(9):1-30

[33] Gong Y, Sbalzarini IF. 2014. Image enhancement by gradient distribution specification. In: Jawahar CV, Shan S, eds. Computer Vision—ACCV, 2014 Workshops. Cham: Springer International Publishing. 47-62

[34] Gong Y, Sbalzarini IF. 2016. A natural-scene gradient distribution prior and its application in light-microscopy image processing. IEEE Journal of Selected Topics in Signal Processing 10(1):99-114

[35] Hanazawa A, Komatsu H. 2001. Influence of the direction of elemental luminance gradients on the responses of V4 cells to textured surfaces. Journal of Neuroscience 21(12):4490-4497

[36] Haralick R. 1979. Statistical and structural approaches to texture. Proceedings of the IEEE 67(5):786-804

[37] Haralick RM, Shanmugam K, Dinstein I. 1973. Textural features for image classification. IEEE Transactions on Systems, Man, and Cybernetics SMC-3(6):610-621

[38] Henderson MM, Tarr MJ, Wehbe L. 2023. A texture statistics encoding model reveals hierarchical feature selectivity across human visual cortex. Journal of Neuroscience 43(22):4144-4161

[39] Huang Y-L, Lin S-H, Chen D-R. 2005. Computer-aided diagnosis applied to 3-D US of solid breast nodules by using principal component analysis and image retrieval.

[40] Humeau-Heurtier A. 2019. Texture feature extraction methods: a survey. IEEE Access 7:8975-9000

[41] Hwang EJ, Park S, Jin K-N, Kim JI, Choi SY, Lee JH, Goo JM, Aum J, Yim J-J, Park CM, Deep Learning-Based Automatic Detection Algorithm Development and Evaluation Group. 2018. Development and validation of a deep learning-based automatic detection algorithm for active pulmonary tuberculosis on chest radiographs. Clinical Infectious Diseases 69(5):739-747

[42] Jagadeesh AV, Gardner JL. 2022. Texture-like representation of objects in human visual cortex. Proceedings of the National Academy of Sciences of the United States of America 119(17):e2115302119

[43] Jun B, Choi I, Kim D. 2013. Local transform features and hybridization for accurate face and human detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 35(6):1423-1436

[44] Karahaliou A, Skiadopoulos S, Boniatis I, Sakellaropoulos F, Likaki E, Panayiotakis G, Costaridou L. 2007. Texture analysis of tissue surrounding microcalcifications on mammograms for breast cancer diagnosis. The British Journal of Radiology 80(956):648-656

[45] Karanja EM, Masupe S, Jeffrey MG. 2020. Analysis of internet of things malware using image texture features and machine learning techniques. Internet of Things 9:100153

[46] Keil MS. 2007. Gradient representations and the perception of luminosity. Vision Research 47(27):3360-3372

[47] Kim JR, Shim WH, Yoon HM, Hong SH, Lee JS, Cho YA, Kim S. 2017. Computerized bone age estimation using deep learning based program: evaluation of the accuracy and efficiency. American Journal of Roentgenology 209(6):1374-1380

[48] Kumar D, Pandey RC, Mishra AK. 2024. A review of image features extraction techniques and their applications in image forensic. Multimedia Tools and Applications 83(40):87801-87902

[49] Kyriacou E, Pavlopoulos S, Konnis G, Koutsouris D, Zoumpoulis P, Theotokas L. 1997. Computer assisted characterization of diffused liver disease using image texture analysis techniques on B-scan images.

[50] Land EH, McCann JJ. 1971. Lightness and Retinex theory. Journal of the Optical Society of America 61(1):1-11

[51] Levin A, Zomet A, Peleg S, Weiss Y. 2004. Seamless image stitching in the gradient domain. In: Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Cham: Springer Verlag. 3024:377-389

[52] Liu X. 2014. Microstructural characterization of pearlitic and complex phase steels using image analysis methods. PhD Thesis, University of Birmingham

[53] Lloyd K, Rosin PL, Marshall D, Moore SC. 2017. Detecting violent and abnormal crowd activity using temporal analysis of grey level co-occurrence matrix (GLCM)-based texture measures. Machine Vision and Applications 28(3):361-371

[54] Lofstedt T, Brynolfsson P, Asklund T, Nyholm T, Garpebring A. 2019. Gray-level invariant Haralick texture features. PLOS ONE 14(2):1-18

[55] Long F, Purves D. 2003. Natural scene statistics as the universal basis of color context effects. Proceedings of the National Academy of Sciences of the United States of America 100(25):15190-15193

[56] Manjunath B, Ma W. 1996. Texture features for browsing and retrieval of image data. IEEE Transactions on Pattern Analysis and Machine Intelligence 18(8):837-842

[57] Marchand E. 2007. Control camera and light source positions using image gradient information.

[58] McCann J, Pollard NS. 2008. Real-time gradient-domain painting. ACM Transactions on Graphics 27(3):1-7

[59] Mettripun N, Amornraksa T. 2014. Image gradient index for classifying digital camera and scanned images.

[60] Miao C, Shaohui T. 2017. An extraction method for digital camouflage texture based on human visual perception and isoperimetric theory.

[61] Midya A, Rabidas R, Sadhu A, Chakraborty J. 2017. Edge weighted local texture features for the categorization of mammographic masses. Journal of Medical and Biological Engineering 38:1-12

[62] Murphy K. 2019. How data will improve healthcare without adding staff or beds. Geneva: World Intellectual Property Organization.

[63] Naik DL, Sajid HU, Kiran R. 2019. Texture-based metallurgical phase identification in structural steels: a supervised machine learning approach. Metals 9(5):546

[64] Oprisan A, Oprisan SA. 2023. Bounds for Haralick features in synthetic images with sinusoidal gradients. Frontiers in Signal Processing 3:1120989

[65] Papathomas T, Kashi R, Gorea A. 1997. A human vision based computational model for chromatic texture segregation. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 27(3):428-440

[66] Park BW, Kim JK, Heo C, Park KJ. 2020. Reliability of CT radiomic features reflecting tumour heterogeneity according to image quality and image processing parameters. Scientific Reports 10(1):3852

[67] Pavlopoulos S, Kyriacou E, Koutsouris D, Blekas K, Stafylopatis A, Zoumpoulis P. 2000. Fuzzy neural network-based texture analysis of ultrasonic images. IEEE Engineering in Medicine and Biology Magazine 19(1):39-47

[68] Perez P, Gangnet M, Blake A. 2003. Poisson image editing. ACM Transactions on Graphics 22(3):313-318

[69] Permuth JB, Choi J, Balarunathan Y, Kim J, Chen DT, Chen L, Orcutt S, Doepker MP, Gage K, Zhang G, Latifi K, Hoffe S, Jiang K, Coppola D, Centeno BA, Magliocco A, Li Q, Trevino J, Merchant N, Gillies R, Malafa M. 2016. Combining radiomic features with a miRNA classifier may improve prediction of malignant pathology for pancreatic intraductal papillary mucinous neoplasms. Oncotarget 7(52):85785-85797

[70] Pieprzyk S, Yevchenko T, Dardas D, Branka AC. 2022. Phase transitions and physical properties by a color texture analysis: Results for liquid crystals. Journal of Molecular Liquids 362(2):119699

[71] Pratt W. 1978. Digital image processing. In: A Wiley-Interscience Publication Number. Hoboken: Wiley. 1

[72] Pratt WK. 2006. Digital image processing: PIKS scientific inside. Hoboken: John Wiley & Sons, Inc.

[73] Raghesh Krishnan K, Sudhakar R. 2013. Automatic classification of liver diseases from ultrasound images using GLRLM texture features. In: Balas VE, Fodor J, Várkonyi-Kóczy AR, Dombi J, Jain LC, eds. Soft Computing Applications. Berlin, Heidelberg: Springer. 611-624

[74] Rosenfeld A, Kak A. 1982. Digital picture processing: volume 1. Number v. 2 in Computer science and applied mathematics. Cambridge: Academic Press.

[75] Sanchez Sanchez PM, Huertas Celdran A, Martinez Perez ET, Demeter D, Bovet G, Martinez Pérez G, Stiller B. 2024. Analyzing the robustness of decentralized horizontal and vertical federated learning architectures in a non-IID scenario. Applied Intelligence 54(8):6637-6653

[76] Sastry S, Mallika K, Bankapalli G, Tiong H, Lakshminarayana S. 2012. Identification of phase transition temperatures by statistical image analysis. Liquid Crystals-LIQ CRYST 39:1-6

[77] Sevcenco I, Agathoklis P. 2021. Light field editing in the gradient domain. IET Image Processing 15(5):1072-1082

[78] Shafiq-ul Hassan M, Latifi K, Zhang G, Ullah G, Gillies R, Moros E. 2018. Voxel size and gray level normalization of CT radiomic features in lung cancer. Scientific Reports 8(1):10545

[79] Shafiq-ul Hassan M, Zhang GG, Latifi K, Ullah G, Hunt DC, Balagurunathan Y, Abdalah MA, Schabath MB, Goldgof DG, Mackin D, Court LE, Gillies RJ, Moros EG. 2017. Intrinsic dependencies of CT radiomic features on voxel size and number of gray levels. Medical Physics 44(3):1050-1062

[80] Shan Q, Jia J, Agarwala A. 2008. High-quality motion deblurring from a single image. ACM Transactions on Graphics 27(3):1-10

[81] Singh R, Kalra MK, Nitiwarangkul C, Patti JA, Homayounieh F, Padole A, Rao P, Putha P, Muse VV, Sharma A, Digumarthy SR. 2018. Deep learning in chest radiography: detection of findings and presence of change. PLOS ONE 13(10):1-12

[82] Song B, Zhang G, Lu H, Wang H, Zhu W, Pickhardt PJ, Liang Z. 2014. Volumetric texture features from higher-order images for diagnosis of colon lesions via CT colonography. International Journal of Computer Assisted Radiology and Surgery 9(6):1021-1031

[83] Soufi M, Arimura H, Nagami N. 2018. Identification of optimal mother wavelets in survival prediction of lung cancer patients using wavelet decomposition-based radiomic features. Medical Physics 45(11):5116-5128

[84] Tamura H, Mori S, Yamawaki T. 1978. Textural features corresponding to visual perception. IEEE Transactions on Systems, Man, and Cybernetics 8(6):460-473

[85] Tharmaseelan H, Rotkopf LT, Ayx I, Hertel A, Nörenberg D, Schoenberg SO, Froelich MF. 2022. Evaluation of radiomics feature stability in abdominal monoenergetic photon counting CT reconstructions. Scientific Reports 12(1):19594

[86] Tuceryan M, Jain AK. 1999. Texture analysis. Singapore: World Scientific. 207-248

[87] Tward DJ. 2021. An optical flow based left-invariant metric for natural gradient descent in affine image registration. Frontiers in Applied Mathematics and Statistics 7:685

[88] Vaziri S, Carlson E, Wang Z, Connor C. 2014. A channel for 3D environmental shape in anterior inferotemporal cortex. Neuron 84(1):55-62

[89] Wan J, Zhou S. 2010. Features extraction based on wavelet packet transform for B-mode ultrasound liver images.

[90] Wang Y, Cheng G. 2016. Application of gradient-based Hough transform to the detection of corrosion pits in optical images. Applied Surface Science 366:9-18

[91] Wang J, Zhang P, Chang S, Li Z, Shi P, Yu H, Sun D. 2025. Automatic deblurring and rating classification for metal corrosion images. Computational Materials Science 251(1):113725

[92] Werner H. 1935. Studies on contour: I. Qualitative analyses. The American Journal of Psychology 47(1):40-64