Summary of Brain-Inspired Efficient Pruning in Spiking Neural Networks

Summary Brain-Inspired Efficient Pruning in Spiking Neural Networks arxiv.org

8,040 words - PDF document - View PDF document

One Line

A brain-inspired pruning method enhances spiking neural networks by extracting crucial information, resulting in improved performance, feature uniformity, and structure selection.

Slides

Slide Presentation (7 slides)

Copy slides outline Copy embed code Download as Word

Brain-Inspired Efficient Pruning in Spiking Neural Networks

Source: arxiv.org - PDF - 8,040 words - view

Introduction

• Researchers have developed a brain-inspired pruning method for spiking neural networks (SNNs) that efficiently extracts critical information while reducing computational and storage overhead.

• SNNs are attractive for deployment on devices with limited resources due to their event-driven computing characteristic.

• The proposed method uses a regeneration mechanism based on criticality to obtain critical pruned networks.

Challenges in Pruning SNNs

• Spike signals in SNNs are easily confused by disturbance and suffer from spike vanish or explosion, resulting in insufficient expression of feature information.

• The non-differentiable property of spikes necessitates the use of surrogate functions to approximate gradients, leading to gradient vanishing.

• Current state-of-the-art methods typically require extended training or iteration times to attain pruned networks, resulting in significant pruning costs.

Regeneration Mechanism Based on Criticality

• The proposed method is inspired by the critical brain hypothesis and defines a metric for neuron criticality in SNNs.

• The criticality score is related to the distance between the membrane potential and the threshold voltage of a neuron.

• The criticality-based regeneration mechanism selects neurons with higher criticality for reactivation and synapse regeneration after each pruning iteration.

Evaluation Results

• The proposed method is evaluated on VGG-16 and ResNet-19 models for both unstructured and structured pruning.

• The method achieves higher performance compared to the state-of-the-art methods with the same time overhead.

• It also achieves comparable or better performance with significant acceleration, especially on VGG-16.

Impact and Mechanisms

• The proposed method efficiently selects potential structures, improving the uniformity of features.

• It reduces overfitting during the recovery phase.

• The authors provide insights into the underlying mechanisms of their method, highlighting its effectiveness in selecting critical structures, improving feature uniformity, and reducing overfitting.

Conclusion

• The brain-inspired pruning method for SNNs efficiently extracts critical information while reducing computational and storage overhead.

• The method achieves higher performance compared to existing methods with the same time overhead and achieves comparable or better performance with significant acceleration.

Key Points

Researchers have developed a brain-inspired pruning method for spiking neural networks (SNNs) that efficiently extracts critical information while reducing computational and storage overhead.
SNNs are attractive for deployment on devices with limited resources due to their event-driven computing characteristic.
The proposed method uses a regeneration mechanism based on criticality to obtain critical pruned networks.
The method achieves higher performance compared to the state-of-the-art methods with the same time overhead and achieves comparable or better performance with significant acceleration.
The authors provide insights into the underlying mechanisms of their method, highlighting its effectiveness in selecting critical structures, improving feature uniformity, and reducing overfitting.

Summaries

19 word summary

A brain-inspired pruning method efficiently extracts critical information in spiking neural networks, improving performance, feature uniformity, and structure selection.

52 word summary

A brain-inspired pruning method for spiking neural networks has been developed to extract critical information efficiently. The method uses a regeneration mechanism based on criticality to obtain critical pruned networks. It achieves higher performance than state-of-the-art methods with the same time overhead, improves feature uniformity, reduces overfitting, and efficiently selects potential structures.

107 word summary

Researchers have developed a brain-inspired pruning method for spiking neural networks (SNNs) that efficiently extracts critical information while reducing computational and storage overhead. The method proposes a regeneration mechanism based on criticality to obtain critical pruned networks. It defines a low-cost metric for the criticality of pruning structures and re-ranks and regenerates the pruned structures with higher criticality. The method is evaluated on VGG-16 and ResNet-19 models for both unstructured and structured pruning, achieving higher performance compared to state-of-the-art methods with the same time overhead. It also achieves comparable or better performance with significant acceleration. The method improves feature uniformity, reduces overfitting, and efficiently selects potential structures.

498 word summary

Researchers have developed a brain-inspired pruning method for spiking neural networks (SNNs) that efficiently extracts critical information while reducing computational and storage overhead. SNNs are attractive for deployment on devices with limited resources due to their event-driven computing characteristic. However, pruning deep SNNs is challenging due to the binary and non-differentiable nature of spike signals. Existing methods require high time overhead to make pruning decisions.

In this study, the researchers propose a regeneration mechanism based on criticality to obtain critical pruned networks. They first define a low-cost metric for the criticality of pruning structures and then re-rank and regenerate the pruned structures with higher criticality. The method is evaluated using VGG-16 and ResNet-19 for both unstructured and structured pruning. The results show that the proposed method achieves higher performance compared to the state-of-the-art methods with the same time overhead. It also achieves comparable performance, and even better performance on VGG-16, with significant acceleration.

SNNs have gained attention as the third generation of neural networks due to their ability to emulate the behavior of biological neurons. They are particularly suitable for devices with limited computing resources and lower power consumption. However, the limited computing and storage capacity of these devices poses a challenge for implementing deep SNNs with large-scale parameters.

To address these challenges, the researchers propose a regeneration mechanism based on criticality. They define a metric for neuron criticality in SNNs, which is related to the distance between the membrane potential and the threshold voltage of a neuron. The criticality-based regeneration mechanism selects neurons with higher criticality for reactivation and synapse regeneration after each pruning iteration.

The proposed method is evaluated on VGG-16 and ResNet-19 models for both unstructured and structured pruning. The results show that the method achieves higher performance compared to the state-of-the-art methods with the same time overhead. It also achieves comparable performance, and even better performance on VGG-16, with significant acceleration. The authors investigate the impact and underlying mechanisms of their method and find that it efficiently selects potential structures, improves the uniformity of features, and reduces overfitting during the recovery phase.

In conclusion, the researchers have developed a brain-inspired pruning method for SNNs that efficiently extracts critical information while reducing computational and storage overhead. The method achieves higher performance compared to existing methods with the same time overhead and achieves comparable or better performance with significant acceleration. The authors also provide insights into the underlying mechanisms of their method, highlighting its effectiveness in selecting critical structures, improving feature uniformity, and reducing overfitting.

637 word summary

Researchers have developed a brain-inspired pruning method for spiking neural networks (SNNs) that efficiently extracts critical information while reducing computational and storage overhead. SNNs are attractive for deployment on devices with limited resources due to their event-driven computing characteristic. However, pruning deep SNNs is challenging due to the binary and non-differentiable nature of spike signals. Existing methods require high time overhead to make pruning decisions.

Network pruning has been widely explored as a solution to reduce the computing and storage overhead of SNNs. Some previous works have been inspired by the human brain and modeled synaptic plasticity and spine motility to optimize network structure and connections. Other works have used predefined thresholds to remove weak weights during the learning process. The lottery ticket hypothesis has also been explored in SNN pruning. However, the binary representation and non-differentiable property of spike signals make training deep SNNs challenging. Spike signals are easily confused by disturbance and suffer from spike vanish or explosion, resulting in insufficient expression of feature information. The non-differentiable property of spikes necessitates the use of surrogate functions to approximate gradients, leading to gradient vanishing. These challenges become more prominent in pruned SNNs because it is difficult to retain the most critical parameters from the not fully trained network. Current state-of-the-art methods typically require extended training or iteration times to attain pruned networks, resulting in significant pruning costs.

To address these challenges, the researchers propose a regeneration mechanism based on criticality. Inspired by the critical brain hypothesis, which suggests that the brain operates at a critical state highly sensitive to inputs and facilitating information transmission, they define a metric for neuron criticality in SNNs. The criticality score is related to the distance between the membrane potential and the threshold voltage of a neuron. They use the derivative of a surrogate function as a suitable criticality metric, as it reflects the criticality changes of neuron decisions and can be obtained with minimal additional computational cost. The criticality-based regeneration mechanism selects neurons with higher criticality for reactivation and synapse regeneration after each pruning iteration.

Raw indexed text (50,650 chars / 8,040 words / 1,414 lines)

Brain-Inspired Efficient Pruning: Exploiting Criticality in

Spiking Neural Networks

Anonymous submission

Abstract

Spiking Neural Networks (SNNs) have been an attractive op-

tion for deployment on devices with limited computing re-

sources and lower power consumption because of the event-

driven computing characteristic. As such devices have lim-

ited computing and storage resources, pruning for SNNs has

been widely focused recently. However, the binary and non-

differentiable property of spike signals make pruning deep

SNNs challenging, so existing methods require high time

overhead to make pruning decisions. In this paper, inspired

by critical brain hypothesis in neuroscience, we design a re-

generation mechanism based on criticality to efficiently ob-

tain the critical pruned networks. Firstly, we propose a low-

cost metric for the criticality of pruning structures. Then we

re-rank the pruned structures after pruning and regenerate

those with higher criticality. We evaluate our method using

VGG-16 and ResNet-19 for both unstructured pruning and

structured pruning. Our method achieves higher performance

compared to current state-of-the-art (SOTA) method with the

same time overhead. We also achieve comparable perfor-

mances (even better on VGG-16) compared to the SOTA

method with 11.3x and 15.5x acceleration. Moreover, we in-

vestigate underlying mechanism of our method and find that

it efficiently selects potential structures, learns the consistent

feature representations and reduces the overfitting during the

recovery phase.

Introduction

Spiking Neural Networks (SNNs) have garnered increasing

attention (Wu et al. 2019; Zheng et al. 2021) as the third gen-

eration of neural networks (Maass 1997) in recent years. By

emulating accumulation-excitation and event-driven behav-

ior characteristics of biological neurons, SNNs have been

an attractive option for deployment on devices with limited

computing resources with lower power consumption com-

pared to Artificial Neural Networks (ANNs) (Akopyan et al.

2015; Roy, Jaiswal, and Panda 2019; Davies et al. 2018; Pei

et al. 2019). However, the limited computing and storage ca-

pacity causes a challenge to the implementation of SNNs on

these devices, as the deep SNNs with large-scale parameters

cannot be directly performed.

Network pruning provides prospect of reducing the com-

puting and storage overhead. Researchers have made a lot

of efforts on the unstructured pruning of SNNs. Some works

are inspired by the human brain. (Kundu et al. 2021) mod-

eled the synapse regeneration process, while (Qi et al. 2018)

designed a connection gate to prune synapses during train-

ing. (Kappel et al. 2015) modeled synaptic plasticity and

spine motility to optimize network structure and connec-

tions. Recent researches on SNN pruning more reference

with the techniques used in ANN pruning. Some works

used predefined thresholds to remove weak weights of the

model in the learning process (Neftci et al. 2016; Liu et al.

2019). (Kim et al. 2022) explored the lottery ticket hypothe-

sis with Iterative Magnitude Pruning (IMP) in SNN pruning.

(Deng et al. 2021) combined spatio-temporal backpropaga-

tion (STBP) and alternating direction method of multipliers

(ADMM) and proposed a regularization method based on

activity. (Chen et al. 2021) improved the Deep-R method

with a weight regeneration mechanism.

However, the binary representation with integrate-and-fire

and non-differentiable property of spike signals make train-

ing deep SNNs challenging (Zheng et al. 2021; Wu et al.

2019; Shrestha et al. 2021). Spike signals, decided by the

membrane potential and the threshold, are easy to be con-

fused by disturbance and suffer from spike vanish or ex-

plosion, resulting in insufficient expression of feature infor-

mation. The non-differentiable property of spikes necessi-

tates surrogate function to approximate gradients and leads

to gradient vanishing. The incomplete feature representa-

tion and gradient vanishing hinder training SNNs. This chal-

lenge becomes particularly prominent the pruned SNNs be-

cause it’s difficult to retain the most critical parameters from

the not fully trained network. The current state-of-the-art

(SOTA) methods typically require extended training or it-

eration times to attain pruned networks, resulting in signif-

icant pruning costs(Chen et al. 2021; Kim et al. 2022). The

gradient-based rewiring method proposed by (Chen et al.

2021) needs 2048 epochs to prune the 6 convolutional lay-

ers SNN. (Kim et al. 2022) spent 2260 epochs for about 90%

sparsity and 3100 epochs for about 95% sparsity on VGG-16

and ResNet-19 with lottery hypothesis(LTH) method. (Kim

et al. 2022) and (Kim et al. 2023) compressed timesteps dur-

ing the pruning process to achieve acceleration using KL

divergence or temporal information concentration, but this

approach introduces new hyperparameters and only yielded

acceleration ratios of 1.12 to 1.59. The early-bird algorithm

was utilized to prune the network in the early training stages

(Kim et al. 2022), but it results in substantial accuracy losson the pruned model.

In this paper, inspired by the critical brain hypothesis

(Turing 2009), which reveals the critical state of the brain

that is highly sensitive to inputs and facilitates informa-

tion transmission (Kinouchi and Copelli 2006; Beggs 2008;

Shew et al. 2009; Beggs and Timme 2012), we aim to ob-

tain the high-efficiency critical pruned networks that main-

tain the power to extract feature information and keep effi-

cient feedback for different inputs. According to the associ-

ation between the criticality and the single neuron’s barely-

excitable state (Gal and Marom 2013), we propose a met-

ric for neuron criticality and further relate it to the distance

between the membrane potential and the threshold volt-

age with minimal additional computational cost. To obtain

the critical pruned model, we design a regeneration mech-

anism based on neuron criticality to preserve critical neu-

rons during the pruning process. Models through unstruc-

tured pruning and regeneration achieve high sparsity (e.g.

95%) with little performance degradation in deep SNNs. We

first perform the structured pruning with the regeneration

and achieve over 50% Flops reduction. Moreover, we in-

vestigate the impact and the underlying mechanisms of our

method and find that it efficiently selects potential structures,

improves the uniformity of features and reduces the overfit-

ting during the recovery phase.

In summary, our key contributions are as follows:

• Inspired by critical brain hypothesis and criticality re-

search in neuroscience, we propose a metric for neuron

criticality of SNNs and design a regeneration mechanism

to efficiently obtain the critical pruned network.

• We evaluate our method using VGG-16 and ResNet-19

for unstructured pruning. Our method achieves higher

performance compared to SOTA methods with the same

time overhead. We also achieve comparable perfor-

mances (even better on VGG-16) compared to the SOTA

method on CIFAR-100 with 11.3x and 15.5x accelera-

tion on 89.91% and 95.69% sparsity.

• We perform the structured pruning using a basic prun-

ing technique and the regeneration mechanism, yield-

ing better results compared to the sophisticated SOTA

method. To our knowledge, this is the first work on struc-

tured pruning with high flops reduction (> 50%) in deep

SNNs.

• We investigate the impact and underlying mechanisms of

our method from both experimental and statistical find-

ings. We find that: (1) our method selects critical struc-

tures with latent potential which become more important

after fine-tuning. (2) Our method improves the unifor-

mity of features within the same class and between the

train and test samples and reducing noise. These improve

the stability and reduces the overfitting during the recover

phrase.

Related Work

Pruning refer to removing specific structures from a network

to induce sparsity. Unstructured pruning (Han et al. 2015;

Kappel et al. 2015; Ding et al. 2019) removes weight param-

eters to achieve high level of connection sparsity. However,

common hardware does not fully optimize sparse matrix op-

erations, limiting potential acceleration from unstructured

pruning. Structured pruning (He, Zhang, and Sun 2017; He

et al. 2019; Lin et al. 2020) achieves structured sparsity

by removing entire kernels or channels. This approach is

hardware-friendly but might not reach the highest sparsity.

Pruning Criterion The most commonly employed prun-

ing paradigm involves evaluating the impact of structure on

network performance through a pruning criterion and subse-

quently removing the least significant ones. For unstructured

pruning, the magnitude of each weight has emerged as the

most widespread pruning criterion, which was firstly pro-

posed in (Han et al. 2015) and has seen widespread adoption

in pruning for ANNs (Zhu and Gupta 2017; Gale, Elsen, and

Hooker 2019; Liu et al. 2021) and SNNs (Neftci et al. 2016;

Liu et al. 2019; Kim et al. 2022). The synaptic parameter

(Bellec et al. 2018) which represents the connection strength

is another pruning criterion. (Chen et al. 2021) improved the

synaptic parameter with gradient rewiring and applied it to

SNN pruning. For structured pruning, the first and second-

order information of the gradient are utilized to design im-

portance scores (Molchanov et al. 2016; He, Zhang, and

Sun 2017). (Hu et al. 2016) proposed the average percent-

age of zeros (APoZs) of the activation layer output serving

as a pruning criterion. (Liu et al. 2017) proposed the scalar

parameters representing channels’ significance and used the

penalty term to push them towards zero during training.

Pruning Schedule The setting of pruning schedules af-

fects the performance of pruned networks. Iterative pruning

follows the pruning-fine-tuning pattern with settings such as

the pruning ratio in one iteration, pruning interval, and num-

ber of iterations. (Zhu and Gupta 2017) proposed a gradual

sparse ratio schedule with a unified pruning interval for un-

structured pruning and was followed by subsequent studies

(Gale, Elsen, and Hooker 2019; Liu et al. 2021). (Molchanov

et al. 2016, 2019; Ding et al. 2021) implemented the iter-

ative pruning on channel level. (Frankle and Carbin 2018)

proposed the lottery hypothesis with a fixed pruning ratio

and a longer pruning interval, while inheriting the surviv-

ing parameters of the initial model after each iteration. (Kim

et al. 2022) proved the lottery hypothesis in SNN pruning.

One-shot pruning refers to achieve the target sparsity once.

(You et al. 2019) proposed using the Hamming distance of

pruning masks in the early stages of training to determine

the pruning time. (Kim et al. 2022) implemented the above

method on SNN pruning.

Preliminaries

Spiking Neural Network

The spiking neuron is the fundamental unit of SNNs. A neu-

ron collect signals from other neurons or inputs in a time

step, and modulates the membrane potential by these sig-

nals. An output spike is generated and delivered to post-

synaptic neurons when the membrane potential exceeds the

threshold. The commonly used neuron model is Leaky In-

tegrate and Fire(LIF) model (Izhikevich 2003). It is definedOriginal

Original

Pruned

𝑾 𝑾

Regenerated

𝒈

𝒈 , ,

0.3

0.62 0.13

0.13

0.62

0.32 0.54

0.54

0.32

0.76

0.52

0.76 0.52

Original

𝜸 𝜸

Pruned

0.3

0.45

0.51

0.77

0.6

0.2

(a) Unstructured Pruning

Regenerated

𝒈 , 𝒈 ,

0.6

0.2

(b) Structured Pruning

Figure 1: Schematic view of Pruning-Regeneration Process. Faded structures and connections with dotted line are pruned. The

gray ones are pruned by basic method mentioned later. The orange ones are extra-pruned (dotted line) and regenerated (solid

line) by our method. (a) Unstructured pruning via global magnitude pruning criterion and regeneration based on criticality. (b)

Structured pruning via the scaling factor γ (BN layer parameter) and regeneration based on criticality.

as:

h[t] = u[t − 1] +

1 X

(

w i s i [t] − u[t − 1])

τ i

u[t] = h[t] · (1 − s[t]) + V reset · s[t]

(1)

(2)

where Eq.1 and Eq.2 describe the charging and resetting pro-

cesses of a LIF neuron. t represents the current time step. s i

represents the spike train from the ith pre-synaptic neuron

and w i is the corresponding synaptic weight. u[t] denotes the

membrane potential at the end of current time step t. V reset

represents the reset voltage, which we set to 0. s[t] ∈ {0, 1}

represents the output of the neuron in the current time step.

The fire process is described as:

s[t] = Θ(h[t] − V threshold )

Θ(x) = 0, x < 0 otherwise 1

(3a)

(3b)

where V threshold is the threshold voltage.

For the learning process, we use the spatio-temporal back-

propagation (STBP) (Wu et al. 2018) to train SNNs. To over-

come the problem that Eq.3b is not differentiable at zero, a

surrogate function is commonly used to replace Eq.3b in the

backward phase (Wu et al. 2018, 2019; Zheng et al. 2021).

Eq.4a defines the surrogate function used in this work. Eq.4b

is the derivative function of Eq.4a and is used to approximate

the gradient in the backward phase.

arctan(πx) +

′

g (x) =

1 + π 2 x 2

g(x) =

(4a)

(4b)

Criticality in Neuroscience

Neuroscience suggests that the critical state plays a vital role

in the brain’s efficient information processing (Beggs and

Timme 2012; Di Santo et al. 2018). This theory, known as

the critical brain hypothesis (Turing 2009), asserts that the

brain operates at the critical state. In this state, the brain is

highly sensitive to any input that can alter its activity. Even

minimal stimuli can trigger a rapid cascade of neuronal ex-

citation, facilitating information transmission throughout the

brain (Kinouchi and Copelli 2006; Beggs 2008; Shew et al.

2009; Beggs and Timme 2012).

Previous researches have observed the criticality in neural

systems at different scales (Hesse and Gross 2014; Heiney

et al. 2021). Self-organized criticality (SOC) refers to the

ability of dynamical systems to effectively tune itself to-

wards the critical state. (Herz and Hopfield 1995) first indi-

cated a mathematical equivalence between SOC models and

integrate-and-fire neuron networks. (Gal and Marom 2013)

established the association between SOC and single neuron

according to the experimental phenomenon. They found that

the cortical neurons of newborn rats keep around a barely-

excitable state, exhibiting characteristics of SOC. Activi-

ties push the neuron towards a excitatory state, while at the

longer time scale regulatory feedback pulls if back.

Method

Criticality from Biology to SNNs

Inspired by the critical brain hypothesis, We aim to exploit

the advantages of the critical state of brain to improve SNNs.

Specifically, we hope that pruned SNNs retain more criti-

cal characteristics and maintain sensitivity to inputs and effi-

ciency in feature extraction, ultimately leading to improved

post-pruning performance. We start from single neuron to

exploit the criticality in SNNs since it shows the lowest level

criticality in neuroscience (Gal and Marom 2013). Previous

works (Hu et al. 2016; Liu et al. 2022) have improved prun-

ing through properties exhibited by neurons during training

and pruning processes. Drawing from these works, a natu-

ral idea is to compute the neuron criticality score and select

neurons with higher criticality.

The key challenge is to find a a suitable metric for neuron

criticality. Inspired by (Gal and Marom 2013), we realize

the criticality is correlated with the barely-excitable state of

neurons. Previous studies (Plesser and Gerstner 2000; Maass

2014) indicate that the magnitude of the membrane potential

is positively correlated with the excitation probability. As the

membrane potential approaches the threshold voltage, the

excitation probability rapidly transitions between 0 and 1.

Following this line we propose that neuronal criticality is re-

lated to the distance between the membrane potential and the

threshold voltage. A neuron’s decision is considered to have

higher criticality when the membrane potential is closer to

the threshold voltage. Considering pruning and training pro-

cesses, we suggest that the derivative of a surrogate function,

such as g ′ , serves as a suitable criticality metric. It offers

several advantages: (1) The derivatives of mainstream surro-

gate functions reach the highest value when the membranepotential equals the threshold voltage and rapidly decrease

with increasing distance, effectively reflecting the critical-

ity changes of neuron’s decisions. (2) The derivative of the

surrogate function can be directly obtained during network

training with minimal additional computational cost. (3) The

derivative of the surrogate function has a unified value range,

enabling global ranking and pruning.

Regeneration via Neuron Criticality

(5)

where S is the set of surviving pruning structures after the

pruning iteration. S ′ is the set of pruned structures. C(x)

denotes the criticality score of x.

To compute the criticality score we have

s(u i ) = aggregate(

1 X ′

g (u i ))

T i=0

1 X

s(u i )

C(e) =

N i=1

s ′ t = s t + r(1 − s t )

(7)

s ′ t

We aim to design a compatible method to reserve critical

neurons for different pruning strategy. To achieve this, we

integrate the criticality metric and a regeneration mecha-

nism of neurons. After each pruning iteration, we reactivate

a subset of pruned neurons and their synapses. We re-sort all

pruned units in the pruned model based on their criticality

scores and reactivate the top k units with the highest scores:

S new = S + T opK(C(S ′ ))

connection level sparsity in deep SNNs, where w is the con-

nection weight. Following the gradual pruning scheme in

(Zhu and Gupta 2017), we iteratively prune the network un-

til reaching the target sparsity. After each pruning iteration,

we perform regeneration based neuron criticality. To ensure

the target sparsity, we expand s t to s ′ t :

(6a)

where r is the regeneration rate and is the temp sparsity

before regeneration at current iteration. We collect the val-

ues of g ′ (u) from the last training iteration before pruning

and calculate the average value as the criticality score. We

perform the global regeneration after each pruning iteration.

Structured Pruning We employ the scaling factor of the

Batch Normalization layer to obtain channel-level sparse

models following (Liu et al. 2017). Specifically, We lever-

age the parameter γ in batch normalization layers as scal-

ing factors and incorporate L1 sparsity regularization during

training. After training, we prune channels according to the

magnitude of scaling factors. Then we calculate the critical-

ity score for each channel by collecting the average value

of derivative of surrogate function from the entire train-

ing dataset. Finally we execute regeneration with criticality

scores and fine-tune the slimmed network.

Experiment

(6b)

Experimental Setup

Eq.6a and 6b illustrate the approach to obtaining critical-

ity scores for a neuron e. u i represents the membrane po-

tential array at ith time step. T is the total number of time

steps. g i ′ is the derivative of surrogate function. N is the to-

tal number of sample using to compute the criticality score.

aggregate() represents the way to aggregate the results of

the pruning structure. For the fully connected layer, we di-

rectly compute the average criticality score for each neuron.

For convolutional layers, we explore two approaches: mean

aggregation and max aggregation. we finally employ max

operation due to its superior performance in the experiments.

We suppose the reason is that max aggregation preserves the

highest response within the corresponding region, prevent-

ing robust responses from being diminished by neighboring

elements within the feature map. We evaluate our method on two representative architec-

tures VGG-16 (Simonyan and Zisserman 2014) and ResNet-

19 (He et al. 2016) for three datasets CIFAR-10, CIFAR-

100 (Krizhevsky, Hinton et al. 2009) and Tiny-ImageNet

(Hansen 2015). We use SGD optimizer with momentum 0.9

and learning rate 0.3. The batch size is set to 128. We eval-

uate both unstructured pruning and structured pruning. For

unstructured pruning, We set 200 epochs for pruning and

300 epochs for fine-fune for CIFAR-10 and CIFAR-100(50

and 150 for tiny-ImageNet). For structured pruning, we fol-

low (Liu et al. 2017) setting 160 epochs for pre-training and

160 epochs for fine-fune. We set the timestep to 5 for all

experiments. Our implementation is based on PyTorch and

SpikingJelly (Fang et al. 2020) package. The experiments

are executed on an RTX 3090 GPU. More implementation

details are contained in Appendix B.

Overall Pruning Strategy Accuracy and Efficiency Evaluations

So far, we have known how to calculate the criticality scores

of neurons. In order to evaluate the impact of preserving

critial neurons on sparse SNN, we perform pruning on SNNs

at both the unstructured (conection) and structured (channel)

levels. As depicted in Figure 1, our approach involves em-

ploying the basic pruning techniques while incorporating a

regeneration mechanism based on neuron criticality (Imple-

mentation details are contained in Appendix A). Unstructured Pruning We evaluate the performance of

our method in Table 1. making comparisons with state-

of-the-art methods for both SNNs (Kim et al. 2022) and

ANNs (Liu et al. 2021). As a baseline We present the re-

sults of GMP (Zhu and Gupta 2017). Our experiments en-

compass sparsity 90%, 95%, and 98%. The setting of hy-

perparameters is consistent for all methods. On CIFAR-10,

we observe that the performance differences among differ-

ent methods are relatively small. However, our method still

universally improves the performance. On the more com-

plex data CIFAR-100 and Tiny-Imagenet, the performance

differences between the methods become more significant,

Unstructured Pruning To better evaluate the effective-

ness of criticality-based regeneration mechanism, we choose

the simple global magnitude pruning criterion |w| to achieveDataset

Sparsity 90% VGG-16(Dense)

GMP (Zhu and Gupta 2017)

GraNet (Liu et al. 2021)

LTH-SNN (Kim et al. 2022)

This work 92.58

91.88

91.36

91.73

92.34 -

92.01

91.04

91.40

92.47 -

91.61

90.41

90.42

91.95 69.35

68.41

68.01

68.29

70.20 -

68.03

66.84

67.30

69.0 -

64.76

63.84

64.27

65.32 55.83

53.74

53.45

53.72

54.94 -

53.84

53.38

53.27

54.01 -

50.11

48.72

50.29

50.88

ResNet-19(Dense)

GMP (Zhu and Gupta 2017)

GraNet (Liu et al. 2021)

LTH-SNN (Kim et al. 2022)

This work 93.44

92.36

92.23

91.94

92.69 -

92.51

91.85

91.65

92.74 -

91.67

91.00

90.83

91.64 71.33

70.20

68.64

70.47

71.21 -

69.32

68.09

69.63

70.19 -

65.49

54.90

65.33

65.97 58.00

54.72

54.62

54.94

55.48 -

54.82

54.80

54.54

55.38 -

51.71

50.63

51.03

52.05

CIFAR-10

95%

98%

90%

CIFAR-100

95%

98%

Tiny-ImageNet

90%

95%

98%

Table 1: Test accuracy of VGG-16 and ResNet19 model after unstructured pruning on CIFAR-10, CIFAR-100 and Tiny-

ImageNet. The ”Dense” lines show the accuracy of dense models without pruning. We mark the best results in bold. We

run the code from (Liu et al. 2021) and (Kim et al. 2022) to obtain results with the same epoch setting.

#Epoch

LTH-SNN 2260 (11.3x)

3100 (15.5x)

This work 200

LTH-SNN 2260 (11.3x)

3100 (15.5x)

VGG-16

ResNet-19

This work

200

Top-1(%) Sparsity

68.90

68.00

69.65

68.60 89.91%

95.69%

89.91%

95.69%

71.38

70.45

71.36

70.23 89.91%

95.69%

89.91%

95.69%

Table 2: Test accuracy of VGG-16 and ResNet19 models

after unstructured pruning on CIFAR-100 with 200 pruning

epochs for our method and over 2000 for LTH-SNN (Kim

et al. 2022). We compare the results from (Kim et al. 2022)

and this work.

and our approach showed a more distinct improvement. It

is more interesting that the achievement of our method on

the VGG model at 90% sparsity for CIFAR-100, where we

obtain accuracy surpassing that of the dense model. This fur-

ther highlights the effectiveness of the critical model during

fine-tune.

In Table 2, we evaluate the efficiency of our method. We

compare it with the state-of-the-art work of SNNs(Kim et al.

2022) and observe that while significantly accelerating the

pruning with the 11.3-15.5 times speedup, our method main-

tains comparable performance to the LTH-SNN method. For

the VGG-16 model, our method achieve even higher accu-

racy at sparsity levels of 89.91% and 95.65%.

Structured Pruning In Table 3, we evaluate the perfor-

mance of our method on CIFAR-100. As pioneers in imple-

menting structured pruning on SNNs, we take the state-of-

the-art work of ANNs (Liu et al. 2017; Wang et al. 2021) and

make it portable to SNNs for a comparative analysis. Addi-

tionally, we presented (Liu et al. 2017) as our baseline. Our

method consistently outperforms Greg (Wang et al. 2021)

which is more complex and computationally expensive. Our

method also shows significant improvements compared to

the baseline, all while maintaining the equivalent level of

Flops.

Dense

Criticality

Grad

Random

GraNet

r=0

0.2 0.4 0.6 0.8 1.0

Regeneration Ratio r

(a)

Method

Model

GMP

Proposed

100 200 300 400 500 600

Pruning Epoch

(b)

Figure 2: (a) Regeneration method comparison including re-

generation based on criticality (ours), gradient, random and

GraNet (Liu et al. 2021). The two dashed lines represent the

performance of the dense model and r = 0 respective. (b)

Performance under different pruning epoch setting.

Effect of the Way and Ratio of Regeneration

To evaluate the contributions and robustness of the

criticality-based regeneration mechanism, we conduct per-

formance analysis in Figure 2a using different metrics at dif-

ferent regeneration ratio r. The regeneration metrics under

evaluation include criticality, grad, random, and GraNet (Liu

et al. 2021). Criticality refers to the method based on criti-

cality, Grad employs parameter gradients at the current it-

eration, Random adopts random ranking, and GraNet repre-

sents the GraNet regeneration strategy. The two dashed lines

represent the performance of the dense model and r = 0

case (GMP) respectively. We observe that all methods ex-

hibited relatively stable performance within the ratio rangeModel

VGG-16

ResNet-19

Method Base Top-1(%) Pruned Top-1(%)

(Liu et al. 2017)

(Wang et al. 2021)

This work

(Liu et al. 2017)

(Wang et al. 2021)

This work

(Liu et al. 2017)

(Wang et al. 2021)

This work

(Liu et al. 2017)

(Wang et al. 2021)

This work 69.18

68.81

69.18

68.81

69.18

72.09

71.34

72.09

71.34

72.09 67.92

68.05

68.90

66.24

67.13

67.64

71.08

70.37

71.28

68.34

69.20

70.01

Flops Reduce(%)

1.26

0.76

0.28

2.94

1.68

1.54

1.01

0.97

0.81

3.75

2.14

2.08

43.76

43.73

43.76

58.10

58.01

58.02

49.44

49.40

49.46

69.49

69.42

69.47

Table 3: Test accuracy of VGG-16 and ResNet19 models after structured pruning on CIFAR-100. We compare our method to

(Liu et al. 2017) and (Wang et al. 2021). We mark the best results in bold.

Effect of Pruning Epoch

To evaluate the sensitivity of our method to pruning cost, we

compare the performance of GMP and our work at different

pruning epoch settings in Figure 2b. We observe that when

the pruning cost is low, our work exhibits a significant per-

formance advantage over GMP. As the cost increases, the

performance gradually improves. our method’s performance

consistently outperforms GMP and reaches peak around

200, indicating the complete gains from the critical state.

Discussion

In order to investigate the impact and underlying mecha-

nisms of neuron criticality during pruning and fine-tuning,

we present the experimental results and statistical findings

for the pruned model in this section for more in-depth anal-

ysis.

Regeneration Survival In Figure 3a, we show the pro-

portion of weights pruned using magnitude pruning and

rescued by the regeneration in each pruning iteration. The

regenerated ones consistently account for more than 50%

for different sparsity. Furthermore, we separately calculate

the proportion of structures survived due to our method

in final models through both unstructured and structured

100

90% pruned

95% pruned

98% pruned

0 0 5 10 15 20 25 30 35 40

Pruning Iteration

Baseline

0.7

Proposed

0.6

0.5

0.4

0.3

0.2

0.1

0.0 ResNet-19

VGG-16

Regenerated

of 0.1 to 0.6. Criticality, Grad, and Random methods outper-

form the accuracy when r = 0, indicative of their robustness

over the wide range of regeneration proportions. In particu-

lar, criticality-based regeneration consistently outperforms

the Grad and Random regeneration across all ratio settings,

even surpassing the performance of the dense model within

the 0.1-0.6 proportion range. This highlights the superiority

of our method. GraNet exhibits distinct performance dynam-

ics compared to other methods. We attribute this behavior to

GraNet’s implementation, which employs cosine decay for

the regeneration ratio, gradually diminishing its impact as

training progresses.

(a)

(b)

Figure 3: (a) The proportion of weights rescued by the regen-

eration from magnitude pruning in each pruning iteration. r

sets to 0.5, 0.2 and 0.1 for the final sparsity 90%, 95% and

98%. (b) Mean importance of non-overlapping channels be-

tween our method and baseline for structured pruning after

fine-tuning.

pruning. For the VGG-16 model obtained through unstruc-

tured pruning with 90% sparsity, the percentage of struc-

tures survived via regeneration is 48.03%. For the ResNet-

19 model obtained through structured pruning with 49.46%

Flops, the percentage of structures survived via regeneration

is 42.60%. This underscores the significant impact of our

method on the structure of pruned models.

Importance transition We conduct a comparative analy-

sis of the importance transition in non-overlapping surviving

structures between the models obtained using our method

and GMP. To better distinguish non-overlapping structures,

we focus on the results of structured pruning. In Figure 3b,

we present the means of γ (normalized) corresponding to

non-overlapping channels in the models through the fine-

tuning process with L1 sparsity regularization for two meth-

ods. The results show that the channels regenerated by our

method exhibit significantly higher importance compared

to those without regeneration. Considering that the regen-

erated channels originally corresponded to lower γ values0

Class

100

This work

Baseline

0.999

(a)

(b)

This work

Baseline

0.9996

0.9994

100 0.998 0

Class

0.9998

0.02

1.000

Baseline

This work

0.03

0.02

0.04

Baseline

This work

0.04

Class

100

(c)

Class

100

(d)

Figure 4: Comparison of the difference in feature extraction of models pruned by our method and GMP. Our method achieves

more uniform feature representations. (a) Intra-cluster variance of VGG-16 through unstructured pruning. (b) Intra-cluster

variance of ResNet-19 through structured pruning. (c) Cosine similarity of the means of features for each class between the

training and test dataset on VGG-16 through unstructured pruning. (d) Cosine similarity of the means of features for each class

between the training and test dataset on ResNet-19 through structured pruning.

5.0

4.8

This work

Baseline

This work

Baseline

4.6

100 200

Epoch

300

4.4

(a)

6.0

1.0

This work

Baseline

5.0

100

Epoch

(c)

300

(b)

This work

Baseline

2.0

100 200

Epoch

150

100

Epoch

150

(d)

Figure 5: Fine-tune process on VGG-16 after unstructured

pruning and ResNet-19 after structured pruning (The com-

plete results are given in Appendix C). (a) Test accuracy on

VGG-16. (b) Test loss on VGG16. (c) Train loss on ResNet-

19. (d) Test Loss on ResNet-19.

and higher criticality, this result suggests the following role

of our method: (1) Identifying structures with latent poten-

tial, thereby taking the pruned model with greater promise.

(2) Leveraging the criticality theory, the pruned model ben-

efits from more efficient feedback during fine-tuning, en-

abling the high-criticality channels to occupy key positions

and consequently leading to higher performance gains.

Feature Extraction The critical brain theory suggests that

the critical state of the brain promotes feature extraction and

transmission. We compare the difference in feature extrac-

tion of models pruned by our method and GMP. In Figure

4a and Figure 4b, we show the intra-cluster variance (Kiang

2001) of the features in CIFAR-100 for unstructured pruning

and structured pruning, respectively. The intra-cluster vari-

ance measures the compactness of intra-cluster sample rep-

resentations (Oti et al. 2020). We extract feature maps before

the fully connected classifier, and normalize them to elimi-

nate the influence of absolute values. Our model achieved

lower variances in almost all classes, indicating enhanced

compactness in class features. This improvement is consis-

tent for both unstructured pruning and structured pruning.

Furthermore, Figure 4c and Figure 4d illustrate the cosine

similarity of the means of features for each class between the

training and test dataset. Our model exhibits higher similar-

ity in feature extraction across almost all classes, leading to

a more uniform representation of features between the train

and test samples and reducing noise.

In Figure 5a and 5b we present curves of test accuracy

and loss during the fine-tune phase for VGG-16 through un-

structured pruning. In contrast, Figure 5a demonstrates that

our model’s test accuracy steadily recovers during retrain-

ing, while the GMP model maintains a lower accuracy level

with significant fluctuations after an initial brief increase.

Figure 5b and 5d show that our model’s test loss steadily

decreases, while the baseline model’s test loss initially de-

creases briefly but then rises. Figure 5c and 5d show that

our model exhibits higher training loss but lower test loss

comparing to baseline. Combining the above observation,

we believe that the baseline models suffer from overfitting

during fine-tune and our method reduces this because the

critical model learns consistent feature representation and

reduce noise between training and test samples.

Conclusion

In this paper, inspired by critical brain hypothesis in neuro-

science, we propose a novel neuron criticality metric and de-

sign a regeneration based on criticality for SNN pruning. Ex-

perimental results demonstrate that our method outperforms

previous SNN pruning methods for both unstructured and

structured pruning. We keeps comparable performance with

SOTA method while significantly reducing pruning cost.

Moreover, we investigate the impact and underlying mecha-

nisms of our method and find that it efficiently identifies po-

tential structures, enhances feature uniformity and reduces

the overfitting during the recover phase.Appendix

A. Details of Pruning Strategy

Unstructured Pruning We present the details for the ba-

sic pruning method and the regeneration mechanism for un-

structured pruning. As mentioned in method section, we

choose a global magnitude pruning criterion to achieve con-

nection level sparsity. Specifically, we have:

s(w) = |w|,

(8)

where s(w) is the importance score of w. The gradual prun-

ing scheme from (Zhu and Gupta 2017) is

s t = s f − s f (1 −

n∆t 3

) ,

T f

(9)

where s f represents the target sparsity. n is current pruning

iteration. T f is the end point of pruning. ∆t is a constant and

represents the interval of pruning iterations. We present the

overall pruning strategy with regeneration in algorithm 1.

Algorithm 1: The pseudo code of unstructured pruning

Require: target sparsity s f , pruning interval ∆t, end point

T f and regeneration ratio r.

Parameter: Model weights W .

1: Let n ← 0.

2: for each training step t do

TrainOneStep()

if t < T f and (t mod ∆t) == 0 then

n = n +1

s t ← CurrentSparsity(s f , n, T f , ∆t)

▷ Eq.9

s ′ t ← ExtendSparsity(s t , r)

▷ Eq.7

W = Pruning(|W |, s ′ t )

▷ Eq.8

C(W ) ← Criticality((u currentStep )

▷ Eq.6

▷ Eq.5

10:

W = Regeneration(C(W ), s ′ t − s t )

11:

end if

12: end for

Algorithm 2: The pseudo code of structured pruning

Require: target sparsity s t , and regeneration ratio r.

Parameter: Model channels chs and scaling factor set Γ.

Train process:

1: Train model with L1 sparsity regularization.

▷ Ep.10

Pruning and Regeneration:

2: s ′ t ← ExtendSparsity(s t , r)

▷ Eq.7

3: chs = Pruning(chs, s ′ t , |Γ|)

▷ pruning by |Γ|

4: C(chs) ← Criticality((u trainSet )

▷ Eq.6

5: chs = Regeneration(C(chs), s ′ t − s t )

▷ Eq.5

Fine-tuning process:

6: Fine-tuning model without L1 sparsity regularization.

where (x, y) is the input and target pair. W is network

weights. The first sum-term denotes train loss in the back-

ward process. γ is the scaling factor defined as the factor

parameter of linear mapping in batch normalization layer.

The second sum-term is the L1 sparsity regularization.

We present the overall pruning strategy with regeneration

in algorithm 2.

B. Experiment Details

The global settings of hyperparameters of experiments are

illustrated in Table 4. Table 5 and 6 respectively illustrate

settings for unstructured pruning and structured pruning. To

be consistent with previous works, we use step lrscheduler

for structured pruning and cosine lrscheduler for unstruc-

tured pruning.

Parameter Description Value

Batch Size

Optimizer

momentum

V threshold

V reset -

Learning Rate

# Timestep

Membrane Constant

Threshold Voltage

Reset Voltage 128

SGD

0.9

0.3

4/3

1.0

0.0

Table 4: Global settings of hyperparameters for all experi-

ments.

Dataset CIFAR-10/100 Tiny-ImageNet

N p

N f

∆t

s f

r 200

300

5e-4

2000

[0.90, 0.95, 0.98]

[0.5, 0.2, 0.1] 50

100

5e-4

1000

[0.90, 0.95, 0.98]

[0.3, 0.1, 0.05]

Table 5: Experiment hyperparameters of unstructured prun-

ing for VGG-16 and ResNet-19 on CIFAR-10, CIFAR-

100 and Tiny-ImageNet. Pruning Epochs (N p ), Fine-tuning

Epochs after pruning (N f ), Weight decay (wd), pruning in-

terval (∆t), Final sparsity s f , Regeneration radio (r). The

settings are common for VGG-16 and ResNet-19.

C. Model Curves during Fine-tuning

Structured Pruning Following the approach proposed by

(Liu et al. 2017), we have a scaling factor for each channel

and jointly train these scaling factors with sparsity regular-

ization:

L =

l(f (x, W ), y) + λ

|γ|,

(10)

(x,y)

γ∈Γ

In Figure 6 and 7, we present complete curves of accuracy

and loss during the fine-tune phase for VGG-16 through

unstructured pruning and for ResNet-19 through structured

pruning. As mentioned in discussion section, Figure 6c

and 7c demonstrates our model recovers better during fine-

tuning. The comparison between curves of training and test

indicates our contribution to reduce overfitting.This work

Baseline

100 200

Epoch

300

0.6

5.0

This work

Baseline

4.8

0.8

This work

Baseline

(a)

100 200

Epoch

300

This work

Baseline

4.6

(b)

100 200

Epoch

300

4.4

(c)

100 200

Epoch

300

(d)

Figure 6: Fine-tune process on VGG-16 after unstructured pruning. (a) Train accuracy. (b) Train loss. (c) Test accuracy. (d) Test

Loss.

This work

Baseline

100

Epoch

1.0

150

(a)

6.0

This work

Baseline

100

Epoch

150

(b)

100

Epoch

(c)

150

2.0

This work

Baseline

100

This work

Baseline

5.0

100

Epoch

150

(d)

Figure 7: Fine-tune process on ResNet-19 after structured pruning . (a) Train accuracy. (b) Train loss. (c) Test accuracy. (d) Test

Loss.

Model VGG-16 ResNet-19

N t

N f

N 1 , N 2

perecent 1

r 160

160

[80, 120]

1e-4

[0.4522, 0.6764]

[0.1, 0.05] 160

160

[80, 120]

1e-4

[0.512, 0.658]

[0.3, 0.1]

Choosing the the special values to achieve the

same flops reducing with SOTA for fair com-

parison.

Table 6: Experiment hyperparameters of structured prun-

ing for VGG-16 and ResNet-19 on CIFAR-100. Training

Epochs with L1 sparsity regularization (N t ), Fine-tuning

Epochs after pruning (N f ), lr Drop (10x) Epochs (N 1 , N 2 ),

Target sparsity s t (perecent), Regeneration radio (r).

References

Akopyan, F.; Sawada, J.; Cassidy, A.; Alvarez-Icaza, R.;

Arthur, J.; Merolla, P.; Imam, N.; Nakamura, Y.; Datta, P.;

Nam, G.-J.; et al. 2015. Truenorth: Design and tool flow of a

65 mw 1 million neuron programmable neurosynaptic chip.

IEEE transactions on computer-aided design of integrated

circuits and systems, 34(10): 1537–1557.

Beggs, J. M. 2008. The criticality hypothesis: how local

cortical networks might optimize information processing.

Philosophical Transactions of the Royal Society A: Math-

ematical, Physical and Engineering Sciences, 366(1864):

329–343.

Beggs, J. M.; and Timme, N. 2012. Being critical of criti-

cality in the brain. Frontiers in physiology, 3: 163.

Bellec, G.; Salaj, D.; Subramoney, A.; Legenstein, R.; and

Maass, W. 2018. Long short-term memory and learning-to-

learn in networks of spiking neurons. Advances in neural

information processing systems, 31.

Chen, Y.; Yu, Z.; Fang, W.; Huang, T.; and Tian, Y. 2021.

Pruning of Deep Spiking Neural Networks through Gradi-

ent Rewiring. In Zhou, Z.-H., ed., Proceedings of the Thirti-

eth International Joint Conference on Artificial Intelligence,

IJCAI-21, 1713–1721. International Joint Conferences on

Artificial Intelligence Organization. Main Track.

Davies, M.; Srinivasa, N.; Lin, T.-H.; Chinya, G.; Cao, Y.;

Choday, S. H.; Dimou, G.; Joshi, P.; Imam, N.; Jain, S.; et al.

2018. Loihi: A neuromorphic manycore processor with on-

chip learning. Ieee Micro, 38(1): 82–99.

Deng, L.; Wu, Y.; Hu, Y.; Liang, L.; Li, G.; Hu, X.; Ding, Y.;

Li, P.; and Xie, Y. 2021. Comprehensive snn compression

using admm optimization and activity regularization. IEEE

transactions on neural networks and learning systems.

Di Santo, S.; Villegas, P.; Burioni, R.; and Muñoz, M. A.

2018. Landau–Ginzburg theory of cortex dynamics: Scale-

free avalanches emerge at the edge of synchronization.

Proceedings of the National Academy of Sciences, 115(7):

E1356–E1365.

Ding, X.; Hao, T.; Tan, J.; Liu, J.; Han, J.; Guo, Y.; and Ding,

G. 2021. Resrep: Lossless cnn pruning via decoupling re-membering and forgetting. In Proceedings of the IEEE/CVF

International Conference on Computer Vision, 4510–4520.

Ding, X.; Zhou, X.; Guo, Y.; Han, J.; Liu, J.; et al. 2019.

Global sparse momentum sgd for pruning very deep neural

networks. Advances in Neural Information Processing Sys-

tems, 32.

Fang, W.; Chen, Y.; Ding, J.; Chen, D.; Yu, Z.; Zhou, H.;

Masquelier, T.; Tian, Y.; and other contributors. 2020. Spik-

ingJelly.

https://github.com/fangwei123456/spikingjelly.

Accessed: 2022-5-1.

Frankle, J.; and Carbin, M. 2018. The lottery ticket hypothe-

sis: Finding sparse, trainable neural networks. arXiv preprint

arXiv:1803.03635.

Gal, A.; and Marom, S. 2013. Self-organized criticality

in single-neuron excitability. Physical Review E, 88(6):

062717.

Gale, T.; Elsen, E.; and Hooker, S. 2019. The state

of sparsity in deep neural networks.

arXiv preprint

arXiv:1902.09574.

Han, S.; Pool, J.; Tran, J.; and Dally, W. 2015. Learning

both weights and connections for efficient neural network.

Advances in neural information processing systems, 28.

Hansen, L. 2015. Tiny ImageNet challenge submission. CS

231N, 5.

He, K.; Zhang, X.; Ren, S.; and Sun, J. 2016. Deep resid-

ual learning for image recognition. In Proceedings of the

IEEE conference on computer vision and pattern recogni-

tion, 770–778.

He, Y.; Liu, P.; Wang, Z.; Hu, Z.; and Yang, Y. 2019. Filter

pruning via geometric median for deep convolutional neu-

ral networks acceleration. In Proceedings of the IEEE/CVF

conference on computer vision and pattern recognition,

4340–4349.

He, Y.; Zhang, X.; and Sun, J. 2017. Channel pruning for ac-

celerating very deep neural networks. In Proceedings of the

IEEE international conference on computer vision, 1389–

1397.

Heiney, K.; Huse Ramstad, O.; Fiskum, V.; Christiansen, N.;

Sandvig, A.; Nichele, S.; and Sandvig, I. 2021. Criticality,

connectivity, and neural disorder: a multifaceted approach

to neural computation. Frontiers in computational neuro-

science, 15: 611183.

Herz, A. V.; and Hopfield, J. J. 1995. Earthquake cycles and

neural reverberations: collective oscillations in systems with

pulse-coupled threshold elements. Physical review letters,

75(6): 1222.

Hesse, J.; and Gross, T. 2014. Self-organized criticality as

a fundamental property of neural systems. Frontiers in sys-

tems neuroscience, 8: 166.

Hu, H.; Peng, R.; Tai, Y.-W.; and Tang, C.-K. 2016.

Network trimming: A data-driven neuron pruning ap-

proach towards efficient deep architectures. arXiv preprint

arXiv:1607.03250.

Izhikevich, E. M. 2003. Simple model of spiking neurons.

IEEE Transactions on neural networks, 14(6): 1569–1572.

Kappel, D.; Habenschuss, S.; Legenstein, R.; and Maass, W.

2015. Network plasticity as Bayesian inference. PLoS com-

putational biology, 11(11): e1004485.

Kiang, M. Y. 2001. Extending the Kohonen self-organizing

map networks for clustering analysis. Computational Statis-

tics & Data Analysis, 38(2): 161–180.

Kim, Y.; Li, Y.; Park, H.; Venkatesha, Y.; Hambitzer, A.; and

Panda, P. 2023. Exploring temporal information dynamics

in spiking neural networks. In Proceedings of the AAAI Con-

ference on Artificial Intelligence, volume 37, 8308–8316.

Kim, Y.; Li, Y.; Park, H.; Venkatesha, Y.; Yin, R.; and Panda,

P. 2022. Exploring lottery ticket hypothesis in spiking neural

networks. In Computer Vision–ECCV 2022: 17th European

Conference, Tel Aviv, Israel, October 23–27, 2022, Proceed-

ings, Part XII, 102–120. Springer.

Kinouchi, O.; and Copelli, M. 2006. Optimal dynamical

range of excitable networks at criticality. Nature physics,

2(5): 348–351.

Krizhevsky, A.; Hinton, G.; et al. 2009. Learning multiple

layers of features from tiny images.

Kundu, S.; Datta, G.; Pedram, M.; and Beerel, P. A. 2021.

Spike-thrift: Towards energy-efficient deep spiking neural

networks by limiting spiking activity via attention-guided

compression. In Proceedings of the IEEE/CVF Winter Con-

ference on Applications of Computer Vision, 3953–3962.

Lin, M.; Ji, R.; Wang, Y.; Zhang, Y.; Zhang, B.; Tian, Y.;

and Shao, L. 2020. Hrank: Filter pruning using high-rank

feature map. In Proceedings of the IEEE/CVF conference

on computer vision and pattern recognition, 1529–1538.

Liu, F.; Zhao, W.; Chen, Y.; Wang, Z.; and Dai, F. 2022.

Dynsnn: A dynamic approach to reduce redundancy in spik-

ing neural networks. In ICASSP 2022-2022 IEEE Interna-

tional Conference on Acoustics, Speech and Signal Process-

ing (ICASSP), 2130–2134. IEEE.

Liu, S.; Chen, T.; Chen, X.; Atashgahi, Z.; Yin, L.; Kou, H.;

Shen, L.; Pechenizkiy, M.; Wang, Z.; and Mocanu, D. C.

2021. Sparse training via boosting pruning plasticity with

neuroregeneration. Advances in Neural Information Pro-

cessing Systems, 34: 9908–9922.

Liu, Y.; Qian, K.; Hu, S.; An, K.; Xu, S.; Zhan, X.; Wang,

J.; Guo, R.; Wu, Y.; Chen, T.-P.; et al. 2019. Application

of deep compression technique in spiking neural network

chip. IEEE transactions on biomedical circuits and systems,

14(2): 274–282.

Liu, Z.; Li, J.; Shen, Z.; Huang, G.; Yan, S.; and Zhang, C.

2017. Learning Efficient Convolutional Networks Through

Network Slimming. In Proceedings of the IEEE Interna-

tional Conference on Computer Vision (ICCV).

Maass, W. 1997. Networks of spiking neurons: the third gen-

eration of neural network models. Neural networks, 10(9):

1659–1671.

Maass, W. 2014. Noise as a resource for computation and

learning in networks of spiking neurons. Proceedings of the

IEEE, 102(5): 860–880.

Molchanov, P.; Mallya, A.; Tyree, S.; Frosio, I.; and Kautz,

J. 2019. Importance estimation for neural network pruning.In Proceedings of the IEEE/CVF conference on computer

vision and pattern recognition, 11264–11272.

Molchanov, P.; Tyree, S.; Karras, T.; Aila, T.; and Kautz, J.

2016. Pruning convolutional neural networks for resource

efficient inference. arXiv preprint arXiv:1611.06440.

Neftci, E. O.; Pedroni, B. U.; Joshi, S.; Al-Shedivat, M.; and

Cauwenberghs, G. 2016. Stochastic synapses enable effi-

cient brain-inspired learning machines. Frontiers in neuro-

science, 10: 241.

Oti, E. U.; Unyeagu, S.; Nwankwo, C. H.; Alvan, W. K.;

and Osuji, G. A. 2020. New K-means clustering methods

that minimize the total intra-cluster variance. Afr. J. Math.

Stat. Stud, 3: 42–54.

Pei, J.; Deng, L.; Song, S.; Zhao, M.; Zhang, Y.; Wu, S.;

Wang, G.; Zou, Z.; Wu, Z.; He, W.; et al. 2019. Towards

artificial general intelligence with hybrid Tianjic chip archi-

tecture. Nature, 572(7767): 106–111.

Plesser, H. E.; and Gerstner, W. 2000. Noise in integrate-

and-fire neurons: from stochastic input to escape rates. Neu-

ral computation, 12(2): 367–384.

Qi, Y.; Shen, J.; Wang, Y.; Tang, H.; Yu, H.; Wu, Z.; Pan, G.;

et al. 2018. Jointly learning network connections and link

weights in spiking neural networks. In IJCAI, 1597–1603.

Roy, K.; Jaiswal, A.; and Panda, P. 2019. Towards spike-

based machine intelligence with neuromorphic computing.

Nature, 575(7784): 607–617.

Shew, W. L.; Yang, H.; Petermann, T.; Roy, R.; and Plenz, D.

2009. Neuronal avalanches imply maximum dynamic range

in cortical networks at criticality. Journal of neuroscience,

29(49): 15595–15600.

Shrestha, A.; Fang, H.; Rider, D. P.; Mei, Z.; and Qiu,

Q. 2021. In-hardware learning of multilayer spiking neu-

ral networks on a neuromorphic processor. In 2021 58th

ACM/IEEE Design Automation Conference (DAC), 367–

372. IEEE.

Simonyan, K.; and Zisserman, A. 2014. Very deep convo-

lutional networks for large-scale image recognition. arXiv

preprint arXiv:1409.1556.

Turing, A. M. 2009. Computing machinery and intelligence.

Springer.

Wang, H.; Qin, C.; Zhang, Y.; and Fu, Y. 2021. Neural Prun-

ing via Growing Regularization. In International Confer-

ence on Learning Representations (ICLR).

Wu, Y.; Deng, L.; Li, G.; Zhu, J.; and Shi, L. 2018. Spatio-

temporal backpropagation for training high-performance

spiking neural networks. Frontiers in neuroscience, 12: 331.

Wu, Y.; Deng, L.; Li, G.; Zhu, J.; Xie, Y.; and Shi, L. 2019.

Direct training for spiking neural networks: Faster, larger,

better. In Proceedings of the AAAI conference on artificial

intelligence, volume 33, 1311–1318.

You, H.; Li, C.; Xu, P.; Fu, Y.; Wang, Y.; Chen, X.; Baraniuk,

R. G.; Wang, Z.; and Lin, Y. 2019. Drawing early-bird tick-

ets: Towards more efficient training of deep networks. arXiv

preprint arXiv:1909.11957.

Zheng, H.; Wu, Y.; Deng, L.; Hu, Y.; and Li, G. 2021. Going

deeper with directly-trained larger spiking neural networks.

In Proceedings of the AAAI conference on artificial intelli-

gence, volume 35, 11062–11070.

Zhu, M.; and Gupta, S. 2017. To prune, or not to prune: ex-

ploring the efficacy of pruning for model compression. arXiv

preprint arXiv:1710.01878.