2025-07-16

Read TODO

📚 Priority Reading Queue

Authors: Wang J, Chang Q, Chang Q, Liu Y, Pal NR
Journal: IEEE Transactions on Cybernetics, 2019
Volume/Issue: Vol. 49, No. 12, pp. 4346-4364
DOI: 10.1109/TCYB.2018.2864142
Key Focus:
- Shows L2 weight decay inadequacy for sparse solutions
- Proposes group lasso as regularizer alternative
- Node pruning applications for fault-tolerant MLPs
Status: ⏳ To Read
Notes: Key paper showing why traditional weight decay fails for sparsity

Authors: Wang J, Xu C, Yang X, Zurada JM
Journal: IEEE Transactions on Neural Networks and Learning Systems
Year: 2018
Volume/Issue: Vol. 29, No. 5, pp. 2012-2024
DOI: 10.1109/TNNLS.2017.2748585
Key Focus:
- Four new backpropagation variants using Group Lasso
- Smoothing functions to handle non-differentiability
- Direct comparison with Weight Decay, Weight Elimination
Status: ⏳ To Read
Notes: Comprehensive comparison with traditional weight decay methods

Authors: Scardapane S, Comminiello D, Hussain A, Uncini A
Conference/Journal: ArXiv preprint
Year: 2016
ArXiv ID: 1607.00485
Key Focus:
- Joint optimization of weights, neuron count, and feature selection
- Group Lasso penalty for network connections
- Extensive comparison with classical weight decay
Status: ⏳ To Read
Notes: Foundational paper on group sparse regularization vs weight decay

For each paper, capture:

Fundamental Question: Why does traditional L2 weight decay fail for structured sparsity?
Methodological: How do group-based penalties differ from element-wise penalties?
Practical: What are the computational trade-offs between methods?
Theoretical: What convergence guarantees exist for these approaches?

Last Updated: Add date when you start reading
Priority: High - Core understanding of weight decay limitations in structured sparsity