Online Convex Optimization and Accelerated Gradient Descent Methods for Efficient Training less than 1 minute read Published: February 04, 2026 TBC Share on Bluesky Facebook LinkedIn X (formerly Twitter) Previous Next
Non-convex optimization for Over-parameterized Neural Nets: Reproducing Kernel Hilbert Space and Neural Tangent Kernel 3 minute read Published: February 01, 2026 This blog is based on Real Analysis by Elias M. Stein and Rami Shakarchi, and Learning Theory on First Principles by Francis Bach.
Note on Submodular Function Optimization, Minimization and Maximization, Lazy Greedy less than 1 minute read Published: November 20, 2025 This blog is based on week 10 of PKU Algorithms for Big Data Analysis.
Efficient Methods for Generative Models 3: Sparse and Adaptive Attention, Dynamic Token Pooling 2 minute read Published: November 20, 2025 Introduction to Recurrent Neural Networks (RNNs)