Hi, I’m Wei.
By day, I’m a Senior Staff SWE focused on AI/ML for payment risk—tackling everything from fraud detection to credit modeling. I use this space to log my learning notes, grounded in the belief that articulation is the ultimate form of reinforcement learning. With the rise of generative AI, I’m spending less time on the syntax of writing and more time on the substance of the ideas.
Posts
-
Knowledge Distillation - Revisit
-
LLM Inference Optimization: 2026 Update - MTP Deep Dive
-
LLM Inference Optimization: 2026 Update - Overview
-
Large Transformer Model - Inference Optimization
subscribe via RSS