Posts
Tags
Categories
About
English
English
简体中文
Light
Dark
Auto
Cancel
Posts
Tags
Categories
About
Light
Dark
Auto
English
English
简体中文
All Posts
Recently Updated
Why Applicative Functor
06-02
2025
Async + Token Bucket: How to Batch LLM API Calls Efficiently
06-18
Programming with Categories: Functor
06-01
Transformer architecture variation: Rotary Position Embedding (RoPE)
05-24
Transformer architecture variation: RMSNorm
05-11
Kosaraju's Algorithm Explained
04-26
One for all: the torch.einsum API
04-14
Two different APIs related to Process Pool in Python
03-30
Class Hierarchy Analysis: a quick way to generate call graph
03-19
Prefix Sum Array: the secret to fast range sum query and more
03-15
Weight Tying in Language Models: A Technique to Parameter efficiency
03-11
What is Multi-Head Attention (MHA)
03-04
An Explanation of Self-Attention mechanism in Transformer
03-02
From Basic Block to Control Flow Graph
02-20
What is Three-Address Code (3AC/TAC)
02-18
The Flow of GraphRAG
02-12
Reading Notes: Outrageously Large Neural Networks-The Sparsely-Gated Mixture-of-Experts Layer
02-02
What is the Python decorator really?
01-20
2024
Reading Notes: Generalization through Memorization: Nearest Neighbor Language Models
12-23
How KNN Algorithm Works
12-15
What is Phantom type in OCaml
12-08
1
2
3
…
5