Transformer🗒️Understanding transformer🗒️Understanding transformer—Statistics🗒️Understanding transformer—KV cacheLarge Model Lightweighting🗒️IntroductionEfficient LLM Inference🗒️Episode 1