TODO: GPT-1/2/3/4 演进、Decoder-Only 架构、In-Context Learning、Emergent Abilities