lukewoodcock 26 May 2025 A Quest to Tame Large Language Models LLM natural language processing transformers machine learning AI language models probabilistic text generation statistical language models n-gram bigram attention mechanisms vector embeddings tokenization context windows self-attention neural networks NLP fundamentals AI architecture language understanding transformer architecture