Relevant Articles
Let’s Build A Simple Interpreter. Part 17: Call Stack and Activation Records
The Dual LLM pattern for building AI assistants that can resist prompt injection
Web LLM runs the vicuna-7b Large Language Model entirely in your browser, and it's very impressive