Suggestions
Andrej Karpathy
Slovak-Canadian computer scientist
Andrej Karpathy is a leading figure in the field of Artificial Intelligence with a strong academic background and professional experience in top tech companies.
He holds a PhD in Computer Science from Stanford University, a MSc in Computer Science from The University of British Columbia, and a BSc in Computer Science and Physics from the University of Toronto.
With a rich educational background, he has worked in various prestigious organizations including roles as the Senior Director of Artificial Intelligence and previously as the Director of Artificial Intelligence at Tesla.
Before his time at Tesla, Andrej was a Research Scientist at OpenAI and gained industry experience through internships at Google DeepMind and Google.
Throughout his career, Andrej Karpathy has made significant contributions to the field of Artificial Intelligence, particularly in the areas of deep learning and machine learning.
Highlights
Remember the llm.c repro of the GPT-2 (124M) training run? It took 45 min on 8xH100. Since then, @kellerjordan0 (and by now many others) have iterated on that extensively in the new modded-nanogpt repo that achieves the same result, now in only 5 min! Love this repo 👏 600 LOC https://t.co/VTtpXbA5g8
Remember exercise pages from textbooks? Large-scale collection of these across all realms of knowledge now moves billions of dollars. Textbooks written primarily for LLMs, compressed to weights, emergent solutions served to humans, or (over time) directly enacted for automation. https://t.co/PjO97NeUdR