Course webiste for BT5153
View My GitHub Profile
Slides
Notebook
Large Language Models: A Survey
GPT1: Improving Language Understanding by Generative Pre-Training
GPT2: Language Models are Unsupervised Multitask Learners
GPT3: Language Models are Few-Shot Learners
InstructGPT: Training language models to follow instructions with human feedback
Llama2
Ilya Sutskever: “pretraining is done. we are now in the post training era”
How Scaling Laws Will Determine AI’s Future