"Do they shoot you in San Francisco if you use capital letters?"
tokenbender
tokenbenderAug 9, 22:30
is it possible to pretrain a language model using pure reinforcement learning from scratch? random weights, no cross-entropy loss pretraining. you may have many questions in your head.
135.98K