News | drihu.com

By jxmorris12, 2 days ago

URL: evjang.com

3 comments

By xg15, 10 hours ago

(2021), still very interesting. Especially the "post-overfitting" training strategy is unexpected.

By luckystarr, 6 hours ago

I remember vaguely that this was observed when training GPT-3 (probably?) as well. Just trained on and on, and the error went up and then down again. Like a phase transition in the model.

By esafak, 6 hours ago

The low sample efficiency of RL is well explained.

6 hours ago

[deleted]

Just Ask for Generalization (2021)