原推:@downingARK @draecomino Yeah. Almost certainly this is what they did. Chinchilla scale to confirm/optimize model performance, then find efficient frontier on parameters given maximum available data, then stuff it with data like a turducken (allowing them to pull down inference cost and increase perf)
https://twitter.com/wintonARK/status/1631342552648675328