I think the goal was to make a model that could keep learning and improving. It's not panning out that way, so with energy constraints we are learning small models make more sense now. I do think there might be a place for large model to discover connections we might not have seen/noticed.