Avi Burra
· 65w
I’ve spent the last few weeks immersed in reasoning and deep research models. I’ve seen enough.
At least one (if not multiple) models will cross 50% on the Humanity’s Last Exam benchmark by th...
How do you feed these models the business and organizational context needed to solve most software problems?