nostrich on nostr

Open in Damus

Avi Burra · 75w

I’ve spent the last few weeks immersed in reasoning and deep research models. I’ve seen enough. At least one (if not multiple) models will cross 50% on the Humanity’s Last Exam benchmark by th...

nostrich 1739069666

How do you feed these models the business and organizational context needed to solve most software problems?