Arka Bagchi’s Post

View profile for Arka Bagchi, graphic

Founding Engineer @ First Drafts AI

GPT-4o struggles with debugging its own code. It often repeats the same incorrect solution without improvement. For coding tasks, consider using other LLMs like Anthropic Claude 3 Opus or Claude 3.5 Sonnet, which perform substantially better. Despite being trained for real-time voice conversations, GPT-4o seems less effective at long-form, multi-turn conversations compared to older GPT models. The function calling with GPT-4o is actually worse than GPT-4 Turbo.

To view or add a comment, sign in

Explore topics