SWE-Agent, Octopus v2, Many-shot Jailbreaking, dynamic inference with Mixture-of-Depths, Apple's ReALM, GPT-4's bar exam marks questioned, CodeEditorBench.
Share this post
AI Research Roundup 24.04.05
Share this post
SWE-Agent, Octopus v2, Many-shot Jailbreaking, dynamic inference with Mixture-of-Depths, Apple's ReALM, GPT-4's bar exam marks questioned, CodeEditorBench.