OpenAI Announces Benchmarks for AI Life Sciences Research. Its Best Model
Failed 63.9% of the Test
-
This week OpenAI announced a 750-task test to to measure "whether AI
systems can support realistic life science research tasks, not just answer
biology que...
12 hours ago