I found mistakes in OpenAI's HealthBench using AI david-gilbertson.medium.com 1 points by Kuinox 6 hours ago