We are officially kicking off the Eval Science Workstream, building a shared scientific foundation for evaluating AI systems that is rigorous, open, and grounded in real-world and cross-disciplinary best practices. Read our blog post.