Scale AI and the Department of Defense’s Chief Digital and Artificial Intelligence Office have joined forces to establish a framework for testing and evaluating generative artificial intelligence.
In a statement Tuesday, Alexandr Wang, founder and CEO of Scale AI, said, “Testing and evaluating generative AI will help the DOD understand the strengths and limitations of the technology, so it can be deployed responsibly. Scale is honored to partner with the DOD on this framework.”
Under the partnership, Scale AI will develop tailored benchmark tests for DOD use cases, integrate them into its T&E platform and support CDAO’s strategy in leveraging large language models.
The framework will facilitate safe AI deployment by enabling continuous performance measurement, providing real-time feedback for warfighters and creating specialized evaluation sets for military support applications.
The Potomac Officers Club will host the 5th Annual Artificial Intelligence Summit on March 21 to discuss the transformative power of AI. Click here to register for the highly anticipated event.