TECH & OTHER NEWS

OpenAI unveils new safety plan for frontier AI models. How it’ll impact future development

December 20, 2023

Sam Altman with the words "The Future" behind him. — Dustin Chambers/Bloomberg via Getty Images

OpenAI has had a big year, leading the generative AI race with ChatGPT. The success of it means that all eyes are on the company to set the appropriate precedent for future AI developments, and OpenAI has taken one step forward with a new safety plan.

Also: With AI upgrade, Salesforce’s Einstein Copilot will handle unstructured data

This week, OpenAI published the initial beta version of its Preparedness Framework, a safety plan delineating the different precautions the company has put in place to ensure the safety of its frontier AI models.

In the first element of the framework, the company commits to running consistent evaluations on its frontier models that push the models to their limits. OpenAI claims that these findings will help the company assess the risk of the models and measure the effectiveness of proposed mitigations.

The evaluations’ findings will then be shown in risk “scorecards” for OpenAI’s frontier models, continually updated to reflect risk thresholds, including cybersecurity, persuasion, model autonomy, and CBRN (chemical, biological, radiological, and nuclear threats), as seen in the image below.

OpenAI Frontier Model Scorecard — OpenAI

The risk thresholds will be classified into four risk safety levels: low, medium, high, and critical. That score will then determine how the company should proceed with the model.

Models that earn a post-mitigation score of “medium” or below can be deployed, while only models with a post-mitigation score of “high” or below can be developed further, according to the post.

Also: AI adds new fuel to autonomous enterprises, but don’t write off humans

OpenAI is also restructuring how the teams internally operate in making decisions.

A dedicated Preparedness team will drive technical work to evaluate the frontier model’s capabilities, such as running evaluations and synthesizing reports. Then, a cross-functional Safety Advisory Group will review all the reports and send them to Leadership and the Board of Directors.

Lastly, leadership will remain in its position as the decision-maker; however, the Board of Directors will hold the right to reverse decisions.

This addition is particularly noteworthy because it follows the turmoil that ensued early last month when Sam Altman was briefly ousted by the Board of Directors, only to be promptly reinstated as CEO with a new board.

Other framework elements include developing a protocol for added safety and outside accountability, collaborating with external parties and internal teams to track real-world misuse, and pioneering new research in measuring how risk evolves as models scale, according to the release.

Artificial Intelligence

Source Link

OpenAI unveils new safety plan for frontier AI models. How it’ll impact future development

Artificial Intelligence

LEAVE A REPLY Cancel reply

TECH NEWS

Everything Old is New Again: AI-Driven Development and Open Source

Gen AI in Healthcare: The State of Affairs in India

Gartner Predicts Legal, Risk and Compliance Functions to Double Technology Spend...

Microsoft to End Support for Windows Mail, Calendar and People Apps...

IDC Predicts: Asia/Pacific Business Leaders to Demand 80% Success Rate on...

The Cooling Conundrum: AI and Automation Push Data Centers Toward 3X...

TOP STORIES

Seventy Percent of Economies Are Underprepared for AI Disruption

New study shows almost half of tech professionals in India believe...

Organizations Combining Organizational Learning and AI-Specific Learning Are up to 80%...

Nvidia’s AI-driven triumph over Intel powered by strategic innovations

Most banks and insurers adopt cloud solutions with the primary objective...

India’s Web3 Ecosystem Has Over 400 Firms, Karnataka Emerges as Industry...

Cyber Security

AI and Gen AI are set to transform cybersecurity for most...

ThreatQuotient Publishes 2024 Evolution of Cybersecurity Automation Adoption Research Report

Kaspersky predicts quantum-proof ransomware and advancements in mobile financial cyberthreats in...

Rising concerns, lingering gaps: most organizations fear AI-driven cyberattacks but lack...

Tenable Forecasts Data Security in the Cloud to Take Centre Stage...

Blockchain-Enhanced Cybersecurity-Safeguarding Digital Identities and Data