OpenAI Launches "EVMbench": Testing AI's Ability to "Ensure Smart Contract Security"

ETH-1.89%

As the security risks in cryptocurrency continue to rise, OpenAI has officially entered the blockchain security field. Led by CEO Sam Altman, OpenAI announced the launch of a new testing framework called “EVMbench,” aimed at evaluating whether artificial intelligence has the practical ability to “understand, detect, and even repair” vulnerabilities in cryptocurrency smart contracts.

OpenAI states that EVMbench will focus on the security issues of smart contracts on Ethereum and other EVM-compatible chains. The ultimate goal is to establish a quantifiable and comparable evaluation standard for AI systems in the blockchain security domain.

Smart contracts refer to self-executing code deployed on the blockchain, widely supporting decentralized exchanges (DEX), lending protocols, derivatives protocols, and other on-chain financial applications. However, once deployed, these contracts are usually immutable or difficult to modify or roll back. If there are logical vulnerabilities, it often results in real financial losses, with high costs for remediation. Over the past few years, the DeFi sector has repeatedly experienced hacker attacks and fund losses due to code flaws, exemplifying this structural risk.

OpenAI points out that the core goal of EVMbench is to verify whether AI systems are mature enough to assist in preventing smart contract vulnerabilities within actual economic risk environments.

This testing framework was developed jointly by OpenAI and cryptocurrency investment firm Paradigm. The test data is not simulated but sourced from past smart contract vulnerabilities discovered during professional security audits and security competitions.

EVMbench primarily evaluates AI performance across three key capabilities: vulnerability identification; ability to reproduce attack paths in controlled environments (simulating hacker perspectives); and fixing vulnerable code without disrupting the original contract functionality.

OpenAI states that the ultimate purpose of launching EVMbench is to establish a clear evaluation standard for AI systems in the blockchain security field. As DeFi protocols now lock in billions of dollars in user funds, the defense of smart contracts has become a fundamental and urgent market priority.

View Original
Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Related Articles

Blockchain Apps Have Failed to Win Over the Masses, Ethereum Builders Admit

In brief ETH Denver founder John Paller says Web3 has been “epically bad” at building usable consumer products. Aztec Network Zac Williamson argues crypto must beat Web2 on experience, not ideology. Both say adoption will stall unless blockchain becomes invisible to users. Crypto built t

Decrypt1h ago

Data: If ETH breaks through $2,046, the total liquidation strength of long positions on mainstream CEXs will reach $845 million.

ChainCatcher reports that, according to Coinglass data, if ETH breaks above $2,046, the total liquidation strength of long positions on major CEXs will reach $845 million. Conversely, if ETH drops below $1,868, the total liquidation strength of short positions on major CEXs will reach $333 million.

GateNewsBot3h ago
Comment
0/400
No comments
Trade Crypto Anywhere Anytime
qrCode
Scan to download Gate App
Community
English
  • 简体中文
  • English
  • Tiếng Việt
  • 繁體中文
  • Español
  • Русский
  • Français (Afrique)
  • Português (Portugal)
  • Bahasa Indonesia
  • 日本語
  • بالعربية
  • Українська
  • Português (Brasil)